BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy11641
         (277 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|195567537|ref|XP_002107316.1| GD15630 [Drosophila simulans]
 gi|194204722|gb|EDX18298.1| GD15630 [Drosophila simulans]
          Length = 411

 Score =  229 bits (584), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 106/203 (52%), Positives = 137/203 (67%), Gaps = 30/203 (14%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +  
Sbjct: 239 PLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASDG 298

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q             
Sbjct: 299 CLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------- 345

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
               G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T QL
Sbjct: 346 ----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQL 388

Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
           ++  CD N+SYQQW FKE+RP W
Sbjct: 389 SLGHCDVNDSYQQWWFKEIRPRW 411



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 303 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 347


>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
           pisum]
          Length = 588

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 104/205 (50%), Positives = 140/205 (68%), Gaps = 30/205 (14%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           PLA +LDMGDIS+Q+ L+DKL CKDFSWFM++VAYDVYTKFPELPPNL+WGE +N+G   
Sbjct: 414 PLARYLDMGDISKQLELKDKLQCKDFSWFMENVAYDVYTKFPELPPNLYWGELRNIGKTT 473

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           CLD+ G   P+++G   CHG G++QLFRLN +GQL  GERC+ AD+Q             
Sbjct: 474 CLDTRGHQPPSLVGLELCHGQGNNQLFRLNTKGQLSVGERCIFADRQ------------- 520

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
                VKL+ C L             GTVDGPW++ ++ + L H+I ++CLSVHP ++Q+
Sbjct: 521 ----NVKLVVCPL-------------GTVDGPWQYESTNKLLLHSITRKCLSVHPSSDQV 563

Query: 253 AMLRCDENNSYQQWRFKEVRPDWSS 277
            ++ CD NN+ QQW FK++ P W+S
Sbjct: 564 ILMPCDSNNANQQWIFKQIVPSWTS 588



 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 32/44 (72%)

Query: 2   GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           G   P+++G   CHG G++QLFRLN +GQL  GERC+ AD+Q +
Sbjct: 479 GHQPPSLVGLELCHGQGNNQLFRLNTKGQLSVGERCIFADRQNV 522


>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
 gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
          Length = 591

 Score =  228 bits (582), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 107/204 (52%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGLQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPVTQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           LA+  CD N+SYQQW FKE+RP W
Sbjct: 568 LALGHCDVNDSYQQWWFKEIRPRW 591



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGLQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
 gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
          Length = 591

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
           N-acetylgalactosaminyltransferase 2 [Drosophila
           melanogaster]
          Length = 591

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform A
           [Drosophila melanogaster]
 gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform B
           [Drosophila melanogaster]
 gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 7;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
           AltName: Full=dGalNAc-T2
 gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform A
           [Drosophila melanogaster]
 gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2, isoform B
           [Drosophila melanogaster]
 gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
 gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
 gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
          Length = 591

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
 gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
          Length = 591

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
          Length = 486

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 313 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 372

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 373 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 420

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 421 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 462

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 463 LSLGHCDVNDSYQQWWFKEIRPRW 486



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 378 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 422


>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
 gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
          Length = 587

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 104/204 (50%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDI+EQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 414 EPLARYLDMGDITEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 473

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERC++AD+Q            
Sbjct: 474 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCIEADRQ------------ 521

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 522 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPTTQQ 563

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++ RCD N+SYQQW FKE+ P W
Sbjct: 564 LSLARCDVNDSYQQWWFKEIHPRW 587



 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERC++AD+Q I
Sbjct: 479 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCIEADRQGI 523


>gi|16769336|gb|AAL28887.1| LD27021p [Drosophila melanogaster]
          Length = 214

 Score =  228 bits (580), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 41  EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 100

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 101 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 148

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 149 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 190

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+RP W
Sbjct: 191 LSLGHCDVNDSYQQWWFKEIRPRW 214



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 106 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 150


>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
 gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
          Length = 567

 Score =  226 bits (577), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 105/204 (51%), Positives = 136/204 (66%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDI+EQ+AL+ +L CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 394 EPLARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 453

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 454 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 501

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW++++ T+ L H + K+C+++HP T Q
Sbjct: 502 -----GIKLAVCRL-------------GTVDGPWQYNDHTKHLMHRVHKKCMALHPTTQQ 543

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L +  CD N+SYQQW FKE+RP W
Sbjct: 544 LTLGHCDVNDSYQQWWFKEIRPRW 567



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 459 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 503


>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
 gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
          Length = 591

 Score =  226 bits (577), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 105/204 (51%), Positives = 136/204 (66%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDI+EQ+AL+ +L CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW++++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNDHTKHLMHRVHKKCMALHPTTQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L +  CD N+SYQQW FKE+RP W
Sbjct: 568 LTLGHCDVNDSYQQWWFKEIRPRW 591



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
 gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
          Length = 591

 Score =  226 bits (576), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 105/204 (51%), Positives = 136/204 (66%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW+++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++  CD N+SYQQW FKE+ P W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIHPRW 591



 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 29/45 (64%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527


>gi|383860243|ref|XP_003705600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Megachile
           rotundata]
          Length = 581

 Score =  223 bits (568), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 100/212 (47%), Positives = 139/212 (65%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA  LD GDISEQ+A + +  CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 400 YKEFFYTREPLAQLLDHGDISEQLAFKKRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 459

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N+ T  CLD+M  + P+++ TS+CHG G++QL RLNA+GQLG GERC+ AD Q    
Sbjct: 460 ELRNVATGTCLDTMSHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCISADSQ---- 515

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        GVK +FCRL             GTVDGPW++   T+ L H + K+C+
Sbjct: 516 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 549

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
           ++HP+T QL+++ CD NN+YQQW F ++RP+W
Sbjct: 550 ALHPQTQQLSLMPCDINNTYQQWSFHQIRPEW 581



 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 25/43 (58%), Positives = 34/43 (79%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
           M  + P+++ TS+CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCISADSQ 515


>gi|170060398|ref|XP_001865784.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
 gi|167878898|gb|EDS42281.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
          Length = 356

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 104/212 (49%), Positives = 134/212 (63%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA FLDMGDISEQ+AL+++L CK F W+M +VAYDV  K+P LP NLFWG
Sbjct: 175 YKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPALPANLFWG 234

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E KN GT+KC+D++GR  PA+IG  +CHG G +QL RLNA GQLG GERC++AD Q    
Sbjct: 235 ELKNSGTEKCVDALGRQPPAIIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNQ---- 290

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        G+KL FCRL             GTVDGPW++  +T  L H   K C+
Sbjct: 291 -------------GIKLAFCRL-------------GTVDGPWQYDETTSTLLHRTYKMCM 324

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
             HP+T  LA++ CD +N+YQ W+F  + P W
Sbjct: 325 GYHPQTRALALMPCDVHNAYQSWKFHTITPRW 356



 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 34/45 (75%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           +GR  PA+IG  +CHG G +QL RLNA GQLG GERC++AD Q I
Sbjct: 248 LGRQPPAIIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNQGI 292


>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
 gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
          Length = 591

 Score =  219 bits (559), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 102/204 (50%), Positives = 135/204 (66%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDI+EQ+AL+ +L+CK F WFM ++AYDV  KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDITEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW ++  T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAICRL-------------GTVDGPWSYNEHTKHLLHRVHKKCMALHPTTQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++ +CD N+ YQ+W FKE+RP W
Sbjct: 568 LSLGQCDVNDGYQKWLFKEIRPHW 591



 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 30/45 (66%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQGI 527


>gi|91081797|ref|XP_973938.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
 gi|270006291|gb|EFA02739.1| hypothetical protein TcasGA2_TC008465 [Tribolium castaneum]
          Length = 583

 Score =  219 bits (557), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 141/212 (66%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      P+A FLDMGDISEQ+AL+ +L+CK F WFM++VAYDV  K+PELPPN+ +G
Sbjct: 402 YKEFFYTREPMARFLDMGDISEQLALKKRLNCKSFEWFMENVAYDVLDKYPELPPNIHFG 461

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +++   KCLD++G   P+++   +CHG G++QL RLNA+GQLG GERC++AD Q    
Sbjct: 462 ELRSVAVSKCLDTLGHGPPSLMAIQHCHGYGNNQLIRLNAKGQLGVGERCIEADAQ---- 517

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        G+KL FCR+             GTVDGPW +  +T  + H + K+C+
Sbjct: 518 -------------GIKLAFCRM-------------GTVDGPWLYDENTHTILHRVHKKCI 551

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
           ++HP+T+ L+++ CD NN+YQQW FKE+RP W
Sbjct: 552 ALHPQTSHLSLMPCDINNAYQQWIFKEIRPKW 583



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 34/45 (75%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           +G   P+++   +CHG G++QL RLNA+GQLG GERC++AD Q I
Sbjct: 475 LGHGPPSLMAIQHCHGYGNNQLIRLNAKGQLGVGERCIEADAQGI 519


>gi|307212076|gb|EFN87959.1| N-acetylgalactosaminyltransferase 7 [Harpegnathos saltator]
          Length = 563

 Score =  219 bits (557), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 101/204 (49%), Positives = 134/204 (65%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA FLD GDISEQ+  +++  CK F W+M +VAYDV  KFPELPPN+ WGE +N+ T+
Sbjct: 390 EPLARFLDHGDISEQLMFKERRKCKSFQWYMDNVAYDVLDKFPELPPNIHWGELQNVATK 449

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLD+MG   P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q            
Sbjct: 450 SCLDTMGHAPPSLMATSHCHGFGNNQLMRLNAKGQLGVGERCVEADGQ------------ 497

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                GVK  FCRL             GT DGPW++   T+ L H + K+C+++HP+T Q
Sbjct: 498 -----GVKYAFCRL-------------GTADGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 539

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L+++ CD NNSYQQW F++V P W
Sbjct: 540 LSLMPCDINNSYQQWIFRQVHPQW 563



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q +
Sbjct: 455 MGHAPPSLMATSHCHGFGNNQLMRLNAKGQLGVGERCVEADGQGV 499


>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
          Length = 596

 Score =  219 bits (557), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 102/212 (48%), Positives = 133/212 (62%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA FLDMGDISEQ+AL+++L CK F W+M +VAYDV  K+P LP NLFWG
Sbjct: 415 YKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPALPANLFWG 474

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E KN GT+KC+D++GR  PA+IG  +CHG G +QL RLNA GQLG GERC++AD      
Sbjct: 475 ELKNSGTEKCVDALGRQPPAMIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNM---- 530

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        G+KL FCR+             GTVDGPW++  +T  L H   K C+
Sbjct: 531 -------------GIKLAFCRM-------------GTVDGPWQYDENTSTLLHRTYKMCM 564

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
             HP+T  LA++ CD +N+YQ W+F  + P W
Sbjct: 565 GYHPQTRALALMPCDVHNAYQSWKFHTITPRW 596



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 33/45 (73%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           +GR  PA+IG  +CHG G +QL RLNA GQLG GERC++AD   I
Sbjct: 488 LGRQPPAMIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNMGI 532


>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
 gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
          Length = 591

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 102/204 (50%), Positives = 134/204 (65%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA +LDMGDISEQ+AL+ +L+CK F WFM ++AYDV  KFP LP NL WGE +++ + 
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHWGELRSVASD 477

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLDSMG   PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q            
Sbjct: 478 GCLDSMGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQ------------ 525

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+KL  CRL             GTVDGPW ++  T+ L H + K+C+++HP   Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWSYNEHTKHLLHRVHKKCMALHPAGQQ 567

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L++ +CD N+ YQ+W FKE+RP W
Sbjct: 568 LSLGQCDVNDGYQKWLFKEIRPHW 591



 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 30/45 (66%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQGI 527


>gi|307169192|gb|EFN62008.1| N-acetylgalactosaminyltransferase 7 [Camponotus floridanus]
          Length = 580

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 133/204 (65%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA  LD GDISEQ+A +++  CK+F W+M +VAYDV  KFPELPPN+ WGE +N+ T 
Sbjct: 407 EPLARLLDHGDISEQLAFKERKKCKNFQWYMDNVAYDVLDKFPELPPNIHWGELRNVATG 466

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLD+MG   P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q            
Sbjct: 467 SCLDTMGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGIGERCVEADGQ------------ 514

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                G+K  FCRL             GT DGPW++   T+ L H + K+C+++HP+T Q
Sbjct: 515 -----GIKYAFCRL-------------GTADGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 556

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L+++ CD NN+YQQW F ++ P W
Sbjct: 557 LSLMLCDINNTYQQWNFHQIHPRW 580



 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q I
Sbjct: 472 MGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGIGERCVEADGQGI 516


>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
           vitripennis]
          Length = 583

 Score =  215 bits (548), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 99/204 (48%), Positives = 133/204 (65%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA  LD GDI+EQ+  + +  CK F WFM ++AYDV  KFPELPPN+ WGE KN+ TQ
Sbjct: 410 EPLARLLDHGDITEQLEFKRRKGCKSFQWFMDNIAYDVLDKFPELPPNIHWGEMKNVATQ 469

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLD+MG   P ++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q            
Sbjct: 470 MCLDTMGHAPPNLMATSHCHGFGNNQLIRLNAKGQLGVGERCVEADGQ------------ 517

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                GVKL FCRL             GTVDGPW++   ++ L H + K+C+++HP+T+ 
Sbjct: 518 -----GVKLAFCRL-------------GTVDGPWQYDEKSKTLLHRVHKKCMALHPQTHH 559

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L+++ CD NN+YQQW F ++ P+W
Sbjct: 560 LSLMPCDMNNAYQQWTFVQISPNW 583



 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 35/45 (77%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   P ++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q +
Sbjct: 475 MGHAPPNLMATSHCHGFGNNQLIRLNAKGQLGVGERCVEADGQGV 519


>gi|380013105|ref|XP_003690610.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
           7-like [Apis florea]
          Length = 581

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 135/212 (63%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA  LD GDISEQ+  + +  CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 400 YKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 459

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N+ T  CLD+M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q    
Sbjct: 460 ELRNVATGTCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ---- 515

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        GVK +FCRL             GTVDGPW++   T+ L H + K+C+
Sbjct: 516 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 549

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
           ++HP+T QL+++ CD NN+YQQW F +V P W
Sbjct: 550 ALHPQTQQLSLMPCDINNTYQQWSFHQVHPQW 581



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 33/43 (76%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
           M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ 515


>gi|242016390|ref|XP_002428804.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212513501|gb|EEB16066.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 579

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 99/212 (46%), Positives = 138/212 (65%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA +L+MGDIS+Q+A++++L CK F WFM+ VA D+ TKFPELPPN+ WG
Sbjct: 398 YKEFFYTREPLARYLNMGDISKQLAMKERLQCKSFQWFMEEVAPDLLTKFPELPPNVHWG 457

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +NL T KCLD++G   PA +  S CHG G++QL RLN++GQLG GERC+D  KQ    
Sbjct: 458 ELRNLATLKCLDTLGHGPPARMSVSECHGFGNNQLVRLNSQGQLGIGERCIDGGKQ---- 513

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        G++L+FCRL             G+VDGPW++    R L H  ++ CL
Sbjct: 514 -------------GIRLVFCRL-------------GSVDGPWQYDEKGRTLLHVKMQMCL 547

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
           ++HP+T+QL+++ CD NNS+ QW FKE+ P W
Sbjct: 548 ALHPQTSQLSLMPCDYNNSFHQWIFKEIHPKW 579



 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 25/45 (55%), Positives = 32/45 (71%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           +G   PA +  S CHG G++QL RLN++GQLG GERC+D  KQ I
Sbjct: 471 LGHGPPARMSVSECHGFGNNQLVRLNSQGQLGIGERCIDGGKQGI 515


>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
           impatiens]
          Length = 582

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA  LD GDISEQ+  + +  CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 401 YKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 460

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N+ T  CLD+M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q    
Sbjct: 461 ELRNIATGMCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADGQ---- 516

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        GVK +FCRL             GTVDGPW++   T+ L H + K+C+
Sbjct: 517 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 550

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
           ++HP+T QL+++ CD NN+YQQW F ++ P W
Sbjct: 551 ALHPQTQQLSLMPCDINNTYQQWSFHQIHPKW 582



 Score = 57.8 bits (138), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 33/43 (76%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
           M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 474 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADGQ 516


>gi|340718182|ref|XP_003397550.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Bombus
           terrestris]
          Length = 581

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 96/204 (47%), Positives = 132/204 (64%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA  LD GDISEQ+  + +  CK F W+M++VAYDV+ KFPELPPN+ WGE +N+ T 
Sbjct: 408 EPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWGELRNIATG 467

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLD+M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q            
Sbjct: 468 MCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADSQ------------ 515

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                GVK +FCRL             GTVDGPW++   T+ L H + K+C+++HP+T Q
Sbjct: 516 -----GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 557

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L+++ CD NN+YQQW F ++ P W
Sbjct: 558 LSLMPCDVNNTYQQWSFHQIHPKW 581



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 33/43 (76%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
           M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADSQ 515


>gi|332023194|gb|EGI63450.1| N-acetylgalactosaminyltransferase 7 [Acromyrmex echinatior]
          Length = 614

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 98/203 (48%), Positives = 131/203 (64%), Gaps = 30/203 (14%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           PLA  LD GDISEQ+A + +  CK F WF+ +VAYDV  KFPELPPN+ WGE +N  +  
Sbjct: 442 PLARLLDHGDISEQLAFKKRKKCKSFQWFVDNVAYDVLDKFPELPPNIHWGELRNTASGS 501

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           CLD+MG + P+++ TS+CHG G++QL RLNA+GQLG GERC+DAD Q             
Sbjct: 502 CLDTMGHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIDADGQR------------ 549

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
                +K  FCRL             GTVDGPW++   T+ L H + K+C+++HP+T QL
Sbjct: 550 -----IKYAFCRL-------------GTVDGPWQYDEKTKTLLHRVYKKCIALHPQTQQL 591

Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
           +++ CD NN+YQQW F +V P W
Sbjct: 592 SLMPCDINNAYQQWSFHQVHPRW 614



 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 37/45 (82%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG + P+++ TS+CHG G++QL RLNA+GQLG GERC+DAD Q I
Sbjct: 506 MGHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIDADGQRI 550


>gi|322798640|gb|EFZ20244.1| hypothetical protein SINV_10970 [Solenopsis invicta]
          Length = 580

 Score =  214 bits (544), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 98/204 (48%), Positives = 132/204 (64%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            PLA  LD GDISEQ+A +++  CK F W++ +VAYDV  KFPELPPN+ WGE +N+ T 
Sbjct: 407 EPLARLLDHGDISEQLAFKERKKCKSFQWYIDNVAYDVLDKFPELPPNIHWGELRNVATG 466

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLD+MG   P+++ TS+CHG G++QL RLNA+GQLG GERC++AD Q            
Sbjct: 467 SCLDTMGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIEADGQ------------ 514

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                GVK  FCRL             GT DGPW++   T+ L H + K+C+++HP+T Q
Sbjct: 515 -----GVKYAFCRL-------------GTADGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 556

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L+++ CD NN+YQQW F +V P W
Sbjct: 557 LSLMPCDINNTYQQWSFHQVHPRW 580



 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 26/45 (57%), Positives = 36/45 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   P+++ TS+CHG G++QL RLNA+GQLG GERC++AD Q +
Sbjct: 472 MGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIEADGQGV 516


>gi|328781461|ref|XP_395266.4| PREDICTED: n-acetylgalactosaminyltransferase 7 [Apis mellifera]
          Length = 581

 Score =  213 bits (543), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 98/212 (46%), Positives = 134/212 (63%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA  LD GDISEQ+  + +  CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 400 YKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 459

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N  T  CLD+M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q    
Sbjct: 460 ELRNGATGTCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ---- 515

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        GVK +FCRL             GTVDGPW++   T+ L H + K+C+
Sbjct: 516 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 549

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
           ++HP+T QL+++ CD NN+YQQW F +V P W
Sbjct: 550 ALHPQTQQLSLMPCDINNTYQQWSFHQVHPQW 581



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/43 (55%), Positives = 33/43 (76%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
           M  + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ 515


>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
 gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
          Length = 598

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 100/212 (47%), Positives = 132/212 (62%), Gaps = 30/212 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA FLDMGDISEQ+AL+++L CK F W+M +VAYDV  K+P LP N+ WG
Sbjct: 417 YKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPMLPANVKWG 476

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N+G +KC+D++GR  PAVIG   CHG G +QL RLN  GQLG GERC++A   E   
Sbjct: 477 ELQNVGKEKCVDALGRQPPAVIGLQQCHGQGHNQLIRLNGAGQLGVGERCIEAYNSE--- 533

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                         +KL FCRL             GTVDGPW++   +  L H   K+C+
Sbjct: 534 --------------IKLAFCRL-------------GTVDGPWQYDEQSGTLLHRTHKKCM 566

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
              P+T QLA++ CD NN+YQ W+FKE++P W
Sbjct: 567 GYQPQTRQLALMPCDINNAYQSWKFKEIQPHW 598



 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 25/40 (62%), Positives = 29/40 (72%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           +GR  PAVIG   CHG G +QL RLN  GQLG GERC++A
Sbjct: 490 LGRQPPAVIGLQQCHGQGHNQLIRLNGAGQLGVGERCIEA 529


>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
          Length = 521

 Score =  211 bits (538), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 96/210 (45%), Positives = 136/210 (64%), Gaps = 30/210 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      P A FLDMG+I++Q+ ++ +L+CK F+WFM+ VAYDV  K+PELP NL WG
Sbjct: 341 YKEFFYTREPTARFLDMGNITQQLEMKKRLNCKSFAWFMEEVAYDVLDKYPELPANLHWG 400

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N   ++CLD+MG   P+++G S+CHG G++QLFRLN  GQLG GERCV+AD Q    
Sbjct: 401 ELRNTAARQCLDTMGHQPPSLMGISHCHGFGNNQLFRLNKAGQLGVGERCVNADSQ---- 456

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                        GVKL+ CRL             G+V+GPW +  +++ L H +LKRCL
Sbjct: 457 -------------GVKLVVCRL-------------GSVEGPWEYDETSKTLLHKVLKRCL 490

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           +VHP+ N L +L CD  N+Y +W F+E++P
Sbjct: 491 AVHPQNNHLTLLPCDTANNYHRWEFREIQP 520



 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 27/45 (60%), Positives = 35/45 (77%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
           MG   P+++G S+CHG G++QLFRLN  GQLG GERCV+AD Q +
Sbjct: 414 MGHQPPSLMGISHCHGFGNNQLFRLNKAGQLGVGERCVNADSQGV 458


>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
          Length = 499

 Score =  210 bits (534), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 103/204 (50%), Positives = 125/204 (61%), Gaps = 30/204 (14%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
            P+A FLDMGDISEQVAL+++L CK F WFM++VAYDVY KFP+LP N+ WG  KN  T 
Sbjct: 326 EPMARFLDMGDISEQVALKERLKCKSFGWFMENVAYDVYDKFPKLPKNVHWGMVKNKATG 385

Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
            CLD+MG+ APA IGTS CHG G+SQLFRLN  GQLG GERCV+ D              
Sbjct: 386 ACLDTMGKAAPAYIGTSSCHGMGNSQLFRLNEAGQLGVGERCVETDGD------------ 433

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                 VK   CRL             GTVDGPW ++    QL H +   CL++ P + Q
Sbjct: 434 -----NVKQAICRL-------------GTVDGPWSYNEERHQLVHRLHGHCLTLQPHSGQ 475

Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
           L +  CD NN+YQQW  K+  P+W
Sbjct: 476 LGLAPCDPNNTYQQWTVKQKTPNW 499



 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 30/41 (73%), Positives = 33/41 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
           MG+ APA IGTS CHG G+SQLFRLN  GQLG GERCV+ D
Sbjct: 391 MGKAAPAYIGTSSCHGMGNSQLFRLNEAGQLGVGERCVETD 431


>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
          Length = 874

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 98/210 (46%), Positives = 131/210 (62%), Gaps = 30/210 (14%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA +LDMGDISEQ+AL+++L CK F W+M +VAYDV  K+P LP NLFWG
Sbjct: 415 YKEYFYTREPLAQYLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPMLPANLFWG 474

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N G +KC+D++GR  PA+IG   CHG G +QL RLNA GQLG GERC++A       
Sbjct: 475 ELQNTGMEKCVDALGRQPPAIIGLQVCHGQGHNQLIRLNAAGQLGVGERCIEAYN----- 529

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                       A +KL FCRL             GTVDGPW++   T  L H   K+C+
Sbjct: 530 ------------ADIKLAFCRL-------------GTVDGPWQYDEHTGTLLHRTHKKCM 564

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
              P+  QLA++ CD NN+YQ W+F++++P
Sbjct: 565 GYQPQNRQLALMPCDINNAYQSWKFQKIQP 594



 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 25/40 (62%), Positives = 30/40 (75%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           +GR  PA+IG   CHG G +QL RLNA GQLG GERC++A
Sbjct: 488 LGRQPPAIIGLQVCHGQGHNQLIRLNAAGQLGVGERCIEA 527


>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
          Length = 499

 Score =  201 bits (511), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 97/203 (47%), Positives = 122/203 (60%), Gaps = 30/203 (14%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           P+A FLDMGDISEQVALRDKL+CK FSW+M++VAYDVY KFP+LP N+ WG  KN     
Sbjct: 327 PMARFLDMGDISEQVALRDKLNCKSFSWYMENVAYDVYDKFPKLPKNVHWGMVKNKAIGL 386

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           CLD+MG+ AP+ IG   CHG+G++QL+RLN  GQLG GERC++AD               
Sbjct: 387 CLDTMGKAAPSYIGIQSCHGAGNNQLYRLNEAGQLGVGERCLEADTD------------- 433

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
                +K   CRL             GTVDGPWR+      L H +   CL++ P +  L
Sbjct: 434 ----SLKQTICRL-------------GTVDGPWRYDKERSHLIHRLHSYCLTLQPNSRTL 476

Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
            +  CD NN+YQQW   +  P W
Sbjct: 477 GLAPCDPNNTYQQWTITQKNPKW 499



 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 25/41 (60%), Positives = 33/41 (80%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
           MG+ AP+ IG   CHG+G++QL+RLN  GQLG GERC++AD
Sbjct: 391 MGKAAPSYIGIQSCHGAGNNQLYRLNEAGQLGVGERCLEAD 431


>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 641

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 85/213 (39%), Positives = 123/213 (57%), Gaps = 30/213 (14%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      PLA + D GD+ +Q+ALR+KL CK F WFMK+VAYDV   FP LP N
Sbjct: 457 WMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLLPRN 516

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L+WGE ++  T +CLD+MG   P+    + CHG+G +Q+FRLNAEGQLG GERC+DA   
Sbjct: 517 LYWGEIRHDATDQCLDAMGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSH 576

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
                             + +++C L             GTVDGPW +   T+ L H   
Sbjct: 577 -----------------SMDVVYCSL-------------GTVDGPWEYSVDTKHLYHKTH 606

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
           K+CL+ +    ++ + +CD++N  Q W F+E++
Sbjct: 607 KKCLTYNKRNYKVHLEKCDDSNLSQHWTFREIQ 639



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/52 (50%), Positives = 33/52 (63%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSL 52
           MG   P+    + CHG+G +Q+FRLNAEGQLG GERC+DA    +     SL
Sbjct: 534 MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSHSMDVVYCSL 585


>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 641

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 85/213 (39%), Positives = 122/213 (57%), Gaps = 30/213 (14%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      PLA + D GD+ +Q+ALR+KL CK F WFMK+VAYDV   FP LP N
Sbjct: 457 WMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLLPRN 516

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L+WGE ++  T +CLD+MG   P+    + CHG+G +Q+FRLNAEGQLG GERC+DA   
Sbjct: 517 LYWGEIRHDATDQCLDAMGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSH 576

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
                             + +++C L             GTVDGPW +   T+ L H   
Sbjct: 577 -----------------SMDVVYCSL-------------GTVDGPWEYSVDTKHLYHKTH 606

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
           K+CL+ +    ++ + +CD+ N  Q W F+E++
Sbjct: 607 KKCLTYNKRNYKVHLEKCDDGNLSQHWTFREIQ 639



 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/52 (50%), Positives = 33/52 (63%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSL 52
           MG   P+    + CHG+G +Q+FRLNAEGQLG GERC+DA    +     SL
Sbjct: 534 MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSHSMDVVYCSL 585


>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
           magnipapillata]
          Length = 599

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 85/212 (40%), Positives = 121/212 (57%), Gaps = 24/212 (11%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+      PLA F+D GD+SEQ+AL+ +++CK F+WFM+ +AYDV  K+PE PPN  WG
Sbjct: 403 YKEFFYTREPLARFIDTGDVSEQMALKKRMNCKSFTWFMEEIAYDVLKKYPEPPPNAHWG 462

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +N+ T  C+D++ R+ P  IG S CH SG +QL+RLN  GQL  GE CV  +K    +
Sbjct: 463 EVRNIATNLCIDTLNRSPPYRIGLSGCHKSGGNQLWRLNTLGQLASGEWCVRYEKASNEI 522

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
           E  +L + +C                        SGT  GPW+F  S+    H  L  CL
Sbjct: 523 EN-ELSMEWC-----------------------PSGTPSGPWKFDASSGFFTHTGLNMCL 558

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
            V+  T +L M RCD +  + +W+ +E+RP W
Sbjct: 559 IVNSRTLKLNMARCDNSKEHHKWKIEEIRPHW 590



 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 27/42 (64%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 42
           + R+ P  IG S CH SG +QL+RLN  GQL  GE CV  +K
Sbjct: 476 LNRSPPYRIGLSGCHKSGGNQLWRLNTLGQLASGEWCVRYEK 517


>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
          Length = 640

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 124/235 (52%), Gaps = 36/235 (15%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           K VIST      +R   +W+  +++   +  P A     GDIS Q+ LR +LHCK F W+
Sbjct: 440 KPVISTNM----VRVIKTWMDEYEKYYYIREPSAKHRSPGDISAQLELRKRLHCKSFKWY 495

Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
           M+ VAYDV   +P LP N  WGEAKNL T KC+D+MGR  P ++G + CHG G +QL RL
Sbjct: 496 MEKVAYDVVYSYPFLPENHVWGEAKNLQTSKCIDTMGRAIPGIVGATPCHGYGGNQLIRL 555

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
           N +GQL  GE C+                              LG  L   +C  + GTV
Sbjct: 556 NKKGQLTQGEWCMTP----------------------------LGNQLQTGHC--AKGTV 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAML--RCDENNSYQQWRFKEVRPD 274
           DGP+++  ++ Q++  +   CL+     +   +   +CD +NS Q+W+++E+  D
Sbjct: 586 DGPFQYDEASEQIKMTLSDTCLTADSSMSSSTLSLKKCDPSNSLQKWKWREIFSD 640



 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 26/38 (68%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 38
           MGR  P ++G + CHG G +QL RLN +GQL  GE C+
Sbjct: 531 MGRAIPGIVGATPCHGYGGNQLIRLNKKGQLTQGEWCM 568


>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
          Length = 601

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 78/217 (35%), Positives = 116/217 (53%), Gaps = 31/217 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R   +W+  + +      P A  ++ GDIS Q+ALRDKL CK F W+M++VAYDV   +
Sbjct: 411 MRVVKTWMDDYSKYYLTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLQSY 470

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P LPPN  WGEA+N  T KCLD MG   P  +G S CHG G +QL RLN +GQ+  GE C
Sbjct: 471 PLLPPNDVWGEARNPATGKCLDRMG-GIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWC 529

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           + A+        +++    CV                        GTV+G W +  +T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGTVNGNWIYDRNTQQ 559

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           ++H   + C++V    + + +  C E+N  Q++ +KE
Sbjct: 560 IQHNAKRMCVTVSESGSDVILETCTEDNQRQKFVWKE 596



 Score = 44.7 bits (104), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 6   PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
           P  +G S CHG G +QL RLN +GQ+  GE C+ A+
Sbjct: 498 PGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTAN 533


>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
 gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
 gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
 gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
          Length = 601

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 77/217 (35%), Positives = 117/217 (53%), Gaps = 31/217 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R   +W+  + +      P A  ++ GDIS Q+ALRDKL CK F W+M++VAYDV   +
Sbjct: 411 MRVVKTWMDDYSKYYLTREPQATNVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSY 470

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P LPPN  WGEA+N  T KCLD MG   P  +G + CHG G +QL RLN +GQ+  GE C
Sbjct: 471 PMLPPNDVWGEARNPATGKCLDRMG-GIPGPMGATGCHGYGGNQLIRLNVQGQMAQGEWC 529

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           + A+        +++    CV                        GTV+G W +   T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGTVNGFWSYDRKTKQ 559

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           + H+  ++C++V    +++ +  C E+N  Q++ +KE
Sbjct: 560 IIHSQKRQCITVSESGSEVTLQTCTEDNERQKFVWKE 596



 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 24/36 (66%)

Query: 6   PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
           P  +G + CHG G +QL RLN +GQ+  GE C+ A+
Sbjct: 498 PGPMGATGCHGYGGNQLIRLNVQGQMAQGEWCLTAN 533


>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
          Length = 562

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/235 (35%), Positives = 119/235 (50%), Gaps = 38/235 (16%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           K VIST      LR   +W+  +++   +  P A     GDIS Q+ LR++L CK F W+
Sbjct: 364 KPVISTNM----LRVIKTWMDEYEKYYYIREPSAKHRLPGDISSQLKLRERLKCKSFEWY 419

Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
           M+ VAYDV   +P  P N  WGEAKN  T KC+D++G+T P ++G   CHG G +QL RL
Sbjct: 420 MEKVAYDVIVSYPLPPENHVWGEAKNHATGKCIDTIGQTIPGIVGAMPCHGYGGNQLIRL 479

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
           N EGQL  GE C+                    + G            NL   +   GTV
Sbjct: 480 NKEGQLTQGEWCITP------------------VGG------------NLVTKYCVKGTV 509

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
           DGP+ +   + Q+   I   CL+     +   L+M +CD  N +Q+W+++E+  D
Sbjct: 510 DGPFAYDEKSEQI--MIDDLCLTAAKSMSSPTLSMEKCDSGNQWQKWKWQEIYID 562



 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 26/38 (68%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 38
           +G+T P ++G   CHG G +QL RLN EGQL  GE C+
Sbjct: 455 IGQTIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCI 492


>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
          Length = 560

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/235 (35%), Positives = 119/235 (50%), Gaps = 38/235 (16%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           K VIST      LR   +W+  +++   +  P A     GDIS Q+ LR++L CK F W+
Sbjct: 362 KPVISTNM----LRVIKTWMDEYEKYYYIREPSAKHRLPGDISSQLKLRERLKCKSFEWY 417

Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
           M+ VAYDV   +P  P N  WGEAKN  T KC+D++G+T P ++G   CHG G +QL RL
Sbjct: 418 MEKVAYDVIVSYPLPPENHVWGEAKNHATGKCIDTIGQTIPGIVGAMPCHGYGGNQLIRL 477

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
           N EGQL  GE C+                    + G            NL   +   GTV
Sbjct: 478 NKEGQLTQGEWCITP------------------VGG------------NLVTKYCVKGTV 507

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
           DGP+ +   + Q+   I   CL+     +   L+M +CD  N +Q+W+++E+  D
Sbjct: 508 DGPFAYDEKSEQI--MIDDLCLTAAKSMSSPTLSMEKCDSGNQWQKWKWQEIYID 560



 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 20/38 (52%), Positives = 26/38 (68%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 38
           +G+T P ++G   CHG G +QL RLN EGQL  GE C+
Sbjct: 453 IGQTIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCI 490


>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
          Length = 409

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/235 (35%), Positives = 118/235 (50%), Gaps = 38/235 (16%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           K VIST      LR   +W+  + +   +  P A     G+IS Q+ LR+ L CK F W+
Sbjct: 211 KPVISTNM----LRVIKTWMDEYDKYYYIREPSAKHRLPGNISSQLKLRESLKCKSFKWY 266

Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
           M+ VAYDV   +P  P N  WGEAKN  T KC+D+MGR  P ++G + CHG G +QL RL
Sbjct: 267 MEKVAYDVIVSYPFPPENHVWGEAKNHATGKCIDTMGRPVPGIVGATPCHGYGGNQLIRL 326

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
           N +GQL  GE C+ A                  + G            NL       GTV
Sbjct: 327 NMKGQLAQGEWCITA------------------VHG------------NLITNHCIKGTV 356

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
           DGP+ ++  + Q+   I   CL+     +   L+M +CD  N +Q+W+++E+  D
Sbjct: 357 DGPFTYNEESEQI--IIRNLCLTAAKSISSSVLSMEKCDSENQWQKWKWREIYID 409



 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 27/40 (67%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR  P ++G + CHG G +QL RLN +GQL  GE C+ A
Sbjct: 302 MGRPVPGIVGATPCHGYGGNQLIRLNMKGQLAQGEWCITA 341


>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
 gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
          Length = 601

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/217 (34%), Positives = 116/217 (53%), Gaps = 31/217 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R   +W+  + +      P A  ++ GDIS Q+ALRDKL CK F W+M++VAYDV   +
Sbjct: 411 MRVVKTWMDDYSKYYLTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSY 470

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P LPPN  WGEA+N  T KCLD MG   P  +G S CHG G +QL RLN +GQ+  GE C
Sbjct: 471 PLLPPNDVWGEARNPATGKCLDRMG-GIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWC 529

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           + A+        +++    CV                        G+V G + +  +T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGSVSGNFVYDRNTQQ 559

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           ++H   + C+++    + +++  C E+N  Q++ +KE
Sbjct: 560 IQHNTKRMCITISEAGSDVSLETCTEDNQRQKFVWKE 596



 Score = 44.3 bits (103), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 18/36 (50%), Positives = 24/36 (66%)

Query: 6   PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
           P  +G S CHG G +QL RLN +GQ+  GE C+ A+
Sbjct: 498 PGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTAN 533


>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
 gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
          Length = 601

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 77/217 (35%), Positives = 115/217 (52%), Gaps = 31/217 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R   +W+  +++      P A  ++ GDIS Q+ALRDKL CK F W+M++VAYDV   +
Sbjct: 411 MRVVKTWMDDYEKYYLTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSY 470

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P LPPN  WG A+N  T KCLD MG   P  +G S CHG G +QL RLN +GQL  GE C
Sbjct: 471 PLLPPNDVWGGAQNPATGKCLDRMG-GIPGPLGASGCHGYGGNQLLRLNVQGQLAQGEWC 529

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           + A+        +++    CV                        G+V+G W +  +T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGSVNGNWVYDRNTQQ 559

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           + H   + C++V    + + +  C E+N  Q++ +KE
Sbjct: 560 ILHNQKRMCITVAESGSDVILDTCTEDNQRQKFVWKE 596



 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 24/36 (66%)

Query: 6   PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
           P  +G S CHG G +QL RLN +GQL  GE C+ A+
Sbjct: 498 PGPLGASGCHGYGGNQLLRLNVQGQLAQGEWCLTAN 533


>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 645

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/235 (35%), Positives = 117/235 (49%), Gaps = 38/235 (16%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           K VIST      LR   +W+  + +   +  P A     G+IS Q+ LR  L CK F W+
Sbjct: 447 KPVISTNM----LRVIKTWMDEYDKYYYIREPSARHRLPGNISSQLKLRKSLKCKSFKWY 502

Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
           M+ VAYDV   +P  P N  WGEAKN  T KC+D+MGR  P ++G + CHG G +QL RL
Sbjct: 503 MEKVAYDVVVSYPFPPENHVWGEAKNHATGKCIDTMGRPVPGIVGATPCHGYGGNQLIRL 562

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
           N +GQL  GE C+ A                  + G            NL       GTV
Sbjct: 563 NRKGQLAQGEWCITA------------------VHG------------NLITNHCIKGTV 592

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
           DGP+ ++  + Q+   I   CL+     +   L+M +CD  N +Q+W+++E+  D
Sbjct: 593 DGPFTYNEESEQI--IIRNLCLTAAKSISSSVLSMEKCDSENQWQKWKWREIYID 645



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 21/40 (52%), Positives = 27/40 (67%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR  P ++G + CHG G +QL RLN +GQL  GE C+ A
Sbjct: 538 MGRPVPGIVGATPCHGYGGNQLIRLNRKGQLAQGEWCITA 577


>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 478

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 71/168 (42%), Positives = 93/168 (55%), Gaps = 30/168 (17%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P+A   D GD+S Q+ LR  L CK F WFMK+V+YDV   FP LP N
Sbjct: 340 WMDEYKEYFYTREPMARHYDPGDLSGQLELRQSLGCKGFDWFMKNVSYDVLKNFPLLPRN 399

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           + WGE + + T +CLD+M    P+ +  S CHG+G +Q+FRLNAEGQLG GERCVDA   
Sbjct: 400 IHWGEIRTMVTGQCLDTMNAHPPSTVSVSSCHGTGGNQIFRLNAEGQLGVGERCVDASSH 459

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
                             ++L+FC L             GTVDGPW +
Sbjct: 460 S-----------------MQLVFCSL-------------GTVDGPWEY 477



 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 25/40 (62%), Positives = 30/40 (75%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           M    P+ +  S CHG+G +Q+FRLNAEGQLG GERCVDA
Sbjct: 417 MNAHPPSTVSVSSCHGTGGNQIFRLNAEGQLGVGERCVDA 456


>gi|391336074|ref|XP_003742408.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Metaseiulus
           occidentalis]
          Length = 593

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/214 (36%), Positives = 116/214 (54%), Gaps = 35/214 (16%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W   +K+      P+A   D G++++Q+ LR +L C+ FSWFMK +AYDV   FP LP N
Sbjct: 412 WFDEYKEYFYTREPMARDYDAGNLTQQLELRKRLQCRPFSWFMKEIAYDVLKHFPYLPRN 471

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
             WG  K +    CLDSM    P  +G S CHG G +QLFRLN EGQLG GERCVDA+K+
Sbjct: 472 NRWGHIKPVSEDGCLDSMRAHPPTNVGVSGCHGGGGNQLFRLNEEGQLGVGERCVDANKE 531

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI- 238
                               L +C+L             GTV+GPW +  +T+Q++H   
Sbjct: 532 NMF-----------------LTYCKL-------------GTVNGPWTYDPNTKQMKHTRD 561

Query: 239 -LKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            + RCL +  + ++L + +C   +  Q++ ++E+
Sbjct: 562 GVIRCLQI--DRDRLFLAKCSAIDD-QKYDWEEI 592



 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 19/24 (79%), Positives = 22/24 (91%)

Query: 20  SQLFRLNAEGQLGHGERCVDADKQ 43
           +QLFRLN EGQLG GERCVDA+K+
Sbjct: 508 NQLFRLNEEGQLGVGERCVDANKE 531


>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Strongylocentrotus purpuratus]
          Length = 606

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 63/209 (30%), Positives = 92/209 (44%), Gaps = 32/209 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W   +K+      P ++ +  G+I +Q+A R+K HCK F WFMK +A D+  K+P    N
Sbjct: 422 WFDDYKEYFYRSKPESLLVSTGNIEKQLAFREKFHCKSFGWFMKEIAPDIIEKYPLPHAN 481

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
            +WGE +      C+DSMG      +G SYCHG+G +QLFR+   GQL   ++C     +
Sbjct: 482 KYWGEIRTKKGSLCVDSMGSKDGGRVGMSYCHGAGGNQLFRVTENGQLRIHDQCAYDHYK 541

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
           E                 V+L  C               G   G W F  +T  L HA  
Sbjct: 542 E-----------------VRLRRC---------------GGSGGGWSFDEATGLLHHAGT 569

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +CL      N++ + +C      Q W  
Sbjct: 570 SQCLEHKAGVNEIILTKCASIKPSQLWEI 598



 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV-DADKQV 44
           MG      +G SYCHG+G +QLFR+   GQL   ++C  D  K+V
Sbjct: 499 MGSKDGGRVGMSYCHGAGGNQLFRVTENGQLRIHDQCAYDHYKEV 543


>gi|395542397|ref|XP_003773119.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
           [Sarcophilus harrisii]
          Length = 797

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 95/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 606 IRVVEVWWDGYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 665

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG      +    CH  G +QLFR+N   QL   ++C
Sbjct: 666 PLPPKNIEWGEIRGTETAHCIDSMGHANGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 725

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +  +      +  K+ +T C L                 N FN        W++  +  +
Sbjct: 726 LTKEP-----DGTKIMITHCSL-----------------NEFNE-------WQYFKNLHR 756

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL   P  +Q+ +  CD N S Q+W    +
Sbjct: 757 FTHITSGKCLDRSPVLHQVFLSECDSNKSTQKWEMNNI 794


>gi|332030162|gb|EGI69956.1| N-acetylgalactosaminyltransferase 6 [Acromyrmex echinatior]
          Length = 603

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 105/220 (47%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  + +      P    LD GD+SEQ ALR KLHCK F+WFMK++A+D+   +P + P+
Sbjct: 394 WMDEYAEYLYKRRPHLRTLDPGDLSEQKALRKKLHCKPFNWFMKNIAFDLVEVYPPIEPD 453

Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER- 172
            F +GE +N+G T+ CLDS  R    ++    C          Q FRL     +   +R 
Sbjct: 454 DFAYGEIRNMGVTELCLDSKKRKRDELVVVDTCMKDDQKVSGEQEFRLTWHKDIRPKDRT 513

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+D  K E+     K PV                   NL+ C    G  +  WR+    
Sbjct: 514 ECLDVSKGEE-----KAPV-------------------NLYPCHGKQG--NQLWRYDVEK 547

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           + L+H    RCL   P + ++ +  CD+++  Q+WR ++V
Sbjct: 548 QWLQHGYSSRCLDTDPSSKRVFVTICDKSSPTQKWRIEQV 587


>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
           garnettii]
          Length = 657

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 95/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNIDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L G K                         W++  +  +
Sbjct: 586 L-----TKGPDGSKIMITHCSLNGFK------------------------EWQYFKNLYR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|260789880|ref|XP_002589972.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
 gi|229275159|gb|EEN45983.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
          Length = 522

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 97/218 (44%), Gaps = 30/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K+      P       GDISE VA R++ HCK F WFM+ +AYD+   F
Sbjct: 332 MRVVEVWWDEYKEYFYASRPEIRNHPYGDISELVAFRERNHCKSFKWFMEEIAYDIVEHF 391

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N  WGE + + +  CLDSMG      +G   CHG G +Q+FRLN   ++   ++C
Sbjct: 392 PLPPKNKIWGEIRGIDSGLCLDSMGHKDGGEMGIGGCHGMGGNQMFRLNEANEMAQYDQC 451

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +                              +GT + L +C        G W F    + 
Sbjct: 452 IHV----------------------------VGTQVKLDHCDTEHTRYRG-WEFDEDKKM 482

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H I K CL      +Q+ M  CDE++  Q+W F +V
Sbjct: 483 LRHKIGK-CLDRSEILHQVFMSDCDESSKTQKWEFNQV 519


>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
           garnettii]
          Length = 657

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 95/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNIDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L G K                         W++  +  +
Sbjct: 586 L-----TKGPDGSKIMITHCSLNGFK------------------------EWQYFKNLYR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|354484375|ref|XP_003504364.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cricetulus
           griseus]
          Length = 784

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 593 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 652

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 653 PLPPKNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHKMGGNQLFRINEVNQLMQYDQC 712

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 713 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 743

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD N   Q+W    +
Sbjct: 744 FTHIPSGKCLDRSDVLHQVFISNCDSNKMTQKWEMNNI 781


>gi|403295730|ref|XP_003938783.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Saimiri boliviensis
           boliviensis]
          Length = 659

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 95/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 468 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 527

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 528 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 587

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 588 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 618

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H +  +CL      +Q+ +  CD + + Q+W    +
Sbjct: 619 FTHILSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 656


>gi|344235654|gb|EGV91757.1| N-acetylgalactosaminyltransferase 7 [Cricetulus griseus]
          Length = 607

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 416 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 475

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 476 PLPPKNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHKMGGNQLFRINEVNQLMQYDQC 535

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 536 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 566

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD N   Q+W    +
Sbjct: 567 FTHIPSGKCLDRSDVLHQVFISNCDSNKMTQKWEMNNI 604


>gi|345790686|ref|XP_543898.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Canis lupus
           familiaris]
          Length = 721

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 530 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 589

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 590 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 649

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 650 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKSLHR 680

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 681 FTHIPSGKCLDRSDVLHQVFISNCDSSKMTQKWEMNNI 718


>gi|32425405|gb|AAH35303.1| GALNT7 protein, partial [Homo sapiens]
          Length = 495

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 304 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 363

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 364 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 423

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 424 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 454

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 455 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 492


>gi|13938655|gb|AAH07484.1| Galnt7 protein, partial [Mus musculus]
          Length = 362

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 171 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 230

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 231 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 290

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 291 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 321

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 322 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 359


>gi|426346015|ref|XP_004040686.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gorilla gorilla
           gorilla]
          Length = 650

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 459 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 518

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 519 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 578

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 579 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 609

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 610 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 647


>gi|348566877|ref|XP_003469228.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cavia
           porcellus]
          Length = 637

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 446 VRVVEVWWDEYKDYFYASRPESKALLYGDISELKKFREDHNCKSFKWFMEEIAYDITLHY 505

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMGRT   ++    CH  G +QLFR+N   QL   ++C
Sbjct: 506 PLPPKNVDWGEIRGFETAYCIDSMGRTNGGIVELGPCHRMGGNQLFRINEANQLMQYDQC 565

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 566 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 596

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD N   Q+W    +
Sbjct: 597 FTHIPSGKCLDRSEVLHQVFISDCDSNKMTQKWEMNNI 634



 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MGRT   ++    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 529 MGRTNGGIVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 588

Query: 61  LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
            + FK + R    P    LD  ++  QV + D
Sbjct: 589 -QYFKNLHRFTHIPSGKCLDRSEVLHQVFISD 619


>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
          Length = 540

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 67/220 (30%), Positives = 105/220 (47%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  + +      P    LD GD+SEQ ALR KLHCK F+WFMK++A+D+   +P + P+
Sbjct: 333 WMDEYAEYIYKRRPHLRTLDPGDLSEQKALRTKLHCKSFNWFMKNIAFDLVEVYPPIEPD 392

Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER- 172
            F +GE +N+G T+ CLDS  R    V+    C          Q FRL     +   +R 
Sbjct: 393 DFAFGEIRNMGATELCLDSKKRKRDEVVVMDICMKDDPKMSGEQEFRLTWHKDIRPKDRT 452

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+D  + E+     K PV+                   L+ C    G  +  WR+    
Sbjct: 453 DCLDVSRGEE-----KAPVS-------------------LYPCHGKQG--NQLWRYDVEK 486

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           + L+H    RCL   P + ++ +  CD+++  Q+WR ++V
Sbjct: 487 QWLQHGYSSRCLDTDPGSKRVFVTACDKSSLTQKWRIEQV 526


>gi|355687724|gb|EHH26308.1| hypothetical protein EGK_16238, partial [Macaca mulatta]
          Length = 615

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 484 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 544 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 574

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 575 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 612


>gi|29612635|gb|AAH49907.1| Galnt7 protein, partial [Mus musculus]
          Length = 365

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 174 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 233

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 234 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 293

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 294 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 324

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 325 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 362


>gi|355778494|gb|EHH63530.1| hypothetical protein EGM_16517, partial [Macaca fascicularis]
          Length = 615

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 484 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 544 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 574

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 575 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 612


>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
           [Saccoglossus kowalevskii]
          Length = 634

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 2/125 (1%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAYDVYTK 112
           LR    W+  +K+      P       GDI+EQV  R + HC  DF WFM  VAYD+  +
Sbjct: 450 LRVAEVWMDEYKEYFYRMKPELRGKPYGDITEQVQFRQE-HCPHDFKWFMDEVAYDITER 508

Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           FP +  N+ WGE + +G+ KC+DSMGR+    +    CHG G SQL RLN  G+    E 
Sbjct: 509 FPLISKNIGWGEVRGVGSSKCVDSMGRSPSGKVALYGCHGYGGSQLLRLNEGGEFRVNEE 568

Query: 173 CVDAD 177
           C+  D
Sbjct: 569 CLYTD 573



 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 2/60 (3%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MGR+    +    CHG G SQL RLN  G+    E C+  D   +  ++     RK+  W
Sbjct: 533 MGRSPSGKVALYGCHGYGGSQLLRLNEGGEFRVNEECLYTDGSTVKLERCV--PRKQYGW 590


>gi|74139820|dbj|BAE31754.1| unnamed protein product [Mus musculus]
 gi|74191634|dbj|BAE30388.1| unnamed protein product [Mus musculus]
 gi|74198878|dbj|BAE30662.1| unnamed protein product [Mus musculus]
          Length = 546

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 355 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 414

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 415 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 474

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 475 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 505

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 506 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 543


>gi|402870854|ref|XP_003899414.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Papio anubis]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHK 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|332820787|ref|XP_003310650.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan troglodytes]
 gi|410227832|gb|JAA11135.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
 gi|410262380|gb|JAA19156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
 gi|410297750|gb|JAA27475.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
 gi|410332293|gb|JAA35093.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
           troglodytes]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|148696676|gb|EDL28623.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7, isoform CRA_a [Mus
           musculus]
          Length = 615

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 484 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 544 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 574

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 575 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 612


>gi|148696677|gb|EDL28624.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7, isoform CRA_b [Mus
           musculus]
          Length = 615

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 484 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 544 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 574

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 575 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 612


>gi|74212590|dbj|BAE31034.1| unnamed protein product [Mus musculus]
          Length = 298

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 107 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 166

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 167 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 226

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 227 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 257

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 258 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 295


>gi|269784707|ref|NP_653332.3| N-acetylgalactosaminyltransferase 7 isoform 1 [Mus musculus]
 gi|51315950|sp|Q80VA0.2|GALT7_MOUSE RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|13650041|gb|AAK37549.1|AF349573_1 UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 [Mus musculus]
 gi|30851602|gb|AAH52461.1| Galnt7 protein [Mus musculus]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 617 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 654


>gi|157502212|ref|NP_059119.2| N-acetylgalactosaminyltransferase 7 [Homo sapiens]
 gi|51315961|sp|Q86SF2.1|GALT7_HUMAN RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|28279289|gb|AAH46129.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
           sapiens]
 gi|28704077|gb|AAH47468.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
           sapiens]
 gi|119625166|gb|EAX04761.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
           sapiens]
 gi|193786832|dbj|BAG52155.1| unnamed protein product [Homo sapiens]
 gi|325464563|gb|ADZ16052.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7)
           [synthetic construct]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|383412007|gb|AFH29217.1| N-acetylgalactosaminyltransferase 7 [Macaca mulatta]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|441620192|ref|XP_003258074.2| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
           [Nomascus leucogenys]
          Length = 636

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 445 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 504

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 505 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 564

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 565 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 595

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD   + Q+W    +
Sbjct: 596 FTHIPSGKCLDRSEVLHQVFISNCDSGKTTQKWEMNNI 633


>gi|269784709|ref|NP_001161453.1| N-acetylgalactosaminyltransferase 7 isoform 2 [Mus musculus]
 gi|26331462|dbj|BAC29461.1| unnamed protein product [Mus musculus]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 617 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 654


>gi|26329091|dbj|BAC28284.1| unnamed protein product [Mus musculus]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 617 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 654


>gi|197101721|ref|NP_001124628.1| N-acetylgalactosaminyltransferase 7 [Pongo abelii]
 gi|75042656|sp|Q5RFJ6.1|GALT7_PONAB RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|55725190|emb|CAH89461.1| hypothetical protein [Pongo abelii]
          Length = 657

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|301753757|ref|XP_002912714.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Ailuropoda
           melanoleuca]
 gi|281338294|gb|EFB13878.1| hypothetical protein PANDA_000463 [Ailuropoda melanoleuca]
          Length = 657

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPFGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 617 FTHIPSGKCLDRSDVLHQVFISDCDSSKMTQKWEMNNI 654



 Score = 38.1 bits (87), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG+T    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 549 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 608

Query: 61  LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
            + FK + R    P    LD  D+  QV + D
Sbjct: 609 -QYFKSLHRFTHIPSGKCLDRSDVLHQVFISD 639


>gi|313220437|emb|CBY31290.1| unnamed protein product [Oikopleura dioica]
          Length = 618

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 103/223 (46%), Gaps = 38/223 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAYDVYTKF 113
           R   +W   + +   +  P A  LD GDI  QV  R K HC  DF+WFMK + Y V  ++
Sbjct: 425 RVIETWWDDWSKFFYVARPEAKTLDFGDIGPQVEFR-KNHCPYDFNWFMKEIGYGVALEY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N FWGE +     +CLDSMG T    + T YCH  G +QLFRLN   QL   ++C
Sbjct: 484 PTPPLNKFWGEIRARQGNQCLDSMGHTNGGSVETYYCHKQGGNQLFRLNEGLQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +                               G+ L L +C  S+G   G W +   T  
Sbjct: 544 L----------------------------FMSGSYLKLSHC--SAGDKAG-WDYDPKTGM 572

Query: 234 LEHAILKR---CLSVHPET--NQLAMLRCDENNSYQQWRFKEV 271
           + +   K+   C+S+  +T  +++  + CD N+  Q++ F E+
Sbjct: 573 ISYGEKKKQRFCISLISKTSISKVKFVECDVNDDNQKFDFNEI 615


>gi|397505872|ref|XP_003823466.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan paniscus]
          Length = 657

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCDLNECK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|198419403|ref|XP_002128971.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
           polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
           intestinalis]
          Length = 631

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 95/220 (43%), Gaps = 32/220 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R   +W   +K       P  + +  GDIS QV  R + +CK F WFM+++AYD+   +
Sbjct: 441 IRVVETWWDEYKDYFYASRPETLNMPYGDISAQVKYRQEHNCKSFKWFMENIAYDIVEHY 500

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG-RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           P  P N  WGE + + T +CLDSMG +     + + YCH  G +QLFRL+   QL   ++
Sbjct: 501 PLPPKNKEWGEVRGVDTNQCLDSMGHKNGDGALESGYCHRMGGNQLFRLSEANQLAQYDQ 560

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           CV   +                             V+ + +C     T    W      +
Sbjct: 561 CVTIQR----------------------------GVVTVMHC---DTTQFREWDHDLINK 589

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
            + H    +CL      +++ +  CD N   Q+W   +++
Sbjct: 590 SVRHTTSGKCLERGELVHKVFLALCDANKPTQRWEINQIQ 629


>gi|355689625|gb|AER98895.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 [Mustela putorius
           furo]
          Length = 230

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 39  VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 98

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 99  PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRLGGNQLFRINEANQLMQYDQC 158

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  S  +
Sbjct: 159 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLYR 189

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 190 FTHIPSGKCLDRSDVLHQVFISDCDSSKMTQKWEMNNI 227



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 35/91 (38%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG+T    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 122 MGKTNGGFVELGPCHRLGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 181

Query: 61  LKVFKQIRSLGSPLAMFLDMGDISEQVALRD 91
                  R    P    LD  D+  QV + D
Sbjct: 182 QYFKSLYRFTHIPSGKCLDRSDVLHQVFISD 212


>gi|296195170|ref|XP_002745262.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Callithrix jacchus]
          Length = 657

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKIMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKMTQKWEMNNI 654


>gi|344288243|ref|XP_003415860.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Loxodonta
           africana]
          Length = 657

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVEWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ + +CD +   Q+W    +
Sbjct: 617 FTHTPSGKCLDRSEVLHQVFISKCDSSKMTQKWEMNNI 654


>gi|338722468|ref|XP_001915592.2| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Equus
           caballus]
          Length = 621

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 430 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 489

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMGRT    +    CH  G +QLFR+N   QL   ++C
Sbjct: 490 PLPPKNVDWGEIRGFETVYCIDSMGRTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 549

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 550 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 580

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 581 FTHIPSGKCLDRSEVLHQVFISNCDSSKMTQKWEINNI 618


>gi|410956565|ref|XP_003984911.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Felis catus]
          Length = 772

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 581 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 640

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 641 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 700

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 701 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKNLHR 731

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 732 FTHIPSGKCLDRSDILHQVFISNCDSSKMTQKWEMNNI 769


>gi|313226887|emb|CBY22032.1| unnamed protein product [Oikopleura dioica]
          Length = 618

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 102/223 (45%), Gaps = 38/223 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAYDVYTKF 113
           R   +W   + +   +  P A  LD GDI  QV  R K HC  DF+WFMK + Y V  ++
Sbjct: 425 RVIETWWDDWSKFFYVARPEAKTLDFGDIGPQVEFR-KNHCPYDFNWFMKEIGYGVALEY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N FWGE +     +CLDSMG T    + T YCH  G +QLFRLN   QL   ++C
Sbjct: 484 PTPPLNKFWGEIRARQGNQCLDSMGHTNGGSVETYYCHKQGGNQLFRLNEGLQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +                               G+ L L +C  S+G   G W +   T  
Sbjct: 544 L----------------------------FMSGSYLKLSHC--SAGDKAG-WDYDPKTGM 572

Query: 234 LEHAILKR---CLSVHPET--NQLAMLRCDENNSYQQWRFKEV 271
           + +   K+   C+S+  +T  +++  + CD N   Q++ F E+
Sbjct: 573 ISYGEKKKQRFCISLISKTSISKVKFVECDVNGDNQKFDFNEI 615


>gi|6318186|emb|CAB60270.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7 [Homo
           sapiens]
          Length = 657

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +C+ F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCQSFKWFMEEIAYDITSHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654


>gi|326918604|ref|XP_003205578.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Meleagris
           gallopavo]
          Length = 665

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 474 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 533

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 534 PLPPKNVEWGEIRGFETAYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 593

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 594 L-----TKGPDGSKIMITHCNLNEYK------------------------DWQYFKNLHR 624

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 625 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 662


>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
           saltator]
          Length = 605

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 101/212 (47%), Gaps = 40/212 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  ++      +P A  +D+GD+SE+V LR++L CK F W+++++       +PE P  
Sbjct: 418 WMDEWRDFYYAMNPGARNVDVGDVSERVKLRERLKCKSFRWYLENI-------YPESPMP 470

Query: 120 L---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
           L   + G+ KN+  Q CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 471 LDYYYLGDVKNVEAQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
              +  V++++                          C    G  +  W +++ T+ + H
Sbjct: 531 ASPQGPVKIVR--------------------------CHGMGG--NQAWVYNDETKMIRH 562

Query: 237 AILKRCLSV--HPETNQLAMLRCDENNSYQQW 266
                CLS     + +Q  + +CD NN  Q+W
Sbjct: 563 TNTGHCLSKPHSGDASQPVLTQCDPNNIGQKW 594



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530


>gi|363733313|ref|XP_420521.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gallus gallus]
          Length = 636

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 445 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 504

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 505 PLPPKNVEWGEIRGFETAYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 564

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 565 L-----TKGPDGSKIMITHCNLNEYK------------------------DWQYFKNLHR 595

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 596 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 633


>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
           echinatior]
          Length = 605

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 103/218 (47%), Gaps = 42/218 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  +D+GD+SE++ LR++L CK F W+++++       +P
Sbjct: 413 RVAEVWMDEWRDFYYAMNPGARNVDVGDVSERIKLRERLKCKSFRWYLENI-------YP 465

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ KN+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 466 ESPMPLDYYYLGDVKNIETQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 525

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W + + T
Sbjct: 526 MCLDAANPQGPVKIVR--------------------------CHGMGG--NQAWVYSDET 557

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
           + ++H     CLS  P +    Q  + +CD NN  Q+W
Sbjct: 558 KMIKHTNTGYCLS-KPHSGDPAQPVLAQCDVNNIGQRW 594



 Score = 37.7 bits (86), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530


>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
           floridanus]
          Length = 608

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/217 (27%), Positives = 101/217 (46%), Gaps = 40/217 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  +D+GD+SE++ LR++L CK F W+++++       +P
Sbjct: 416 RVAEVWMDEWRDFYYAMNPGARNVDVGDVSERIKLRERLKCKSFRWYLENI-------YP 468

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ KN+  Q CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 469 ESPMPLDYYYLGDVKNVEMQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 528

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W ++N T
Sbjct: 529 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNNET 560

Query: 232 RQLEHAILKRCLSV--HPETNQLAMLRCDENNSYQQW 266
           + + H     CLS     + +Q  +  CD NN  Q+W
Sbjct: 561 KMIRHTNTGHCLSKPHSGDASQPVLAPCDANNIGQRW 597



 Score = 37.7 bits (86), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 494 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 533


>gi|12621080|ref|NP_075215.1| N-acetylgalactosaminyltransferase 7 [Rattus norvegicus]
 gi|51315737|sp|Q9R0C5.1|GALT7_RAT RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
           Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
           Short=pp-GaNTase 7; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 7; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 7
 gi|4092503|gb|AAC99426.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T6 [Rattus
           norvegicus]
 gi|149032267|gb|EDL87173.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7, isoform CRA_a
           [Rattus norvegicus]
          Length = 657

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+   +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD     Q+W    +
Sbjct: 617 FTHIASGKCLDRSEVLHQVFISSCDNGKMTQKWEMNNI 654


>gi|440908503|gb|ELR58512.1| N-acetylgalactosaminyltransferase 7, partial [Bos grunniens mutus]
          Length = 615

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 484 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 544 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 574

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 575 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 612



 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG+T    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 507 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 566

Query: 61  LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
            + FK +  L   P    LD  ++  QV + D
Sbjct: 567 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 597


>gi|359067894|ref|XP_002689501.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
          Length = 617

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 426 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 485

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 486 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 545

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 546 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 576

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 577 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 614



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG+T    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 509 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 568

Query: 61  LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
            + FK +  L   P    LD  ++  QV + D
Sbjct: 569 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 599


>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7-like [Saccoglossus
           kowalevskii]
          Length = 594

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 91/207 (43%), Gaps = 31/207 (14%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P +  L  GDIS+Q+  + K +CK+F WFMK VA D+   +P    N
Sbjct: 410 WMDDYKEFFYRSKPESQLLHFGDISKQLEFKTKHNCKNFDWFMKEVAPDLLAVYPVPAAN 469

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
             WGE K+   + C+D+MG      IG S CHG G +QLFR+  + +    E C+     
Sbjct: 470 QAWGEIKSNTNKVCVDTMGNREGGTIGISGCHGQGGNQLFRITEDHEFRIHELCL----- 524

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
                       + + + VKL  C               G     W F  S   +     
Sbjct: 525 ------------YEIYSEVKLRRC--------------DGKSKYSWFFDESKGWISLKDK 558

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQW 266
             CL ++    +L+M +CD +N  Q+W
Sbjct: 559 NLCLELNSNLRRLSMKKCDISNGLQKW 585


>gi|119896052|ref|XP_602855.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
          Length = 772

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 581 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 640

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 641 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 700

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 701 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 731

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 732 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 769



 Score = 37.4 bits (85), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG+T    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 664 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 723

Query: 61  LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
            + FK +  L   P    LD  ++  QV + D
Sbjct: 724 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 754


>gi|426222421|ref|XP_004005390.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Ovis aries]
          Length = 865

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 674 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 733

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 734 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 793

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 794 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 824

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 825 LTHISSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 862


>gi|296484976|tpg|DAA27091.1| TPA: N-acetylgalactosaminyltransferase 7-like [Bos taurus]
          Length = 781

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 590 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 649

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 650 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 709

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 710 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 740

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 741 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 778



 Score = 37.4 bits (85), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG+T    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 673 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 732

Query: 61  LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
            + FK +  L   P    LD  ++  QV + D
Sbjct: 733 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 763


>gi|109076193|ref|XP_001085532.1| PREDICTED: n-acetylgalactosaminyltransferase 7 [Macaca mulatta]
          Length = 630

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 91/208 (43%), Gaps = 29/208 (13%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +P  P N+ WG
Sbjct: 449 YKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHYPLPPKNVDWG 508

Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
           E +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C+      K  
Sbjct: 509 EIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCL-----TKGA 563

Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
           +  K+ +T C L   K                         W++  +  +  H    +CL
Sbjct: 564 DGSKVMITHCNLNEFK------------------------EWQYFKNLHRFTHIPSGKCL 599

Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEV 271
                 +Q+ +  CD + + Q+W    +
Sbjct: 600 DRSEVLHQVFISNCDSSKTTQKWEMNNI 627


>gi|335301041|ref|XP_001926518.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Sus scrofa]
          Length = 712

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 521 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 580

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 581 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 640

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 641 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYLKNLHR 671

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H     CL      +Q+ +  CD +   Q+W    +
Sbjct: 672 LTHIPSGMCLDRSEVLHQVFISNCDSSKMTQKWEINNI 709



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 4/93 (4%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD--ADKQVISTQKSSLELRKRS 58
           MG+T    +    CH  G +QLFR+N   QL   ++C+   AD   +     +L   K  
Sbjct: 604 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEW 663

Query: 59  SWLKVFKQIRSLGSPLAMFLDMGDISEQVALRD 91
            +LK   ++  +  P  M LD  ++  QV + +
Sbjct: 664 QYLKNLHRLTHI--PSGMCLDRSEVLHQVFISN 694


>gi|327268630|ref|XP_003219099.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Anolis
           carolinensis]
          Length = 654

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 90/218 (41%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDI++    R+   CK F WFM+ +AYD+ + +
Sbjct: 463 VRVVEVWWDEYKDYFYASRPETKALAYGDITDLKKFREDHRCKSFKWFMEEIAYDITSHY 522

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 523 PLPPKNVEWGEIRGFETSYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 582

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 583 LT-----KGPDGTKVVITHCNLNEYK------------------------EWQYFKNLHR 613

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD N + Q W    +
Sbjct: 614 FTHIPSGKCLDRSEVLHQVFLSECDSNKATQSWEMNNI 651


>gi|417411949|gb|JAA52393.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 615

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +C+ F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCRSFKWFMEEIAYDITSHY 483

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMGRT    +    CH  G +QLFR+N   QL   ++C
Sbjct: 484 PLPPKNVDWGEIRGFETVYCIDSMGRTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 544 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKNLHR 574

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 575 FTHIPSGKCLDRSDVLHQVFLSSCDSSKMAQKWEMNNI 612



 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 2/90 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MGRT    +    CH  G +QLFR+N   QL   ++C+           +   L +   W
Sbjct: 507 MGRTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKIMITHCNLNEFKEW 566

Query: 61  LKVFKQI-RSLGSPLAMFLDMGDISEQVAL 89
            + FK + R    P    LD  D+  QV L
Sbjct: 567 -QYFKNLHRFTHIPSGKCLDRSDVLHQVFL 595


>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
          Length = 615

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/220 (28%), Positives = 100/220 (45%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +        P    +D GD++EQ ALRDKL CK F WFM+++A+D+   +P + P+
Sbjct: 399 WMDEYADFIYRRRPHLRAMDPGDLTEQKALRDKLKCKSFKWFMENIAFDLVEVYPPIEPD 458

Query: 120 LF-WGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLGHGER- 172
            F +GE +N+G    CLD+ G+     I   YC          Q F+L     +    R 
Sbjct: 459 DFAYGEMRNIGVPNLCLDAKGKGKDEEIAVDYCQKDTPKIKGEQEFQLTWHKDIRPNRRT 518

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+D  + +      K PVT                   L+ C    G  +  WR++   
Sbjct: 519 ECLDVSRGDD-----KSPVT-------------------LYPCHGKQG--NQLWRYNVEK 552

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           + L+H    RCL + P + ++ +  CD  +S Q+WR + V
Sbjct: 553 QWLQHGYGSRCLDMDPASKKVYVSNCDPTSSTQKWRIEHV 592


>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
          Length = 602

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 65/220 (29%), Positives = 105/220 (47%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  + +      P    LD GD+SEQ ALR KLHCK F+WF++++A+D+   +P + P+
Sbjct: 393 WMDEYAEYIYKRRPHLRALDPGDLSEQKALRVKLHCKPFNWFIENIAFDLVEVYPPIEPD 452

Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER- 172
            F +GE +N+G T+ CLDS  R    +I    C          Q FRL     +   +R 
Sbjct: 453 DFAYGEIRNMGATELCLDSKKRKRDELIVVDTCVKDDPKVSGEQEFRLTWHKDIRPKDRT 512

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+D  + E+     K PV+                   L+ C    G  +  WR+    
Sbjct: 513 DCLDVSRGEE-----KAPVS-------------------LYPCHGKQG--NQLWRYDIEK 546

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           + L+H    RCL   P + ++ +  CD+++  Q+WR ++V
Sbjct: 547 QWLQHGYSSRCLDTDPGSKRVFVTTCDKSSPTQKWRIEQV 586


>gi|449270894|gb|EMC81540.1| N-acetylgalactosaminyltransferase 7, partial [Columba livia]
          Length = 613

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 422 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 481

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 482 PLPPKNVDWGEIRGFETSYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 541

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C                      N +   D  W++  +  +
Sbjct: 542 L-----TKGPDGSKIMITHC----------------------NQNEYKD--WQYFKNLHR 572

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 573 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 610


>gi|449500526|ref|XP_002187477.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Taeniopygia
           guttata]
          Length = 828

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 637 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 696

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 697 PLPPKNVDWGEIRGFETSYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 756

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C                   N F         W+F  +  +
Sbjct: 757 L-----TKGPDGSKIMITHCNQ-----------------NEFKE-------WQFFKNLHR 787

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD + + Q+W    +
Sbjct: 788 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 825


>gi|47575716|ref|NP_001001200.1| polypeptide N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
           tropicalis]
 gi|45501097|gb|AAH67317.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
           tropicalis]
          Length = 653

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 88/218 (40%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDI      R+  +CK F WFM+ +AYD+   F
Sbjct: 462 IRVVEVWWDEYKDYFYASRPETKALPYGDIGPLKKFREDHNCKSFKWFMEEIAYDITKYF 521

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T   ++    CH  G +QLFR+N   QL   ++C
Sbjct: 522 PLPPKNVEWGEIRGFETNYCIDSMGHTNGGIVELGGCHRMGGNQLFRINEANQLMQYDQC 581

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
                      + K P    V+    L  C L   L               W++  +  +
Sbjct: 582 -----------LYKGPDGTQVM----LTHCNLKEFLE--------------WQYFKNMHR 612

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD N   Q+W    +
Sbjct: 613 FLHISTGKCLDRSEVLHQVFLSDCDANKPTQKWEMNNI 650



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG T   ++    CH  G +QLFR+N   QL   ++C+           +   L++   W
Sbjct: 545 MGHTNGGIVELGGCHRMGGNQLFRINEANQLMQYDQCLYKGPDGTQVMLTHCNLKEFLEW 604

Query: 61  LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
            + FK + R L       LD  ++  QV L D
Sbjct: 605 -QYFKNMHRFLHISTGKCLDRSEVLHQVFLSD 635


>gi|431918328|gb|ELK17555.1| N-acetylgalactosaminyltransferase 7 [Pteropus alecto]
          Length = 717

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 526 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 585

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG+T         CH  G +QLFR+N   QL   ++C
Sbjct: 586 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFAELGPCHRMGGNQLFRINEANQLMQYDQC 645

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 646 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKNLNR 676

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD +   Q+W    +
Sbjct: 677 FTHIPSGKCLDRSEVLHQVFLSNCDSSKMTQKWEMNNI 714


>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Apis mellifera]
          Length = 606

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 104/221 (47%), Gaps = 42/221 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+SE++ LR++L CK F W+++++       +P
Sbjct: 414 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 466

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ +N+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 467 ESPMPLDYYYLGDVQNVDTQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 526

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W ++  T
Sbjct: 527 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 558

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFK 269
           + ++H  +  CLS  P +N   Q  +  CD +N  Q+W  +
Sbjct: 559 KMIKHTNIGHCLS-KPRSNDAMQPVLAPCDPHNIGQKWTMR 598



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531


>gi|51490866|emb|CAD44538.1| polypeptide N-acetylgalactosaminyltransferase 7 [Homo sapiens]
          Length = 166

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 86/192 (44%), Gaps = 29/192 (15%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT 140
           GDISE    R+  +C+ F WFM+ +AYD+ + +P  P N+ WGE +   T  C+DSMG+T
Sbjct: 2   GDISELKKFREDHNCQSFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKT 61

Query: 141 APAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKL 200
               +    CH  G +QLFR+N   QL   ++C+      K  +  K+ +T C L   K 
Sbjct: 62  NGGFVELGPCHRMGGNQLFRINEANQLMQYDQCL-----TKGADGSKVMITHCNLNEFK- 115

Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
                                   W++  +  +  H    +CL      +Q+ +  CD +
Sbjct: 116 -----------------------EWQYFKNLHRFTHIPSGKCLDRSEVLHQVFISNCDSS 152

Query: 261 NSYQQWRFKEVR 272
            + Q+W    + 
Sbjct: 153 KTTQKWEMNNIH 164


>gi|387019377|gb|AFJ51806.1| n-acetylgalactosaminyltransferase 7-like [Crotalus adamanteus]
          Length = 658

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 90/218 (41%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   FK       P    L  GDIS+    R+   CK F WFM+ +AYD+ + +
Sbjct: 467 VRVVEVWWDEFKDYFYASRPETKALAYGDISDLKNFREDHKCKSFKWFMEEIAYDITSHY 526

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +      C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 527 PLPPRNVEWGEIRGFEAAYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 586

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 587 LI-----KGPDGSKVMITHCNLNEYK------------------------EWQYFKNLYR 617

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +Q+ +  CD N + Q+W    +
Sbjct: 618 FTHISSGKCLDRSEVLHQVFISDCDSNKASQKWEMNNI 655


>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Megachile rotundata]
          Length = 605

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 102/218 (46%), Gaps = 42/218 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+SE++ LR++L CK F W+++++       +P
Sbjct: 413 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 465

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ +N+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 466 ESPMPLDYYYLGDVQNIDTQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 525

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W ++  T
Sbjct: 526 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 557

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
           + ++H     CLS  P +N   Q  +  CD +N  Q+W
Sbjct: 558 KMIKHTNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKW 594



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530


>gi|351701091|gb|EHB04010.1| N-acetylgalactosaminyltransferase 7, partial [Heterocephalus
           glaber]
          Length = 616

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 92/219 (42%), Gaps = 31/219 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM  +AYD+ T  
Sbjct: 425 VRVVEVWWDEYKDYFYASRPESKALLYGDISELKKFREDHNCKSFKWFMDEIAYDI-TSH 483

Query: 114 PELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           P LPP N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++
Sbjct: 484 PPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQ 543

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C+      K  +  K+ +T C L   K                         W++  +  
Sbjct: 544 CL-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLH 574

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +  H    +CL      +Q+ +  CD     Q+W    +
Sbjct: 575 RFTHIPSGKCLDRSEVLHQVFISNCDSGKMTQKWEMNNI 613


>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Bombus impatiens]
          Length = 606

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 101/213 (47%), Gaps = 42/213 (19%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           WL  +K      +P A  + +GD+SE++ LR++L CK F W+++++       +PE P  
Sbjct: 419 WLDQWKYFYYNINPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YPESPMP 471

Query: 120 L---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
           L   + G+ +N+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 472 LDYYYLGDVQNVETQSCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
              +  V++++                          C    G  +  W ++  T+ ++H
Sbjct: 532 ASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEETKMIKH 563

Query: 237 AILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
                CLS  P +N   Q  +  CD +N  Q+W
Sbjct: 564 TNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKW 595



 Score = 37.7 bits (86), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531


>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Bombus impatiens]
          Length = 606

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 102/218 (46%), Gaps = 42/218 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+SE++ LR++L CK F W+++++       +P
Sbjct: 414 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 466

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ +N+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 467 ESPMPLDYYYLGDVQNVETQSCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 526

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W ++  T
Sbjct: 527 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 558

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
           + ++H     CLS  P +N   Q  +  CD +N  Q+W
Sbjct: 559 KMIKHTNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKW 595



 Score = 37.7 bits (86), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531


>gi|345307492|ref|XP_001507110.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Ornithorhynchus anatinus]
          Length = 873

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 63/121 (52%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P    L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 439 VRVVEVWWDDYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 498

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T   V+    CH  G +QLFR+N   QL   ++C
Sbjct: 499 PLPPKNVEWGEIRGFDTAYCIDSMGHTNGGVVELGPCHRMGGNQLFRINEANQLMQYDQC 558

Query: 174 V 174
           +
Sbjct: 559 L 559


>gi|444727227|gb|ELW67729.1| N-acetylgalactosaminyltransferase 7 [Tupaia chinensis]
          Length = 606

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 65/121 (53%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 274 IRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 333

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE + L T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 334 PLPPRNIDWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEVNQLMQYDQC 393

Query: 174 V 174
           +
Sbjct: 394 L 394


>gi|47191417|emb|CAG14579.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 174

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 84/199 (42%), Gaps = 29/199 (14%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           P ++ L  GDISE    R++  CK F WFM+ +AYD+   +P  P N+ WGE + L T  
Sbjct: 2   PESLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPLHYPMPPKNVDWGEIRGLDTSY 61

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           C+DSMG T    +    CH  G +QLFR+N   QL   ++C+                  
Sbjct: 62  CIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQCLTRGTDN------------ 109

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
              +GV +  C                     W++     +  H    +CL      +++
Sbjct: 110 ---SGVIITHC--------------DQNQHTEWKYFKDLHRFTHVTTGKCLDRSDLLHKV 152

Query: 253 AMLRCDENNSYQQWRFKEV 271
            +  CD + + Q+W    +
Sbjct: 153 FISDCDTSKTTQKWEMNNI 171


>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Apis florea]
          Length = 605

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 102/221 (46%), Gaps = 42/221 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+SE++ LR +L CK F W+++++       +P
Sbjct: 413 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRQRLKCKSFRWYLENI-------YP 465

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ +N+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 466 ESPMPLDYYYLGDVQNVDTQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 525

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W ++  T
Sbjct: 526 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 557

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFK 269
           + ++H     CLS  P +N   Q  +  CD +N  Q+W  +
Sbjct: 558 KMIKHTNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKWTMR 597



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530


>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Bombus terrestris]
 gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Bombus terrestris]
          Length = 606

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 102/218 (46%), Gaps = 42/218 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+SE++ LR++L CK F W+++++       +P
Sbjct: 414 RVAEVWMDEWRDFYYAMNPGARSVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 466

Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P  L   + G+ +N+ TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 467 ESPMPLDYFYLGDVQNVETQSCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 526

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   +  V++++                          C    G  +  W ++  T
Sbjct: 527 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 558

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
           + ++H     CLS  P +N   Q  +  CD +N  Q+W
Sbjct: 559 KMIKHTNTGHCLS-KPRSNDAMQPVLASCDLHNIGQKW 595



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531


>gi|410927898|ref|XP_003977377.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
           7-like [Takifugu rubripes]
          Length = 664

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 93/239 (38%), Gaps = 29/239 (12%)

Query: 33  HGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDK 92
           HG +       V S+      +R    W   +K       P  + L  GDISE    R++
Sbjct: 452 HGWQGNPPPAHVGSSPTLKNYVRVVEVWWDEYKDYFYASRPETLTLAYGDISELKKFREE 511

Query: 93  LHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHG 152
             CK F WFM+ +AYD+   +P  P N+ WGE + L +  C+DSMG T    +    CH 
Sbjct: 512 HRCKSFKWFMEEIAYDIPLHYPLPPKNVEWGEIRGLDSSYCIDSMGHTNGGSVEIGPCHR 571

Query: 153 SGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLF 212
            G +QLFR+N   QL   ++C+        V      +T C                   
Sbjct: 572 MGGNQLFRINDANQLMQYDQCLTRGADNSAVI-----ITHC------------------- 607

Query: 213 NCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
                       W++     +  H    +CL      +++ +  CD + + Q+W    +
Sbjct: 608 -----DQNQHNEWKYFKDLHRFTHVTTGKCLDRSDLLHKVFISDCDTSKTTQKWEMNNI 661


>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 615

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 61/215 (28%), Positives = 103/215 (47%), Gaps = 41/215 (19%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           W+  +K      +P A  +++GD+SE++ALR+KL CK F W+++++       +PE  +P
Sbjct: 427 WMDEWKHFYYAINPGASNVEVGDVSERLALREKLKCKSFRWYLENI-------YPESQMP 479

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE KN+ +Q+CLD+M R +   +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 480 LDYYYLGEIKNVDSQQCLDTMSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDA 539

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
                            ++  V LI            C    G  +  W + +    ++H
Sbjct: 540 SN---------------IVGPVSLI-----------RCHGLEG--NQAWVYDSKEMTIKH 571

Query: 237 AILKRCLSVHPETNQLAML--RCDENNSYQQWRFK 269
               +CL      +Q A +   CD + S QQW  K
Sbjct: 572 KTTDQCLEHSMSADQYAAILNECDGSRS-QQWTMK 605



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 23/44 (52%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQV 44
           M R +   +G SYCHG G +Q+F      Q+   + C+DA   V
Sbjct: 500 MSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIV 543


>gi|390348396|ref|XP_787966.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
           [Strongylocentrotus purpuratus]
          Length = 403

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 54/127 (42%), Positives = 64/127 (50%), Gaps = 4/127 (3%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHC-KDFSWFMKHVAYDVYTK 112
           +R    WL  FK+      P       GDI EQ+  R K HC K F WFM+ VA+D   K
Sbjct: 203 MRVAEVWLDEFKEFFYTSRPDLRGKPYGDIGEQIRFR-KHHCPKSFKWFMEEVAFDSLEK 261

Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGR--TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           FP   PN  WGE K+  T  C+DSMG   TA    G  YCHG G +Q FRL   GQ+   
Sbjct: 262 FPPPQPNQAWGEIKSDHTGMCVDSMGHQATAGGEAGVYYCHGMGGNQRFRLTGPGQIMFN 321

Query: 171 ERCVDAD 177
           + C   D
Sbjct: 322 DYCFYVD 328


>gi|66472462|ref|NP_001018477.1| N-acetylgalactosaminyltransferase 7 [Danio rerio]
 gi|63100869|gb|AAH95642.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 7 [Danio rerio]
          Length = 652

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 55/218 (25%), Positives = 89/218 (40%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P  + L  GDIS     R++  CK F WFM+ +AYD+   +
Sbjct: 461 VRVVEVWWDDYKDYFYASRPETLTLAYGDISTLKKFREEHRCKSFKWFMEEIAYDIPLHY 520

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 521 PLPPKNVEWGEIRGYETSYCIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQC 580

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +                               G+ + + +C  +  T    W++     +
Sbjct: 581 LTKGGD--------------------------GSAVIITHCNLNEHT---EWKYFKDLHR 611

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +CL      +++ +  CD + S Q+W    +
Sbjct: 612 FTHVPTGKCLDRSDVLHKVYIADCDNSKSSQKWEMNNI 649



 Score = 38.9 bits (89), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 2/92 (2%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
           MG T    +    CH  G +QLFR+N   QL   ++C+       +   +   L + + W
Sbjct: 544 MGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQCLTKGGDGSAVIITHCNLNEHTEW 603

Query: 61  LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
            K FK + R    P    LD  D+  +V + D
Sbjct: 604 -KYFKDLHRFTHVPTGKCLDRSDVLHKVYIAD 634


>gi|432847870|ref|XP_004066191.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oryzias
           latipes]
          Length = 653

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 86/218 (39%), Gaps = 30/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P  + L  GDISE    R++  CK F WFM+ +AYD+   +
Sbjct: 463 VRVVEVWWDEYKDFFYASRPETLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPVHY 522

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 523 PLPPKNVEWGEIRGFETSYCIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQC 582

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +                     A V +  C                     W++     +
Sbjct: 583 LTRGDN----------------AAVIITHCDRNQYTE--------------WKYFKDLHR 612

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             H    +C+      +++ +  CD + + Q+W    +
Sbjct: 613 FTHMPTGKCMDRSDLLHKVFISDCDTSKTTQKWEMNNI 650


>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 605

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 59/215 (27%), Positives = 101/215 (46%), Gaps = 38/215 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  +++GDI+ ++ LR+ L CK F W++ ++       +P
Sbjct: 410 RVAEVWMDEWRDFYYAMNPGAKKIEVGDITSRLKLREDLKCKSFRWYLTNI-------YP 462

Query: 115 E--LPPNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P +  F G+ KN+ T++CLD+MGR +   +G SYCHG G +Q+F      Q+   +
Sbjct: 463 ESTMPLDYYFLGDIKNVETEQCLDTMGRKSGENVGMSYCHGYGGNQVFSYTKRHQITADD 522

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA                 V   VKL+            C    G  +  W+++N T
Sbjct: 523 NCLDAAS---------------VRGPVKLV-----------RCHGMGGNQE--WKYYNDT 554

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
            +L+H    +CL    + +    LR  + +  QQW
Sbjct: 555 FELKHVHTGQCLDKPSDRDSTPQLRSCDGSRGQQW 589



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR +   +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 488 MGRKSGENVGMSYCHGYGGNQVFSYTKRHQITADDNCLDA 527


>gi|350597048|ref|XP_003361960.2| PREDICTED: hypothetical protein LOC100155807 [Sus scrofa]
          Length = 1191

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 5/146 (3%)

Query: 54   LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
            +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 1019 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 1078

Query: 114  PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            P  P N+ WGE +   T  C+DSMG+T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 1079 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 1138

Query: 174  VDADKQEKLVEMLKLPVTFCVLAGVK 199
            +      K  +  K+ +T C L   K
Sbjct: 1139 LT-----KGADGSKVMITHCNLNEFK 1159


>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 2 [Acyrthosiphon pisum]
          Length = 615

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/220 (27%), Positives = 104/220 (47%), Gaps = 41/220 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  +++GD+SE++ALR+KL CK F W+++++       +P
Sbjct: 422 RVAEVWMDEWRDFYYAMNPGASNVEVGDVSERLALREKLKCKSFRWYLENI-------YP 474

Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P + ++ GE KN+ +Q+CLD+M R +   +G SYCHG G +Q+F      Q+   +
Sbjct: 475 ESQMPLDYYYLGEIKNVDSQQCLDTMSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDD 534

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA                 ++  V LI            C    G  +  W + +  
Sbjct: 535 NCLDASN---------------IVGPVSLI-----------RCHGLEG--NQAWVYDSKE 566

Query: 232 RQLEHAILKRCLSVHPETNQLAML--RCDENNSYQQWRFK 269
             ++H    +CL      +Q A +   CD + S QQW  K
Sbjct: 567 MTIKHKTTDQCLEHSMSADQYAAILNECDGSRS-QQWTMK 605



 Score = 37.4 bits (85), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 16/44 (36%), Positives = 23/44 (52%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQV 44
           M R +   +G SYCHG G +Q+F      Q+   + C+DA   V
Sbjct: 500 MSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIV 543


>gi|348538240|ref|XP_003456600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oreochromis
           niloticus]
          Length = 649

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 54/220 (24%), Positives = 89/220 (40%), Gaps = 33/220 (15%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P  + L  GDIS+    R++  CK F WFM+ +AYD+   +
Sbjct: 458 VRVVEVWWDDYKDYFYASRPETLTLAYGDISDLKRFREEHRCKSFKWFMEEIAYDIPLHY 517

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   T  C+DSMG T    +    CH  G +QLFR+N   QL   ++C
Sbjct: 518 PLPPKNVEWGEIRGFDTSYCIDSMGHTNGGNVELGPCHRMGGNQLFRINEANQLMQYDQC 577

Query: 174 --VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
             +  DK                 + + +  C     +               W++    
Sbjct: 578 LTIGGDK-----------------SAIIITHCDTNQHME--------------WKYFKDL 606

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            +  H    +CL      +++ +  CD + + Q+W    +
Sbjct: 607 HRFTHVPTGKCLDRSDLLHKVFVSDCDTSKTTQKWEMNNI 646


>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
          Length = 613

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/225 (26%), Positives = 101/225 (44%), Gaps = 35/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    +  G+I+EQ+A+R KL+CK F WFM+ +A+D+  K+P
Sbjct: 399 RVAEVWMDEYAEFLYKRRPQYRSIQPGNITEQLAIRKKLNCKPFKWFMEEIAFDLPKKYP 458

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGH 169
            + PP +  GE +N+G   C+D+  +      G   C     G G  Q  ++     +  
Sbjct: 459 PIEPPAVAEGEMRNVGANLCVDTRFKGQGETFGLEKCAKDEPGIGGEQRLQITWHKDIRP 518

Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW-R 226
           G+R  C D         + K PV                    L+NC    G     W +
Sbjct: 519 GKRSFCFDVS-----TSVEKAPVI-------------------LYNCHGMKGNQ---WFK 551

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +   T Q+ H I + CL    E  ++ M  CD++++ Q+W+F++V
Sbjct: 552 YDTETSQVFHPISRHCLDCDAERKEIFMSSCDKDSNTQKWKFEKV 596


>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
           laevis]
 gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
          Length = 653

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 57/218 (26%), Positives = 91/218 (41%), Gaps = 29/218 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   ++       P    L  GDIS     R+  +CK F WFM+ +AYD+   +
Sbjct: 462 VRVVEVWWDEYRDYFYASRPETKALAYGDISALKKFREDHNCKSFKWFMEEIAYDIPNYY 521

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE +   +  C+DSMG T   +     CH  G +QLFR+N   QL   ++C
Sbjct: 522 PLPPRNVDWGEIRGFESGYCIDSMGHTNGGLAELGGCHRMGGNQLFRINEANQLMQYDQC 581

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +      K  +  K+ +T C L   K                         W++  +  +
Sbjct: 582 LT-----KGTDGSKVILTHCNLNEYK------------------------EWQYFKNMHR 612

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L +    +CL      +Q+ +  CD N + Q+W    +
Sbjct: 613 LLYIPSGKCLDRSEVLHQVFLSDCDSNKTTQKWEMNNI 650


>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           isoform 3 [Bombus impatiens]
          Length = 607

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 55/193 (28%), Positives = 93/193 (48%), Gaps = 42/193 (21%)

Query: 80  MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNL---FWGEAKNLGTQKCLDS 136
           +GD+SE++ LR++L CK F W+++++       +PE P  L   + G+ +N+ TQ CLD+
Sbjct: 440 VGDVSERIKLRERLKCKSFRWYLENI-------YPESPMPLDYYYLGDVQNVETQSCLDT 492

Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
           MGR     +G SYCHG G +Q+F      Q+   + C+DA   +  V++++         
Sbjct: 493 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVKIVR--------- 543

Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN---QLA 253
                            C    G  +  W ++  T+ ++H     CLS  P +N   Q  
Sbjct: 544 -----------------CHGMGG--NQAWVYNEETKMIKHTNTGHCLS-KPRSNDAMQPV 583

Query: 254 MLRCDENNSYQQW 266
           +  CD +N  Q+W
Sbjct: 584 LAPCDPHNIGQKW 596



 Score = 37.7 bits (86), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 493 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 532


>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
          Length = 639

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 64/221 (28%), Positives = 102/221 (46%), Gaps = 43/221 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K      +P A  +++GD+S + +LR KL CK F W++++V       +P
Sbjct: 442 RVAEVWMDRWKDFFYEMNPGARSVEVGDVSSRRSLRKKLQCKSFRWYLENV-------YP 494

Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P + F+ GE +N  TQ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 495 ESQMPLDYFFLGEIRNAETQTCLDTMGRKGGENVGISYCHGLGGNQVFAYTKRQQIMSDD 554

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA   + +V++++                          C    G  +  W +    
Sbjct: 555 NCLDATGTDGIVKLIR--------------------------CHGMGG--NQAWLYEAQD 586

Query: 232 RQLEHAILKRCLSVHPETNQL---AMLRCDENNSYQQWRFK 269
             + H    RCLS  PE+  +    + RCD  +S QQW  K
Sbjct: 587 GLIRHVNSGRCLS-KPESRDVTLPVLRRCD-GSSGQQWIMK 625



 Score = 38.1 bits (87), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 520 MGRKGGENVGISYCHGLGGNQVFAYTKRQQIMSDDNCLDA 559


>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
           [Strongylocentrotus purpuratus]
 gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
           [Strongylocentrotus purpuratus]
          Length = 670

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 73/228 (32%), Positives = 108/228 (47%), Gaps = 36/228 (15%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  +K       PL      G+ISEQV  R +   K F WFM+ +A+D   ++
Sbjct: 470 MRVAEVWLDEYKDSFYATKPLLEGKPCGNISEQVEYRKQNCPKSFKWFMEEIAFDNLQEY 529

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG-RTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           P    N  WGE +++ ++ CLD+ G +  P  A +G   CHG G +QLFR+N  GQL   
Sbjct: 530 PPPAKNKIWGEIRSVDSELCLDTNGAQNDPTHATMGMEGCHGMGGNQLFRINTRGQL--- 586

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLA---GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
                      +VE       FC+ A     KLIF +          F S  T+   W +
Sbjct: 587 -----------VVED-----GFCLYARPNTEKLIFAKCTAE----KSFTSETTM---WDY 623

Query: 228 HNSTRQLEH-AILKRCLSVHPETNQLAMLRCDENNSYQQWRF--KEVR 272
            +S RQ+ H    KRCL  + ET+++ +  C      QQ++F  +E+R
Sbjct: 624 -DSDRQIIHLERRKRCLDHNQETSKVFVTPCQHGEISQQFKFYKEEIR 670


>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
          Length = 588

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 95/214 (44%), Gaps = 34/214 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+   + +P  M    GD+S++  LR+ L CK F W++ +V  D   + P     
Sbjct: 401 WMDDYKKFFYIVNPTVMKHKYGDVSDRKTLRNDLQCKSFQWYLDNVYPD--AQIPRRYKV 458

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE KN G   CLD+MGR     +G   CHG G +Q+F    + ++   + C+D    
Sbjct: 459 L--GEIKNTGANICLDTMGRKENKKVGCYSCHGQGGNQVFSFTMDNEIRIDDLCLDVANS 516

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
           +  V M+K                          C +  G  +  W ++  T QL H   
Sbjct: 517 KGPVMMVK--------------------------CHHQKG--NQYWEYNIKTNQLIHTNS 548

Query: 240 KRCLS--VHPETNQLAMLRCDENNSYQQWRFKEV 271
           K+CL+  +    N+  M RCD +   Q+W F+ +
Sbjct: 549 KQCLTKPIARSANEPRMERCDGSIDTQKWSFRNM 582


>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
          Length = 626

 Score = 90.9 bits (224), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 96/199 (48%), Gaps = 44/199 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
           D GDIS +  LR+ L CK F W++ ++       +PEL     ++  GE +N   Q  +C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEVRNSAVQPARC 502

Query: 134 LDSM-GR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD M GR      +GT  CHG G +Q + L+ +G++   E CVD                
Sbjct: 503 LDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDESCVDY--------------- 547

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  +
Sbjct: 548 -------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAK 592

Query: 252 LAMLRCDENNSYQQWRFKE 270
           L M+ C  ++ YQ W+FKE
Sbjct: 593 LEMVACQYDDPYQHWKFKE 611


>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
 gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           Short=pp-GaNTase 5; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 5; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
          Length = 626

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 96/199 (48%), Gaps = 44/199 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
           D GDIS +  LR+ L CK F W++ ++       +PEL     ++  GE +N   Q  +C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEVRNSAVQPARC 502

Query: 134 LDSM-GR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD M GR      +GT  CHG G +Q + L+ +G++   E CVD                
Sbjct: 503 LDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDESCVDY--------------- 547

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  +
Sbjct: 548 -------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAK 592

Query: 252 LAMLRCDENNSYQQWRFKE 270
           L M+ C  ++ YQ W+FKE
Sbjct: 593 LEMVACQYDDPYQHWKFKE 611


>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
          Length = 630

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 96/199 (48%), Gaps = 44/199 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
           D GD+S +  LR  L CK F W++ ++       FPEL     ++  GE +N   Q  +C
Sbjct: 454 DFGDVSARKKLRSDLGCKSFKWYLDNI-------FPELFVPGESVAKGEVRNSAVQPARC 506

Query: 134 LDSM-GRTAPA-VIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD M GR      +GT  CHG G +Q + L+ +G++   E CVD                
Sbjct: 507 LDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDESCVDY--------------- 551

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  +
Sbjct: 552 -------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTRDGAK 596

Query: 252 LAMLRCDENNSYQQWRFKE 270
           L M++C  ++ YQ W+FKE
Sbjct: 597 LEMVQCQYDDPYQHWKFKE 615


>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
 gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
          Length = 596

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 63/211 (29%), Positives = 97/211 (45%), Gaps = 39/211 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+     SP A     GD+SE+ ALR++L CK F W+++++       +PE  +P
Sbjct: 400 WLDEWKEFYYQMSPGARKASAGDVSERRALRERLKCKSFRWYLENI-------YPESQMP 452

Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            +  F GE +N+ T  CLD+MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 453 LDYYFLGEIRNVKTHNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 512

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
                                       LG V NL  C    G  +  W + +  + ++H
Sbjct: 513 SNA-------------------------LGPV-NLVRCHGMGGNQE--WIYDDEEKTIKH 544

Query: 237 AILKRCLSVHPETN-QLAMLRCDENNSYQQW 266
                CL+   E +    +LR    +  QQW
Sbjct: 545 VNSGNCLTRASEDDPSTPLLRPCNYSEGQQW 575



 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 473 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 512


>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
           [Crassostrea gigas]
          Length = 526

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 64/232 (27%), Positives = 100/232 (43%), Gaps = 33/232 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    +D GD+SEQ A+RDKLHCK F WFM+ VA+D+   +P
Sbjct: 318 RVAEVWMDEYAEYLYKRRPHYRNIDPGDVSEQKAIRDKLHCKPFKWFMEEVAFDLPKFYP 377

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS---SQLFRLNAEGQLGHG 170
            + PP    GE +N     CLD+  +          C   G     Q F       +  G
Sbjct: 378 PVEPPPFASGEVRNKAANMCLDTRYKGQNERFDLQPCLKDGKGGGEQQFEFTWHKDIRPG 437

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
           +R V  D  + +    K PV                    LFNC    G  +  ++++  
Sbjct: 438 KRTVCFDVSQSIK---KAPVI-------------------LFNCHGMGG--NQRFKYNID 473

Query: 231 TRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK-----EVRPDWSS 277
           T+Q+ H I  +CL    + +++ M  CD+ +  Q W+ +     EVR +W +
Sbjct: 474 TQQIYHVISNQCLDCDRDRHEIFMNPCDKTSETQMWKVEHVHEDEVRKEWDA 525


>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 632

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/133 (35%), Positives = 74/133 (55%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  M +D GD+S +  LRD LHCK FSW+++++  D  ++ P
Sbjct: 390 RLAEVWMDDFKDFFYIISPGVMRVDYGDVSSRKGLRDALHCKPFSWYLENIYPD--SQIP 447

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 448 RRYYSL--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCL 505

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 506 DVSRLNGPVLMLK 518


>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Monodelphis domestica]
          Length = 559

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 49/134 (36%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Sarcophilus harrisii]
          Length = 559

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 49/134 (36%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
          Length = 499

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      ++P +
Sbjct: 313 WMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRH 367

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 368 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 427

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 428 LNGPVTMLK 436


>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Pongo abelii]
          Length = 612

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      ++P +
Sbjct: 373 WMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRH 427

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 428 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 487

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 488 LNGPVTMLK 496


>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
 gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
 gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
 gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
          Length = 559

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Callithrix jacchus]
 gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
           boliviensis boliviensis]
          Length = 559

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
 gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Nomascus leucogenys]
 gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Pan troglodytes]
 gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
           paniscus]
 gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
           gorilla gorilla]
 gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
           sapiens]
 gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
           CRA_a [Homo sapiens]
 gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
           CRA_a [Homo sapiens]
 gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [synthetic
           construct]
 gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
           fascicularis]
 gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
 gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
 gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
 gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
           troglodytes]
          Length = 559

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
           anubis]
          Length = 559

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|90078941|dbj|BAE89150.1| unnamed protein product [Macaca fascicularis]
          Length = 311

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      ++P +
Sbjct: 125 WMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRH 179

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 180 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 239

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 240 LNGPVTMLK 248


>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
          Length = 559

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 565

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/216 (25%), Positives = 94/216 (43%), Gaps = 38/216 (17%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE---- 115
           W+  +K      SP +   + GD++ +  LRDKL CK F W+++++       +PE    
Sbjct: 379 WMDDYKTFFYKISPGSKKSEYGDVTNRKQLRDKLQCKSFKWYLENI-------YPESQFM 431

Query: 116 LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
           +  N+  GE +N+ T++CLD+MGR     +G   CHG G +Q+F    + +L H + C+D
Sbjct: 432 MDYNMI-GEIRNMETKQCLDNMGRKENNKVGIYACHGQGGNQIFAWTKKKELKHDDLCLD 490

Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
           A +Q    ++++L                         C N  G  +  W F+     + 
Sbjct: 491 ASRQSGFNDIMQL------------------------RCHNQGGNQE--WSFNRELNVIT 524

Query: 236 HAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           HA    CL           LR  +    QQW   ++
Sbjct: 525 HATSGLCLDKADTGKDTPTLRVCDKRLSQQWELVDI 560



 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 16/43 (37%), Positives = 24/43 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
           MGR     +G   CHG G +Q+F    + +L H + C+DA +Q
Sbjct: 452 MGRKENNKVGIYACHGQGGNQIFAWTKKKELKHDDLCLDASRQ 494


>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 3 [Anolis carolinensis]
          Length = 498

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 73/134 (54%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD++ + ALRD L CK FSW++++V  D      
Sbjct: 294 RLAEVWMDEFKDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPD-----S 348

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P   F  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C
Sbjct: 349 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 408

Query: 174 VDADKQEKLVEMLK 187
           +D  +    V MLK
Sbjct: 409 LDVSRLNGPVIMLK 422


>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Cavia porcellus]
          Length = 559

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRIGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|340386578|ref|XP_003391785.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like,
           partial [Amphimedon queenslandica]
          Length = 237

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 95/217 (43%), Gaps = 33/217 (15%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+      P       GDISE++ LR KLHCK F W+M+    +VY++ 
Sbjct: 41  MRLAEVWMDEYKEHYYAKRPSIRSRSYGDISERLELRKKLHCKSFRWYME----NVYSEM 96

Query: 114 PELPPNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           P    NL + G   N  ++ C D+MG      +    CHG G +Q F  +  G+L   + 
Sbjct: 97  PLPLENLRYGGYVMNPQSKLCFDTMGNREYGKVSLFTCHGIGGNQEFGFSDSGELIFDDD 156

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+D   +E+                        G  + + NC    G  +  W ++N T
Sbjct: 157 LCLDISSKER------------------------GGKVGILNCHGLGG--NQKWEYNNVT 190

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           R L+H + K C+   P  N L M  C E+   Q+W F
Sbjct: 191 RHLQHVVTKLCVEKGP-GNILVMNNCMESFPSQEWIF 226


>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Anolis carolinensis]
          Length = 556

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 98/220 (44%), Gaps = 37/220 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD++ + ALRD L CK FSW++++V  D      
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPD-----S 421

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P   F  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C
Sbjct: 422 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 481

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +D  +    V MLK                          C +  G  +  W +      
Sbjct: 482 LDVSRLNGPVIMLK--------------------------CHHMRG--NQLWEYDAERLT 513

Query: 234 LEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
           L H    +CL    E +++   M  C E+ S QQW  + +
Sbjct: 514 LRHLNSNQCLDEPAEDDKMVPTMKDCSESRS-QQWLLRNM 552


>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 600

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 74/131 (56%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+     SP A     GD+SE+ ALR+KL CK F W+++++       +PE  +P
Sbjct: 402 WLDEWKEFYYQMSPGARKASAGDVSERRALREKLKCKSFRWYLENI-------YPESQMP 454

Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            +  F GE +N  +Q CLD+MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 455 LDYYFLGEIRNEESQNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 514

Query: 177 DKQEKLVEMLK 187
                 V +++
Sbjct: 515 SNALGPVNLVR 525



 Score = 40.0 bits (92), Expect = 1.00,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 475 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 514


>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
 gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
          Length = 624

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 94/197 (47%), Gaps = 42/197 (21%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
           D GDIS +  LR+ L CK F W++ ++       +PEL     ++  GE +N  T +CLD
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGELRNAQTSQCLD 502

Query: 136 S-MGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
           S +G        T Y CH  G +Q + L+ +G++   E CVD                  
Sbjct: 503 SAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESCVDY----------------- 545

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA 253
                       G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  +L 
Sbjct: 546 -----------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAKLE 592

Query: 254 MLRCDENNSYQQWRFKE 270
           M+ C  ++ YQ W+FKE
Sbjct: 593 MVACQYDDPYQHWKFKE 609


>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Anolis carolinensis]
          Length = 557

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 98/220 (44%), Gaps = 37/220 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD++ + ALRD L CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P   F  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C
Sbjct: 423 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           +D  +    V MLK                          C +  G  +  W +      
Sbjct: 483 LDVSRLNGPVIMLK--------------------------CHHMRG--NQLWEYDAERLT 514

Query: 234 LEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
           L H    +CL    E +++   M  C E+ S QQW  + +
Sbjct: 515 LRHLNSNQCLDEPAEDDKMVPTMKDCSESRS-QQWLLRNM 553


>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
          Length = 624

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/197 (28%), Positives = 94/197 (47%), Gaps = 42/197 (21%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
           D GDIS +  LR+ L CK F W++ ++       +PEL     ++  GE +N  T +CLD
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGELRNAQTSQCLD 502

Query: 136 S-MGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
           S +G        T Y CH  G +Q + L+ +G++   E CVD                  
Sbjct: 503 SAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESCVDY----------------- 545

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA 253
                       G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  +L 
Sbjct: 546 -----------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAKLE 592

Query: 254 MLRCDENNSYQQWRFKE 270
           M+ C  ++ YQ W+FKE
Sbjct: 593 MVACQYDDPYQHWKFKE 609


>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
           N-acetylgalactosaminyltransferase 10-like [Metaseiulus
           occidentalis]
          Length = 590

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 64/221 (28%), Positives = 95/221 (42%), Gaps = 33/221 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P    L  GD+S+Q ALR KL CK F WFM+ VA+D   ++P
Sbjct: 385 RVAEVWMDEYKEFLYNRRPHYRTLGYGDVSKQKALRKKLKCKPFKWFMETVAFDQPLRYP 444

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLGH 169
            + PP+  WG  +N+G  KCLD+  +          C  S       Q F L     L  
Sbjct: 445 PVEPPDFAWGAIRNVGADKCLDTKFKEQGKRFSLETCISSNGDVSGEQNFVLTWHKDLRP 504

Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
            +R  C D    EK     K PV                    L+ C    G  +  +++
Sbjct: 505 AKRNVCFDVSSGEK-----KAPVV-------------------LWTCHGMHG--NQLFKY 538

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           + +T+QL H I  +C+   P   ++ M  CD  ++ Q+W F
Sbjct: 539 NVNTKQLLHPITAQCIDCDPGNLEIFMNPCDVESNTQKWLF 579


>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           2 [Ornithorhynchus anatinus]
          Length = 558

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL CK FSW++++V  D      ++P +
Sbjct: 372 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPD-----SQIPRH 426

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 427 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 486

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 487 LNGPVTMLK 495


>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Anolis carolinensis]
          Length = 559

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Ornithorhynchus anatinus]
          Length = 559

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL CK FSW++++V  D      ++P +
Sbjct: 373 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPD-----SQIPRH 427

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 428 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 487

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 488 LNGPVTMLK 496


>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
           Acetylgalactosaminyltransferase-T1
          Length = 472

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      ++P +
Sbjct: 286 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 340

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 341 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 400

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 401 LNGPVTMLK 409


>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
           [Taeniopygia guttata]
          Length = 559

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
 gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Meleagris gallopavo]
 gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
          Length = 559

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
           (Silurana) tropicalis]
 gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
           [Xenopus (Silurana) tropicalis]
          Length = 498

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+SE+ ALR+ L C  FSW+++ V  D      
Sbjct: 294 RLAEVWMDDFKDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPD-----S 348

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P   F  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C
Sbjct: 349 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 408

Query: 174 VDADKQEKLVEMLK 187
           +D  +    V MLK
Sbjct: 409 LDVSRLNGPVIMLK 422


>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
          Length = 632

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 96/202 (47%), Gaps = 47/202 (23%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ---- 131
           D GD+S++ ALR++L CK F W++ ++       +PEL     ++  GE +N G +    
Sbjct: 456 DYGDVSDRKALRERLKCKSFKWYLDNI-------YPELFVPGDSVAKGEVRNYGYKEGGG 508

Query: 132 --KCLDS-MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKL 188
             +CLDS +G      +    CHG G +Q + L+ +G++   E C+D             
Sbjct: 509 APQCLDSVVGEDVHKDVTPYPCHGQGGNQYWMLSKDGEIRRDESCID------------- 555

Query: 189 PVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPE 248
                  AG  ++         +F C    G  +  WR+++   QL H +  +CL +  +
Sbjct: 556 ------YAGANVM---------IFPCHGMKGNQE--WRYNHKLHQLMHVVSDKCLEMSRD 598

Query: 249 TNQLAMLRCDENNSYQQWRFKE 270
            ++L +  CD +N YQQW F+E
Sbjct: 599 GSKLLVNTCDSSNPYQQWAFQE 620


>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
           glaber]
          Length = 559

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
           adamanteus]
          Length = 556

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 98/214 (45%), Gaps = 35/214 (16%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + ALR+ L CK FSW++++V  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLEYVYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCMDVSRL 487

Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
              V MLK                          C +  G  +  W +      L H   
Sbjct: 488 NGPVIMLK--------------------------CHHMRG--NQLWEYDAERLTLRHVNS 519

Query: 240 KRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
            +CL    E +++   M  C E+ S QQW  + +
Sbjct: 520 NQCLDEPSEDDKMVPTMKDCSESRS-QQWLLRNM 552


>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mus musculus]
          Length = 559

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
           musculus]
 gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
           musculus]
          Length = 400

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      ++P +
Sbjct: 214 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 268

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 269 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 328

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 329 LNGPVTMLK 337


>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           1 [Canis lupus familiaris]
 gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Ailuropoda melanoleuca]
 gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
          Length = 559

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
 gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
 gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
 gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mus musculus]
          Length = 559

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Meleagris gallopavo]
          Length = 556

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + ALR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVDTNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVTMLK 495


>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
           gallus]
          Length = 556

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + ALR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVDTNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVTMLK 495


>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
          Length = 591

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      ++P +
Sbjct: 405 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----SQIPRH 459

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 460 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 519

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 520 LNGPVTMLK 528


>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
          Length = 483

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+SE+ ALR+ L C  FSW+++ V  D      ++P  
Sbjct: 299 WMDDFKDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPD-----SQIPRR 353

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  +
Sbjct: 354 YFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSR 413

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 414 LNGPVIMLK 422


>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
           [Xenopus laevis]
 gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
          Length = 556

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+SE+ ALR+ L C  FSW+++ V  D      
Sbjct: 367 RLAEVWMDDFKDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPD-----S 421

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P   F  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C
Sbjct: 422 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 481

Query: 174 VDADKQEKLVEMLK 187
           +D  +    V MLK
Sbjct: 482 LDVSRLNGPVIMLK 495


>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
          Length = 666

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 64/224 (28%), Positives = 106/224 (47%), Gaps = 36/224 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + S G  L   +D GD++EQ A+R KL+CK F WFMK VA+D+   +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCKSFKWFMKEVAFDLMKTY 509

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
           P + PP+   G  +N+G Q  CLD++GR     +G   C  +      +Q + L+   + 
Sbjct: 510 PPVDPPSYAMGALQNVGNQNLCLDTLGRKKHNKMGMYACADNIKTPQRTQFWELSWKRDL 569

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L   + C+D       +     PV                    L++C +  G  +  W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603

Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +    +QL+H    +RCL + P + ++   +CD +N +QQW F
Sbjct: 604 YYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNF 647


>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Danio rerio]
          Length = 556

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 46/128 (35%), Positives = 74/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + ALR+ L CK FSW++++V  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVRVDYGDVSSRKALRESLKCKPFSWYLENVYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+DA + 
Sbjct: 430 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDASRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVVMLK 495


>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
           melanogaster]
 gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
           melanogaster]
          Length = 630

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 72/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+      P A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 444 WLDDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 496

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 557 SSSNGPVNMVR 567



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556


>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
 gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
          Length = 541

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 72/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+      P A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 355 WLDDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 407

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 408 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 467

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 468 SSSNGPVNMVR 478



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 428 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 467


>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
           caballus]
          Length = 453

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      ++P +
Sbjct: 267 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----SQIPRH 321

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 322 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 381

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 382 LNGPVTMLK 390


>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
 gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
          Length = 675

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 72/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+      P A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 489 WLDDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 541

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 542 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 602 SSSNGPVNMVR 612



 Score = 37.4 bits (85), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 562 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601


>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Oreochromis niloticus]
          Length = 557

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDI+ + ALR KL CK FSW+++++  D      ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDITSRTALRQKLQCKPFSWYLENIYPD-----SQIPRH 425

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            +  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 486 LNGPVMMLK 494


>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
          Length = 553

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCRPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 559

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GD++ ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDVASRIGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Takifugu rubripes]
          Length = 556

 Score = 86.7 bits (213), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  M +D GD+S +  LRD L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDDFKDFFYIISPGVMRVDYGDVSSRKGLRDALRCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVLMLK 495


>gi|395823173|ref|XP_003804166.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 1 [Otolemur garnettii]
          Length = 539

 Score = 86.7 bits (213), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 348 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 402

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 403 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 462

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 463 LDVSKLNGPVTMLK 476


>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Equus caballus]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Monodelphis domestica]
          Length = 556

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + ALR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   AE ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKDNEKVGMFNCHGMGGNQVFSYTAEKEIRTDDFCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 SGPVIMLK 495


>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
           mutus]
          Length = 564

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 373 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 427

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 428 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 487

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 488 LDVSKLNGPVTMLK 501


>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
           catus]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
           [Bos taurus]
          Length = 519

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      ++P +
Sbjct: 333 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----SQIPRH 387

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 388 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 447

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 448 LNGPVTMLK 456


>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDI+ ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDIASRLGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Loxodonta africana]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Oryzias latipes]
          Length = 557

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS + +LR KL CK FSW+++++  D      ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDISTRTSLRQKLQCKPFSWYLENIYPD-----SQIPRH 425

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            +  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 486 LNGPVMMLK 494


>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Sus scrofa]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
           taurus]
 gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
           taurus]
 gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
           aries]
          Length = 559

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Metaseiulus occidentalis]
          Length = 514

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 96/202 (47%), Gaps = 43/202 (21%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLGT-QKCL 134
           D GDIS+++ALRDKL C  F W++  V       +PEL  P +    GE +NLG+   CL
Sbjct: 346 DYGDISDRIALRDKLKCHSFKWYIDEV-------YPELFVPGDAIGSGEMRNLGSGGMCL 398

Query: 135 DS-MGRTA-PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           DS  G+++    +G   CHG G +Q +  +  G++   E C+D                 
Sbjct: 399 DSPAGKSSLHKAVGLYPCHGQGGNQYWLYSKNGEIRRDEACLD----------------- 441

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
              AG  +I         L+ C  S G  +  W +   T+ L H   ++CL +     +L
Sbjct: 442 --YAGTDVI---------LYPCHGSKG--NQYWIYDQQTKLLRHGSSRKCLEMSLSKTEL 488

Query: 253 AMLRCDENNSYQQWRFKEVRPD 274
            +  CDE+N+ Q+W+F + R D
Sbjct: 489 LVRGCDEDNNRQKWKFGKYRED 510


>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
           laevis]
 gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
          Length = 559

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDI+ +V LR KL CK FSW++++V  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDIATRVGLRHKLQCKPFSWYLENVYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + +  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTASKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVIMLK 496


>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
          Length = 591

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 96/224 (42%), Gaps = 33/224 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    +D GD+++Q ALR+KLHCK F WFM+ VA+D+  K+P
Sbjct: 379 RVAEVWMDEYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYP 438

Query: 115 ELPPNLF-WGEAKNLGTQK-CLDSMGRTAPAVIGTSYC---HGSGSSQLFRLNAEGQLGH 169
            + P  F  GE +NL   + C+DS  +    VIG + C         Q F L     L  
Sbjct: 439 PIEPGDFGVGEIRNLAAPELCVDSGHKDRDQVIGLAECVKGTNKNGEQNFALTWHKDLRV 498

Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             +  C+D        +++                        L+ C  S G  +  WR+
Sbjct: 499 KGKTLCLDVSDPNDKADIV------------------------LYPCHGSQG--NQYWRY 532

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
               +   H    RCL   P   +L + +CD+ +  Q+WRF+ V
Sbjct: 533 DVEKQWFLHGGNPRCLDCDPGQRRLYVTKCDDQSKTQKWRFENV 576


>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
          Length = 613

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 96/224 (42%), Gaps = 33/224 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    +D GD+++Q ALR+KLHCK F WFM+ VA+D+  K+P
Sbjct: 401 RVAEVWMDEYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYP 460

Query: 115 ELPPNLF-WGEAKNLGTQK-CLDSMGRTAPAVIGTSYC---HGSGSSQLFRLNAEGQLGH 169
            + P  F  GE +NL   + C+DS  +    VIG + C         Q F L     L  
Sbjct: 461 PIEPGDFGVGEIRNLAAPELCVDSGHKDRDQVIGLAECVKGTNKNGEQNFALTWHKDLRV 520

Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             +  C+D        +++                        L+ C  S G  +  WR+
Sbjct: 521 KGKTLCLDVSDPNDKADIV------------------------LYPCHGSQG--NQYWRY 554

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
               +   H    RCL   P   +L + +CD+ +  Q+WRF+ V
Sbjct: 555 DVEKQWFLHGGNPRCLDCDPGQRRLYVTKCDDQSKTQKWRFENV 598


>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
 gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
          Length = 623

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/200 (28%), Positives = 96/200 (48%), Gaps = 49/200 (24%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLG--TQKC 133
           D GDIS +  LR+ L CK F W++ ++       +PEL     ++  GE +N G   ++C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEMRNAGGKNRQC 502

Query: 134 LD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +D   S G+T    +G   CH  G +Q + L+ +G++   E CVD               
Sbjct: 503 IDYKPSGGKT----VGMYQCHNQGGNQYWMLSKDGEIRRDESCVDY-------------- 544

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
                          G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  
Sbjct: 545 --------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGA 588

Query: 251 QLAMLRCDENNSYQQWRFKE 270
           +L M+ C  ++ YQ W+FKE
Sbjct: 589 KLEMVACQYDDPYQHWKFKE 608


>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
          Length = 623

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 56/200 (28%), Positives = 96/200 (48%), Gaps = 49/200 (24%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLG--TQKC 133
           D GDIS +  LR+ L CK F W++ ++       +PEL     ++  GE +N G   ++C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEMRNAGGKNRQC 502

Query: 134 LD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +D   S G+T    +G   CH  G +Q + L+ +G++   E CVD               
Sbjct: 503 IDYKPSGGKT----VGMYQCHNQGGNQYWMLSKDGEIRRDESCVDY-------------- 544

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
                          G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  
Sbjct: 545 --------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGA 588

Query: 251 QLAMLRCDENNSYQQWRFKE 270
           +L M+ C  ++ YQ W+FKE
Sbjct: 589 KLEMVACQYDDPYQHWKFKE 608


>gi|74215848|dbj|BAE28617.1| unnamed protein product [Mus musculus]
          Length = 330

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D G+IS ++ LR KL CK FSW+++++  D      ++P +
Sbjct: 144 WMDEFKNFFYIISPGVTKVDYGNISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 198

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 199 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 258

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 259 LNGPVTMLK 267


>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Takifugu rubripes]
          Length = 557

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 69/129 (53%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDI+ + ALR KL CK FSW+++ +  D      ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDIATRTALRQKLQCKPFSWYLESIYPD-----SQIPRH 425

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            +  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 486 LNGPVMMLK 494


>gi|383863685|ref|XP_003707310.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Megachile
           rotundata]
          Length = 610

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 99/220 (45%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  + +      P    LD G++++Q  LR +LHCK F WFM++VA+D+   +P + P+
Sbjct: 401 WMDEYAEYIYRRRPHLRSLDPGNLTKQKNLRARLHCKPFKWFMENVAFDLVDVYPPIEPD 460

Query: 120 LFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLGHGER- 172
            F  GE +N+G  + CLD+  R    ++    C    S     Q F+L     +   +R 
Sbjct: 461 DFASGEIRNMGVPELCLDAKKRKKDELVVVDICMKDNSKIIGEQEFKLTWHKDIRPKDRT 520

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+D  + +      K PV+                   L+ C    G  +  WR++   
Sbjct: 521 ECLDVSRGDA-----KAPVS-------------------LYPCHGGQG--NQLWRYNVEK 554

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           + L H    RCL   P + ++ +  CD  +  Q+WR ++V
Sbjct: 555 QWLMHGYSSRCLDTDPASKKVFVTNCDSTSPTQKWRIEKV 594


>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Equus
           caballus]
          Length = 601

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKG-- 538

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +  W  H   R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWMFEHI 587


>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Oryzias latipes]
          Length = 557

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  M +D GD+S + ALR+ L CK F+W+++++  D  ++ P    +
Sbjct: 373 WMDEFKDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPD--SQIPRRYYS 430

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 431 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 488

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 489 NGPVVMLK 496


>gi|24656262|ref|NP_647749.2| polypeptide GalNAc transferase 6, isoform A [Drosophila
           melanogaster]
 gi|24656265|ref|NP_728779.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
           melanogaster]
 gi|442629817|ref|NP_001261342.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
           melanogaster]
 gi|51315873|sp|Q6WV16.2|GALT6_DROME RecName: Full=N-acetylgalactosaminyltransferase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
 gi|7292281|gb|AAF47689.1| polypeptide GalNAc transferase 6, isoform A [Drosophila
           melanogaster]
 gi|7292282|gb|AAF47690.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
           melanogaster]
 gi|440215219|gb|AGB94037.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
           melanogaster]
          Length = 666

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 106/224 (47%), Gaps = 36/224 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + S G  L   +D GD++EQ A+R KL+CK F WFM+ VA+D+   +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCKSFKWFMEEVAFDLMKTY 509

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
           P + PP+   G  +N+G Q  CLD++GR     +G   C  +      +Q + L+   + 
Sbjct: 510 PPVDPPSYAMGALQNVGNQNLCLDTLGRKKHNKMGMYACADNIKTPQRTQFWELSWKRDL 569

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L   + C+D       +     PV                    L++C +  G  +  W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603

Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +    +QL+H    +RCL + P + ++   +CD +N +QQW F
Sbjct: 604 YYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNF 647


>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Oryzias latipes]
          Length = 556

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  M +D GD+S + ALR+ L CK F+W+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVVMLK 495


>gi|34042986|gb|AAQ56703.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 666

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 106/224 (47%), Gaps = 36/224 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + S G  L   +D GD++EQ A+R KL+CK F WFM+ VA+D+   +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCKSFKWFMEEVAFDLMKTY 509

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
           P + PP+   G  +N+G Q  CLD++GR     +G   C  +      +Q + L+   + 
Sbjct: 510 PPVDPPSYAMGALQNVGNQNLCLDTLGRKKHNKMGMYACADNIKTPQRTQFWELSWKRDL 569

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L   + C+D       +     PV                    L++C +  G  +  W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603

Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +    +QL+H    +RCL + P + ++   +CD +N +QQW F
Sbjct: 604 YYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNF 647


>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
           2 [Danio rerio]
          Length = 557

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP    +D GDIS + +LR +L CK FSW++++V  D      ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDISTRTSLRQRLQCKPFSWYLENVYPD-----SQIPRH 425

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            +  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+D  K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485

Query: 179 QEKLVEMLK 187
               V MLK
Sbjct: 486 LNGPVMMLK 494


>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 3 [Oryzias latipes]
          Length = 558

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  M +D GD+S + ALR+ L CK F+W+++++  D  ++ P    +
Sbjct: 374 WMDEFKDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPD--SQIPRRYYS 431

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 432 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 489

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 490 NGPVVMLK 497


>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
           spiralis]
 gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
           spiralis]
          Length = 670

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/215 (28%), Positives = 94/215 (43%), Gaps = 35/215 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P AMF  +G+IS+++ALR +L CK F W++K++       +P
Sbjct: 383 RTAEVWMDEYKEFYYKMTPSAMFAPLGNISDRIALRKRLGCKSFKWYLKNI-------YP 435

Query: 115 E--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +PP  +  G  KN     CLD+MGR A        CH SG +Q+  +      G   
Sbjct: 436 ESNIPPTYYSIGYIKNEKNDLCLDTMGRKASGSPALLTCHNSGGNQVLFMKVWSYTGTLN 495

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
             + AD+              C+ A  K       + + L  C N    +   W +  +T
Sbjct: 496 --IRADE-------------LCLQASRK-----ADSPIFLQQCNNDESQI---WDYDPNT 532

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
            Q  H   + CL+  P  N + M  C + N  QQW
Sbjct: 533 YQFRHRQSRACLT--PLNNMIVMAECYKENLLQQW 565


>gi|157113705|ref|XP_001652065.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108877647|gb|EAT41872.1| AAEL006558-PA [Aedes aegypti]
          Length = 368

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 74/136 (54%), Gaps = 10/136 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    WL  +K+     SP A     GD+SE+  LR++L CK F W+++++       +P
Sbjct: 168 RVAEVWLDEWKEFYYQMSPGARKASAGDVSERKELRERLKCKSFRWYLENI-------YP 220

Query: 115 E--LPPNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P +  F GE +N+ T  CLD+MGR +   IG+SYCHG G +Q+F      Q+   +
Sbjct: 221 ESQMPLDYYFLGEIRNVETGNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDD 280

Query: 172 RCVDADKQEKLVEMLK 187
            C+DA      V +++
Sbjct: 281 NCLDASNALGPVNLVR 296



 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 246 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDA 285


>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Canis lupus
           familiaris]
          Length = 601

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKG-- 538

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +  W  H   R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
 gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
          Length = 632

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 72/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 446 WLDEWRDFYYAMSTGARKASAGDVSDRKALRDRLQCKSFRWYLENV-------YPESLMP 498

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 499 LDYYYLGEIRNAETETCLDTMGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 558

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 559 ASSSGPVNMVR 569



 Score = 38.1 bits (87), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 519 MGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 558


>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Ornithorhynchus anatinus]
          Length = 556

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + ALR  L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKALRQNLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
          Length = 592

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 39/211 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+     SP A     GD+SE+  LR++L CK F W+++++       +PE  +P
Sbjct: 397 WLDEWKEFYYQMSPGARKASAGDVSERKELRERLKCKSFRWYLENI-------YPESQMP 449

Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            +  F GE +N+ T  CLD+MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 450 LDYYFLGEIRNVETGNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDA 509

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
                                       LG V NL  C    G  +  W +    + ++H
Sbjct: 510 SNA-------------------------LGPV-NLVRCHGMGGNQE--WVYDEEDKTIKH 541

Query: 237 AILKRCLSVHPETN-QLAMLRCDENNSYQQW 266
                CL+   + +    +LR  +    QQW
Sbjct: 542 VNSGNCLTRPTQDDPSTPLLRPCKYTKGQQW 572



 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR +   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 470 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDA 509


>gi|380024969|ref|XP_003696257.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
           florea]
          Length = 598

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    LD G++  Q  LR KLHCK F WFM+++A+D+   +P
Sbjct: 384 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 443

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C     +    Q F+L     + 
Sbjct: 444 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 503

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  K                  G K         ++L+ C    G  +  WR
Sbjct: 504 PKDRTECLDVSK-----------------GGAK-------ASVSLYPCHGGQG--NQLWR 537

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++ + +CD ++  Q+WR ++V
Sbjct: 538 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 582


>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Callithrix jacchus]
          Length = 601

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNIAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H++   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHSVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Saimiri boliviensis boliviensis]
          Length = 601

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNIAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H++   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 FWGYR---KDRTLFHSVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|328781649|ref|XP_003250010.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
           mellifera]
          Length = 598

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    LD G++  Q  LR KLHCK F WFM+++A+D+   +P
Sbjct: 384 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 443

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C     +    Q F+L     + 
Sbjct: 444 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 503

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  K                  G K         ++L+ C    G  +  WR
Sbjct: 504 PKDRTECLDVSK-----------------GGTK-------ASVSLYPCHGGQG--NQLWR 537

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++ + +CD ++  Q+WR ++V
Sbjct: 538 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 582


>gi|380024971|ref|XP_003696258.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 3 [Apis
           florea]
          Length = 590

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    LD G++  Q  LR KLHCK F WFM+++A+D+   +P
Sbjct: 376 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 435

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C     +    Q F+L     + 
Sbjct: 436 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 495

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  K                  G K         ++L+ C    G  +  WR
Sbjct: 496 PKDRTECLDVSK-----------------GGAK-------ASVSLYPCHGGQG--NQLWR 529

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++ + +CD ++  Q+WR ++V
Sbjct: 530 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 574


>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Oreochromis niloticus]
          Length = 556

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  M ++ GD+S + ALR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDDFKDFFYIISPGVMRVEYGDVSSRKALREALKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +C+D+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCMDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVVMLK 495


>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
           AltName: Full=Polypeptide GalNAc transferase 1;
           Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 1;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 1; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 1
           soluble form
 gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
          Length = 559

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 71/133 (53%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL C+ FSW+++++  D  ++ P
Sbjct: 368 RLAEVWMDEFKTFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD--SQIP 425

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C+
Sbjct: 426 RHYSSL--GEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCL 483

Query: 175 DADKQEKLVEMLK 187
           D  K    V MLK
Sbjct: 484 DVSKLNGPVTMLK 496


>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 559

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +   T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRKEETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|328781647|ref|XP_003250009.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
           mellifera]
          Length = 611

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    LD G++  Q  LR KLHCK F WFM+++A+D+   +P
Sbjct: 397 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 456

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C     +    Q F+L     + 
Sbjct: 457 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 516

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  K                  G K         ++L+ C    G  +  WR
Sbjct: 517 PKDRTECLDVSK-----------------GGTK-------ASVSLYPCHGGQG--NQLWR 550

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++ + +CD ++  Q+WR ++V
Sbjct: 551 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 595


>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
 gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
          Length = 561

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 424

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 482

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 483 DVSRLNGPVIMLK 495


>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
           caballus]
          Length = 539

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S + +LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 355 WMDEFKDFFYIISPGVVKVDYGDVSVRKSLRENLKCKPFSWYLENIYPD--SQIPRRYYS 412

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 413 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 470

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 471 NGPVIMLK 478


>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Saimiri boliviensis boliviensis]
          Length = 556

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLQCKPFSWYLENIYPD--SQIP 424

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 482

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 483 DVSRLNGPVIMLK 495


>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
 gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
          Length = 675

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 72/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+      P A     GD+S++ ALR++L CK F W++++V       +PE  +P
Sbjct: 489 WLDDWKEFYYSFYPGARKASAGDVSDRKALRERLKCKSFRWYLENV-------YPESLMP 541

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 542 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 602 SSSNGPVNMVR 612



 Score = 37.4 bits (85), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 562 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601


>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
          Length = 559

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/134 (35%), Positives = 70/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS +V LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T + LD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQFLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496


>gi|380024967|ref|XP_003696256.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
           florea]
          Length = 611

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    LD G++  Q  LR KLHCK F WFM+++A+D+   +P
Sbjct: 397 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 456

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C     +    Q F+L     + 
Sbjct: 457 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 516

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  K                  G K         ++L+ C    G  +  WR
Sbjct: 517 PKDRTECLDVSK-----------------GGAK-------ASVSLYPCHGGQG--NQLWR 550

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++ + +CD ++  Q+WR ++V
Sbjct: 551 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 595


>gi|340713833|ref|XP_003395440.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like [Bombus
           terrestris]
          Length = 610

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L+ G++ EQ  LR +LHCK F WFM+++A+D+   +P
Sbjct: 396 RVAEVWMDEYAEYIYTRRPHLRSLNPGNLKEQRDLRARLHCKPFKWFMENIAFDLVDVYP 455

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C          Q F+L     + 
Sbjct: 456 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDTCIKDNPKIIGEQEFKLTWHKDIR 515

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  +        K PVT                   L+ C    G  +  WR
Sbjct: 516 PKDRTECLDVSRGGA-----KAPVT-------------------LYPCHGGQG--NQLWR 549

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++   +CD +++ Q+WR ++V
Sbjct: 550 YNVEKQWLMHGYTSRCLDTDPASKKVFAAKCDSSSATQKWRIEKV 594


>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
           partial [Papio anubis]
          Length = 444

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 306 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 363

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 364 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 421

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 422 NGPVIMLK 429


>gi|350409603|ref|XP_003488790.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Bombus
           impatiens]
          Length = 610

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L+ G++ EQ  LR +LHCK F WFM+++A+D+   +P
Sbjct: 396 RVAEVWMDEYAEYIYTRRPHLRSLNPGNLKEQRDLRARLHCKPFKWFMENIAFDLVDVYP 455

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLG 168
            + P+ F  GE +N+G  + CLDS  R    ++    C          Q F+L     + 
Sbjct: 456 PVEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDTCIKDNPKIIGEQEFKLTWHKDIR 515

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C+D  +        K PVT                   L+ C    G  +  WR
Sbjct: 516 PKDRTECLDVSRGGA-----KAPVT-------------------LYPCHGGQG--NQLWR 549

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   + L H    RCL   P + ++   +CD +++ Q+WR ++V
Sbjct: 550 YNVEKQWLMHGYTSRCLDTDPASKKVFAAKCDSSSATQKWRIEKV 594


>gi|344237432|gb|EGV93535.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Cricetulus
           griseus]
          Length = 413

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D G+IS ++ LR KL CK FSW+++++  D      
Sbjct: 222 RLAEVWMDEFKNFFYIISPGFTKVDYGEISSRLGLRHKLQCKPFSWYLENIYPD-----S 276

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+ T +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 277 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 336

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 337 LDVSKLNGPVTMLK 350


>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
          Length = 516

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 327 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 384

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 385 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 442

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 443 SGPVIMLK 450


>gi|426337430|ref|XP_004032709.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
           partial [Gorilla gorilla gorilla]
          Length = 231

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 47  WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 104

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 105 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 162

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 163 NGPVIMLK 170


>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Felis catus]
          Length = 601

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF L     +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTLGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCVDCNPAEKKIFMARCDPLSETQQWVFEHL 587


>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like [Cavia porcellus]
          Length = 601

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCIDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 FWGYR---KDRTLFHPVSNSCMDSNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Cricetulus griseus]
          Length = 497

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 313 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 370

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 371 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 428

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 429 NGPVIMLK 436


>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
          Length = 519

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 35/219 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 330 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 387

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 388 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 445

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
           D  +    V MLK                          C +  G  +  W +      L
Sbjct: 446 DVSRLNGPVIMLK--------------------------CHHMRG--NQLWEYDAERLTL 477

Query: 235 EHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
            H    +CL    E +++   M  C  N S QQW  + +
Sbjct: 478 RHVNSNQCLDEPSEEDKMVPTMQDCSGNRS-QQWLLRNM 515


>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
           partial [Pan paniscus]
          Length = 518

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 304 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 363

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 364 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 423

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 424 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 457

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 458 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 504


>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Pan troglodytes]
 gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Pongo abelii]
 gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Pongo abelii]
 gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
           paniscus]
 gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
           sapiens]
 gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           isoform CRA_a [Homo sapiens]
 gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           isoform CRA_a [Homo sapiens]
 gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
          Length = 556

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Callithrix jacchus]
          Length = 561

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPD--SQIP 424

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 482

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 483 DVSRLNGPVIMLK 495


>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Nomascus leucogenys]
          Length = 557

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 425

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 426 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 483

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 484 DVSRLNGPVIMLK 496


>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
 gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
           taurus]
 gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
          Length = 556

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Nomascus leucogenys]
          Length = 556

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
 gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
          Length = 488

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 57/217 (26%), Positives = 94/217 (43%), Gaps = 40/217 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD--VYTKFPELP 117
           W+  FK    + SP    +D GD++ +  LRDKL+CK F W+++++  D  + T +  L 
Sbjct: 299 WMDNFKDFFYIISPGVTKVDYGDVTGRKELRDKLNCKPFKWYLENIYPDSQIPTSYHSL- 357

Query: 118 PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 177
                GE +N+ + +C+D+M R     +G   CHG G +Q+F    E +L   + C+D  
Sbjct: 358 -----GEIRNVDSNQCIDNMARKENEKVGIFSCHGMGGNQVFSYTKEKELRTDDLCLDVS 412

Query: 178 KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
           K                           G  + LF C +  G  +  W +    + L+H 
Sbjct: 413 KP--------------------------GGPVMLFKCHHLGG--NQLWEYDEKIQLLKHV 444

Query: 238 ILKRCLSVHPET---NQLAMLRCDENNSYQQWRFKEV 271
              +CL   P +    + +M  CD   S Q W+   V
Sbjct: 445 NSNQCLD-KPHSGSPQEPSMQPCDPGKSSQHWQLTNV 480


>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
           aries]
          Length = 556

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 2 [Papio anubis]
          Length = 584

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 370 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 429

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 430 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 489

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 490 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 523

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 524 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 570


>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Otolemur garnettii]
          Length = 556

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
           partial [Gorilla gorilla gorilla]
          Length = 555

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 341 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 400

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 401 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 460

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 461 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 494

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 495 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 541


>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Homo sapiens]
 gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
          Length = 556

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Oryctolagus cuniculus]
          Length = 557

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 425

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 426 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 483

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 484 DVSRLNGPVIMLK 496


>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 isoform 2
           [Macaca mulatta]
          Length = 584

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 370 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 429

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 430 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 489

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 490 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 523

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 524 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 570


>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
           norvegicus]
 gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
 gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
 gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
 gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a
           [Rattus norvegicus]
          Length = 556

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 SGPVIMLK 495


>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
          Length = 516

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 332 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 389

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 390 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 447

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 448 NGPVIMLK 455


>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 2 [Ailuropoda melanoleuca]
          Length = 557

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 425

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+
Sbjct: 426 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 483

Query: 175 DADKQEKLVEMLK 187
           D  +    V MLK
Sbjct: 484 DVSRLSGPVIMLK 496


>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
           catus]
          Length = 546

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 362 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 419

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 420 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 477

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 478 SGPVIMLK 485


>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           isoform 1 [Ailuropoda melanoleuca]
          Length = 556

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 SGPVIMLK 495


>gi|351699506|gb|EHB02425.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
           glaber]
          Length = 246

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 70/134 (52%), Gaps = 6/134 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GD+S ++ LR KL CK FSW+++++  D      
Sbjct: 56  RLAEVWMDEFKNFFYIISPGVTKVDYGDMSSRLGLRHKLQCKPFSWYLENIYPD-----S 110

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           ++P + F  GE +N+   +CLD+M R     +G   CHG G +Q+F   A  ++   + C
Sbjct: 111 QIPCHYFSLGEIRNVEANQCLDNMARKENEKVGIFNCHGMGRNQVFSYTANKEIRTDDLC 170

Query: 174 VDADKQEKLVEMLK 187
           +D  K    V MLK
Sbjct: 171 LDVSKLNGPVTMLK 184


>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
           AltName: Full=Polypeptide GalNAc transferase 13;
           Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 13;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 13
 gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 [Mus musculus]
 gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
 gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
 gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
 gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
           musculus]
          Length = 556

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 SGPVIMLK 495


>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 1 [Papio anubis]
          Length = 601

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Nomascus leucogenys]
          Length = 601

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           2 [Otolemur garnettii]
          Length = 558

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 374 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 431

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 432 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 489

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 490 NGPVIMLK 497


>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
          Length = 584

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 370 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 429

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 430 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 489

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 490 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 523

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 524 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 570


>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
 gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
           AltName: Full=Polypeptide GalNAc transferase 17;
           Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 17;
           AltName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 17; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 17
 gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 20 [Homo sapiens]
          Length = 601

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
          Length = 556

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFKCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 SGPVIMLK 495


>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           3 [Canis lupus familiaris]
          Length = 556

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 SGPVIMLK 495


>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 isoform 1
           [Macaca mulatta]
 gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
          Length = 601

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|195020976|ref|XP_001985304.1| GH16989 [Drosophila grimshawi]
 gi|193898786|gb|EDV97652.1| GH16989 [Drosophila grimshawi]
          Length = 682

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 62/219 (28%), Positives = 104/219 (47%), Gaps = 36/219 (16%)

Query: 60  WLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-P 117
           W+  +K  + + G  +   +D GD++ Q A+R KL CK F WFM++VA+D+   +P   P
Sbjct: 470 WMDEYKNYLYANGEGIYERVDAGDLTAQKAIRTKLKCKSFKWFMENVAFDLMKSYPPFDP 529

Query: 118 PNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHG 170
           P+   G  +NLG Q KC+D++GR+    +G  +C         SQ + L+   + +L   
Sbjct: 530 PDYASGAIQNLGDQTKCVDTLGRSRHNRLGIYHCADDLVRPQRSQYWSLSWKRDLRLRRK 589

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
           + C+D                         I+     V  L++C    G  +  W +   
Sbjct: 590 KDCMDVQ-----------------------IWGENAPVW-LWDCHGQQG--NQYWYYDYQ 623

Query: 231 TRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           T+ + H    KRC+ + P T +L + +CD +NSY +W F
Sbjct: 624 TKLIMHGKEGKRCMELLPSTEELVVNKCDVSNSYMRWSF 662


>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
           troglodytes]
          Length = 601

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
          Length = 594

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 73/131 (55%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  + +     +P A     GD+SE+  LR++L CK F W+++++       +PE  +P
Sbjct: 452 WLDGWSEFYYNINPGARKASAGDVSERRELRERLKCKSFRWYLENI-------YPESQMP 504

Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            +  F GE +N+ +Q CLD+MGR A   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 505 LDYYFLGEIRNVESQNCLDTMGRKANEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 564

Query: 177 DKQEKLVEMLK 187
                 V +++
Sbjct: 565 SNALGPVNLVR 575



 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 24/40 (60%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR A   IG+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 525 MGRKANEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 564


>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Macaca mulatta]
          Length = 375

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 191 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 248

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 249 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 306

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 307 NGPVIMLK 314


>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Oryctolagus cuniculus]
          Length = 601

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCVDCNPAEKKIFMARCDPFSETQQWIFEHI 587


>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
           1 [Callithrix jacchus]
          Length = 556

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPD--SQIPRRYYS 429

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C+D  + 
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487

Query: 180 EKLVEMLK 187
              V MLK
Sbjct: 488 NGPVIMLK 495


>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
           melanogaster]
 gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
 gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
           Short=pp-GaNTase 5; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 5; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5
 gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
           melanogaster]
 gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
          Length = 630

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 71/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 444 WLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 496

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 557 SSSNGPVNMVR 567



 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556


>gi|440908502|gb|ELR58511.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial [Bos
           grunniens mutus]
          Length = 254

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 89/218 (40%), Gaps = 18/218 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GD+S Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 40  RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFKWFMAAVAWDVPKYYP 99

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + PP   WGE +N+    C+DS        +    C   GS + +        G  E  
Sbjct: 100 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWRE-- 157

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
            D    E L         FC  A          + + L++C    G  +  W  H   R 
Sbjct: 158 -DIRPGEPLHTR-----KFCFDA------VSHSSPVTLYDCHGMKG--NQLWG-HRKDRT 202

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 203 LFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWVFEHI 240


>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 617

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 72/136 (52%), Gaps = 10/136 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    WL  ++      S  A     GD+S++ ALRD+L CK F W++++V       +P
Sbjct: 426 RVAEVWLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YP 478

Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   +
Sbjct: 479 ESLMPLDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDD 538

Query: 172 RCVDADKQEKLVEMLK 187
            C+DA      V M++
Sbjct: 539 LCLDASSSNGPVNMVR 554



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 504 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 543


>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
 gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
          Length = 630

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 71/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 444 WLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 496

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 557 SSSNGPVNMVR 567



 Score = 37.4 bits (85), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556


>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
          Length = 536

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 71/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++ ALRD+L CK F W++++V       +PE  +P
Sbjct: 350 WLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 402

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 403 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 462

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 463 SSSNGPVNMVR 473



 Score = 37.4 bits (85), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 423 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 462


>gi|431918329|gb|ELK17556.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Pteropus
           alecto]
          Length = 371

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 157 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKFYP 216

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 217 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 276

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 277 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 310

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 311 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 357


>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 1 [Otolemur garnettii]
          Length = 601

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHLVSNSCVDCNPAEKKIFMARCDPLSETQQWVFEHI 587


>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           isoform 2 [Otolemur garnettii]
          Length = 600

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 386 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 445

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 446 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 505

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 506 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 539

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 540 LWGYR---KDRTLFHLVSNSCVDCNPAEKKIFMARCDPLSETQQWVFEHI 586


>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
          Length = 601

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GD+S Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAVSHSS-------PVT-------------------LYDCHGMKG-- 538

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +  W  H   R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWVFEHI 587


>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Ovis aries]
          Length = 601

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GD+S Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFRWFMAAVAWDVPRHYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAVSHSS-------PVT-------------------LYDCHGMKG-- 538

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +  W  H   R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWVFEHI 587


>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
 gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
          Length = 633

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 72/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++ ALR++L CK F W++++V       +PE  +P
Sbjct: 447 WLDEWRDFYYAMSTGARKASAGDVSDRKALRERLQCKSFRWYLENV-------YPESLMP 499

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 500 LDYYYLGEIRNAETETCLDTMGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 559

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 560 ASSNGPVNMVR 570



 Score = 38.1 bits (87), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 23/40 (57%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G+SYCHG G +Q+F      Q+   + C+DA
Sbjct: 520 MGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 559


>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
           [Heterocephalus glaber]
          Length = 522

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 88/218 (40%), Gaps = 15/218 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 305 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 364

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + PP   WGE +N+    C+DS        +    C   GS + +      +L      
Sbjct: 365 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQACELFTFGWR 424

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
            D    E L         FC  A          + + L++C    G     +R     R 
Sbjct: 425 EDIRPGEPLHTR-----KFCFDA------ISHNSPVTLYDCHGMKGNQLWGYR---KDRT 470

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 471 LFHPVSNSCMDSNPAEKKIFMARCDPLSETQQWIFEHI 508


>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Loxodonta africana]
          Length = 601

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 87/218 (39%), Gaps = 18/218 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + PP   WGE +N+    C+DS        +    C   GS + +        G  E  
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWRE-- 504

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
            D    E L         FC  A          +   L++C    G     +R     R 
Sbjct: 505 -DIRPGEPLHTR-----KFCFDA------ISHNSPATLYDCHGMKGNQLWGYR---KDRT 549

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 550 LFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587


>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
            [Saccoglossus kowalevskii]
          Length = 1082

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/221 (25%), Positives = 90/221 (40%), Gaps = 38/221 (17%)

Query: 54   LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
            +R    W+  +K+      P A     GD+SE++ L+ +L CKDF W++ +V       +
Sbjct: 892  IRVAEVWMDDYKRFFYATRPDAQGKTYGDLSERLKLKSRLKCKDFKWYLDNV-------Y 944

Query: 114  PELP-PN---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
            PEL  PN     WGE +N  +  CLD++ R A   +G   CHG G +Q+F     G++ H
Sbjct: 945  PELSVPNENAYAWGECQNAASNVCLDTLMREAGQPVGLYICHGGGGNQVFSYTKLGEVRH 1004

Query: 170  GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
             E C+D   +                        ++G       C    G      R H 
Sbjct: 1005 EELCLDVSTK------------------------KVGETPVFEQCHALGGNQMWEHRKHG 1040

Query: 230  STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
              R     +   CL    + + L M+ C+   + Q W F +
Sbjct: 1041 FIRHKSSGL---CLDRSGDNDGLKMMSCNSKKTTQIWEFSQ 1078


>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like, partial [Cricetulus griseus]
          Length = 555

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 87/230 (37%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 341 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 400

Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + P    WGE +NL    C+DS        +    C   GS       QLF       +
Sbjct: 401 PVEPLPAAWGEIRNLAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 460

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 461 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 494

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 495 LWGYR---KDRTLFHPVTNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 541


>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
           partial [Monodelphis domestica]
          Length = 573

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 87/218 (39%), Gaps = 18/218 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 359 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 418

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + PP   WGE +N+    C+DS        +    C   GS + +        G  E  
Sbjct: 419 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWRE-- 476

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
            D    E L         FC  A          + + L++C    G     +R     + 
Sbjct: 477 -DVRPGEPLHTR-----KFCFDA------ISHSSPVTLYDCHGMKGNQHWGYR---KDKT 521

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 522 LFHPVSNSCIDCNPSEKKIFMNRCDPLSETQQWIFEHI 559


>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Nasonia vitripennis]
          Length = 635

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 97/224 (43%), Gaps = 47/224 (20%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+SE+V LR++L CK F W+++++       +P
Sbjct: 436 RVAEVWMDEWRDFYYAMNPGARNVPVGDVSERVKLREQLKCKSFRWYLENI-------YP 488

Query: 115 ELPPNL---FWGEAKNLGTQK-------CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAE 164
           E P  L   + G+ KN            CLD+MGR     +G SYCHG G +Q+F     
Sbjct: 489 ESPMPLDYYYLGDIKNADPNNPEKVQNYCLDTMGRRTGENVGMSYCHGLGGNQIFAYTKR 548

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
            Q+   + C+DA   +  V++++                          C    G  +  
Sbjct: 549 QQIMSDDMCLDAASPQGPVKIVR--------------------------CHGMGG--NQA 580

Query: 225 WRFHNSTRQLEHAILKRCLSV--HPETNQLAMLRCDENNSYQQW 266
           W ++  T+ ++H   K CLS     +  Q  +  CD  +  Q+W
Sbjct: 581 WIYNEETKMIKHVNTKHCLSKPRPSDPTQPVLAPCDSKSVGQKW 624



 Score = 38.1 bits (87), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 521 MGRRTGENVGMSYCHGLGGNQIFAYTKRQQIMSDDMCLDA 560


>gi|395542399|ref|XP_003773120.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
           [Sarcophilus harrisii]
          Length = 436

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 61/218 (27%), Positives = 87/218 (39%), Gaps = 18/218 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 222 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 281

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + PP   WGE +N+    C+DS        +    C   GS + +        G  E  
Sbjct: 282 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWRE-- 339

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
            D    E L         FC  A          + + L++C    G     +R     + 
Sbjct: 340 -DVRPGEPLHTR-----KFCFDA------ISHSSPVTLYDCHGMKGNQHWGYR---KDKT 384

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 385 LFHPVSNSCIDCNPSEKKIFMNRCDPLSETQQWIFEHI 422


>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
 gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
          Length = 682

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/220 (28%), Positives = 100/220 (45%), Gaps = 39/220 (17%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  +K+I          +D GD+SE+  LR+ L C  F W++ +V  D++   
Sbjct: 481 IRLAEVWLDDYKEI-YYNRINHKLVDFGDVSERKKLREHLKCHSFKWYLDNVFPDLF--- 536

Query: 114 PELPPN-LFWGEAKNLGTQK-CLD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGH 169
             LP   +  GE +NLG QK C+D  +GR A       Y CH  G +Q + L+  G++  
Sbjct: 537 --LPSEAIASGEIRNLGNQKYCVDHDVGRNAVNDSVIPYPCHLQGGNQFWMLSKSGEIRR 594

Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
            E C+D   +         PVT                    + C  S G  +  W +++
Sbjct: 595 DEYCIDYTGRGS-------PVT--------------------YECHGSKG--NQLWDYNH 625

Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            T +L H + + CL++  +   L M  CD+ N  Q+WRF+
Sbjct: 626 ETGRLYHPVSRYCLTLSGDDTMLVMRICDDKNERQRWRFQ 665


>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
 gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
          Length = 621

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++ +LRD+L CK F W++++V       +PE  +P
Sbjct: 435 WLDEWRDFYYAMSTGARKASAGDVSDRKSLRDRLQCKSFRWYLENV-------YPESLMP 487

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 488 LDYYYLGEIRNSETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 547

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 548 ASSNGPVNMVR 558



 Score = 37.4 bits (85), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 508 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 547


>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
 gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
          Length = 662

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 102/222 (45%), Gaps = 32/222 (14%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + S G  +   +D GD++ Q A+R KL CK F WFM+ VA+D+   +
Sbjct: 446 RVAEVWMDEYKNYLYSHGDGIYERVDAGDLTAQKAIRTKLKCKSFRWFMEEVAFDLMKNY 505

Query: 114 PEL-PPNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQL 167
           P + PPN   G  +++G  Q CLD+MGR     +G   C         SQ + L+ +  L
Sbjct: 506 PPVDPPNYAMGAIQSVGNPQLCLDTMGRKKHNRMGMFACADDLKVPQKSQFWELSWKRDL 565

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
                     +Q +  E L + +                  + L++C    G     + +
Sbjct: 566 ----------RQRRKKECLDVQI------------WEANAPVWLWDCHGQGGNQYWYYDY 603

Query: 228 HNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           HN  + L+H +  +RCL + P ++++ +  CD  N Y QW F
Sbjct: 604 HN--KLLKHGMEGRRCLELLPFSHEVVVNNCDPKNHYMQWNF 643


>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
 gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
          Length = 677

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+      P A     GD+S++  LR++L CK F W++++V       +PE  +P
Sbjct: 491 WLDDWKEFYYSFYPGARKASAGDVSDRKNLRERLKCKSFRWYLENV-------YPESLMP 543

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 544 LDYYYLGEIRNSETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 603

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 604 SSSNGPVNMVR 614



 Score = 37.4 bits (85), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 564 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 603


>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
          Length = 626

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 99/217 (45%), Gaps = 41/217 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +     +P A+ + +GD+SE+ ALR++L CK F W+++++       +P
Sbjct: 432 RVAEVWMDEWGEFYYAMNPGALNVPVGDVSERKALRERLKCKSFRWYLENI-------YP 484

Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P + ++ GE +N  T  CLD++G  A   +G  YCHG G +Q+F      Q+   +
Sbjct: 485 ESQMPLDYYYLGEIRNAETSNCLDTLGGKAGQPLGMGYCHGMGGNQVFAYTKRKQIMSDD 544

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C+DA      +++++                          C    G  +  W +   +
Sbjct: 545 NCLDAAHPRGPIKLIR--------------------------CHGMRGNQE--WTYDTKS 576

Query: 232 RQLEHAILKRCLSVHPETNQL--AMLRCDENNSYQQW 266
           R ++H     CL   PE+  +   +LR  + +  QQW
Sbjct: 577 RTIKHTNTGMCLD-KPESTDVWKPVLRSCDRSRGQQW 612


>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like isoform 2 [Xenopus (Silurana) tropicalis]
          Length = 601

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 88/226 (38%), Gaps = 34/226 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  + +      P    L  GDIS Q  LR  L CKDF W+M  VA+DV   +P
Sbjct: 387 RVAETWMDEYAEYIYQRRPEYRHLSTGDISSQKELRKHLKCKDFKWYMSEVAWDVPKFYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+ +  C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPASWGEIRNVASNLCIDSKHGATGTELRLDTCVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL--GTVLNLFNCFNSSGTVDGPW 225
             GE                       L   K  F  +   + + L++C    G     +
Sbjct: 507 RPGE----------------------PLHTRKFCFDSISHSSPVTLYDCHGMKGNQQWSY 544

Query: 226 RFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           R     + L H +   C+  +P   +L M RCD  +  QQW F+ V
Sbjct: 545 R---KDKTLFHPVTNSCIDCNPAEKKLFMNRCDPLSETQQWIFEHV 587


>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like isoform 1 [Xenopus (Silurana) tropicalis]
          Length = 600

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 96/251 (38%), Gaps = 34/251 (13%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVAL 89
           ++GH  R     K    T  +    R   +W+  + +      P    L  GDIS Q  L
Sbjct: 361 RVGHIYRKYVPYKVPTGTSLARNLKRVAETWMDEYAEYIYQRRPEYRHLSTGDISSQKEL 420

Query: 90  RDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTS 148
           R  L CKDF W+M  VA+DV   +P + PP   WGE +N+ +  C+DS        +   
Sbjct: 421 RKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVASNLCIDSKHGATGTELRLD 480

Query: 149 YCHGSGS------SQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIF 202
            C   GS       QLF       +  GE                       L   K  F
Sbjct: 481 TCVKDGSERTWSHEQLFTFGWREDIRPGE----------------------PLHTRKFCF 518

Query: 203 CRL--GTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
             +   + + L++C    G     +R     + L H +   C+  +P   +L M RCD  
Sbjct: 519 DSISHSSPVTLYDCHGMKGNQQWSYR---KDKTLFHPVTNSCIDCNPAEKKLFMNRCDPL 575

Query: 261 NSYQQWRFKEV 271
           +  QQW F+ V
Sbjct: 576 SETQQWIFEHV 586


>gi|118090108|ref|XP_420520.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Gallus gallus]
          Length = 601

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/230 (27%), Positives = 87/230 (37%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     + L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWSYR---KDKTLFHPVSSSCIDCNPSEKKIFMNRCDPLSETQQWVFEHI 587


>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 580

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 71/133 (53%), Gaps = 12/133 (9%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  F+      SP     + GD+S++  LRD+L C  F W+++++       +PE    
Sbjct: 393 WMDDFRHFYYRISPGVRKTEFGDVSQRKKLRDRLKCHTFEWYLENI-------YPESQFR 445

Query: 120 LFW---GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD- 175
           L +   GE +N+ T KCLD+MGR     +G   CHG G +Q+F L  + ++ H + C+D 
Sbjct: 446 LDFKTIGEIRNIETHKCLDNMGRKENEKVGIFSCHGQGGNQIFALTKQNEIKHDDLCLDA 505

Query: 176 -ADKQEKLVEMLK 187
            A+   K V M+K
Sbjct: 506 SANSHYKDVVMIK 518



 Score = 38.1 bits (87), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 23/40 (57%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G   CHG G +Q+F L  + ++ H + C+DA
Sbjct: 466 MGRKENEKVGIFSCHGQGGNQIFALTKQNEIKHDDLCLDA 505


>gi|449685123|ref|XP_002167708.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
           partial [Hydra magnipapillata]
          Length = 411

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/232 (27%), Positives = 107/232 (46%), Gaps = 35/232 (15%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           K V  T   +L  R    W+  +K+      PL    + G+I++++ LR KL CK F W+
Sbjct: 196 KGVTETLSKNLN-RLAEVWMDEYKEYYYRSRPLFRGKEYGNITQRLELRQKLQCKSFKWY 254

Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAV---IGTSYCHGSGSSQL 158
           M+++  D+  + P+L P    GE +N  +  C+DSMG     V   +G   CHG G +Q 
Sbjct: 255 MENIYSDM--EIPDLYPPAE-GEIRNGASNLCIDSMGVVKENVKHQVGLYPCHGEGGAQH 311

Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
           F+L+ +G++      +  DK             FC+   V       G  +  F C    
Sbjct: 312 FQLSLKGEI------IFQDK-------------FCLDVAV----ASPGAFIEFFKCHKQR 348

Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLS--VHPETNQLAMLRCDENNSYQQWRF 268
           G  +  W+ +  + ++ H + ++CL   V    ++  M RCD   S Q+W+F
Sbjct: 349 G--NQLWQHNIDSGEIVHFVTQQCLDLGVSDHKDKAVMNRCDGRAS-QKWKF 397


>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 612

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 101/220 (45%), Gaps = 35/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  +K+   +  P    L+ GD++ Q  LR +L+CK F WFM++VA+D  +K+P + PP
Sbjct: 412 WMDEYKEYLYMRRPHYRNLEPGDLTAQKELRKRLNCKSFKWFMENVAFDQPSKYPAIEPP 471

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
           +  WGE ++  +  C+D+  +              G ++ F L         E+C+   +
Sbjct: 472 DYAWGEIRHEKSSLCIDTQFK--------------GQNERFSL---------EKCIRDHR 508

Query: 179 QEKLVEMLKL-------PVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            +   + L L       P    V   V     R   V  L++C    G  +  +++  +T
Sbjct: 509 DQSGEQHLVLTWHKDIRPQKRTVCFDVSSSEPRAPVV--LWSCHGMHG--NQLFKYDTTT 564

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +Q+ H I   CL    E  ++ M  CD N+  Q+W F+ V
Sbjct: 565 KQISHPITASCLDCDAERREIFMNSCDPNSVSQRWLFEHV 604


>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
 gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
          Length = 630

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 70/131 (53%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++  LRD+L CK F W++++V       +PE  +P
Sbjct: 444 WLDEWRDFYYAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENV-------YPESLMP 496

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 557 ASSNGPVNMVR 567



 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556


>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
           norvegicus]
          Length = 601

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/230 (27%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + P    WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPLPAAWGEIRNVAANLCVDSKHGAMGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C+DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCLDAISHSS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---EDRTLFHPVSNSCMDCNPAEKKVFMARCDPLSETQQWMFEHI 587


>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
 gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
          Length = 630

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 70/131 (53%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  ++      S  A     GD+S++  LRD+L CK F W++++V       +PE  +P
Sbjct: 444 WLDEWRDFYYAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENV-------YPESLMP 496

Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + ++ GE +N  T+ CLD+MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556

Query: 177 DKQEKLVEMLK 187
                 V M++
Sbjct: 557 ASSNGPVNMVR 567



 Score = 37.4 bits (85), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 22/40 (55%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
           MGR     +G SYCHG G +Q+F      Q+   + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556


>gi|324520154|gb|ADY47570.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Ascaris suum]
          Length = 286

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 80/149 (53%), Gaps = 10/149 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  +D+GD++++ ALR+ L C+ F W+++++       +P
Sbjct: 94  RTAEVWMDEYKEFFYKMVPAARSVDVGDLADRKALRENLQCRSFRWYLENI-------YP 146

Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P P  F   G+ KN  T KC+D++GR+A    G + CHG G +Q + L ++G++   E
Sbjct: 147 EAPIPRGFKSIGQIKNPSTTKCVDTLGRSAGEAAGVTVCHGIGGNQAWSLTSDGEVRSDE 206

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKL 200
            C+ AD+     + + + +  C    V +
Sbjct: 207 TCLAADRAADKAKKIDVKLEKCSTTSVNV 235


>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
           1 [Taeniopygia guttata]
 gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
           2 [Taeniopygia guttata]
          Length = 601

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/228 (28%), Positives = 86/228 (37%), Gaps = 42/228 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
              +R     + L H +   C+  +P   ++ M RCD  +  QQW F+
Sbjct: 541 HWSYR---KDKTLFHPVSSSCIDCNPAEKKIFMNRCDPLSETQQWIFE 585


>gi|158300139|ref|XP_320141.4| AGAP012414-PA [Anopheles gambiae str. PEST]
 gi|157013013|gb|EAA00190.4| AGAP012414-PA [Anopheles gambiae str. PEST]
          Length = 596

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 91/198 (45%), Gaps = 34/198 (17%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
           D+GDIS Q+A+R+KL CK F WFM  VA+D+  K+P + PP+   G  +++     C+D+
Sbjct: 418 DVGDISRQLAIREKLQCKPFKWFMTQVAFDLIEKYPPIEPPDFANGAIQSVANAALCVDT 477

Query: 137 MGRTAPAVIGTSYCHGSGS----SQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +       IG   C         +Q F+L  + + ++  GE C D  +     ++L    
Sbjct: 478 LNHGEKQTIGLYSCAEDKKQPQPNQFFQLSWHRDLRIKFGELCWDVSESVPNAKIL---- 533

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
                               L++C    G  +  WR+   T+ L+     RCL ++P   
Sbjct: 534 --------------------LYHCHGGQG--NQLWRYEPETQMLKQGKNNRCLDMNPNNR 571

Query: 251 QLAMLRCDENNSYQQWRF 268
           ++ +  C+  N  Q+WR+
Sbjct: 572 EVFVNPCEPTNPRQKWRW 589


>gi|449270895|gb|EMC81541.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
           [Columba livia]
          Length = 440

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/228 (28%), Positives = 86/228 (37%), Gaps = 42/228 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 226 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 285

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 286 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 345

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 346 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 379

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
              +R     + L H +   C+  +P   ++ M RCD  +  QQW F+
Sbjct: 380 HWSYR---KDKTLFHPVSGSCIDCNPAEKKIFMNRCDPLSETQQWIFE 424


>gi|324507788|gb|ADY43296.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Ascaris suum]
          Length = 580

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 111/223 (49%), Gaps = 40/223 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    WL  +K +     P A F+D GDISE++ L+++LHCK+F+W++K V  ++  K P
Sbjct: 388 RAAEVWLDDYKMLYLKQVPSARFVDFGDISERLKLKEQLHCKNFTWYLKEVYPEL--KIP 445

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           E    L+    K  G   C+DS+G+ TA + +G   CHG+G +Q +              
Sbjct: 446 EREDGLYL-TFKQAGL--CIDSLGKQTAHSPVGVYSCHGTGGNQEWVF------------ 490

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
              DKQ+     LK P T   ++   +       V++L  C     T DG W  + S  +
Sbjct: 491 ---DKQKG---TLKNPFTKLCMSDSDI------GVVSLQKC----ETADGKWTMNESEGR 534

Query: 234 LEH-----AILKRC-LSVHPETNQLAMLRCDENNSYQQWRFKE 270
           L++     A+++R  +++  +   L ++ CD ++  Q+W F++
Sbjct: 535 LKYKDSCVALIRRTPVNMQKDETVLMLMPCDNSDHRQRWVFEK 577


>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
          Length = 622

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/199 (28%), Positives = 90/199 (45%), Gaps = 44/199 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNL--GTQKC 133
           D GD+S + ALR+KL CK F W++ +V       +PEL  P +    GE +N   G++ C
Sbjct: 449 DFGDVSSRKALREKLQCKSFKWYLDNV-------YPELFVPGDAIGKGEIRNRGGGSKNC 501

Query: 134 LD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD  S GR      G  +CH  G +Q + L+ +G++   E C+D                
Sbjct: 502 LDWASHGRQRSVNAGLYWCHKKGGNQYWMLSKDGEIRRDESCID---------------- 545

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AGV ++         ++ C    G  +  W++     Q  H +  +CL +  +  +
Sbjct: 546 ---YAGVDVM---------VYPCHGMKGNQE--WKYLLYQSQFLHVVTNKCLEMSRDGTR 591

Query: 252 LAMLRCDENNSYQQWRFKE 270
           L +  CD  N YQQW  +E
Sbjct: 592 LMVNVCDRTNPYQQWLIQE 610


>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
 gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
          Length = 666

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 103/224 (45%), Gaps = 36/224 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + S G  +   +D GD++EQ A+R KL CK F WFM+ VA+D+   +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGVYESVDPGDLTEQKAIRTKLKCKSFKWFMEAVAFDLMKTY 509

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
           P + PP    G  +N+G Q  CLD+MG+     +G   C         SQ + L+   + 
Sbjct: 510 PPVDPPAYAMGALQNVGNQNLCLDTMGKKKHNRMGMYSCASDIKVPQRSQFWELSWKRDL 569

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L   + C+D       +     PV                    L++C +  G  +  W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603

Query: 226 RFHNSTRQLEHAI-LKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +    + L+H    +RCL + P + ++ + +CD +N +QQW F
Sbjct: 604 YYDYHKKMLKHGEGSRRCLELLPFSQEVVVNKCDPDNRFQQWNF 647


>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Hydra magnipapillata]
          Length = 641

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 33/213 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           WL  +K++     P +   D GDIS+++ALR+KL+CK F W+++++  DV  + P+L P 
Sbjct: 444 WLDEYKELYYQKRPQSKGKDYGDISQRLALRNKLNCKSFKWYIENIYPDV--QLPDLYPP 501

Query: 120 LFWGEAKNLGTQKCLDSMGR---TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER-CVD 175
              GE KN  +  CLDSMG         +G   CHG G +Q F  + +G++   E  C+D
Sbjct: 502 AR-GEIKNPASSYCLDSMGDMKGNNVKKLGIFPCHGQGGNQNFVFSRKGEIVFDEEYCLD 560

Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
                                       + G ++++  C N  G  +  W    +T ++ 
Sbjct: 561 VSS------------------------SKPGVLIDIMKCHNFGG--NQQWIHKINTGEIM 594

Query: 236 HAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           H   ++C+     TN+   ++     + Q+W F
Sbjct: 595 HYPTRQCIDRGSNTNKSPQMQPCNGQTSQKWVF 627


>gi|195492881|ref|XP_002094181.1| GE20340 [Drosophila yakuba]
 gi|194180282|gb|EDW93893.1| GE20340 [Drosophila yakuba]
          Length = 666

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 101/224 (45%), Gaps = 36/224 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + S G  L   +D GD++EQ A+R KL CK F WFM+ VA+D+   +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLKCKSFKWFMEEVAFDLMKTY 509

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
           P + PP+   G  +N+G Q  CLD+MGR     +G   C         +Q + L+   + 
Sbjct: 510 PPVDPPSYARGALQNVGNQNLCLDTMGRKKHNKMGMYACANDIKIPQRTQFWELSWKRDL 569

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L   + C+D       +     PV                    L++C    G  +  W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHMQGG--NQYW 603

Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +    + L H    +RCL + P + ++   +CD +N +QQW F
Sbjct: 604 YYDYRKKLLLHGKEGRRCLELLPFSQEVVANKCDPDNRFQQWNF 647


>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
          Length = 601

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 64/230 (27%), Positives = 87/230 (37%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +    ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNHAEKKIFMARCDPLSETQQWIFEHI 587


>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
 gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
          Length = 513

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 88/215 (40%), Gaps = 31/215 (14%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K +     P A   D GDIS+++ LR +L CK F W++ ++  DV  + PEL P 
Sbjct: 325 WMDEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKCKSFKWYINNIYPDV--QMPELDPP 382

Query: 120 LFWGEAKNLGTQKCLDSMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 177
              GE +N  + +CLDS+G      A +G   CHG G +Q+ +      +   E C D  
Sbjct: 383 AR-GEVRNPSSNQCLDSLGAKPEHNARVGIYTCHGQGGNQVSKYMPRELIFEEENCFDVS 441

Query: 178 KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
           K                           G  + L  C    G  +  W+       L H 
Sbjct: 442 KTHP------------------------GAPVELMKCHGMRGNQE--WKHDREKGTLMHF 475

Query: 238 ILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
             ++CL     ++Q A++   +    Q+W F   R
Sbjct: 476 TTQQCLDRGSPSDQYAVMNPCDGRESQRWVFSHYR 510


>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
 gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
          Length = 504

 Score = 80.1 bits (196), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 66/124 (53%), Gaps = 15/124 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      PLA  +  GDI +++ LR +L CK F W+++HV         
Sbjct: 317 RAAEVWMDEYKKYYYAAVPLATNIPFGDIDDRLQLRKELQCKPFRWYLEHVY-------- 368

Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              P L   E +N G+ +    CLDS+G  A AV+G   CHG+G +Q + LN +G+L H 
Sbjct: 369 ---PQLGIPERRNNGSIRQGVYCLDSLGNVAGAVVGLYSCHGNGGNQNWILNRKGELKHH 425

Query: 171 ERCV 174
           + C+
Sbjct: 426 DLCL 429



 Score = 41.6 bits (96), Expect = 0.43,   Method: Compositional matrix adjust.
 Identities = 20/55 (36%), Positives = 34/55 (61%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELR 55
           +G  A AV+G   CHG+G +Q + LN +G+L H + C+   K  IS + +S+ ++
Sbjct: 392 LGNVAGAVVGLYSCHGNGGNQNWILNRKGELKHHDLCLTLVKFTISARYNSVLMK 446


>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Metaseiulus occidentalis]
          Length = 641

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/231 (28%), Positives = 101/231 (43%), Gaps = 45/231 (19%)

Query: 42  KQVISTQKSSLELRKRSSWLKVFKQ--IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
           K V   +K+S+ L K   W+  + Q     +G  L    D GDISE+V LR  L+CK F 
Sbjct: 441 KGVNVLKKNSVRLAK--VWMDEYAQYYFDRIGPDLG---DYGDISERVELRRSLNCKSFD 495

Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK--CLDSMGRTAPAVIGTSYCHGSGSSQ 157
           W++K++  D++            GE +N G ++  CLDS   T  A +    CHG G +Q
Sbjct: 496 WYVKNIYPDLFIPGDAAAS----GEVRNSGFERKWCLDSAA-TVHATVSVYPCHGQGGNQ 550

Query: 158 LFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNS 217
            +  +  G++   E C+D                              G  + L++C  S
Sbjct: 551 YWLFSKTGEIRRDELCLDYS----------------------------GGDVVLYSCHGS 582

Query: 218 SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            G  +  WR+   T+QL H   ++CL+++     + M  C    S QQW F
Sbjct: 583 KG--NQYWRYDPMTKQLRHGSSEKCLAINEYKTGVTMETCGP-TSRQQWNF 630


>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
 gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 20 [Mus musculus]
          Length = 601

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 88/230 (38%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 446

Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + P    WGE +++    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPLPAAWGEIRSVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C+DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCLDAISHSS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     R L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 LWGYR---EDRTLFHPVSNSCMDCNPSEKKIFMARCDPLSETQQWIFEHI 587


>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           1 [Ciona intestinalis]
          Length = 573

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/216 (26%), Positives = 96/216 (44%), Gaps = 38/216 (17%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + +P  +  + GDISE+V LR+KL CK F W+++++  D      ++P  
Sbjct: 386 WMDSFKNFFYIITPGVLKQEYGDISERVRLREKLQCKSFKWYLENIYPD-----SQIPGE 440

Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
            +  GE +N     CLD+MGR     +G   CH  G +Q++      +L   + C+DA K
Sbjct: 441 YYSLGEIRNEEGGLCLDTMGRKENDKVGIFNCHEMGGNQVWAYTGNQELRCDDICLDASK 500

Query: 179 QEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI 238
                                     +G  + +  C +  G  +  W +   TR L+H  
Sbjct: 501 --------------------------VGGPIMMVKCHHMRG--NQLWEYDEDTRLLKHTN 532

Query: 239 LKRCLSVHP--ETNQLAMLR-CDENNSYQQWRFKEV 271
             +CL   P  +++QL +L  CD     Q+W  K +
Sbjct: 533 THQCLD-RPVDQSSQLPLLTDCDVTKRSQRWLLKNM 567


>gi|170056941|ref|XP_001864259.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167876546|gb|EDS39929.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 606

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 97/198 (48%), Gaps = 34/198 (17%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
           D GD+S+Q+A+R+KL CK F WF++HVA+D+  K+P + PP+   G  +++ +   C+D+
Sbjct: 415 DAGDLSKQLAIREKLQCKPFKWFIEHVAFDLIEKYPPVEPPDFANGAIQSVSSPNLCVDT 474

Query: 137 MGRTAPAVIGTSYCHGSG----SSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +       IG   C        ++Q F+L  + + ++ +GE C D  +            
Sbjct: 475 LSHGEKETIGLFGCAADKAQPQANQFFQLSWHRDLRIKYGELCWDVSES----------- 523

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
               +A  K++         L++C    G  +  WR+   ++Q+      RCL +  +  
Sbjct: 524 ----IANAKIL---------LYHCHGGQG--NQLWRYDLDSQQIRQGKNNRCLDMDADKR 568

Query: 251 QLAMLRCDENNSYQQWRF 268
            + +  C E+N  Q+W++
Sbjct: 569 AVFVNPCSESNPNQRWKW 586


>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
 gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
          Length = 545

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 69/131 (52%), Gaps = 12/131 (9%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR    WL  +K +    +P       GD+SE+  +R++L CK F WF++++       F
Sbjct: 354 LRVVEVWLDSYKDLFYALNPHMQGEPYGDVSERKRIRERLQCKSFDWFLENI-------F 406

Query: 114 PELP-PNLFW---GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
           PELP P+      GE KNLG  KC+D+MG  AP   G   CHG G +Q+F    +  + +
Sbjct: 407 PELPIPDKNVQGRGELKNLGGNKCMDTMGEHAPYT-GLYSCHGMGGNQVFSYTWKNVISY 465

Query: 170 GERCVDADKQE 180
            ERC+   + +
Sbjct: 466 QERCLAVSRNK 476


>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
 gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
          Length = 561

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/225 (28%), Positives = 97/225 (43%), Gaps = 37/225 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P     DMGDIS Q+ LR++L CK F WFMK VA D+   +P
Sbjct: 347 RVAEVWMDEYKEHLYKRRPHLRKTDMGDISGQLQLRERLKCKPFKWFMKVVAPDIILHYP 406

Query: 115 ELPPNL-----FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLN---AEGQ 166
            + P        W +A NL    C+DS              HG G +++ RL+     G 
Sbjct: 407 PVEPEPAASGEIWNKASNL----CIDSK-------------HGGGQAEV-RLDQCVKGGG 448

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
           + +GE+         +    +   TFC  A +K      G  L LF C    G     W 
Sbjct: 449 IMNGEQNFHMSWHNDIRPKGR---TFCFDAQMK------GGTLILFACHQMLGNQH--WL 497

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++   +QL H +   CL  + ++  L +  C+  N  QQW +++ 
Sbjct: 498 YYEEKQQLLHKVTGSCLEGNSDSKSLFLSECNHKNPMQQWSWEKT 542


>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
           [Tribolium castaneum]
 gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
          Length = 602

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 76/136 (55%), Gaps = 10/136 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  ++      +P A  + +GD+S +  LR++L CK F W++++V       +P
Sbjct: 405 RVAEVWMDEWRDFYYAMNPGARSVPVGDVSARRELRERLKCKSFRWYLENV-------YP 457

Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P   ++ G+ +N+ T+ CLD+MGR +   +G +YCH  G +Q+F      Q+   +
Sbjct: 458 ESQMPLEYYYLGDIRNVETKNCLDTMGRKSGENLGMTYCHNLGGNQVFAYTKRQQIMSDD 517

Query: 172 RCVDADKQEKLVEMLK 187
            C+DA  ++  V++++
Sbjct: 518 NCLDASNKKGPVKLVR 533


>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
 gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
          Length = 676

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 96/200 (48%), Gaps = 35/200 (17%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLD 135
           +D GD++EQ+A+R KL CK F WFM+ VA+D+   +P + PP    G  +N+G ++ C+D
Sbjct: 483 IDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFALGAIQNVGDKRLCID 542

Query: 136 SMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLP 189
           +MGR     +G   C         +Q + L+   + +L   + C+D       +  +  P
Sbjct: 543 TMGRRKHKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKECLDVQ-----IWTVNAP 597

Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPE 248
           V                    L++C    G  +  W +   T+ ++H    +RCL + P 
Sbjct: 598 VW-------------------LWDCHLQGG--NQYWSYDYHTQTIKHGRDGRRCLELLPF 636

Query: 249 TNQLAMLRCDENNSYQQWRF 268
             +L + +C+ +N++ +W F
Sbjct: 637 AQELVVNKCNTSNTFMKWNF 656


>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
 gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
          Length = 676

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 96/200 (48%), Gaps = 35/200 (17%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLD 135
           +D GD++EQ+A+R KL CK F WFM+ VA+D+   +P + PP    G  +N+G ++ C+D
Sbjct: 483 IDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFALGAIQNVGDKRLCID 542

Query: 136 SMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLP 189
           +MGR     +G   C         +Q + L+   + +L   + C+D       +  +  P
Sbjct: 543 TMGRRKHKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKECLDVQ-----IWTVNAP 597

Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPE 248
           V                    L++C    G  +  W +   T+ ++H    +RCL + P 
Sbjct: 598 VW-------------------LWDCHLQGG--NQYWSYDYHTQTIKHGRDGRRCLELLPF 636

Query: 249 TNQLAMLRCDENNSYQQWRF 268
             +L + +C+ +N++ +W F
Sbjct: 637 AQELVVNKCNTSNTFMKWNF 656


>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
           [Loxodonta africana]
          Length = 560

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 9/138 (6%)

Query: 55  RKRSSWLKV-----FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDV 109
           +++  W +V      K    + SP  + +D GD+S +  LR+ L CK FSW+++++  D 
Sbjct: 366 KQQEDWQEVWDGMNLKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD- 424

Query: 110 YTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
            ++ P    +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++  
Sbjct: 425 -SQIPRRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRT 481

Query: 170 GERCVDADKQEKLVEMLK 187
            + C+D  +    V MLK
Sbjct: 482 DDLCLDVSRLNGPVIMLK 499


>gi|51490860|emb|CAD44535.1| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
          Length = 170

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 6/108 (5%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQKCLDSMGR 139
           GDIS +V LR KL CK FSW+++++  D      ++P + F  GE +N+ T +CLD+M R
Sbjct: 5   GDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRHYFSLGEIRNVETNQCLDNMAR 59

Query: 140 TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
                +G   CHG G +Q+F   A  ++   + C+D  K    V MLK
Sbjct: 60  KENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLK 107


>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 534

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 6/111 (5%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQKCLDS 136
           +D GDIS +  LR KL CK FSW+++++  D      ++P + +  GE +N+ T +CLD+
Sbjct: 366 VDYGDISSRTTLRQKLQCKPFSWYLENIYPD-----SQIPRHYYSLGEIRNVETNQCLDN 420

Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
           M R     +G   CHG G +Q+F   A  ++   + C+D  K    V MLK
Sbjct: 421 MARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMMLK 471


>gi|170039452|ref|XP_001847548.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167863025|gb|EDS26408.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 606

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 96/198 (48%), Gaps = 34/198 (17%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
           D GD+S+Q+A+R+KL CK F WF++HVA+D+  K+P + PP+   G  ++  +   C+D+
Sbjct: 415 DAGDLSKQLAIREKLQCKPFKWFIEHVAFDLIEKYPPVEPPDFANGAIQSASSPNLCVDT 474

Query: 137 MGRTAPAVIGTSYCHGSG----SSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +       IG   C        ++Q F+L  + + ++ +GE C D  +            
Sbjct: 475 LSHGEKETIGLFGCAADKAQPQANQFFQLSWHRDLRIKYGELCWDVSES----------- 523

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
               +A  K++         L++C    G  +  WR+   ++Q+      RCL +  +  
Sbjct: 524 ----IANAKIL---------LYHCHGGQG--NQLWRYDLDSQQIRQGKNNRCLDMDADKR 568

Query: 251 QLAMLRCDENNSYQQWRF 268
            + +  C E+N  Q+W++
Sbjct: 569 AVFVNPCSESNPNQRWKW 586


>gi|389610347|dbj|BAM18785.1| polypeptide GalNAc transferase 2 [Papilio xuthus]
          Length = 256

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 72/135 (53%), Gaps = 11/135 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K++     PLA  +D GDISE+V LR+KLHCK F W++K+V       +P
Sbjct: 79  RAAEVWMDDYKELYYRSQPLAKQVDFGDISERVRLREKLHCKPFRWYLKNV-------YP 131

Query: 115 ELP-PNLFWGEAKNLGTQ--KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           EL  P L      ++  Q  +CLD+MG      +G   CH +G +Q +R +  G + H  
Sbjct: 132 ELQVPTLENSGDGHVIRQGPRCLDTMGHLVDGTLGMYPCHNTGGNQEWRFD-NGLIRHHS 190

Query: 172 RCVDADKQEKLVEML 186
            C+    ++++  +L
Sbjct: 191 LCISLSPEDRVTAVL 205


>gi|327278031|ref|XP_003223766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like [Anolis carolinensis]
          Length = 602

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 86/230 (37%), Gaps = 42/230 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GD+S Q  LR  L CKDF WFM  VA+DV   FP
Sbjct: 387 RVAETWMDEFAEYVYQRRPEYRHLSTGDLSAQKELRKHLKCKDFKWFMAAVAWDVPKYFP 446

Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + P    WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 447 PVEPLPASWGEIRNVAANLCVDSKHGGTGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506

Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             GE       C DA            PVT                   L++C    G  
Sbjct: 507 RPGEPLHTRKFCFDAVSHSS-------PVT-------------------LYDCHGMKGNQ 540

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              +R     + L H +   C+  +P   ++ M RCD  +  QQW F+ +
Sbjct: 541 HWSYR---KDKTLFHPVSSSCIDCNPAEKKIFMNRCDPLSETQQWIFEHI 587


>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
          Length = 604

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 102/229 (44%), Gaps = 41/229 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K++     P    +  GDIS+++ LR KL+CK F W++ +V  D+    P
Sbjct: 405 RVAEVWMDEYKELYYGVRPHHRAIGTGDISDRLELRKKLNCKPFKWYLDNVFPDMMVPLP 464

Query: 115 ELPPNLFW-GEAKNLGTQKCLDSM-GRTAPAVIGTSYC-HGSGSSQLFRLNAE-GQL--- 167
           E   NL   G  KN  T  CLDS+  R      G   C +G   +Q+F    + G++   
Sbjct: 465 E---NLLGKGAVKNAATNMCLDSLSSREVDMKAGLYPCANGKSENQMFYFTTKYGEIRRE 521

Query: 168 -GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
              G RC+D                    AG K      G+ L+++ C    G  +  W+
Sbjct: 522 GTFGARCLD-------------------FAGGK-----PGSTLSMYGCHLMKGNQE--WK 555

Query: 227 FHNSTRQLEHAILKRCL--SVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
              S +Q+ HA  K CL  +V+    +L +  CD NN+ Q+W F E  P
Sbjct: 556 --RSGKQIVHAASKLCLEAAVNGNDRKLIVNTCDSNNANQEWTFTEYDP 602


>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Bombus impatiens]
          Length = 602

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 86/199 (43%), Gaps = 42/199 (21%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
           GD+SE+ ALR KL CK F W++ +V  +++     +      GE +NLG     CLDS  
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492

Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
           R A    PA  G   CH  G +Q + L+  G++   E C+D                   
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQYWMLSKTGEIRRDESCLDYS----------------- 533

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
                      GT + L+ C  S G     W +++ T+Q+ H    +CL++     +L M
Sbjct: 534 -----------GTDVILYPCHGSKGNQQ--WIYNHQTKQIRHGSSDKCLAITESKQRLIM 580

Query: 255 LRCDENNSYQQWRFKEVRP 273
             C    + Q+W F+   P
Sbjct: 581 EECSAAAARQRWSFENYDP 599


>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Bombus terrestris]
 gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Bombus terrestris]
          Length = 602

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 86/199 (43%), Gaps = 42/199 (21%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
           GD+SE+ ALR KL CK F W++ +V  +++     +      GE +NLG     CLDS  
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492

Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
           R A    PA  G   CH  G +Q + L+  G++   E C+D                   
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQYWMLSKTGEIRRDESCLDYS----------------- 533

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
                      GT + L+ C  S G     W +++ T+Q+ H    +CL++     +L M
Sbjct: 534 -----------GTDVILYPCHGSKGNQQ--WIYNHQTKQIRHGSSDKCLAITESKQRLIM 580

Query: 255 LRCDENNSYQQWRFKEVRP 273
             C    + Q+W F+   P
Sbjct: 581 EECSAAAARQRWSFENYDP 599


>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
 gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
          Length = 685

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 33/211 (15%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
           D GD+S +  LR  L CK F W++ ++       +PEL     ++  GE +N   Q  +C
Sbjct: 474 DFGDVSARKKLRSDLGCKSFKWYLDNI-------YPELFVPGESVAKGEVRNSAVQPARC 526

Query: 134 LDSM-GR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD M GR      +GT  CHG G +QL   NA+       +C+D+   +++      P  
Sbjct: 527 LDCMVGRHEKNRPVGTYQCHGQGGNQL--RNAQTS-----QCLDSAVGDEVENKAITPYP 579

Query: 192 FCVLAGVKL-IFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAIL 239
                G +  +  + G +    +C + +GT            +  WR+++ T +L+HA+ 
Sbjct: 580 CHEQGGNQYWMLSKDGEIRRDESCVDYAGTDVMVFPCHGMKGNQEWRYNHDTGRLQHAVS 639

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           ++CL +  +  +L M+ C  ++ YQ W+FKE
Sbjct: 640 QKCLGMTKDGAKLEMVACHYDDPYQHWKFKE 670


>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
 gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
          Length = 481

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 46/117 (39%), Positives = 62/117 (52%), Gaps = 3/117 (2%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K I     P       GDIS+++ LR KL CK F W++K+V  DV  ++P++   
Sbjct: 332 WMDNYKDIFYNRFPNLRQHSYGDISKRIELRKKLKCKSFDWYLKNVFTDV--QYPDMI-F 388

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
           L  GE +N  T  CLDSMG    A IG   CHG G +QL       +L   E C+DA
Sbjct: 389 LAKGELRNPSTGYCLDSMGNKEYADIGIYPCHGQGGNQLLTYTIRKELEMDEVCLDA 445


>gi|355689619|gb|AER98893.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mustela putorius
           furo]
          Length = 256

 Score = 77.8 bits (190), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 64/224 (28%), Positives = 103/224 (45%), Gaps = 38/224 (16%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC +FSWF+ ++  +
Sbjct: 63  QVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCHNFSWFLNNIYPE 122

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  +NLGT +CLD +G       P ++ T  CHG G +Q F    +
Sbjct: 123 MFV--PDLKPT-FYGAIRNLGTNQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 176

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
             L H                  +    C+ AG       LG     F   NS    D  
Sbjct: 177 RDLRH-----------------NIAKQLCLHAGAST----LGLRSCHFTGKNSQVPKDEE 215

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           W      R + +     CL+   E  + AM  C+ ++ +QQW F
Sbjct: 216 WEL-TQDRLIRNLGSGTCLT--SEDKKPAMAPCNPSDPHQQWLF 256


>gi|358341053|dbj|GAA48824.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 424

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 63/218 (28%), Positives = 97/218 (44%), Gaps = 34/218 (15%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+   K      SP A+  D GDISE+ ALR +L CK F W++++V  DV + F
Sbjct: 222 MRTALVWMDDHKLFYLKSSPDAVHTDYGDISERQALRKRLRCKSFRWYLENV--DVESVF 279

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P     +  GE ++  +  CLD++G+     +G S CHG G +QLF    +G++     C
Sbjct: 280 PVDFHGI--GEIRHESSGLCLDTLGQKQHGPVGLSSCHGQGGNQLFVWTTKGEIQAEVGC 337

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           V          + K     C+         RL T               GP  F    R+
Sbjct: 338 VSPTDDGDTPLLFKP----CL---------RLDT---------------GPQLFDYQDRK 369

Query: 234 LEHAILKRCLSV-HPETNQLAMLRCDENNSYQQWRFKE 270
           L H    +CL+V + ++ QL +  C++  S+  W   E
Sbjct: 370 LIHRATAKCLTVSNLKSAQLTLRPCEDTPSF-HWILPE 406


>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Metaseiulus occidentalis]
          Length = 583

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 92/213 (43%), Gaps = 42/213 (19%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K      +P+A  +D GD S +  LR  L CK F W+++++       +PE  +P
Sbjct: 393 WLDEWKDFYFAINPVAKKVDRGDTSGRHKLRQDLQCKSFRWYLENI-------YPESHMP 445

Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + +  GE KN     CLD+ G+ +  V+    CHG G +Q+F      Q+   + C+DA
Sbjct: 446 LDYYHLGEIKNADGNLCLDTYGKKSGDVLYMGKCHGLGGNQVFAYTKRQQIMADDSCLDA 505

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
                 V+                          LF C N  G  +  W +  +T+Q+ H
Sbjct: 506 SSPSGPVK--------------------------LFRCHNMGG--NQMWTYDGNTKQIRH 537

Query: 237 AILKRCLSVHPETNQ--LAMLR-CDENNSYQQW 266
               RCL   P  N+  +A L+ C+     Q+W
Sbjct: 538 INTNRCLD-KPNANRSNVATLKECNSGKESQKW 569


>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
          Length = 586

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 87/195 (44%), Gaps = 38/195 (19%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           D GDI+ + ALR+KL CK F W++ ++  +++     +      GE +NLG   + CLDS
Sbjct: 419 DFGDITSRKALREKLGCKSFKWYLDNIYPELFIPGEAVAS----GEIRNLGIGGKTCLDS 474

Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
             R +     +G   CH  G +Q +  +  G++   E C+D   QE +            
Sbjct: 475 PARRSDLHKPVGLYPCHRQGGNQFWMYSKSGEIRRDEACLDYSGQEVI------------ 522

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
                           L+ C  S G  +  W ++ +++ L H    +CL+++   N+L M
Sbjct: 523 ----------------LYPCHGSKG--NQFWDYNANSKLLRHGSSDKCLAINEAKNKLLM 564

Query: 255 LRCDENNSYQQWRFK 269
             CDE  + Q W  +
Sbjct: 565 EPCDEEATRQHWSLE 579


>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
          Length = 671

 Score = 76.6 bits (187), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 55/202 (27%), Positives = 90/202 (44%), Gaps = 44/202 (21%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNL--GTQKC 133
           D GD+S +  LR++LHCK F W++ ++       +PEL  P + +  GE +N+  G + C
Sbjct: 504 DFGDVSSRKKLREELHCKPFRWYLDNI-------YPELFVPGDAVASGEVRNMGYGNRTC 556

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD+    R     +G   CH  G +Q + L+  G++   E C+D                
Sbjct: 557 LDAPAGKRNLRKAVGLYPCHNQGGNQYWMLSKTGEIRRDEACLD---------------- 600

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AG  ++         L+ C  S G  +  W + + T  L H    RCL+++   N+
Sbjct: 601 ---YAGDDVV---------LYPCHGSRG--NQYWNYSDDTHLLRHGSSDRCLAINEAKNK 646

Query: 252 LAMLRCDENNSYQQWRFKEVRP 273
           L M  C+     Q+W F+   P
Sbjct: 647 LIMQDCNPLVEAQRWSFQNYDP 668


>gi|357629476|gb|EHJ78219.1| hypothetical protein KGM_03405 [Danaus plexippus]
          Length = 353

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 67/134 (50%), Gaps = 8/134 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K++     PLA  +D GDISE+V++R +LHCK F W+++HV       +P
Sbjct: 179 RAAEVWMDDYKELYYRSQPLAKQVDFGDISERVSIRQRLHCKPFRWYLEHV-------YP 231

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           EL    F          +CLD+MG      +    CH +G +Q +  +  G + H   C+
Sbjct: 232 ELRVPTFGNSIAIKQGPRCLDTMGHQVDGTVAMYPCHNTGGNQEWSFD-NGLIRHQSLCL 290

Query: 175 DADKQEKLVEMLKL 188
              +++ +  +L +
Sbjct: 291 GLSQEDSVTVVLAV 304


>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 675

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/215 (25%), Positives = 95/215 (44%), Gaps = 26/215 (12%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD + Q  LR +L+CK F WFM  VA+D+   +P + PP
Sbjct: 469 WMDEYAEYIYQRRPEYRHLSAGDTAAQKDLRSQLNCKSFKWFMTKVAWDLSKHYPPVEPP 528

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG--HGERCVDA 176
              WGE +N+G+  CL++    + + +    C          L   G++G  HG+     
Sbjct: 529 AAAWGEIRNVGSSMCLETKHFVSGSPVWMESC----------LKGRGEVGWNHGQVFTFG 578

Query: 177 DKQEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
            +++  V   M    V F  ++          + + L++C    G  +  WR+    + L
Sbjct: 579 WREDIRVGDPMHTKKVCFDAVSN--------NSPVTLYDCHGMKG--NQLWRYRKD-KSL 627

Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            H +   C+   P+  ++ M  CD ++  QQW F+
Sbjct: 628 YHPVSNSCVDSSPKEGRVFMNTCDPSSPNQQWVFE 662


>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Oreochromis niloticus]
          Length = 587

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 95/215 (44%), Gaps = 22/215 (10%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR++L+CK+F WFM  VA+D+   +P + PP
Sbjct: 379 WMDEYAEYIYQRRPEYRHLSAGDMTVQKELRNRLNCKNFKWFMSEVAWDLPKHYPPVEPP 438

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
              WGE +N+G+  C++S    + + I    C         +   +    HG+      +
Sbjct: 439 AAAWGEIRNVGSSMCMESKHFVSGSPIRLENC--------VKGRGDVSWSHGQVFTFGWR 490

Query: 179 QEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
           ++  V   M    V F  ++          + + L++C    G  +  WR+    + L H
Sbjct: 491 EDIRVGDPMHTKKVCFDAISH--------NSPVTLYDCHGMKG--NQLWRYRKD-KSLYH 539

Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            +   C+   P   ++ M  CD ++  QQW F++ 
Sbjct: 540 PVSNSCIDSSPNERRVFMNTCDPSSPTQQWLFEKT 574


>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
          Length = 624

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 67/131 (51%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           W+  +K      +P       GD+SE+ ALR+KLHCK F W++++V       +PE  +P
Sbjct: 432 WMDEYKDFFYKINPGVRSTSYGDVSERKALREKLHCKSFKWYLQNV-------YPESQMP 484

Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
                 GE +N  T +C+DSMGR +   +G   CHG G +Q+F    +  L   + C+D 
Sbjct: 485 VEYHALGEIRNKATGQCIDSMGRKSGEKVGMVQCHGMGGNQIFSYTKKQALQTDDVCLDV 544

Query: 177 DKQEKLVEMLK 187
                 V++ +
Sbjct: 545 SSLHGPVKLFQ 555


>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
          Length = 669

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 15/124 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      PLA  +  GDI +++ LR++L CK F W++++V         
Sbjct: 480 RAAEVWMDEYKRYYYAAVPLATNIPFGDIEDRLRLREELQCKPFRWYLENVY-------- 531

Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              P L   E +N G+ +    CLDS+G  A A++G   CHG+G +Q + LN +G++ H 
Sbjct: 532 ---PQLSVPERRNNGSIRQGAFCLDSLGNVAGAIVGLYSCHGNGGNQNWILNRKGEVKHH 588

Query: 171 ERCV 174
           + C+
Sbjct: 589 DLCL 592


>gi|405973911|gb|EKC38600.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Crassostrea gigas]
          Length = 581

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 88/217 (40%), Gaps = 36/217 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  ++ GDISE++ LR +L CK F WF++HV       +P
Sbjct: 392 RAAEVWMDNYKEFYYAAVPSAKMVNFGDISERMDLRKRLSCKPFKWFLEHV-------YP 444

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P +             C+D++G  A  ++G   CH +G +Q F L  EG + H + 
Sbjct: 445 ELKVPGHQDQAFGSIQQDNNCMDTLGNFADGILGIFPCHFAGGNQEFSLTKEGFIRHLDL 504

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C                        V L     GTV+ LF C    G     W+  +   
Sbjct: 505 C------------------------VTLTGSMPGTVVKLFQC--QEGNTLQMWQRTSRDT 538

Query: 233 QLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            L+      CL S   +   L +  C E +  Q+W F
Sbjct: 539 MLKSHNYDLCLDSQEVQMKGLIVNACQEGSPTQKWSF 575


>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
          Length = 584

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 10/123 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A  +D+GD++E+  LR+ L CK F W+++       T +P
Sbjct: 398 RTAEVWMDEYKDIFYSMVPAARNVDVGDLTERKILRENLQCKSFRWYLE-------TIYP 450

Query: 115 E--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P + F  G+ +N+G  +CLD+ GR+A        CHG G +QL+    +G++   E
Sbjct: 451 ESPIPIDFFSLGQVQNMGVMECLDTAGRSAGDSPAMLPCHGQGGNQLWTYTGKGEIRSDE 510

Query: 172 RCV 174
            C+
Sbjct: 511 LCL 513


>gi|350593489|ref|XP_003133459.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Sus scrofa]
          Length = 321

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/110 (35%), Positives = 64/110 (58%), Gaps = 4/110 (3%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSM 137
           +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +L  GE +N+ T +CLD+M
Sbjct: 155 VDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYSL--GEIRNVETNQCLDNM 210

Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
           GR     +G   CHG G +Q+F   A+ ++   + C+D  +    V MLK
Sbjct: 211 GRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLK 260


>gi|312372346|gb|EFR20327.1| hypothetical protein AND_20267 [Anopheles darlingi]
          Length = 616

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/201 (26%), Positives = 92/201 (45%), Gaps = 37/201 (18%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
           D+GDIS+Q+A+R+KL CK F WFM HVA+D+  K+P + PP+   G  ++      C+D+
Sbjct: 422 DVGDISKQIAIREKLQCKPFKWFMTHVAFDLMEKYPPIEPPDFANGAIQSAARPALCVDT 481

Query: 137 MGRTAPAVIGTSYC----HGSGSSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +       IG   C         +Q F+L  + + ++  GE C D  +     ++L    
Sbjct: 482 LNHGEKQTIGLYSCAEDKRQPQPNQFFQLSWHRDLRIKFGELCWDVSESVPNAKIL---- 537

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHP--- 247
                               L++C    G  +  WR+   ++ L+     RCL + P   
Sbjct: 538 --------------------LYHCHGGQG--NQLWRYDPDSQMLKQGKNNRCLDMDPSLG 575

Query: 248 ETNQLAMLRCDENNSYQQWRF 268
           +  Q+ +  C+  N +Q+W +
Sbjct: 576 DGRQVFVNPCNLTNPHQRWHW 596


>gi|209875919|ref|XP_002139402.1| glycosyl transferase  [Cryptosporidium muris RN66]
 gi|209555008|gb|EEA05053.1| glycosyl transferase, group 2 family protein [Cryptosporidium muris
           RN66]
          Length = 743

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 65/230 (28%), Positives = 105/230 (45%), Gaps = 41/230 (17%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR   +WL  F       SP    +D+GD +E + L+ +L CK F WF+ +VA + Y K 
Sbjct: 540 LRTARAWLDEFYIFTKALSP-NKSIDLGDFNEVIQLKQRLQCKPFKWFLDNVAPETYLK- 597

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
            ++    + GE KN     C+D+M ++ + + IG  +CHG   +Q F             
Sbjct: 598 -DINEVQYLGEIKNPNLNMCIDTMSKSESGSKIGVFFCHGEKGTQGF------------- 643

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
            +     EK+  M +   +FC+              L + NC      +   W+++ +T 
Sbjct: 644 -IFLKGSEKIRLMSR--ESFCI-----------SDRLTMENC----DEITTRWKYNENTL 685

Query: 233 QLEHAILK---RCLSVHPETNQLAMLRCDENNSY--QQWRF-KEVRPDWS 276
            L H  L    +CL+V P +N+   L+  + N Y  QQW F K VR ++S
Sbjct: 686 ALTHKGLDNSTKCLTVLPNSNKDLNLQLLDCNYYKEQQWIFEKFVRSEYS 735


>gi|339239855|ref|XP_003375853.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
 gi|316975462|gb|EFV58902.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
           spiralis]
          Length = 625

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 91/200 (45%), Gaps = 44/200 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGTQK--C 133
           + GD+S+++ALR +L+C  F W++K+V       +PEL  P N +  GE + +G  K  C
Sbjct: 459 EFGDVSDRIALRQRLNCSSFEWYIKNV-------YPELFVPGNSIAKGEIRCMGQNKRHC 511

Query: 134 LD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD + GR       + Y CHG G +Q + L+  G++   E CVD                
Sbjct: 512 LDFASGRKEHNKPISMYPCHGEGGNQYWMLSPTGEIRRDESCVD---------------- 555

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AG K+          L  C    G  +  W+++   + L H I   CL ++ +   
Sbjct: 556 ---YAGQKVF---------LSGCHGLKGNQE--WKYNFKDKTLTHVISGLCLEMNHDATG 601

Query: 252 LAMLRCDENNSYQQWRFKEV 271
           + M  CD  N  QQW F E+
Sbjct: 602 VIMASCDSQNLNQQWTFGEI 621


>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
 gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
           malayi]
          Length = 584

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 10/123 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A  +D+GD++E+  LR+ L CK F W+++       T +P
Sbjct: 398 RTAEVWMDEYKDIFYGMVPAAKNVDVGDLTERKILRENLQCKSFRWYLE-------TIYP 450

Query: 115 E--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E  +P + F  G+ +N+G  +CLD+ GR+A        CHG G +QL+    +G++   E
Sbjct: 451 ESPIPIDFFSLGQVQNMGVMECLDTAGRSAGDSPAMLPCHGKGGNQLWTYTGKGEIRSDE 510

Query: 172 RCV 174
            C+
Sbjct: 511 LCL 513


>gi|326428492|gb|EGD74062.1| polypeptide N-acetylgalactosaminyltransferase 1 [Salpingoeca sp.
           ATCC 50818]
          Length = 531

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 71/125 (56%), Gaps = 8/125 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMF--LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           LR    WL  +K  +  G   + F  +D GD+S Q+ALR +L CK FSW+++ V   ++ 
Sbjct: 348 LRIVEVWLDEYKP-QFYGHRASHFTGVDFGDVSAQIALRSRLGCKPFSWYVQTVYPQLHV 406

Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
             PE   NL WGE +  GT  CLD+MG      +GT  CHG G +Q + L A+G+L HG 
Sbjct: 407 --PEHH-NLAWGELR-CGT-VCLDTMGHGDGEEVGTFGCHGLGGNQYWVLTAQGELMHGV 461

Query: 172 RCVDA 176
            C+ A
Sbjct: 462 LCLAA 466



 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 20/53 (37%), Positives = 29/53 (54%)

Query: 1   MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLE 53
           MG      +GT  CHG G +Q + L A+G+L HG  C+ A    +S  K +L+
Sbjct: 427 MGHGDGEEVGTFGCHGLGGNQYWVLTAQGELMHGVLCLAATPTGVSLAKCALD 479


>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           17 [Ciona intestinalis]
          Length = 870

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 91/218 (41%), Gaps = 33/218 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    +  GD+S+Q ALR+ L C+ F WFMK VA D+   +P
Sbjct: 659 RVAEVWMDEYAEYFYKKRPHVRGIHPGDLSKQKALRELLECRSFDWFMKEVAPDIIKHYP 718

Query: 115 E-LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSS-QLFRLNAEGQLGHGER 172
             +P    WG   N G+++CLD + +   A +    C   G++ Q F L  +  +  G  
Sbjct: 719 PVMPEPAAWGMLSNEGSKRCLDGLYKKEGAPLSLMPCREEGTADQSFILTWKEDIRPG-- 776

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLN----LFNCFNSSGTVDGPWRFH 228
               DK  K                    FC  G  LN    L+ C    G  +  W++ 
Sbjct: 777 -TSMDKARK--------------------FCLDGQGLNSPVVLWQCHGQYG--NQLWKYK 813

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           + T  L H I   C +   E  ++ +L C  N+  Q+W
Sbjct: 814 DDT--LIHGITGHCATSDAEAKKVILLPCKANDINQRW 849


>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           2, partial [Ciona intestinalis]
          Length = 582

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/219 (26%), Positives = 95/219 (43%), Gaps = 41/219 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  +  G+I  ++ +R +  CK F W++++V       +P
Sbjct: 393 RAAEVWMDDYKEYYYAAVPSAKLIPFGNIENRLQIRVRNQCKPFKWYLENV-------YP 445

Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  K  G  KC+D++G      IG   CH SG +Q F +N E Q+ H 
Sbjct: 446 ELRVPSKESVAFGSIKQ-GVNKCIDTLGHVQEGSIGLYECHDSGGNQEFSMNKEMQIRHQ 504

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
           + C  A +                         R G+++ L +C + + T+    +F  S
Sbjct: 505 DLCFTAGEG-----------------------AREGSIIKLRHC-DENNTMQ---KFEQS 537

Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
             QL       C+ S H     L + +CD + S QQWRF
Sbjct: 538 KSQLRLYQSSYCIDSRHERDMGLILSKCDHSRS-QQWRF 575


>gi|195114158|ref|XP_002001634.1| GI15842 [Drosophila mojavensis]
 gi|193912209|gb|EDW11076.1| GI15842 [Drosophila mojavensis]
          Length = 628

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 38/120 (31%), Positives = 64/120 (53%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KL CK F W+++HV  D+ T  P
Sbjct: 444 RAAEVWMDEYKQHYYNAVPLAKNIPFGNIDDRLALKEKLQCKPFKWYLEHVYPDLQTPDP 503

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +  +  G++ H E C+
Sbjct: 504 QDV-----GQFRQDATE-CLDTMGHIVDGTVGLFPCHNTGGNQEWTFSKRGEIKHDELCL 557


>gi|167534921|ref|XP_001749135.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772288|gb|EDQ85941.1| predicted protein [Monosiga brevicollis MX1]
          Length = 576

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/229 (27%), Positives = 103/229 (44%), Gaps = 48/229 (20%)

Query: 53  ELRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD-V 109
           +LR  + W+  +  I   +LG        MGDIS+ + LRD+L CK+F W++K+V  + +
Sbjct: 385 KLRTAAVWMDDYAFIVHEALGIKPEQLEQMGDISDMLELRDRLQCKNFDWYLKNVYPESI 444

Query: 110 YTKFPELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEGQL- 167
            T   ++      G  KN  + +CLD++ ++ A + +G   CHG GS +   L+   ++ 
Sbjct: 445 ITDRADIKA---LGMIKNPASNQCLDTLQKSWADSPVGVYGCHGQGSQRFLTLSKTHEIR 501

Query: 168 --GHGERCVDADKQEKLVEMLKLPVTFCVLA-GVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
              + E C+ +          +L +T+C  +   K  F   G V N  N           
Sbjct: 502 PVANLELCLTS----------RLTITWCEASRDAKFEFTPAGLVQNKHN----------- 540

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
                           RCL V  ET +LA   CDEN++ Q+W F +  P
Sbjct: 541 ---------------GRCLRVD-ETPKLAFADCDENDARQKWIFPDESP 573


>gi|6624102|gb|AAF19246.1|AC009227_1 similar to polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]; similar to JC4223 (PID:g2135942), partial
           [Homo sapiens]
          Length = 166

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/109 (35%), Positives = 63/109 (57%), Gaps = 4/109 (3%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           D GD+S +  LR+ L CK FSW+++++  D  ++ P    +L  GE +N+ T +CLD+MG
Sbjct: 1   DYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYSL--GEIRNVETNQCLDNMG 56

Query: 139 RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
           R     +G   CHG G +Q+F   A+ ++   + C+D  +    V MLK
Sbjct: 57  RKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLK 105


>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Callithrix jacchus]
          Length = 622

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/124 (37%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  FK+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSFKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|345488662|ref|XP_003425959.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Nasonia vitripennis]
          Length = 572

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/215 (28%), Positives = 92/215 (42%), Gaps = 34/215 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ KLHCK FSW++KHV  ++     
Sbjct: 386 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKHVYPELIIPTS 445

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + +  +G + H + C 
Sbjct: 446 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGMTNDGLIKHHDLC- 499

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
                      L LPV               GT L +  C    G+ +  WR H     +
Sbjct: 500 -----------LTLPVY------------AKGTSLLMQIC---DGSENQKWR-HLEGGLI 532

Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            HA +  C+ S +     +   +CD N   Q+W  
Sbjct: 533 RHARIPVCIDSRYHAQRGVTAEKCDSNAESQRWHL 567


>gi|170589103|ref|XP_001899313.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158593526|gb|EDP32121.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 636

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 98/251 (39%), Gaps = 35/251 (13%)

Query: 30  QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
           ++GH  RC  +      I    S    R    W+  + +      P  + +D GD+SEQ 
Sbjct: 399 RVGHIYRCKYIPFSNPGIGDFISRNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSEQK 458

Query: 88  ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
           A+R +L CK F WFMK +A+D    +P + P +   GE +N+   KC+D+  + A    G
Sbjct: 459 AIRKRLKCKSFDWFMKKIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTDFKNAEKKFG 518

Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKL 200
              C     +     N      H  R      C D       V + K P+          
Sbjct: 519 LRKCVSDDPTNHGEQNLRLTFWHDVRLQTRTMCFDVS-----VSVNKAPIV--------- 564

Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
                     LF+C    G  +  +++    +Q+ H I   C+    E  ++ M  CD  
Sbjct: 565 ----------LFSCHGMQG--NQHFKYLPDKQQMYHPISDLCMDCDVEQGEIFMNPCDSE 612

Query: 261 NSYQQWRFKEV 271
              Q+W + ++
Sbjct: 613 RRSQKWTWTKI 623


>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
 gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
          Length = 584

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 96/220 (43%), Gaps = 46/220 (20%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A  +D+GD++E+  LR+ L CK F W+++       T +P
Sbjct: 398 RTAEVWMDEYKDIFYKMVPAAKNVDIGDLTERKVLRENLQCKSFRWYLE-------TIYP 450

Query: 115 ELPPN---LFWGEAKNLGTQKCLDSMGRTA---PAVIGTSYCHGSGSSQLFRLNAEGQLG 168
           E P     L  G+ +N+G   CLD+ GR+A   PA++    CHG G +QL+    +G++ 
Sbjct: 451 ESPIPIDFLSLGQIQNMGVVGCLDTAGRSAGDSPAILP---CHGKGGNQLWAYTGKGEIR 507

Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
             E C                + F V  G+ +  C    +L+              + + 
Sbjct: 508 ADELC----------------LAFTV-KGISMEKCADWVLLSKVT-----------FEYV 539

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             T  L+H     CL  +  T QL+   C  N+  Q+WR 
Sbjct: 540 KKTLMLKHRETGLCLQANESTLQLST--CLHNDMNQKWRL 577


>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Saimiri boliviensis boliviensis]
 gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Saimiri boliviensis boliviensis]
          Length = 622

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 13/135 (9%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLKLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG---ERCVDADK 178
           L H    + C+ A K
Sbjct: 544 LRHNIAKQLCLHASK 558


>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
 gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
          Length = 670

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/195 (28%), Positives = 87/195 (44%), Gaps = 44/195 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLGT--QKC 133
           D GD+SE+  LR+ L CK F W++ +V       +PEL  P +    GE +NLG   + C
Sbjct: 503 DFGDVSERKKLREDLQCKSFKWYLDNV-------YPELFIPGDAVANGEMRNLGYGGRTC 555

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LDS    R     +G   CH  G +Q +  +  G++   + C+D                
Sbjct: 556 LDSPSGKRYLKKPVGLYPCHRQGGNQYWMFSKTGEIRRDQACLD---------------- 599

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AG  +I         LF C  S G  +  W +H +T+ L H    +CL+++   ++
Sbjct: 600 ---YAGKDVI---------LFGCHGSKG--NQFWTYHENTKLLHHGSSGKCLAINESKDK 645

Query: 252 LAMLRCDENNSYQQW 266
           L M  CD ++  Q W
Sbjct: 646 LIMEECDASHLRQHW 660


>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Rhipicephalus pulchellus]
          Length = 621

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/212 (26%), Positives = 90/212 (42%), Gaps = 41/212 (19%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K      +P A  +D GD+S +  LR KL C  F W+++++       +PE  +P
Sbjct: 435 WLDEWKDFYFAINPAAKNVDKGDLSYRKQLRTKLKCNTFRWYLENI-------YPESHMP 487

Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + +  GE K+  T  CLD+ GR +   +  S CHG G +Q+F      Q+   + C+DA
Sbjct: 488 LDYYHLGEIKHADTSDCLDTFGRKSGENVAVSKCHGMGGNQVFAYTKRQQIMSDDNCLDA 547

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
                 V++L+                          C    G  +  W ++   +  +H
Sbjct: 548 SSPRGPVKLLR--------------------------CHGMGG--NQLWIYNKEEQSFKH 579

Query: 237 AILKRCLSVHPETNQ--LAMLRCDENNSYQQW 266
               RCL   PE     L +LR  +  S Q+W
Sbjct: 580 VNTARCLD-QPEAKDPSLPVLRECDGRSSQRW 610


>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
          Length = 613

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/222 (25%), Positives = 94/222 (42%), Gaps = 36/222 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ     +P     D GD+++Q ALR++L CK F WF++ VA D+  ++P
Sbjct: 400 RVAEVWMDEYKQFLYERNPQFDQTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLVVRYP 459

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
              P  F  G  ++    K CLDSM   A   IG   C  + +    +Q F L    + +
Sbjct: 460 LRDPKPFASGRVQSAANPKLCLDSMNHKAKEPIGVFSCAANRTYPQNNQFFTLTYYRDIR 519

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
           +   ++C+DA                             G+ + LFNC  S G  +  W+
Sbjct: 520 VSSVDKCLDASSD--------------------------GSEVILFNCHESQG--NQLWQ 551

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           +   T+ + H    R   +     ++ + +CD     Q+W +
Sbjct: 552 YDTETQMIRHGKPTRNQCLDLVERKVVVSKCDHRKKTQRWEW 593


>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 573

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 98/219 (44%), Gaps = 20/219 (9%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    +D GD++ Q A+R++L+CK F WF++++A+D+  K+P
Sbjct: 370 RVAEVWMDEYAEYLYKRRPHYRNIDPGDLTVQKAVRERLNCKPFKWFIENIAFDLPLKYP 429

Query: 115 EL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
            + PP+L  GE +++     C+D+  +      G   C  +  S+  R      L   E 
Sbjct: 430 PIEPPDLAEGEIRSIADPGLCVDTERKEPEDTFGLKPCEKNFKSKNTRTEQYFILTWHED 489

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
                           P    V   V  I  +    +NL+ C    G  +  W + ++  
Sbjct: 490 I--------------RPKGRNVCWDVSSIDNKAS--VNLYKCHGMKG--NQYWHYDSNEN 531

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H    RCL+ + E+  L + +CD+N   Q+W+ +++
Sbjct: 532 WLVHGTKNRCLTANVESQTLYVTKCDKNIENQKWKIEKI 570


>gi|324506451|gb|ADY42754.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Ascaris suum]
          Length = 618

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 92/194 (47%), Gaps = 31/194 (15%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSM 137
           D GD+++Q A+R++L CK F WFMK +A+D    +P + P +   GE +N+  Q C+D+ 
Sbjct: 431 DEGDLTKQKAVRERLKCKSFDWFMKEIAFDQDKFYPAVEPEDSANGELRNVKAQTCVDTQ 490

Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA- 196
            R              G S+ F L          +C+  D      + ++L  +F V   
Sbjct: 491 FR--------------GGSERFGLR---------KCISDDPSGGGEQNMRLTFSFDVRPK 527

Query: 197 GVKLIFCRLGTV----LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
           G  L F    +V    + LF+C    G  +  +++  +++Q+ H +  +C+    E  ++
Sbjct: 528 GRTLCFDVSSSVRKAPVVLFSCHGMQG--NQLFKYLPNSKQMYHPVSNQCMDCDAERGEI 585

Query: 253 AMLRCDENNSYQQW 266
            M +CDE    QQW
Sbjct: 586 FMDKCDEGKPTQQW 599


>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 561

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 95/217 (43%), Gaps = 39/217 (17%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE---L 116
           W+  +K       P A   + GDI ++  LR+KL C  F W+++++       FP+   L
Sbjct: 371 WMDEYKSFYYKTVPNARNAEYGDIEDRKQLREKLQCNSFRWYLENI-------FPDSQFL 423

Query: 117 PPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
             N F + E +N+ T++CLD+MG+   +    S CHG G  Q++  +   +L H + C+D
Sbjct: 424 LDNYFRFCEVRNMETKQCLDNMGQKEKSKAALSRCHGQGGHQIYAWSKLNELKHDDLCLD 483

Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
           A            P  F  +   +              C +  GT +  WR++   + + 
Sbjct: 484 ASA----------PSGFKDVEQSR--------------CNSHGGTQE--WRYNEELKSIM 517

Query: 236 HAILKRCLS-VHPETNQLAMLRCDENNSYQQWRFKEV 271
           H +   CL     ++    + +CD N   Q+W   E+
Sbjct: 518 HVVSGLCLDKADADSTTPTLHQCD-NRLSQKWELVEI 553


>gi|194761420|ref|XP_001962927.1| GF15680 [Drosophila ananassae]
 gi|190616624|gb|EDV32148.1| GF15680 [Drosophila ananassae]
          Length = 630

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 65/120 (54%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 446 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 505

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +  GT+ CLD+MG      +G   CH +G +Q +  +  G++ H + C+
Sbjct: 506 QE-----IGQFRQDGTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFSKRGEIKHDDLCL 559


>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Nomascus leucogenys]
          Length = 622

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
          Length = 603

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 409 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 468

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 469 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 524

Query: 167 LGHG 170
           L H 
Sbjct: 525 LRHN 528


>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
           gorilla gorilla]
          Length = 622

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Pan paniscus]
 gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Pan paniscus]
          Length = 622

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
 gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
           N-acetylgalactosaminyltransferase 6 [Homo sapiens]
          Length = 622

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
 gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
 gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
 gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
           [synthetic construct]
 gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
           [synthetic construct]
          Length = 622

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Pongo abelii]
 gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           3 [Pongo abelii]
          Length = 622

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Nasonia vitripennis]
          Length = 610

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 47/197 (23%), Positives = 83/197 (42%), Gaps = 36/197 (18%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           + GD+S++ ALR  L CK F W++ ++  +++     +      GE ++L ++ C+DS G
Sbjct: 445 NYGDVSDRKALRKNLGCKSFKWYLDNIYPELFIPGEAVAS----GEIRHLASRLCIDSPG 500

Query: 139 RTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
                   +G   CH  G +Q + L+  G++   E C+D                     
Sbjct: 501 NPEDLHQAVGFYECHNQGGNQYWMLSKTGEIRRDESCLDYS------------------- 541

Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLR 256
                    GT + L+ C  S G     W ++  T+Q+ H    +CL++     +L M  
Sbjct: 542 ---------GTDVILYPCHGSKGNQQ--WTYNTQTKQIRHESSGKCLAITESKQRLLMEE 590

Query: 257 CDENNSYQQWRFKEVRP 273
           C      Q+W F+   P
Sbjct: 591 CSPAAPRQRWAFENYDP 607


>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
 gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
          Length = 673

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 57/219 (26%), Positives = 100/219 (45%), Gaps = 36/219 (16%)

Query: 60  WLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-P 117
           W+  +KQ    G+  +   +D GD++ Q A+R KL CK F WFM++VA+D+   +P + P
Sbjct: 459 WMDEYKQYLYNGADGVYERIDAGDLTAQKAIRTKLKCKSFKWFMENVAFDLIKNYPPIDP 518

Query: 118 PNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHG 170
           P+   G  +N+G    C+D++ +     +G   C  +      +Q + L+   + +L   
Sbjct: 519 PDYASGAIQNVGDPTLCVDTLSKPRHNRVGIYSCARNLVKPQRTQYWSLSWKRDLRLHRK 578

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
           + C+D       +     PV                    L++C    G  +  W +   
Sbjct: 579 KDCLDVQ-----IWDANAPVW-------------------LWDCHGQQG--NQYWYYDYR 612

Query: 231 TRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           T+ ++H    KRC+ + P+T +L +  CDE N   QW F
Sbjct: 613 TKLIKHGKEGKRCMELLPDTEELVVNACDEKNRLMQWNF 651


>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
          Length = 592

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 64/229 (27%), Positives = 99/229 (43%), Gaps = 19/229 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 424

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG-QLGHG-ER 172
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+  L      LG G E 
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEE 482

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLI--------FCRLGTVLNLFNCFNSSGTVDGP 224
           C        LV    +          K I          RL   + +  C +  G  +  
Sbjct: 483 CCSNHPLYGLVYTPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRG--NQL 540

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
           W +      L HA   +CL    E +++   M  C  + S QQW  + +
Sbjct: 541 WEYDAERLTLRHANSNQCLDEPSEEDKMVPTMQDCSGSRS-QQWLLRNM 588


>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
           musculus]
          Length = 594

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 64/229 (27%), Positives = 99/229 (43%), Gaps = 19/229 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P
Sbjct: 369 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 426

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG-QLGHG-ER 172
               +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+  L      LG G E 
Sbjct: 427 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEE 484

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLI--------FCRLGTVLNLFNCFNSSGTVDGP 224
           C        LV    +          K I          RL   + +  C +  G  +  
Sbjct: 485 CCSNHPLYGLVYTPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRG--NQL 542

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
           W +      L HA   +CL    E +++   M  C  + S QQW  + +
Sbjct: 543 WEYDAERLTLRHANSNQCLDEPSEEDKMVPTMQDCSGSRS-QQWLLRNM 590


>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Hydra magnipapillata]
          Length = 548

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 69/136 (50%), Gaps = 8/136 (5%)

Query: 55  RKRSSWLKVFKQIR-SLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K++      P    +  GDISE+V LR KL CK F W++ +V  D+    
Sbjct: 398 RLAEVWMDEYKELYYRRKPPEDKLVKYGDISERVELRKKLGCKSFKWYIDNVIPDMIGAD 457

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAP-AVIGTSYCHGSGSSQLFRLNAEGQLGHG-E 171
           P  P +   GE +N+ +  CLDSMG     A I    CH  G +Q F L+  G++ H  E
Sbjct: 458 PNPPAH---GEVRNVASNMCLDSMGNKGNRAQIKVFPCHRLGGNQFFVLSKRGEIIHNDE 514

Query: 172 RCVD--ADKQEKLVEM 185
            C+D   + +E  V+M
Sbjct: 515 SCLDYSLENEENKVDM 530


>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
          Length = 495

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 88/211 (41%), Gaps = 39/211 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           W+  ++      +P     D GD+S ++ LR+KL CK F W+++++       +PE  +P
Sbjct: 314 WMDEWRSFYYKINPGVKQTDYGDLSPRIQLREKLECKSFRWYLQNI-------YPESQMP 366

Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + +  GE +N  T +CLDSMGR A   +G   CHG G +Q+F  + +      + C+D 
Sbjct: 367 LDYYSLGEIRNKETNQCLDSMGRKAGEKVGIVGCHGMGGNQIFSYSKKKAFQTDDLCLD- 425

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
                 V  L  PV                    L+ C    G  +  W     T  + H
Sbjct: 426 ------VSALTGPV-------------------KLYQCHGLGG--NQLWEHDQETLVIRH 458

Query: 237 AILKRCLS-VHPETNQLAMLRCDENNSYQQW 266
               +CL    P+      L     +S QQW
Sbjct: 459 VSSNQCLDKPTPQDRDTPALGACSGSSSQQW 489


>gi|195435185|ref|XP_002065582.1| GK14594 [Drosophila willistoni]
 gi|194161667|gb|EDW76568.1| GK14594 [Drosophila willistoni]
          Length = 635

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 65/120 (54%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 451 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEP 510

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +  GT+ CLD+MG      +G   CH +G +Q +  +  G++ H + C+
Sbjct: 511 QE-----IGQFRQDGTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCL 564


>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Cavia porcellus]
          Length = 622

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSWF+ ++  +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKIAQEKSFGDISERLQLRERLHCHNFSWFLSNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLG  +CLD     R    +I  S CHG G +Q F    + +
Sbjct: 488 MFV--PDLSPT-FYGAIKNLGINQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRE 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|198437817|ref|XP_002130165.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
           polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
           intestinalis]
          Length = 647

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 51/213 (23%), Positives = 89/213 (41%), Gaps = 21/213 (9%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W   + +      P    +D GD+S  ++ + KL C+ F WFMK +AYD+   +P
Sbjct: 429 RVIETWWDDYSKYFYTRRPEVKSIDTGDLSVPLSYKKKLQCRSFDWFMKEIAYDIPKHYP 488

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + PP+   G   N+GT+ CLD     + + +  S C   G    + L     +  G   
Sbjct: 489 MVEPPSGASGHISNIGTKLCLDGKFGGSGSPLTLSNCEQHGGELNWWLTWLEDIRPGGNT 548

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
            D  ++                  V L      +V++ + C +  G  +  W++    +Q
Sbjct: 549 PDTARK------------------VCLDCVSRDSVVSFWECHHMRG--NQLWKYIFDNKQ 588

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           L H++ + CL   P   ++    C+ NN  Q W
Sbjct: 589 LYHSVSRSCLEASPGEGKVYARECNHNNLNQLW 621


>gi|321455342|gb|EFX66478.1| hypothetical protein DAPPUDRAFT_302681 [Daphnia pulex]
          Length = 613

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 92/223 (41%), Gaps = 34/223 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P    L++GD+S Q  +R++L CK F WFM ++A+D+  K+P
Sbjct: 399 RVAEVWMDEYKEYLYKRRPQYRNLEVGDLSSQTEIRERLQCKSFKWFMTNIAFDLPKKYP 458

Query: 115 EL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYC--HGSGSS--QLFRLNAEGQLG 168
            + PP+   GE ++      C+D+  +          C   GSG S  Q F L     + 
Sbjct: 459 PIEPPDFASGEIRSKADSTLCVDTQFKKENERFNLEQCVKDGSGRSGEQQFALTWHKDIR 518

Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
             +R  C D    +    +L                        L+ C  S G  +  WR
Sbjct: 519 PHKRTMCWDVSSLDAQAPVL------------------------LWGCHGSQG--NQLWR 552

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           +   T QL H    RCL   P   +L +  C+  +  Q+W F+
Sbjct: 553 YDPETLQLIHGGNPRCLDCDPGRKELFVATCNSESPTQRWTFE 595


>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
 gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
          Length = 675

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 57/224 (25%), Positives = 102/224 (45%), Gaps = 36/224 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K  + + G  +   +D GD++ Q A+R KL CK F WFM++VA+D+   +
Sbjct: 459 RVAEVWMDEYKNYLYNHGDGIYDNVDPGDLTAQKAIRTKLKCKSFKWFMENVAFDLMKSY 518

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCH----GSGSSQLFRLN--AEG 165
           P + PP+   G  +N+G    C+D++GR     +G   C         +Q + L+   + 
Sbjct: 519 PPVDPPDYASGAIQNVGDNTLCIDTLGRVRHNRVGMYRCAIDLVKPQRTQYWSLSWKRDL 578

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L   + C+D    +        PV                    L++C    G  +  W
Sbjct: 579 RLRRKKDCLDVQIWDA-----NAPVW-------------------LWDCHGQQG--NQYW 612

Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +   T+ ++H    KRC+ + P + +L +  C+E+N Y +W F
Sbjct: 613 FYDYHTKMIKHGKEGKRCMELLPFSEELVVNNCNESNEYMRWNF 656


>gi|332020473|gb|EGI60888.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Acromyrmex
           echinatior]
          Length = 442

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/215 (27%), Positives = 91/215 (42%), Gaps = 34/215 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ KLHCK FSW++K+V  ++     
Sbjct: 256 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTS 315

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + L  +G + H + C 
Sbjct: 316 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 369

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
                      L LPV               GT L +  C    G+ +  WR H     +
Sbjct: 370 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HMEGGLI 402

Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            H  +  C+ S +     +   +CD N   Q+W  
Sbjct: 403 RHTRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 437


>gi|241622516|ref|XP_002407424.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
 gi|215500988|gb|EEC10482.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
          Length = 471

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 98/223 (43%), Gaps = 31/223 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P    LD GD++ Q ALR +L+CK F WFM+ VA+D  +K+P
Sbjct: 266 RVAEVWMDEYKEHLYHRRPHYRHLDPGDLTAQKALRKRLNCKSFKWFMEQVAFDQPSKYP 325

Query: 115 E-LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL-----NAEGQLG 168
             L P   W + +N  +  C+D+  +              G ++ F L     +  G+ G
Sbjct: 326 ALLTPVAHWPQVRNEESGLCIDTQFK--------------GQNERFSLAPCLKDQRGRSG 371

Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
             +  +   K    V   K  V F V +        +   + L++C    G  +  W++ 
Sbjct: 372 EQQLVLTWHKD---VRPAKRSVCFDVSSS------DVHAPVMLWSCHGMHG--NQLWKYD 420

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
             T+ L H I   CL    +++++ M  CD +   Q+W F+ V
Sbjct: 421 TVTKHLFHPITANCLDCDAKSHEVFMSICDSDVRTQRWLFEHV 463


>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 460

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+     +P A  +D GD+S +  LR KL C  F W+++++       +PE  +P
Sbjct: 274 WLDEWKEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENI-------YPESHMP 326

Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + +  GE K+  +  CLD+ GR +   +  S CHG G +Q+F      Q+   + C+DA
Sbjct: 327 LDYYHLGEIKHADSPVCLDTFGRKSGENVAVSTCHGQGGNQVFAYTKRQQIMSDDNCLDA 386

Query: 177 DKQEKLVEMLK 187
                 V++L+
Sbjct: 387 SSPRGPVKLLR 397


>gi|10438640|dbj|BAB15297.1| unnamed protein product [Homo sapiens]
 gi|193787354|dbj|BAG52560.1| unnamed protein product [Homo sapiens]
          Length = 187

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/113 (37%), Positives = 64/113 (56%), Gaps = 6/113 (5%)

Query: 59  SWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPP 118
           S+ K+F +     + +A     GDISE++ LR++LHC +FSW++ +V  +++   P+L P
Sbjct: 3   SYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFV--PDLTP 60

Query: 119 NLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
             F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  L H
Sbjct: 61  T-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRDLRH 111


>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
 gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
 gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
           troglodytes]
          Length = 622

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLMPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Takifugu rubripes]
          Length = 600

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/215 (25%), Positives = 95/215 (44%), Gaps = 26/215 (12%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR +L CK+F WFM +VA+D+   +P + PP
Sbjct: 392 WMDEYAEYVYQRRPEYRHLSAGDMTPQKELRSRLGCKNFKWFMSNVAWDLPKHYPPVEPP 451

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG--HGERCVDA 176
              WGE +N+G+  C++     + + I    C          + + G++G  HG+     
Sbjct: 452 AAAWGEIQNVGSGLCMEIKHFVSGSPIRLENC----------VKSRGEVGWSHGQVLTFG 501

Query: 177 DKQEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
            +++  V   M    V F  ++          + + L++C    G  +  WR+    R L
Sbjct: 502 WREDIRVGDPMHTRKVCFDAVSH--------NSPVTLYDCHGMKG--NQLWRYRKDKR-L 550

Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            H +   C+   P   ++ M  CD  +  QQW F+
Sbjct: 551 YHPVSNSCIDSSPGERRVFMNTCDPASLSQQWLFE 585


>gi|351697576|gb|EHB00495.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Heterocephalus
           glaber]
          Length = 622

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 70/124 (56%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC++FSWF+ ++  +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKIAQEKSFGDISERLQLREQLHCRNFSWFLHNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLG  +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGINQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|109096689|ref|XP_001083664.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
           mulatta]
          Length = 641

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC  FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|350583955|ref|XP_003126192.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Sus scrofa]
          Length = 384

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSWF+ ++  +
Sbjct: 190 QVRLAEVWMDSYKEIFYRRNLQAAKMAQEKSFGDISERLQLRERLHCHNFSWFLDNIYPE 249

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
           ++   P+L P  F+G  KNLG   CLD   S     P ++ T  CHG G +Q F    + 
Sbjct: 250 MFV--PDLKPT-FYGAIKNLGMDHCLDVGESNNGGKPLIMYT--CHGLGGNQYFEYTTQR 304

Query: 166 QLGH 169
            L H
Sbjct: 305 DLRH 308


>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
          Length = 650

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 44/195 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE +NLG   + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD+    +     +GT  CH  G +Q + L+  G++   + C+D                
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 580

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G  + LF C    G  +  W +  +T+QL H    +CL++    ++
Sbjct: 581 -------------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDK 625

Query: 252 LAMLRCDENNSYQQW 266
           L M  C  + S QQW
Sbjct: 626 LLMEECSASLSRQQW 640


>gi|355564239|gb|EHH20739.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
 gi|355762987|gb|EHH62101.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
           fascicularis]
 gi|380809242|gb|AFE76496.1| polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
          Length = 622

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC  FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
 gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
          Length = 650

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 44/195 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE +NLG   + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD+    +     +GT  CH  G +Q + L+  G++   + C+D                
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 580

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G  + LF C    G  +  W +  +T+QL H    +CL++    ++
Sbjct: 581 -------------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDK 625

Query: 252 LAMLRCDENNSYQQW 266
           L M  C  + S QQW
Sbjct: 626 LLMEECSASLSRQQW 640


>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
 gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
          Length = 585

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/227 (25%), Positives = 99/227 (43%), Gaps = 38/227 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + Q     +P     D GD+S Q  LR++L CK F WF++ VA D+  ++P
Sbjct: 371 RVAEVWMDEYSQFLYERNPQFAKTDPGDLSAQRELRERLQCKPFKWFLEVVAPDLLVRYP 430

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
              P  F  G  +++   + CLDS+   A   IG   C  + +    +Q F L+   + +
Sbjct: 431 PRDPQPFASGRVQSVANPRLCLDSLNHQAKEPIGLYACAFNKTHPQNNQFFTLSYHRDIR 490

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
           +   ++C+DA K                          L   + LF+C  S G  +  WR
Sbjct: 491 VRSNDKCLDAAK--------------------------LNDEIVLFSCHESQG--NQMWR 522

Query: 227 FHNSTRQLEHAILK--RCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           +   ++ + H      RC+     TN+L +  CD++ + Q+W +  V
Sbjct: 523 YDYESKMIIHGKDHHGRCMEADLHTNKLFVRSCDKDKASQKWNWGYV 569


>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
 gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
 gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 9; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9
 gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
 gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
          Length = 650

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 44/195 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE +NLG   + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD+    +     +GT  CH  G +Q + L+  G++   + C+D                
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 580

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G  + LF C    G  +  W +  +T+QL H    +CL++    ++
Sbjct: 581 -------------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDK 625

Query: 252 LAMLRCDENNSYQQW 266
           L M  C  + S QQW
Sbjct: 626 LLMEECSASLSRQQW 640


>gi|402886019|ref|XP_003906439.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Papio anubis]
 gi|402886021|ref|XP_003906440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Papio anubis]
          Length = 622

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC  FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
 gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
 gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
 gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
          Length = 647

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 84/195 (43%), Gaps = 41/195 (21%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
           D GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE  N+    CLD
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIANVPNGMCLD 535

Query: 136 SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
           +  ++      + Y CHG G +Q + L+  G++   + C+D                   
Sbjct: 536 AKEKSEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDY------------------ 577

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
                      G  + LF C    G  +  W +  +T+QL H    +CL++    ++L M
Sbjct: 578 ----------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLLM 625

Query: 255 LRCDENNSYQQWRFK 269
             C  + S QQW  +
Sbjct: 626 EECSASLSRQQWTLE 640


>gi|350409232|ref|XP_003488663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Bombus impatiens]
          Length = 571

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/215 (27%), Positives = 94/215 (43%), Gaps = 34/215 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ KLHCK FSW++K+V  ++     
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTS 444

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + L  +G + H + C 
Sbjct: 445 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 498

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
                      L LPV           + + GT L +  C    G+ +  WR H     +
Sbjct: 499 -----------LTLPV-----------YAK-GTTLLMQIC---DGSENQKWR-HLEGGLI 531

Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            H+ +  C+ S +     +   +CD N   Q+W  
Sbjct: 532 RHSRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 566


>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Meleagris gallopavo]
          Length = 562

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/222 (26%), Positives = 95/222 (42%), Gaps = 39/222 (17%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR+ L+CK F WFM  VA+D+   +P + PP
Sbjct: 352 WMDEYAEYIYQRRPEYRHLSAGDVTAQKELRNNLNCKSFKWFMSEVAWDLPKFYPPVEPP 411

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
              WGE +N+GT  C+D+              HGS  S L RL         E CV  D+
Sbjct: 412 AAAWGEIRNVGTGLCVDTK-------------HGSLGSPL-RL---------ESCV-KDR 447

Query: 179 QEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDGPWRF 227
            E     +++  TF     ++         FC       + + L++C    G  +  WR+
Sbjct: 448 GEAAWNNVQVTXTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQLWRY 505

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+       ++ M  C+ ++  QQW F+
Sbjct: 506 RKD-KTLYHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 546


>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 490

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 10/120 (8%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K      SP     D GDIS+++ LR  L CK F W+++++       +PE   P
Sbjct: 312 WLDEYKNFYYRISPSVAKTDPGDISDRLNLRKSLSCKSFKWYLENI-------YPESSWP 364

Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            N  F GE +N     CLD+M + A   +G   CHG G +Q++      +L H + C+D 
Sbjct: 365 VNYQFMGEVRNTEAHVCLDTMMKEAGNKVGLYGCHGQGGNQIWAFTKNNELRHDDLCLDV 424


>gi|195454523|ref|XP_002074278.1| GK18434 [Drosophila willistoni]
 gi|194170363|gb|EDW85264.1| GK18434 [Drosophila willistoni]
          Length = 646

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/229 (24%), Positives = 99/229 (43%), Gaps = 38/229 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMF-LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K+      P     +D GD+S Q+A+R++L CK F W+MK +A D   K+
Sbjct: 432 RVAEVWMDKYKEYVYTRDPETYEKIDAGDLSRQLAIRERLQCKSFDWYMKEIAPDFLIKY 491

Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNA--EG 165
           P + PP    G  +++     CLDS+G+T+   +G   C  +     ++Q F L +  E 
Sbjct: 492 PPIEPPKFASGAVQSVAYPLFCLDSLGQTSNNAVGLYSCAANLTFPQANQNFGLTSFREM 551

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +LG    C+D                                 + ++NC N  G  +  W
Sbjct: 552 RLGEEGDCLDVQDNHP------------------------NATIWMWNCHNQGG--NQFW 585

Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLR--CDENNSYQQWRFKEV 271
            +  + + + H     RC+    + ++ ++    C ENN  Q+W+F  V
Sbjct: 586 YYDRNHQWIVHGKNGHRCMEAFVDDSERSVYTNDCQENNPRQRWKFANV 634


>gi|161077160|ref|NP_001097343.1| CG30463, isoform E [Drosophila melanogaster]
 gi|157400368|gb|ABV53824.1| CG30463, isoform E [Drosophila melanogaster]
          Length = 264

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 41/192 (21%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
           D GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE  N+    CLD
Sbjct: 100 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIANVPNGMCLD 152

Query: 136 SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
           +  ++      + Y CHG G +Q + L+  G++   + C+D                   
Sbjct: 153 AKEKSEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDY------------------ 194

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
                      G  + LF C    G  +  W +  +T+QL H    +CL++    ++L M
Sbjct: 195 ----------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLLM 242

Query: 255 LRCDENNSYQQW 266
             C  + S QQW
Sbjct: 243 EECSASLSRQQW 254


>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
          Length = 647

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 86/198 (43%), Gaps = 44/198 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNL--GTQKC 133
           D GD+SE+  LR+ L CK F W++ ++       FPEL      +  GE +N+  G + C
Sbjct: 480 DYGDVSERKQLRENLGCKPFRWYLDNI-------FPELFIPGEAVASGEVRNMGYGNRTC 532

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD+ G  +     +G   CH  G +Q + L+  G++   E C+D                
Sbjct: 533 LDAPGGKKNLRKPVGLYPCHNQGGNQYWMLSKTGEIRRDEACLD---------------- 576

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AG  +I         L+ C  S G  +  W +   +  L H    +CL+++    +
Sbjct: 577 ---YAGQDVI---------LYPCHGSKG--NQYWNYSPDSHLLRHGSSDKCLAINESKQK 622

Query: 252 LAMLRCDENNSYQQWRFK 269
           L M  CD     Q+W+F+
Sbjct: 623 LIMADCDSTVDAQKWQFQ 640


>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
          Length = 600

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/218 (23%), Positives = 85/218 (38%), Gaps = 18/218 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  + +      P    L  GD++ Q  LR  L CKDF W+M  VA+D+   +P
Sbjct: 386 RVAETWMDEYTEYIYQRRPEYRHLSTGDLTAQKELRKHLKCKDFKWYMNTVAWDLPKYYP 445

Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            + P    WGE +N  +  C+DS   +    +    C   G+ + +        G  E  
Sbjct: 446 PVEPLPAAWGEIRNAASGLCVDSKHGSTGTELRLDNCLKEGAERTWAHEQIFTFGWREDI 505

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
              D              FC  A          + + L++C    G     +R     + 
Sbjct: 506 RPGDPLHTR--------KFCFDA------ISQNSPITLYDCHGMKGNQHWSYR---KDKS 548

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           L H +   C+   P   ++ M +CD  +  QQW F++V
Sbjct: 549 LYHLVSSGCMDCSPNDKRIFMNKCDPKSETQQWIFQKV 586


>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
           sapiens]
          Length = 622

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE+  LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERPQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|402585386|gb|EJW79326.1| polypeptide N-acetylgalactosaminyltransferase 10 [Wuchereria
           bancrofti]
          Length = 288

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/251 (23%), Positives = 98/251 (39%), Gaps = 35/251 (13%)

Query: 30  QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
           ++GH  RC  +      I    S    R    W+  + +      P  + +D GD+SEQ 
Sbjct: 51  RVGHIYRCKYIPFSNPGIGDFISRNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSEQK 110

Query: 88  ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
           A+R +L CK F WFMK +A+D    +P + P +   GE +N+   KC+D+  + A    G
Sbjct: 111 AVRKRLKCKSFDWFMKEIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTDFKNAEKKFG 170

Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKL 200
              C     +     N      H  R      C D       V + K P+          
Sbjct: 171 LRKCVSDDPANHGEQNLRLTFWHDVRLQTRTMCFDVS-----VSVNKAPIV--------- 216

Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
                     LF+C    G  +  +++    +Q+ H I   C+    E  ++ M  C+  
Sbjct: 217 ----------LFSCHGMQG--NQHFKYLPDKQQMYHPISDLCMDCDVEQGEIFMNPCNSE 264

Query: 261 NSYQQWRFKEV 271
              Q+W + ++
Sbjct: 265 RRSQKWTWTKI 275


>gi|324520233|gb|ADY47590.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
          Length = 267

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 55/221 (24%), Positives = 97/221 (43%), Gaps = 41/221 (18%)

Query: 54  LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +R    WL  +K+I    +   L  +   GD+SE+  LR++L C  F W++ +V  D++ 
Sbjct: 76  VRLAEVWLDEYKEIYYERINHKLGEY---GDVSERKRLRERLKCHSFKWYLDNVFPDLFI 132

Query: 112 KFPELPPNLFWGEAKNLGTQK-CLD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLG 168
               +      GE +N G  K C+D  +GR         Y CH  G +Q + L+ EG++ 
Sbjct: 133 PSKAIGK----GEIRNRGNPKFCVDHEVGRNVVNDAVIPYPCHLMGGNQFWLLSKEGEIR 188

Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
             E C+D   +  +V                            + C  S G  +  W ++
Sbjct: 189 RDEYCIDYPGRGSVV---------------------------TYECHGSKG--NQLWEYN 219

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           + + ++ H I ++CL++  +  +L M  CD+ N  Q W F+
Sbjct: 220 HESGRIYHPISRKCLALSDDDKKLIMSACDKANDRQSWAFQ 260


>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
 gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
          Length = 644

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 89/198 (44%), Gaps = 44/198 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL  P + +  GE KNLG   + C
Sbjct: 477 DWGDVSDRKKLREDLQCKSFRWYLDNI-------YPELFIPGDAVAHGEIKNLGYGGRTC 529

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           +D+    +     +GT  CH  G +Q + L+  G++   + C+D                
Sbjct: 530 MDAPAGKKHLKKSVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 574

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
                         G  + L+ C  S G  +  W +  +T+QL H    +CL++    ++
Sbjct: 575 -------------AGKDVTLYACHGSKG--NQFWTYRENTKQLHHGTSGKCLAISEAKDK 619

Query: 252 LAMLRCDENNSYQQWRFK 269
           L M  C+ + + QQW  +
Sbjct: 620 LLMEECNSSLARQQWMLE 637


>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
 gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
          Length = 658

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 91/195 (46%), Gaps = 44/195 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL  P + +  GE +NLG   + C
Sbjct: 491 DWGDVSDRKKLREDLQCKSFKWYLDNI-------YPELFIPGDAVAHGEIRNLGYGGRTC 543

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LDS    +     +G   CH  G +Q + L+  G++   + C+D                
Sbjct: 544 LDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDDYCLD---------------- 587

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AG ++I         L++C    G  +  W +  +T+QL H    +CLS+  + +Q
Sbjct: 588 ---YAGKEVI---------LYSCHGGKG--NQFWTYRENTKQLHHGTSGKCLSISEKKDQ 633

Query: 252 LAMLRCDENNSYQQW 266
           L M  C ++ + Q+W
Sbjct: 634 LLMEECSQSLTRQEW 648


>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 573

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 69/126 (54%), Gaps = 11/126 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  + +   + +P A+ +D GD++++  LR +L+CK F W+++H+       +
Sbjct: 378 VRTALVWLDQYSRFYFMLNPSALSVDYGDVTKRKKLRQQLNCKSFRWYLEHI-------Y 430

Query: 114 PE--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           PE  +P ++   GE ++   Q CLDS+G      +G ++CHG G +Q+F +   G +   
Sbjct: 431 PESSIPIDVIRLGEIRHKSGQ-CLDSLGHKLGETVGVTHCHGQGGNQVFAITESGTIRVH 489

Query: 171 ERCVDA 176
             C+D 
Sbjct: 490 AGCMDG 495


>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Canis lupus familiaris]
          Length = 622

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 102/224 (45%), Gaps = 38/224 (16%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC +FSWF+ ++  +
Sbjct: 428 QVRLAEVWMDNYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCHNFSWFLHNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P L+ G  +NLG  +CLD +G       P ++ T  CHG G +Q F    +
Sbjct: 488 MFV--PDLKPTLY-GAIRNLGINQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 541

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
             L H     +  KQ             C+ A        LG     F   NS    D  
Sbjct: 542 RDLRH-----NISKQ------------LCLHASAGT----LGLRSCHFTGKNSQVPKDEE 580

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           W F    + + +     CL+   E  + AM  C+  + +Q W F
Sbjct: 581 WEF-TQDQLIRNRASGTCLTS--EDKKPAMAACNPRDPHQHWVF 621


>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
          Length = 622

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +      GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMTQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543

Query: 167 LGHG 170
           L H 
Sbjct: 544 LRHN 547


>gi|326670821|ref|XP_003199296.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Danio rerio]
          Length = 435

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 62/236 (26%), Positives = 100/236 (42%), Gaps = 45/236 (19%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +     + +A     GD+S +V LR++L CK FSW++
Sbjct: 234 QVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKEHSFGDVSRRVDLRERLQCKSFSWYL 293

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLF 159
           K+V  +V+   P+L P L +G  +N+G + CLD   S     P ++    CHG G +Q F
Sbjct: 294 KNVYPEVF--MPDLNP-LQFGAIRNMGKEACLDVGESNEGGKPLIMYP--CHGMGGNQYF 348

Query: 160 RLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC-F 215
             +   ++ H    E C+D                              G ++ L  C +
Sbjct: 349 EYSTHHEIRHNIQKELCLDGTD---------------------------GAMVRLEECQY 381

Query: 216 NSSGTVDGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
               T+  P   W           A  K+CLS  PE   L  + C+ ++ YQ W F
Sbjct: 382 KGHNTITSPQQKWELREDQLFYNKAS-KQCLSARPENPSL--VPCNPDDKYQLWLF 434


>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
 gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
          Length = 658

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/195 (27%), Positives = 91/195 (46%), Gaps = 44/195 (22%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL  P + +  GE +NLG   + C
Sbjct: 491 DWGDVSDRKKLREDLQCKSFKWYLDNI-------YPELFIPGDAVAHGEIRNLGYGGRTC 543

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LDS    +     +G   CH  G +Q + L+  G++   + C+D                
Sbjct: 544 LDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDDYCLD---------------- 587

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
               AG ++I         L++C    G  +  W +  +T+QL H    +CLS+  + +Q
Sbjct: 588 ---YAGKEVI---------LYSCHGGKG--NQFWTYRENTKQLHHGTSGKCLSISEKKDQ 633

Query: 252 LAMLRCDENNSYQQW 266
           L M  C ++ + Q+W
Sbjct: 634 LLMEECSQSLTRQEW 648


>gi|307183874|gb|EFN70488.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Camponotus
           floridanus]
          Length = 451

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/215 (27%), Positives = 91/215 (42%), Gaps = 34/215 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ +LHCK FSW++K+V  ++     
Sbjct: 265 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTS 324

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + L  +G + H + C 
Sbjct: 325 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHNTGGNQEWGLTKDGLIKHHDLC- 378

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
                      L LPV               GT L +  C    G+ +  WR H     +
Sbjct: 379 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HLEGGLI 411

Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            H  +  C+ S +     +   +CD N   Q+W  
Sbjct: 412 RHTRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 446


>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Saccoglossus kowalevskii]
          Length = 524

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 100/220 (45%), Gaps = 42/220 (19%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K I     P     D GD++E+  LRD+L C DF W+++++       +
Sbjct: 325 MRVAEVWMDEYKDIFYALKPQLKGEDYGDVTERKELRDRLQCHDFKWYLQNI-------Y 377

Query: 114 PELP-PNL---FWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG 168
           PELP P+L     GE +NLG    C+D+MG  A   +    CHG G +Q+F  + + ++ 
Sbjct: 378 PELPIPDLKVQARGELRNLGKIGYCMDTMGANA---MCAHPCHGIGHNQMFSFSWQQKIW 434

Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
            GE C+ +   ++              A ++ IF          N  NS   +   W+ H
Sbjct: 435 FGELCLTSSPYQR--------------ASIQQIFA---------NQCNSPRVIH--WQ-H 468

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
               ++      +CL +  +   L +  C EN++ Q W++
Sbjct: 469 TVNGKMMDTKTGKCLDIGTDQKFLILKTC-ENSATQNWKW 507


>gi|307214182|gb|EFN89299.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Harpegnathos
           saltator]
          Length = 442

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/215 (27%), Positives = 92/215 (42%), Gaps = 34/215 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ +LHCK FSW++K+V  ++     
Sbjct: 256 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTS 315

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + L  +G + H + C 
Sbjct: 316 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 369

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
                      L LPV               GT L +  C    G+ +  WR H     +
Sbjct: 370 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HLEGGLI 402

Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            H+ +  C+ S +     +   +CD N   Q+W  
Sbjct: 403 RHSRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 437


>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
 gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
          Length = 499

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 87/192 (45%), Gaps = 33/192 (17%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN--LFWGEAKNLGTQKCLD 135
           +D+GDIS++VALR +L CK F W++ ++  D+  K   LPP   L+  + +N  +  CLD
Sbjct: 338 VDIGDISDRVALRKRLGCKSFKWYLDNIYPDMTNK---LPPKSYLYSHQIRNKESSLCLD 394

Query: 136 SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ-LGHGERCVDADKQEKLVEMLKLPVTFCV 194
           ++G      +G   CHG G +Q F L    + L + ++C+D+   +              
Sbjct: 395 TLGEKNIKRVGLYTCHGMGGNQFFTLTKSNEILFNDDKCLDSPNGDP------------- 441

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
                      G+ + +  C    G  +  W+ +     + H   + CL +  +   + +
Sbjct: 442 -----------GSYVEMITCHGLKGNQE--WKHNKRMGTIVHVWTENCLDISQDGQFIIV 488

Query: 255 LRCDENNSYQQW 266
             CD  +S Q+W
Sbjct: 489 NPCD-GSSTQRW 499


>gi|167523942|ref|XP_001746307.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775069|gb|EDQ88694.1| predicted protein [Monosiga brevicollis MX1]
          Length = 2376

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 10/124 (8%)

Query: 54   LRKRSSWLKVFKQI-RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
            LR    W+  + +  +S     A  +D+GDISE+  LR+ LHCK F W++ +V       
Sbjct: 1275 LRLAEVWMDEYAEFFKSRKGSAARKIDIGDISERQKLREDLHCKPFKWYLDNV------- 1327

Query: 113  FPEL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            +PEL  P     GE +      CLDS G++    +    CHG G +QL+ L+  G+L H 
Sbjct: 1328 YPELRVPDPNPVGEGQVQSGGFCLDSAGKSVGHAVALYRCHGLGGNQLWTLSHNGELAHE 1387

Query: 171  ERCV 174
            + CV
Sbjct: 1388 DACV 1391


>gi|432112638|gb|ELK35354.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Myotis davidii]
          Length = 416

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/122 (33%), Positives = 68/122 (55%), Gaps = 8/122 (6%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC++FSWF+ ++  +
Sbjct: 222 QVRLAEVWMDSYKEIFYRRNMEAAKMAQEKTFGDISERLQLREQLHCRNFSWFLHNIYPE 281

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQL 167
           ++   P+L P  F+G  KNLG  +CLD   +         Y CHG G +Q F    +  L
Sbjct: 282 LF--IPDLKPT-FYGAIKNLGINQCLDVGEKNHGGKPLIMYACHGLGGNQYFEYTTQRDL 338

Query: 168 GH 169
            H
Sbjct: 339 RH 340


>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
          Length = 614

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 70/126 (55%), Gaps = 14/126 (11%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +KQI   R+L  + +A     GDISE++ LR+ LHC++FSWF+ +V  +
Sbjct: 420 QVRLAEVWMDSYKQIFYRRNLQAAKMAQEKSFGDISERLKLRELLHCRNFSWFLHNVYPE 479

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  KNLG  +CLD +G       P ++    CHG G +Q F    +
Sbjct: 480 MFV--PDLKPT-FYGAIKNLGINQCLD-VGENNHGGKPLIMYA--CHGLGGNQYFEYTTQ 533

Query: 165 GQLGHG 170
             L H 
Sbjct: 534 RDLRHN 539


>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
           catus]
          Length = 622

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 38/224 (16%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ L+++LHC++FSWF+ ++  +
Sbjct: 428 QVRLAEVWMDSYKEIFYRRNLQAAKMAQEKSFGDISERLQLKERLHCRNFSWFLHNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  +NLG  +CLD +G       P ++ T  CHG G +Q F    +
Sbjct: 488 MFV--PDLKPT-FYGAIRNLGVDQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 541

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
             L H     +  KQ             C+ A        LG     F   NS    D  
Sbjct: 542 RDLRH-----NIAKQ------------LCLHASAGT----LGLRSCHFTGQNSQVPKDEE 580

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           W      + + ++    CL+   E  + AM  C+ ++ +Q W F
Sbjct: 581 WEL-TQDQLIRNSGSGTCLTS--ENKKPAMASCNPSDPHQHWLF 621


>gi|383847543|ref|XP_003699412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Megachile rotundata]
          Length = 571

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 94/217 (43%), Gaps = 38/217 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ KLHCK FSW++K+V       +P
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNV-------YP 437

Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P    G   +L     CLDSMG      +G   CH +G +Q + L  +G + H + 
Sbjct: 438 ELVIPTSEGGPGGSLKQGPACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDL 497

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C            L LPV           + + GT L +  C    G+ +  WR H    
Sbjct: 498 C------------LTLPV-----------YAK-GTTLLMQIC---DGSENQKWR-HLEGG 529

Query: 233 QLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            + H+ +  C+ S +     +   +CD N   Q+W  
Sbjct: 530 LIRHSRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 566


>gi|322785490|gb|EFZ12159.1| hypothetical protein SINV_06585 [Solenopsis invicta]
          Length = 466

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 59/215 (27%), Positives = 91/215 (42%), Gaps = 34/215 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ +LHCK FSW++K+V  ++     
Sbjct: 280 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTS 339

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + L  +G + H + C 
Sbjct: 340 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 393

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
                      L LPV               GT L +  C    G+ +  WR H     +
Sbjct: 394 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HLEGGLI 426

Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            H  +  C+ S +     +   +CD N   Q+W  
Sbjct: 427 RHTRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 461


>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Hydra magnipapillata]
          Length = 565

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 59/225 (26%), Positives = 100/225 (44%), Gaps = 38/225 (16%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD-VYTKFPELPP 118
           W+  FK+    G+P+    + G+ISE++ +R++  CK F W++ +VA D V T++    P
Sbjct: 358 WMDEFKEFLYRGNPMVRSQNAGNISERIKVRERNKCKSFKWYLLNVANDTVRTRY---EP 414

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL-GHGERCVDAD 177
           +   GE +N  T+ CLD+ G  A   I  S C    S+Q+FR     +L  + E C+DA 
Sbjct: 415 DRASGEIENTHTKLCLDTYGANAGRKIKLSKCGQRNSNQIFRWTYIYELHQYPEECLDA- 473

Query: 178 KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
              +  +M  + +  C   G    F                        +   T Q++H+
Sbjct: 474 ---RYADMDNVYIEKCHEMGGNQKFL-----------------------YDKDTNQIKHS 507

Query: 238 ILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKEVR----PDWS 276
               CLSV  E +     +  CD  +  Q W+ +  +    P+W+
Sbjct: 508 GTGLCLSVPNEDSPTHPVIEACDLKSVRQLWKIQLSKDSTIPEWA 552


>gi|339249613|ref|XP_003373794.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
           spiralis]
 gi|316970007|gb|EFV54023.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
           spiralis]
          Length = 587

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 53/223 (23%), Positives = 93/223 (41%), Gaps = 30/223 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+   +  P    LD GDIS+Q  +R +L CK F WFM+ +A+D   ++P
Sbjct: 386 RVAEVWMDEYKEYVYMRRPHYRKLDPGDISKQKEIRKRLGCKPFKWFMQVIAFDQSVRYP 445

Query: 115 ELP-PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCH-----GSGSSQLFRLNAEGQLG 168
            +   ++  GE +++ +  C+         V+G + C       +   Q F+     ++ 
Sbjct: 446 PVEMVDVREGEIRSIQSGMCVTVQYVVEHGVVGMADCKKGHDDAAVGEQFFKYTTRKEIK 505

Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
              R +  D  E  V   K PV                    L+ C +  G  +  WR++
Sbjct: 506 FRNRRLCFDVPENKV---KAPVI-------------------LYPCHDMQG--NQAWRYN 541

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           + T Q+ H +   CL       ++ M  C+ N   Q+W F  +
Sbjct: 542 SKTMQIVHPVTNMCLDADFSRREVFMQSCNLNLLSQRWSFGAI 584


>gi|195471053|ref|XP_002087820.1| GE14879 [Drosophila yakuba]
 gi|194173921|gb|EDW87532.1| GE14879 [Drosophila yakuba]
          Length = 634

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 450 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 509

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +     G++ H + C+
Sbjct: 510 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 563


>gi|194855488|ref|XP_001968556.1| GG24441 [Drosophila erecta]
 gi|190660423|gb|EDV57615.1| GG24441 [Drosophila erecta]
          Length = 631

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 447 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 506

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +     G++ H + C+
Sbjct: 507 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 560


>gi|62484229|ref|NP_608773.2| polypeptide GalNAc transferase 2, isoform A [Drosophila
           melanogaster]
 gi|320594323|ref|NP_995625.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
           melanogaster]
 gi|195576320|ref|XP_002078024.1| GD22759 [Drosophila simulans]
 gi|51315875|sp|Q6WV19.2|GALT2_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           Short=pp-GaNTase 2; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 2; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2
 gi|61678274|gb|AAF51113.3| polypeptide GalNAc transferase 2, isoform A [Drosophila
           melanogaster]
 gi|194190033|gb|EDX03609.1| GD22759 [Drosophila simulans]
 gi|318068299|gb|AAS64620.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
           melanogaster]
          Length = 633

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 449 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 508

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +     G++ H + C+
Sbjct: 509 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 562


>gi|34042922|gb|AAQ56700.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 615

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 431 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 490

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +     G++ H + C+
Sbjct: 491 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 544


>gi|195342262|ref|XP_002037720.1| GM18147 [Drosophila sechellia]
 gi|194132570|gb|EDW54138.1| GM18147 [Drosophila sechellia]
          Length = 606

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 422 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 481

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +     G++ H + C+
Sbjct: 482 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 535


>gi|33589464|gb|AAQ22499.1| RE02655p [Drosophila melanogaster]
          Length = 633

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 449 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 508

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +     G++ H + C+
Sbjct: 509 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 562


>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Strongylocentrotus purpuratus]
          Length = 749

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/114 (35%), Positives = 62/114 (54%), Gaps = 6/114 (5%)

Query: 73  PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
           P  M  + GD+S+++ LR++L C DF W++ +V   +  K P+       G+ +N  T  
Sbjct: 579 PQLMGQEYGDVSDRIKLREELKCHDFQWYLDNVYPAL--KVPDTKVRAR-GDVRNAATSM 635

Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEML 186
           CLDSMG+    V+G   CHG G++Q F L  + QL H  +C+   K    + ML
Sbjct: 636 CLDSMGK---GVLGMFPCHGEGNNQAFTLTWDDQLKHKNKCLPKIKTPGQLRML 686


>gi|405975887|gb|EKC40420.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 653

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 46/140 (32%), Positives = 65/140 (46%), Gaps = 9/140 (6%)

Query: 54  LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +R    W+  +K I    L   L    D GD+SE+  LR++L CK F W++K +  D+  
Sbjct: 491 VRTAEVWMDEYKHIYYERLNYDLG---DYGDVSERKDLRNRLGCKSFGWYLKTMLPDM-- 545

Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           K PE    L+ GE +N+    CLD+MG TA        CH  G +Q FR    G +    
Sbjct: 546 KLPETA--LYSGEVRNMEKGMCLDTMGTTAGNKFQAIPCHHQGGNQFFRFTVNGHIERDS 603

Query: 172 RCVDADKQEKLVEMLKLPVT 191
            C+       L  + K  VT
Sbjct: 604 ACLSDQDGSLLYVLCKRNVT 623


>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Oryctolagus cuniculus]
          Length = 622

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 45/123 (36%), Positives = 69/123 (56%), Gaps = 10/123 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++LHC +FSWF+ +V  +
Sbjct: 428 QVRLAEVWMDNYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWFLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLG  +CLD     R    +I  S CHG G +Q F    +  
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGLGQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQKD 543

Query: 167 LGH 169
           L H
Sbjct: 544 LRH 546


>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
 gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
          Length = 650

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 50/193 (25%), Positives = 84/193 (43%), Gaps = 44/193 (22%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKCLD 135
           GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE +NLG   + CLD
Sbjct: 485 GDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTCLD 537

Query: 136 SMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
           +    +     +GT  CH  G +Q + L+  G++   + C+D                  
Sbjct: 538 APAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY----------------- 580

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA 253
                       G  + LF C    G  +  W +  +T+QL H    +CL++    ++L 
Sbjct: 581 -----------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLL 627

Query: 254 MLRCDENNSYQQW 266
           M  C  + S QQW
Sbjct: 628 MEECSASLSRQQW 640


>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
          Length = 582

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 55/217 (25%), Positives = 91/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  F +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 376 WMDEFAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 435

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      +++    C  G G +     Q+F       +  G  
Sbjct: 436 AAAWGEIRNVGTGLCTDTKHGAVGSLLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 493

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    R
Sbjct: 494 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-R 532

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  CD ++  QQW F+
Sbjct: 533 TLYHPVSSSCMDCSESDHRIFMNACDPSSPTQQWLFE 569


>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oreochromis niloticus]
          Length = 600

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 90/213 (42%), Gaps = 22/213 (10%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR +L+CK F WFM  VA+D+   +P + PP
Sbjct: 392 WMDEYAEYVYQRRPEYRHLSAGDMTAQKELRTRLNCKSFKWFMNEVAWDLPKHYPPVEPP 451

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG--HGERCVDA 176
              WGE +N+G+  C++     + + I    C          +   G++G  HG+     
Sbjct: 452 AAAWGEIQNVGSGMCMEVKHFVSGSPIRLENC----------VKGRGEVGWSHGQVLTFG 501

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
            +++  V         C  A          + + L++C    G  +  WR+    + L H
Sbjct: 502 WREDIRVGDPMHTRKLCFDA------VSHSSPVTLYDCHGMKG--NQLWRYRQD-KSLYH 552

Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            +   C+   P   ++ M  CD  +  QQW F+
Sbjct: 553 PVSNSCIDSSPIERRVFMNTCDPASLSQQWLFE 585


>gi|195386226|ref|XP_002051805.1| GJ10330 [Drosophila virilis]
 gi|194148262|gb|EDW63960.1| GJ10330 [Drosophila virilis]
          Length = 631

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 447 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEP 506

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T +CLD+MG      +G   CH +G +Q +  +  G++ H + C+
Sbjct: 507 QEV-----GQFRQ-DTTECLDTMGHVIDGTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCL 560


>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
 gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
          Length = 684

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 97/213 (45%), Gaps = 40/213 (18%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
           D GD+S++  LR+ L CK F W++ ++       +PEL     ++  GE +NLG   + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD+    +     +GT  CH  G +Q+        + HG  C+DA  +EK  E   + + 
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQI------ANMQHG-MCLDA--KEKSEEETPVSIY 586

Query: 192 FCVLAGVK--LIFCRLGTV-------------LNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
            C   G     +  + G +             + LF C    G  +  W +  +T+QL H
Sbjct: 587 ECHGQGGNQYWMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKG--NQFWTYRENTKQLHH 644

Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               +CL++    ++L M  C+++ S QQW  +
Sbjct: 645 GTSGKCLAISESKDKLLMEECNQSLSRQQWTLE 677


>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
           caballus]
          Length = 622

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 101/224 (45%), Gaps = 38/224 (16%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC +FSWF++++  +
Sbjct: 428 QVRLAEVWMDGYKEIFYRRNMQAAKMAQEKSFGDISERLQLRERLHCHNFSWFLQNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  KNLG   CLD +G       P ++ T  CHG G +Q F    +
Sbjct: 488 MFV--PDLKPT-FYGAIKNLGIDHCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 541

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
             L H                  +    C+ A        LG     F   NS    D  
Sbjct: 542 RDLRHN-----------------IAKQLCLHASAGT----LGLRSCHFTGKNSQVPKDEE 580

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           W      + + ++    CL+   E  + AM  C+ ++ +Q W F
Sbjct: 581 WEL-TQDQLIRNSGSGTCLTS--EDKKPAMASCNPSDPHQHWLF 621


>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
 gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
          Length = 693

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 92/207 (44%), Gaps = 35/207 (16%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLGT--QKC 133
           D GD+SE+  LR+ L CK F W++ ++       +PEL  P +    GE +NLG   + C
Sbjct: 493 DYGDVSERKKLREDLQCKSFKWYLDNI-------YPELFIPGDAVANGEIRNLGYGGRTC 545

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV- 190
           LDS    R     +G   CH  G +Q+  +N        + CVDA K      +   P  
Sbjct: 546 LDSPTGKRNMKKPVGLYPCHKQGGNQIKSINT-------DMCVDAPKTGDESPVGVYPCH 598

Query: 191 -----TFCVLAGVKLI------FCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
                 + +L+    I          G  + LF C  S G  +  W +H +T+ L H   
Sbjct: 599 GQGGHQYWMLSKAGEIRRDQSCLDYAGKDVILFGCHGSKG--NQFWTYHENTKLLHHGSS 656

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQW 266
            +CL+++   ++L M  CD ++  Q W
Sbjct: 657 GKCLAINEGKDKLIMEECDASHLRQHW 683


>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
 gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
          Length = 575

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 55/216 (25%), Positives = 90/216 (41%), Gaps = 31/216 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A     GDI E+V LR KL C+ F W++++V  ++  K P
Sbjct: 387 RAVEVWMDDYKRYYYAAVPYAKNTPYGDIEERVELRRKLRCRPFKWYVQNVYPEL--KLP 444

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
                  +GE K  G Q C+D++G      IG   CHG+G +Q++ L     L H   C+
Sbjct: 445 SDESTKSFGEIKQ-GNQ-CVDTLGHMRGQTIGLFECHGAGGNQMWSLTKSSLLKHETMCL 502

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
             +  +                 V+L+ C     +               W +  +T +L
Sbjct: 503 GVNDGK-------------ATEPVQLLDCDENNSMQH-------------WEYEKATSRL 536

Query: 235 EHAILKRCLSVHP-ETNQLAMLRCDENNSYQQWRFK 269
            H     CLS    +T+ L + +C+ +   Q W F+
Sbjct: 537 RHKPTSLCLSSDKHKTSGLTLEQCNGSAFSQHWAFE 572


>gi|198474621|ref|XP_001356764.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
 gi|198138471|gb|EAL33829.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
          Length = 639

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 455 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 514

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +  +  G++ H + C+
Sbjct: 515 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCL 568


>gi|195148230|ref|XP_002015077.1| GL19517 [Drosophila persimilis]
 gi|194107030|gb|EDW29073.1| GL19517 [Drosophila persimilis]
          Length = 638

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 454 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 513

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +  +  G++ H + C+
Sbjct: 514 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCL 567


>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Ailuropoda melanoleuca]
          Length = 622

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 14/126 (11%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC++FSWF+ ++  +
Sbjct: 428 QVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFLTNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  +NLG  +CLD +G       P ++ T  CHG G +Q F     
Sbjct: 488 MFV--PDLKPT-FYGAIRNLGINQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTR 541

Query: 165 GQLGHG 170
             L H 
Sbjct: 542 RDLRHN 547


>gi|395834931|ref|XP_003790440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           1 [Otolemur garnettii]
 gi|395834933|ref|XP_003790441.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
           2 [Otolemur garnettii]
          Length = 622

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 68/126 (53%), Gaps = 14/126 (11%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K I        + +A     GDISE++ LR++LHC++FSWF+ +V  +
Sbjct: 428 QVRLAEVWMDSYKMIFYRRNQQAAKMAQEKSFGDISERLQLRERLHCRNFSWFLNNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  KNLG  +CLD +G       P ++ +  CHG G +Q F    +
Sbjct: 488 MFV--PDLMPT-FYGAIKNLGINQCLD-VGENNHGEKPLIMYS--CHGLGGNQYFEYTTQ 541

Query: 165 GQLGHG 170
             L H 
Sbjct: 542 RDLRHN 547


>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
          Length = 621

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 14/126 (11%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC++FSWF+ ++  +
Sbjct: 428 QVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFLTNIYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F+G  +NLG  +CLD +G       P ++ T  CHG G +Q F     
Sbjct: 488 MFV--PDLKPT-FYGAIRNLGINQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTR 541

Query: 165 GQLGHG 170
             L H 
Sbjct: 542 RDLRHN 547


>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Takifugu rubripes]
          Length = 552

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 50/213 (23%), Positives = 93/213 (43%), Gaps = 22/213 (10%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR +L+CK F WFM  VA+D+   +P + PP
Sbjct: 344 WMDEYAEYIYQRRPEYRHLAAGDMAVQKDLRSQLNCKSFKWFMTKVAWDLPKHYPPVEPP 403

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
              WGE +N+ +  CL++    + + +    C         +   EG   HG+      +
Sbjct: 404 AAAWGEIRNVASGMCLETKHFASGSPVRMESC--------LKGRGEGGWSHGQVFTFGWR 455

Query: 179 QEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
           ++  V   M    V F  ++          + + L++C    G  +  W +    + L H
Sbjct: 456 EDIRVGDPMHTKKVCFDAVSN--------NSPVTLYDCHGMKG--NQFWHYRED-KSLYH 504

Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            +   C+  +P+  ++ M  C+ ++  QQW F+
Sbjct: 505 PVSNSCVDSNPKEGRVFMNTCNPSSPSQQWAFE 537


>gi|157107416|ref|XP_001649767.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884053|gb|EAT48278.1| AAEL000654-PA [Aedes aegypti]
          Length = 607

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/201 (23%), Positives = 93/201 (46%), Gaps = 34/201 (16%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
           D GD+S+Q+A+R+KL CK F WFM++VA+D+  K+P + PP+   G  +++     C+D+
Sbjct: 416 DAGDLSKQLAIREKLQCKPFKWFMENVAFDLVEKYPPIEPPDFANGAIQSVANPALCVDT 475

Query: 137 MGRTAPAVIGTSYCHGSG----SSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
           +       +G   C        ++Q F+L  + + ++  GE C D  +     ++L    
Sbjct: 476 LNHGEKQTVGLFSCASDKVQPQANQYFQLSWHRDLRIKFGEMCWDVSESVPNAKIL---- 531

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
                               L++C    G  +  W +   ++ L+     RCL +     
Sbjct: 532 --------------------LYHCHGGQG--NQLWSYDVESQMLKQGKNNRCLDMDSSRR 569

Query: 251 QLAMLRCDENNSYQQWRFKEV 271
           ++ +  C ++N  Q+W++  V
Sbjct: 570 EVFVNPCSDDNPNQKWKWGYV 590


>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
 gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 3; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
 gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
 gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
          Length = 612

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 46/162 (28%), Positives = 78/162 (48%), Gaps = 13/162 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  ++ GD+SE+  LR+ L CK F W+++++       +P
Sbjct: 422 RTAEVWMDEYKAFFYKMVPAARNVEAGDVSERKKLRETLQCKSFKWYLENI-------YP 474

Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P P  F   G   N  T+KC+D+ G+      G   CHG+G +Q + L  +G++   +
Sbjct: 475 EAPLPADFRSLGAIVNRFTEKCVDTNGKKDGQAPGIQACHGAGGNQAWSLTGKGEIRSDD 534

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFC---RLGTVLN 210
            C+ +    ++   LKL         VK +F    + GT+L+
Sbjct: 535 LCLSSGHVYQIGSELKLERCSVSKINVKHVFVFDDQAGTLLH 576


>gi|34528626|dbj|BAC85542.1| unnamed protein product [Homo sapiens]
          Length = 213

 Score = 70.9 bits (172), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/98 (36%), Positives = 57/98 (58%), Gaps = 4/98 (4%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  FK    + SP  + +D GD+S +  LR+ L CK FSW+++++  D  ++ P    +
Sbjct: 99  WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 156

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
           L  GE +N+ T +CLD+MGR     +G   CHG G +Q
Sbjct: 157 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQ 192


>gi|224613432|gb|ACN60295.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Salmo salar]
          Length = 185

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 95/212 (44%), Gaps = 36/212 (16%)

Query: 63  VFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW 122
           ++ +  S  + +A     GDISE+V LR++L CK+F+W++  +  +V+   P+L P  F 
Sbjct: 3   IYYRRNSNAAKMAKEKGFGDISERVNLRERLQCKNFTWYLNTIYPEVFV--PDLTPTKF- 59

Query: 123 GEAKNLGTQKCLDSMGRT---APAVIGTSYCHGSGSSQLFRLNAEGQLGHG---ERCVDA 176
           G  KN G Q CLD   +     P ++ T  CH  G +Q F   +  +L H    + C+ A
Sbjct: 60  GAIKNSGAQSCLDVGEKNEGGKPLIMYT--CHNMGGNQYFEYTSHKELRHNIGKQLCLQA 117

Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
           + Q   V+     +  C L G       LGT          S      W F      L++
Sbjct: 118 NPQPDQVK-----IELCTLKG-------LGT----------SVAPQQEWIF-TEENLLKN 154

Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            + ++CL +  +  Q+ M  C   +  Q W F
Sbjct: 155 PLSRKCLLL--KGGQIVMDDCKAADLNQHWAF 184


>gi|326434666|gb|EGD80236.1| polypeptide N-acetylgalactosaminyltransferase 13 [Salpingoeca sp.
           ATCC 50818]
          Length = 641

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 55/218 (25%), Positives = 91/218 (41%), Gaps = 37/218 (16%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +KQ      P    +D+GD++++ ALR++L CK F W++KHV  D++   
Sbjct: 453 MRLAEVWMDDYKQFFYDTKPKRENIDIGDLTKRKALRERLKCKPFKWYLKHVLPDLFVPD 512

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ--LFRLNAEGQLGHGE 171
            E   ++    A   G   CLD MG  A    G   CHG G +Q  ++ +N E       
Sbjct: 513 SE---HVLHKGALRAGNGLCLDKMGHRAGGQAGVFSCHGEGGNQGWMYTVNDE------- 562

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
                         ++   + C    + +   +    ++L  C    G  +  W++ N+ 
Sbjct: 563 --------------IRTADSLC----LDVYSSKFPAPIHLQRCHQKQG--NQAWKYENNV 602

Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
               H     CLS     N   +L +  C E +S QQW
Sbjct: 603 --FRHVASNGCLSSVVSGNGQHELKLDTCKEGSSGQQW 638


>gi|312065523|ref|XP_003135832.1| glycosyl transferase [Loa loa]
 gi|307769015|gb|EFO28249.1| glycosyl transferase [Loa loa]
          Length = 614

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 101/251 (40%), Gaps = 35/251 (13%)

Query: 30  QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
           ++GH  RC  V      I    S    R    W+  + +      P  + +D GD+S+Q 
Sbjct: 378 RVGHIYRCKYVPFPDPGIGDFISKNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSKQK 437

Query: 88  ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
           A+R +L CK F WFMK +A+D    +P + P +   GE +N+   KC+D+  + A    G
Sbjct: 438 AVRKRLKCKSFDWFMKEIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTHFKNAEEKFG 497

Query: 147 TSYCHGSGSS----QLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKL 200
              C   G +    Q  RL    + +L     C D       V   + P+          
Sbjct: 498 LRKCVSDGPANDGEQNLRLTFWRDVRLHTRTMCFDVS-----VSGNRAPIV--------- 543

Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
                     LF+C    G  +  +++    +Q+ H +   C+    E  ++ M  CD  
Sbjct: 544 ----------LFSCHGMQG--NQHFKYLPDKQQMYHPVSDLCMDCDVERGEIFMNPCDSK 591

Query: 261 NSYQQWRFKEV 271
              Q+W + ++
Sbjct: 592 RWSQKWTWAKI 602


>gi|148696679|gb|EDL28626.1| mCG6366 [Mus musculus]
          Length = 431

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 61/245 (24%), Positives = 89/245 (36%), Gaps = 57/245 (23%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 202 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 261

Query: 115 ELPP----------------NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS--- 155
            + P                ++F  + +++    C+DS        +    C   GS   
Sbjct: 262 PVEPLPAAWGLSGIVTQPVLDVFLIQIRSVAANLCVDSKHGATGTELRLDVCVKDGSERT 321

Query: 156 ---SQLFRLNAEGQLGHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLG 206
               QLF       +  GE       C+DA            PVT               
Sbjct: 322 WSHEQLFTFGWREDIRPGEPLHTRKFCLDAISHSS-------PVT--------------- 359

Query: 207 TVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
               L++C    G     +R     R L H +   C+  +P   ++ M RCD  +  QQW
Sbjct: 360 ----LYDCHGMKGNQLWGYR---EDRTLFHPVSNSCMDCNPSEKKIFMARCDPLSETQQW 412

Query: 267 RFKEV 271
            F+ +
Sbjct: 413 IFEHI 417


>gi|312370886|gb|EFR19191.1| hypothetical protein AND_22918 [Anopheles darlingi]
          Length = 1204

 Score = 70.9 bits (172), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 85/200 (42%), Gaps = 47/200 (23%)

Query: 79   DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNL--------FWGEAKNLGT 130
            D GD++E+  LR  LHCK F W+++       T FPEL P L        F  EA ++  
Sbjct: 1029 DFGDVTERKQLRRALHCKSFRWYLE-------TVFPELAPALDKRPGHGRFENEALSMEG 1081

Query: 131  QKCLDSMGRTAPAVIGTSYCH-GSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLP 189
            Q       +++  +     C  GS + Q +  N  G+L +  RC+D D            
Sbjct: 1082 QPKHCLTAQSSAGLPTMEPCQAGSDARQHWLHNLFGELSNENRCLDYD------------ 1129

Query: 190  VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILK-RCLSVHPE 248
                            G+ L ++ C  + G  +  WR++ +T Q EH     +CL+V P 
Sbjct: 1130 ----------------GSALRVYACHKARGNQE--WRYNGTTYQFEHCKHSAKCLAVEPT 1171

Query: 249  TNQLAMLRCDENNSYQQWRF 268
            T +L +  CD     Q+W +
Sbjct: 1172 TKRLRIDGCDAAQESQRWHY 1191



 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 46/173 (26%), Positives = 71/173 (41%), Gaps = 53/173 (30%)

Query: 80  MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPPN--------LFWGEAKNLGT 130
            G+++E+  LR +L CK F W+++       T FPE   P+         F  +AK   T
Sbjct: 407 FGNLTERHELRQRLACKPFRWYLE-------TVFPEQFDPSKAVARGEIRFADDAK--AT 457

Query: 131 QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV 190
             CLD      P+++    CHG G  QL+ L A+G++   + C+D D             
Sbjct: 458 PLCLD-----WPSLLSLVTCHGYGGHQLWYLTAKGEVTREDHCLDYD------------- 499

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
                          G +L++  C    G  +  W +   TRQL+     RCL
Sbjct: 500 ---------------GELLSVVRCHGLGG--NQRWVWLPDTRQLKKLTYDRCL 535



 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 1/66 (1%)

Query: 6   PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQK-SSLELRKRSSWLKVF 64
           P+++    CHG G  QL+ L A+G++   + C+D D +++S  +   L   +R  WL   
Sbjct: 464 PSLLSLVTCHGYGGHQLWYLTAKGEVTREDHCLDYDGELLSVVRCHGLGGNQRWVWLPDT 523

Query: 65  KQIRSL 70
           +Q++ L
Sbjct: 524 RQLKKL 529


>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 2 [Bombus impatiens]
          Length = 637

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 92/211 (43%), Gaps = 31/211 (14%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
           GD+SE+ ALR KL CK F W++ +V  +++     +      GE +NLG     CLDS  
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492

Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD-ADKQEKLVEMLKLPVTFC 193
           R A    PA  G   CH  G +Q+  L +         C+D A K E L + + L     
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSAGKPEDLHQPVGLYPCHR 543

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
                  +  + G +    +C + SGT            +  W +++ T+Q+ H    +C
Sbjct: 544 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNHQTKQIRHGSSDKC 603

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           L++     +L M  C    + Q+W F+   P
Sbjct: 604 LAITESKQRLIMEECSAAAARQRWSFENYDP 634


>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 3 [Bombus terrestris]
          Length = 637

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 92/211 (43%), Gaps = 31/211 (14%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
           GD+SE+ ALR KL CK F W++ +V  +++     +      GE +NLG     CLDS  
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492

Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD-ADKQEKLVEMLKLPVTFC 193
           R A    PA  G   CH  G +Q+  L +         C+D A K E L + + L     
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSAGKPEDLHQPVGLYPCHR 543

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
                  +  + G +    +C + SGT            +  W +++ T+Q+ H    +C
Sbjct: 544 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNHQTKQIRHGSSDKC 603

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           L++     +L M  C    + Q+W F+   P
Sbjct: 604 LAITESKQRLIMEECSAAAARQRWSFENYDP 634


>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
           ricinus]
          Length = 582

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 10/131 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K+     +P A  +D GD+S +  LR KL C  F W+++++       +PE  +P
Sbjct: 396 WLDEWKEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENI-------YPESHMP 448

Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
            + +  GE K+  +  CLD+ GR +   +  S CHG   +Q+F      Q+   + C+DA
Sbjct: 449 LDYYHLGEIKHADSPVCLDTFGRKSGENVAVSTCHGXXXNQVFAYTKRQQIMSDDNCLDA 508

Query: 177 DKQEKLVEMLK 187
                 V++L+
Sbjct: 509 SSPRGPVKLLR 519


>gi|242024227|ref|XP_002432530.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212517982|gb|EEB19792.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 603

 Score = 70.5 bits (171), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 93/217 (42%), Gaps = 37/217 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      PLA  +  G+I +++ L+ KLHCK F W++++V       +P
Sbjct: 415 RAAEVWMDDYKKYYYAAVPLAKSIPFGNIDDRLELKRKLHCKSFKWYLENV-------YP 467

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P +           Q CLD++G +    +G   CH +G +Q + +  +  + H + 
Sbjct: 468 ELSIPHSTSPAFGSIRQRQLCLDTLGHSIEQTVGLYVCHDTGGNQEWGMEDDSYIKHHDL 527

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C            L +P     + G  L+  RL        C ++    +  WR  N+  
Sbjct: 528 C------------LTIP---NYVPGA-LVLMRL--------CEDAD---NQKWRLVNNGG 560

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRF 268
            ++H     CL     T + L + +CD  +  QQW F
Sbjct: 561 LVKHETYSLCLDSRDATTRGLIVEKCDSRSPTQQWNF 597


>gi|66507571|ref|XP_394527.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Apis mellifera]
 gi|380015445|ref|XP_003691712.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Apis florea]
          Length = 571

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 61/217 (28%), Positives = 93/217 (42%), Gaps = 38/217 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ KLHCK FSW++K+V       +P
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNV-------YP 437

Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P    G   +L     CLDSMG      +G   CH +G +Q + L  +G + H   
Sbjct: 438 ELVIPTSEGGPGGSLKQGSACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIRHHGL 497

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C            L LPV           + + GT L +  C    G+ +  WR H    
Sbjct: 498 C------------LTLPV-----------YAK-GTTLLMQIC---DGSENQKWR-HLEGG 529

Query: 233 QLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
            + H+ +  C+ S +     +   +CD N   Q+W  
Sbjct: 530 LIRHSRIPVCVDSRYHAQKGITAEKCDSNAETQRWHL 566


>gi|410910894|ref|XP_003968925.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Takifugu rubripes]
          Length = 577

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 65/133 (48%), Gaps = 12/133 (9%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+I    +P A     GD++E+  LR+KL CK F WF+ +V  D++   
Sbjct: 375 VRAAEVWMDEYKEIYYHRNPHARLEAYGDVTERRKLREKLGCKGFRWFLDNVYPDLH--I 432

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  KN G T  C D       AV+G       CHG G +Q F  + EG+L 
Sbjct: 433 PEDKPGRF-GMLKNRGKTNHCFDYNPTDENAVVGERVIVYPCHGMGQNQFFEYSMEGELR 491

Query: 169 HGER----CVDAD 177
           +  R    C+  D
Sbjct: 492 YNTRSPAGCIVGD 504


>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 637

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 95/221 (42%), Gaps = 46/221 (20%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
           WL  +K       P    +D GD+SE++ALR++L CKDF W+++++       +PE  +P
Sbjct: 449 WLDKYKDFVYAIMPELKNVDAGDVSERLALRERLQCKDFRWYLQNI-------YPESSMP 501

Query: 118 PNLFWGEAKNLGTQKCLDSMGRTAPAVI----GTSYCHGSGSSQLFRLNAEGQLGHGERC 173
            +     A       C DS+G  +   +    G   CH  G +Q+   +  G+L   + C
Sbjct: 502 VDFHHVGALRNQDHGCADSLGYDSENGVNQNAGIFPCHNQGGNQIVVFSKSGELKFDDLC 561

Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
           ++  K                 + VKL  C             + G     W ++  ++Q
Sbjct: 562 MEGSKN----------------SAVKLQKC-------------TEGNQKQVWEYNKESKQ 592

Query: 234 LEHAILKRCLSVHP--ETNQLAMLRCDE--NNSYQQWRFKE 270
           ++H    +CL+V P  +  +L +  C +  + + QQW  ++
Sbjct: 593 MKHLYSSQCLTVEPTKDDGKLRLSSCADGGHQTNQQWHLEQ 633


>gi|340712798|ref|XP_003394942.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Bombus terrestris]
          Length = 571

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 40/120 (33%), Positives = 62/120 (51%), Gaps = 5/120 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++ L+ KLHCK FSW++K+V  ++     
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTS 444

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E  P    G +   GT  CLDSMG      +G   CH +G +Q + L  +G + H + C+
Sbjct: 445 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLCL 499


>gi|195032291|ref|XP_001988471.1| GH11183 [Drosophila grimshawi]
 gi|193904471|gb|EDW03338.1| GH11183 [Drosophila grimshawi]
          Length = 640

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I +++AL++KLHCK F W++++V  D+    P
Sbjct: 456 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 515

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           +       G+ +   T+ CLD+MG      +G   CH +G +Q +  +  G++ H + C+
Sbjct: 516 QEV-----GQFRQDMTE-CLDTMGHLVDGTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCL 569


>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
           (Silurana) tropicalis]
 gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
          Length = 603

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 56/226 (24%), Positives = 92/226 (40%), Gaps = 44/226 (19%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP--ELP 117
           W+  + +      P    L +GD++ Q  LR KL C+DF WFM  +A+D+   +P  ELP
Sbjct: 394 WMDEYAEYIYQRRPDYRHLSVGDVAAQKELRRKLQCRDFKWFMNEIAWDLPKYYPPVELP 453

Query: 118 PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS------GSSQLFRLNAEGQLGHG- 170
           P   WGE +N+    C+     T+   I    C+        G+ Q+F L     +  G 
Sbjct: 454 PAA-WGEIRNVRMDMCVGVKHVTSGTEIRLEACNKGRNSDTWGARQVFTLGWREDIRPGI 512

Query: 171 -----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
                + C DA  Q         PVT                   LF+C    G  +  W
Sbjct: 513 PQHTMKSCFDAVSQTS-------PVT-------------------LFDCHGMKG--NQLW 544

Query: 226 RFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
           ++    + + H     C+  +    ++ M +C+ ++  QQW F+ +
Sbjct: 545 KYRRD-KTVYHPTSNSCMDCNEGEYKIFMNKCNPSSPTQQWTFEHI 589


>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 576

 Score = 70.1 bits (170), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 91/239 (38%), Gaps = 84/239 (35%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLG------ 129
           D GD+S + ALR KL CK F W++ +V       +PEL  P +    GE +N G      
Sbjct: 363 DFGDVSSRKALRKKLQCKSFKWYLDNV-------YPELFVPGDAIGKGEIRNKGEVAGDV 415

Query: 130 TQKCLD--------------------------------------SMGRTAPAVIGTSYCH 151
            Q CLD                                      S GR   A +G  +CH
Sbjct: 416 VQHCLDSEVGEDIQKVVIAYPCHKSGGNQIRNRGGRSKNCLDWASHGRQRSANVGLYWCH 475

Query: 152 GSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNL 211
             G +Q + L+ +G++   E C+D                    AG  ++         +
Sbjct: 476 KKGGNQYWMLSKDGEIRRDESCID-------------------YAGADVM---------V 507

Query: 212 FNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           + C    G  +  W++     Q+ H I  +CL +  + ++L M  CD+ N YQ W  +E
Sbjct: 508 YPCHGMKGNQE--WKYLPDESQILHVITSKCLEMSRDGSKLMMNVCDQANPYQHWLIQE 564


>gi|269115411|gb|ACZ26277.1| N-acetyl galactosaminyl transferase-like protein [Mayetiola
           destructor]
          Length = 638

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 53/202 (26%), Positives = 91/202 (45%), Gaps = 35/202 (17%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK--CLD 135
           +D GD+++Q A+R+KL CK F WFMK +A+D+  K+P + P  F   A    T    C+D
Sbjct: 445 IDAGDLTKQKAIREKLKCKSFDWFMKEIAFDLPKKYPPIEPADFASGAIQSITNPNICID 504

Query: 136 SMGRTAPAVIGTSYCHGS----GSSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLP 189
           +M R     +G   C  +     ++Q F L  + + ++     C D      +      P
Sbjct: 505 TMNRGKGQDVGLFSCADNLQHPQANQFFVLTWHKDIRMFGSTMCWDLPNSYPMS-----P 559

Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET 249
           + F +                   C    G  +  W+++  T+Q+ H     C+  + +T
Sbjct: 560 ILFHI-------------------CHGQQG--NQFWKYNLKTKQITHD-KNVCVECNAKT 597

Query: 250 NQLAMLRCDENNSYQQWRFKEV 271
            +L + +CD NN  Q W+F +V
Sbjct: 598 MKLYVNKCDPNNPNQLWKFGKV 619


>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 12 [Taeniopygia
           guttata]
          Length = 594

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 94/226 (41%), Gaps = 34/226 (15%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +KQ+    +P A     GD++E+  LR+KL CKDF WF+++V  +++   
Sbjct: 392 VRAAEVWMDEYKQLYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLENVYPELHV-- 449

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  KN G +  C D        + G       CHG G +Q F   +  ++ 
Sbjct: 450 PEDRPGFF-GMLKNRGMENFCFDYNPTNEHQITGQRVILYPCHGMGQNQFFEYTSHNEIR 508

Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF- 227
           +  R  +              V   V +G           L ++ C  ++ +V    +F 
Sbjct: 509 YNTRQPE--------------VCAAVDSGTDY--------LTMYLCQENAHSVPENQKFI 546

Query: 228 HNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWRFKE 270
                 L H   ++C+         N   +LR   N+ YQ+W FKE
Sbjct: 547 FREDGALFHPQTEKCIQAEANAYNGNPAPLLRPCSNSDYQKWFFKE 592


>gi|354505207|ref|XP_003514663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like,
           partial [Cricetulus griseus]
          Length = 233

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 69/123 (56%), Gaps = 10/123 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +    + GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 39  QVRLAEVWMDSYKKIFYRRNLQAAKIVQENNFGDISERLQLREQLHCHNFSWYLHNVYPE 98

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 99  MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 154

Query: 167 LGH 169
           L H
Sbjct: 155 LRH 157


>gi|195327857|ref|XP_002030634.1| GM25555 [Drosophila sechellia]
 gi|194119577|gb|EDW41620.1| GM25555 [Drosophila sechellia]
          Length = 590

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 108/257 (42%), Gaps = 43/257 (16%)

Query: 21  QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
            LFR +   Q+ +  +  +++K++IS        R    WL  +K       P    + +
Sbjct: 350 HLFR-DGNFQVRYTNKDKNSEKKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD--- 135
           G+++EQ AL+++LHCK F WF+ ++A D    +P L P  +  G  +++ + K CLD   
Sbjct: 405 GNLAEQRALKNRLHCKPFKWFLDNLATDFLHLYPILDPAEYASGVLQSIASPKLCLDRKD 464

Query: 136 -SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
            S G+   A   + +   S   Q + L    +L  G  C++                   
Sbjct: 465 PSHGQPKLAPCSSDHVFPS-PQQYWSLTNHRELRSGFYCLEVRNH--------------- 508

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQ 251
                      G  ++++ C   SG  +  W F + T Q+   +    + CL   PE N 
Sbjct: 509 -----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQQQNSRHCLEAQPELNA 555

Query: 252 LAMLRCDENNSYQQWRF 268
           +    CD  N  QQW+F
Sbjct: 556 VTSSVCDPKNHKQQWKF 572


>gi|313226886|emb|CBY22031.1| unnamed protein product [Oikopleura dioica]
          Length = 685

 Score = 69.3 bits (168), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 48/124 (38%), Positives = 62/124 (50%), Gaps = 17/124 (13%)

Query: 58  SSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAY---DVYTKF 113
             W K F   R    P    L  GDISEQV  R   HC  +F+W M+ +AY   D+Y   
Sbjct: 496 DDWTKYFYWNR----PELQKLPYGDISEQVKFRHD-HCPYNFTWMMEEIAYGITDMYDAP 550

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGR---TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            EL   L WGE +  G+  C+DSMG      PA     YCH  G +QLFR++A G +   
Sbjct: 551 NEL---LAWGEIRGKGSNICVDSMGHKDNDGPAE--AWYCHKQGGNQLFRIHAGGFIAQN 605

Query: 171 ERCV 174
            +C+
Sbjct: 606 LQCI 609


>gi|21358257|ref|NP_648800.1| polypeptide GalNAc transferase 8 [Drosophila melanogaster]
 gi|51316119|sp|Q9VUT6.1|GALT8_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 8;
           Short=pp-GaNTase 8; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 8; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 8
 gi|7294236|gb|AAF49587.1| polypeptide GalNAc transferase 8 [Drosophila melanogaster]
 gi|17945053|gb|AAL48588.1| RE06471p [Drosophila melanogaster]
 gi|220947814|gb|ACL86450.1| pgant8-PA [synthetic construct]
          Length = 590

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 62/257 (24%), Positives = 107/257 (41%), Gaps = 43/257 (16%)

Query: 21  QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
            LFR +   Q+ +  +  +++K++IS        R    WL  +K       P    + +
Sbjct: 350 HLFR-DGNFQIRYTNKDKNSEKKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD--- 135
           G+++EQ  L+++LHCK F WF+ ++A D    +P L P  +  G  +++ + K CLD   
Sbjct: 405 GNLAEQRDLKNRLHCKPFKWFLDNLATDFLNLYPILDPAEYASGVLQSISSPKLCLDRKD 464

Query: 136 -SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
            S G+   A   + +   S   Q + L    +L  G  C++                   
Sbjct: 465 PSHGQPKLAPCSSDHVFPS-PEQYWSLTNHRELRSGFYCLEVRNH--------------- 508

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQ 251
                      G  ++++ C   SG  +  W F + T Q+   +    + CL   PE N 
Sbjct: 509 -----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQQQNFRHCLEAQPELNA 555

Query: 252 LAMLRCDENNSYQQWRF 268
           +    CD  N  QQW+F
Sbjct: 556 VTSSVCDPKNHKQQWKF 572


>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
           castaneum]
 gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
          Length = 564

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 66/136 (48%), Gaps = 13/136 (9%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       PLA  +  GDISE++ LR  L CK F W+++HV       +P
Sbjct: 378 RAAEVWMDDYKHFYYAAVPLAKNIPFGDISERLELRRNLQCKPFKWYLQHV-------YP 430

Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL   +    + ++G  +    CLD+MG      +    CH +G +Q + L + G + H 
Sbjct: 431 ELA--IPQATSAHVGELRQGMYCLDTMGHLIDGTVALYQCHHTGGNQEWGLTSGGLIKHH 488

Query: 171 ERCVDADKQEKLVEML 186
           + C+  D   K V+++
Sbjct: 489 DLCLTLDDYMKGVQVV 504


>gi|357625888|gb|EHJ76177.1| hypothetical protein KGM_07902 [Danaus plexippus]
          Length = 535

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 88/222 (39%), Gaps = 32/222 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + Q      P  + +D GDIS+Q ALR+KL CK F WFM  +A+D+  K+P
Sbjct: 320 RVAEVWMDEYAQYLYKRRPHYLKIDTGDISKQKALREKLQCKPFKWFMTQIAFDLTAKYP 379

Query: 115 ELPPNLFWGEAKNLGTQK--CLDSMGRTAPAVIGTSYCHGSGSS-QLFRLNAEGQLGHGE 171
            + P  F        T    C+D+        +    C  S S+ Q F L+    +    
Sbjct: 380 PVEPKPFAEGRIRPATYPHLCVDAHHGNQMDKLHLKSCTASTSAEQNFMLSWHKDIKSKT 439

Query: 172 R--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
           R  C D         +L             L  C LG    L             WR+H 
Sbjct: 440 RNMCWDLPDSSPRSPIL-------------LYSCHLGGGNQL-------------WRYHP 473

Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            +R+L+H     CL     T  + + +C ++ + Q+W   +V
Sbjct: 474 ESRRLKHGTNDNCLDFEISTRSVFIKQCSDSET-QEWIIDKV 514


>gi|344255628|gb|EGW11732.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Cricetulus
           griseus]
          Length = 386

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 69/123 (56%), Gaps = 10/123 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +    + GDISE++ LR++LHC +FSW++ +V  +
Sbjct: 42  QVRLAEVWMDSYKKIFYRRNLQAAKIVQENNFGDISERLQLREQLHCHNFSWYLHNVYPE 101

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 102 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 157

Query: 167 LGH 169
           L H
Sbjct: 158 LRH 160


>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
 gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
          Length = 679

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 98/220 (44%), Gaps = 38/220 (17%)

Query: 60  WLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-P 117
           W+  +K+ +   G  +   +D GD++ Q A+R KL CK F WFM+ VA+D+   +P + P
Sbjct: 464 WMDEYKKYLYDHGDGIYDRVDAGDLTAQKAIRTKLKCKSFKWFMEEVAFDLMKSYPPIDP 523

Query: 118 PNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER 172
           P+   G  +N+G    C+D+ G      +G   C         +Q F+L+ +  L    +
Sbjct: 524 PDYASGAIQNVGDSSLCVDTHGLRKHNRMGVYSCAEDLQKPQRNQFFQLSWKRDLRQRRK 583

Query: 173 --CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
             C+D       +     PV                    L++C    G     + F++ 
Sbjct: 584 KDCLDVQ-----IWDANAPVW-------------------LWDCHGQQGNQ---YWFYDY 616

Query: 231 TRQL-EHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +QL  H    +RCL + P +  L + +C E+NSY +W F
Sbjct: 617 RKQLIRHGKEGRRCLELLPFSKTLVVNQCSESNSYMKWNF 656


>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
           [Acromyrmex echinatior]
          Length = 566

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 92/214 (42%), Gaps = 33/214 (15%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           + GDISE+ ALR KL CK F W++ +V  +++     +      GE +NLG     CLDS
Sbjct: 364 NYGDISERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 419

Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF-- 192
             R A      G   CH  G +Q+ ++ +         C+D+  +   +E L  PV    
Sbjct: 420 PARKADLHKPCGLYPCHRQGGNQIRQVTS-------GMCIDSSGK---IEDLHQPVGMYP 469

Query: 193 CVLAGVK--LIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAIL 239
           C   G     +  + G +    +C + SG+            +  W ++  T  + H   
Sbjct: 470 CHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWIYNPQTNHIRHGSS 529

Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
            +CL++     QL M  C  N+  Q+W F+   P
Sbjct: 530 DKCLAITESKQQLVMEECSSNSLRQRWSFENYDP 563


>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
 gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
          Length = 536

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 62/130 (47%), Gaps = 9/130 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+      P    +  G+I+E++ LR KL CK F W++ HV  DV    
Sbjct: 341 VRTAEVWMDGYKEFVYQRQPYMRNIHFGNITERLELRKKLQCKSFKWYLDHVFTDV---- 396

Query: 114 PELPPN---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
             + PN   +  G+ +N  ++ CL+++GR   A +G S C   G + +  L    +L   
Sbjct: 397 --ILPNESAIAKGKVRNPESEMCLNTLGRPKHAFLGLSPCAHEGKTMIISLTVLNELAMD 454

Query: 171 ERCVDADKQE 180
           E C D    +
Sbjct: 455 EVCFDVSDHQ 464


>gi|326922813|ref|XP_003207639.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3-like [Meleagris
           gallopavo]
          Length = 632

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 106/235 (45%), Gaps = 44/235 (18%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GDIS+++ LR +L CK+F+W++
Sbjct: 429 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLNLRQRLQCKNFTWYL 488

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            +V  +VY   P+L P LF G  KN+G   CLD +G       P ++ +  CHG G +Q 
Sbjct: 489 NNVYPEVYV--PDLNP-LFSGYLKNIGNHMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 542

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFN 213
           F  +A  ++ H    E C+ A K          PV    C   G K+             
Sbjct: 543 FEYSAHHEIRHNIQKELCLHASKG---------PVQLRECSYKGQKIF------------ 581

Query: 214 CFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
                   +  W+ H   + L  A L  CL+ + E    +++ C+ ++ +Q+W F
Sbjct: 582 -----AFGEEQWQ-HQKDQTLYSAALHMCLTGNGE--HPSLVSCNPSDPFQKWIF 628


>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Taeniopygia guttata]
          Length = 555

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 95/222 (42%), Gaps = 40/222 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR+ L+CK F WFM  VA+D+   +P + PP
Sbjct: 346 WMDEYAEFIYQRRPEYRHLSAGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPP 405

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
              WGE +N+GT  C+D+              HG+  S L RL         E CV  D+
Sbjct: 406 AAAWGEIRNVGTGLCVDTK-------------HGALGSPL-RL---------ENCV-KDR 441

Query: 179 QEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDGPWRF 227
            E     +++  TF     ++         FC       + + L++C    G  +  WR+
Sbjct: 442 GEAAWNNVQV-FTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQLWRY 498

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+       ++ M  C+ ++  QQW F+
Sbjct: 499 RKD-KTLYHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 539


>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 624

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 62/228 (27%), Positives = 98/228 (42%), Gaps = 46/228 (20%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC++FSW++ ++  +
Sbjct: 430 QVRLAEVWMDEYKEIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPE 489

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLG  +CLD     R    +I    CH  G +Q F    +  
Sbjct: 490 MFV--PDLKPT-FYGAIKNLGIDQCLDVGENNRGGKPLIMYP-CHSLGGNQYFEYTTQRD 545

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
           L H                  +    C+ A        LG     F   NS    D  W 
Sbjct: 546 LRHN-----------------IAKQLCLHASAGT----LGLRGCQFTVKNSQVPKDEEW- 583

Query: 227 FHNSTRQLEHAILKR------CLSVHPETNQLAMLRCDENNSYQQWRF 268
                 +L   +L R      CL+   +  + AM  CD N+  Q W F
Sbjct: 584 ------ELTQDLLIRNSGSGTCLTA--KDKKPAMASCDPNDPRQHWLF 623


>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
          Length = 622

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 100/226 (44%), Gaps = 42/226 (18%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A   + GD+SE++ LR++LHC +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 488 MFV--PDLNPT-FSGAIKNLGTSQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 543

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
           L H                           G +L     G+ L L NC     NS    D
Sbjct: 544 LRHN-------------------------IGKQLCLHASGSTLGLRNCQFIGKNSEVPKD 578

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             W      + + +     CL+   +  + AM  C+ ++ YQ W F
Sbjct: 579 EEWEL-TQDQLIRNLGSGTCLTS--KDKKPAMAPCNPSDPYQLWLF 621


>gi|157128332|ref|XP_001661405.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108872614|gb|EAT36839.1| AAEL011095-PA [Aedes aegypti]
          Length = 573

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 15/124 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  GDI E++ L+++L CK F W++ +V         
Sbjct: 388 RAAEVWMDDYKQYYYAAVPLAKNIPFGDIEERMELKERLQCKPFKWYLANVY-------- 439

Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              P L   E +  G+ +    C+D++G     ++G   CH SG +Q + +  +GQ+ H 
Sbjct: 440 ---PQLTIPEQQTKGSLRQGPYCMDTLGHLVDGIVGLYQCHDSGGNQDWAITKKGQIKHL 496

Query: 171 ERCV 174
           + C+
Sbjct: 497 DLCL 500


>gi|432092278|gb|ELK24901.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Myotis
           davidii]
          Length = 363

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 54/124 (43%), Gaps = 7/124 (5%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 193 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 252

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 253 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 312

Query: 168 GHGE 171
             GE
Sbjct: 313 RPGE 316


>gi|194749861|ref|XP_001957354.1| GF24092 [Drosophila ananassae]
 gi|190624636|gb|EDV40160.1| GF24092 [Drosophila ananassae]
          Length = 591

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/240 (25%), Positives = 105/240 (43%), Gaps = 44/240 (18%)

Query: 39  DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDF 98
           +++K+VIS        R    WL  +K       P    + +G ++E+ AL+ +LHCK F
Sbjct: 368 ESEKKVISRNYR----RVAEVWLDDYKDKLFEKMPHLTVIQVGSLTEEKALKKRLHCKPF 423

Query: 99  SWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQK-CLD-SMGRTAPAVIGTSYC---HG 152
            WF+  +  D    +P + P  F +G  ++L + K CLD S G+  P     S C   H 
Sbjct: 424 KWFLDQLGKDFLNLYPVVEPVDFAFGVLQSLASPKLCLDRSEGK--PGQPKLSECPDDHT 481

Query: 153 SGSSQL-FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNL 211
              S+L + L    +L  G  C++   Q+                         G +  +
Sbjct: 482 FPESELKWTLTNHRELRSGPVCLEKRNQQ-------------------------GDIY-V 515

Query: 212 FNCFNSSGTVDGPWRFHNSTRQLEHAIL---KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           F C    G  +  W  + +++Q+ H  +   +RCL + P+   +   +CD NN+ Q+W+F
Sbjct: 516 FQCHGQMG--NQFWSLNTTSQQVVHGQMGSSRRCLEIQPDAKGVTTRKCDSNNAKQRWKF 573


>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 6-like [Hydra
           magnipapillata]
          Length = 591

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 94/219 (42%), Gaps = 37/219 (16%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK+I             GD+SE+  LR+ L CK F W+++++  ++    
Sbjct: 391 IRVAEVWMDEFKEIFYSFRANLKPEQGGDVSERKKLREDLKCKSFKWYLQNIIPEL---- 446

Query: 114 PELPPNLFW--GEAKNLGTQKCLDSMGRTAP-AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            E+P    +  G+ KNLGT  CLD++ +       G   CH  G++Q F    + ++ H 
Sbjct: 447 -EIPDKYPYGRGDVKNLGTLSCLDTLAQNNQGGKPGLYPCHKMGTNQYFIFTKKFEIWHD 505

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
             C+D    +                        L   + L+ C    G  +  W+ H  
Sbjct: 506 GLCLDLSDSD------------------------LNAKVKLWPCHKQGG--NQKWK-HTK 538

Query: 231 TRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           +  + H   K+CL    + +Q+ +  CD NN+ Q+W F+
Sbjct: 539 SGLIMHESRKKCL--EGQGDQILIRACDTNNANQRWLFE 575


>gi|170046214|ref|XP_001850669.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
           quinquefasciatus]
 gi|167869055|gb|EDS32438.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
           quinquefasciatus]
          Length = 576

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 62/124 (50%), Gaps = 15/124 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ      PLA  +  G+I E++ L+++L CK+F W++ +V         
Sbjct: 391 RAAEVWMDDYKQYYYAAVPLAKNIPFGNIDERLQLKEQLECKNFKWYLDNVY-------- 442

Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              P L   E +  G+ +    C+D++G     ++G  +CH SG +Q + +   GQ+ H 
Sbjct: 443 ---PQLTIPEQQTKGSLRQGPYCIDTLGHLVDGIVGLYHCHNSGGNQDWAITKSGQIKHL 499

Query: 171 ERCV 174
           + C+
Sbjct: 500 DLCL 503


>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 1 [Mustela putorius
           furo]
          Length = 461

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 54/99 (54%), Gaps = 6/99 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK    + SP    +D GDIS ++ LR KL CK FSW+++++  D      
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----S 422

Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHG 152
           ++P + F  GE +N+ T +CLD+M R     +G   CHG
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHG 461


>gi|149032012|gb|EDL86924.1| rCG50623 [Rattus norvegicus]
          Length = 431

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 100/226 (44%), Gaps = 42/226 (18%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A   + GD+SE++ LR++LHC +FSW++ +V  +
Sbjct: 237 QVRLAEVWMDDYKKIFYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPE 296

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 297 MFV--PDLNPT-FSGAIKNLGTSQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 352

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
           L H                           G +L     G+ L L NC     NS    D
Sbjct: 353 LRHN-------------------------IGKQLCLHASGSTLGLRNCQFIGKNSEVPKD 387

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             W      + + +     CL+   +  + AM  C+ ++ YQ W F
Sbjct: 388 EEWEL-TQDQLIRNLGSGTCLT--SKDKKPAMAPCNPSDPYQLWLF 430


>gi|444727225|gb|ELW67727.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Tupaia
           chinensis]
          Length = 406

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 51/106 (48%), Gaps = 1/106 (0%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 232 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 291

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
            + PP+  WGE +N+    C+DS        +    C   GS + +
Sbjct: 292 PVEPPSAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTW 337


>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
           gallus]
          Length = 611

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 59/222 (26%), Positives = 95/222 (42%), Gaps = 40/222 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR+ L+CK F WFM  VA+D+   +P + PP
Sbjct: 402 WMDEYAEYIYQRRPEYRHLSAGDVTAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPP 461

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
              WGE +N+GT  C+D+              HGS  S L RL         E CV  D+
Sbjct: 462 AAAWGEIRNVGTGLCVDTK-------------HGSLGSPL-RL---------ESCV-KDR 497

Query: 179 QEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDGPWRF 227
            E     +++  TF     ++         FC       + + L++C    G  +  WR+
Sbjct: 498 GEAAWNNVQV-FTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQLWRY 554

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+       ++ M  C+ ++  QQW F+
Sbjct: 555 RKD-KTLYHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 595


>gi|380805795|gb|AFE74773.1| polypeptide N-acetylgalactosaminyltransferase-like 6, partial
           [Macaca mulatta]
          Length = 336

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 1/104 (0%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 233 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 292

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
            + PP   WGE +N+    C+DS        +    C   GS +
Sbjct: 293 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSER 336


>gi|194873027|ref|XP_001973126.1| GG15924 [Drosophila erecta]
 gi|190654909|gb|EDV52152.1| GG15924 [Drosophila erecta]
          Length = 589

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 65/258 (25%), Positives = 104/258 (40%), Gaps = 49/258 (18%)

Query: 29  GQLGHGERCVDADKQVISTQKSSLELRKRSS---------WLKVFKQIRSLGSPLAMFLD 79
            ++GH  R  D + QV  T K     RK  S         WL  +K       P    + 
Sbjct: 346 SRVGHLFR--DGNFQVRYTNKDKNSERKLISRNYRRVAEIWLDEYKDKLFANMPHLTVIP 403

Query: 80  MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD-- 135
           +G+++EQ  L+ +LHCK F WF+ ++A D    +P L P  +  G  +++ + K CLD  
Sbjct: 404 VGNLAEQRELKRRLHCKPFKWFLDNLAADFLNLYPILDPAEYASGVLQSISSPKLCLDRK 463

Query: 136 --SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
              +GR   A   + +   S   Q + L    +L  G  C++   Q              
Sbjct: 464 EPRLGRPQLAPCSSDHVFPS-PEQYWSLTNHRELRSGFYCLEVRNQ-------------- 508

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETN 250
                       G  +N++ C   SG  +  W F + T Q+   +    + CL   P +N
Sbjct: 509 ------------GANVNIYQCHGQSG--NQFWSFDSKTHQVISGQQKNSRHCLEAQPASN 554

Query: 251 QLAMLRCDENNSYQQWRF 268
            +    CD  N  Q+W+F
Sbjct: 555 SVTSSVCDPKNHRQKWKF 572


>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oryzias latipes]
          Length = 677

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 92/215 (42%), Gaps = 22/215 (10%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFMK VA+D+   +P + PP
Sbjct: 469 WMDEYAEYVYQRRPEYRHLSAGDVAAQKELRSTLNCKSFKWFMKEVAWDLPKHYPPVEPP 528

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
              WGE ++  +  CL+S    +   I    C         +  A+   GHG+      +
Sbjct: 529 AAAWGEVRSAASGLCLESKHFVSGTPIRLESC--------VKGRADVSWGHGQVFTFGWR 580

Query: 179 QEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
           ++  V   M    V F  ++          + + L++C    G  +  WR+    + L H
Sbjct: 581 EDIRVGDPMHTKKVCFDAVSH--------HSPVTLYDCHGMRG--NQLWRYRKD-KSLYH 629

Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            +   C+  +    ++ M  C+  ++ QQW F++ 
Sbjct: 630 PVSNSCVDSNASERRVFMNVCNPASASQQWLFEKT 664


>gi|118093614|ref|XP_422023.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Gallus
           gallus]
          Length = 632

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 106/235 (45%), Gaps = 44/235 (18%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GDIS+++ LR +L CK+F+W++
Sbjct: 429 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYL 488

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            +V  +VY   P+L P LF G  KN+G   CLD +G       P ++ +  CHG G +Q 
Sbjct: 489 NNVYPEVYV--PDLNP-LFSGYLKNVGNHMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 542

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFN 213
           F  +A  ++ H    E C+ A K          PV    C   G K+             
Sbjct: 543 FEYSAHHEIRHNIQKELCLHASKG---------PVQLRECSYKGQKIF------------ 581

Query: 214 CFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
                   +  W+ H   + L  A L  CL+ + E    +++ C+ ++ +Q+W F
Sbjct: 582 -----AFGEEQWQ-HQKDQTLYSAALHMCLTGNGE--HPSLVSCNPSDPFQKWIF 628


>gi|260793003|ref|XP_002591503.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
 gi|229276709|gb|EEN47514.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
          Length = 618

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 83/197 (42%), Gaps = 40/197 (20%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT 140
           GD+S ++ L+DKLHCK F WFM+ +  D+Y   PE  P    G  +N  +  C DS G  
Sbjct: 451 GDVSARLDLKDKLHCKPFKWFMQTIMPDMYV--PEDRPGRS-GALRNSASNLCFDSEGAE 507

Query: 141 APAVIGTSY-CHGSGSSQLFRLNAEGQLGHG----ERCVDADKQEKLVEMLKLPVTFCVL 195
                 T + CHG G +Q F LN+  +  H     E CV+A   E  V ++       V 
Sbjct: 508 NAGKRPTMWGCHGMGGNQYFELNSREEFRHNTGGKEMCVEAQGGE-FVVLMHCTSGNNVP 566

Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ---- 251
           A  K      G+V N ++                           +CL   PE  Q    
Sbjct: 567 AQQKWQIREDGSVYNAYH--------------------------GKCLHFDPEERQEKKE 600

Query: 252 LAMLRCDENNSYQQWRF 268
           +  +RC   +S Q+WRF
Sbjct: 601 VKAVRCTGQDS-QRWRF 616


>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
 gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
          Length = 615

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 10/151 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  ++ GD++E+  LR+ L CK F W+++++       +P
Sbjct: 424 RTAEVWMDEYKAFFYKMVPAARNVEAGDVTERKKLRETLQCKSFKWYLENI-------YP 476

Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P P  F   G   N  T+KC+D+ G+      G   CHGSG +Q + L  +G++   +
Sbjct: 477 EAPLPADFKSLGAIVNRFTEKCIDTNGKKDGQSPGLQGCHGSGGNQAWSLTGKGEIRSDD 536

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIF 202
            C+ +    ++   LKL         +K +F
Sbjct: 537 LCLSSGHVYQIGSELKLERCSVSKINIKHVF 567


>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
          Length = 603

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 97/230 (42%), Gaps = 39/230 (16%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  +K+     +PL+     GDISE+V LR+ L CK F W+++++  D+    
Sbjct: 400 VRAVEVWLDDYKRHFYARNPLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDL--PV 457

Query: 114 PELPPNLFWGEAKNLGT-QKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
           PE  P  F G   N G+  +CLD           V+GT  CHG G +Q F  N++G L +
Sbjct: 458 PEDTPGQF-GALHNKGSPSRCLDYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRY 516

Query: 170 G---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW- 225
               E C+ A K +   E             +  + C         N  N +      W 
Sbjct: 517 TSQFELCI-AKKDDNSGE-------------IAAVMC---------NGKNVNPPARAIWI 553

Query: 226 ---RFHNSTRQLEHAILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKE 270
              +    T QL++     C++    T   ++    C   + YQQW F+E
Sbjct: 554 KIPQNDGKTFQLKNKSNGLCIATEKTTGNPKIVFESCKSTSRYQQWYFQE 603


>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Anolis carolinensis]
          Length = 612

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 89/219 (40%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  VA+D+   +P + PP
Sbjct: 403 WMDEYAEYIYQRRPEYRHLSAGDVATQKELRSNLNCKSFRWFMNEVAWDLRKFYPPVEPP 462

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS------GSSQLFRLNAEGQLGHGER 172
              WGE  N+GT  C+D+      + +    C  S       + Q+F  +    +  G  
Sbjct: 463 AAAWGEIHNVGTSLCVDTKHGALGSPLKIETCVKSRGEAAWNNVQVFTFSWREDIRPG-- 520

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C    G  +  W++    +
Sbjct: 521 --DPQHTKK----------FCFDA------VSHNSPVTLYDCHGMKG--NQLWKYRKD-K 559

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H I   C+       ++ M  C+ ++  QQW F+ +
Sbjct: 560 TLYHPISGSCMDCTENDRKIFMNTCNPSSPTQQWIFEHI 598


>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Apis mellifera]
          Length = 635

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
           GD+SE+ ALR +L CK F W++ +V  +++     +      GE +NLG     CLDS  
Sbjct: 435 GDVSERKALRKRLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 490

Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
           R A    PA  G   CH  G +Q+  L +         C+D+  K E L + + L     
Sbjct: 491 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSPGKPEDLHQPVGLYPCHR 541

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
                  +  + G +    +C + SGT            +  W ++  T+Q+ H    +C
Sbjct: 542 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNPQTKQIRHGSSDKC 601

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           L++     +L M  C  + + Q+W F+   P
Sbjct: 602 LAITESKQRLIMEECSSSAARQRWSFENYDP 632


>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
           gallus]
          Length = 590

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 65/245 (26%), Positives = 100/245 (40%), Gaps = 46/245 (18%)

Query: 42  KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
           KQ   ++  +L   +R    W+  +K++    +P A     GD+SE+  LR+KL CKDF 
Sbjct: 374 KQAPYSRSKALANSVRAAEVWMDEYKELYYHRNPHARLEPYGDVSERRLLREKLKCKDFK 433

Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
           WF+++V  +++   PE  P  F+G  KN G    C D        + G       CHG G
Sbjct: 434 WFLENVYPELHV--PEDRPG-FFGMLKNRGMANFCFDYNPSNEHEITGHRVILYPCHGMG 490

Query: 155 SSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL------GTV 208
            +Q F   +  ++ +  R  +A                 V+AG   +   L         
Sbjct: 491 QNQFFEYTSHNEIRYNTRQPEA--------------CAAVIAGTDYLTMNLCQENIHRVP 536

Query: 209 LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQ 265
            N    F   GT+     FH  T        ++CL     +   N   +LR   N+ YQ+
Sbjct: 537 ENQKFAFREDGTL-----FHMQT--------QKCLQAERNSYNGNPAPVLRPCTNSDYQK 583

Query: 266 WRFKE 270
           W FKE
Sbjct: 584 WFFKE 588


>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
          Length = 603

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 97/230 (42%), Gaps = 39/230 (16%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  +K+     +PL+     GDISE+V LR+ L CK F W+++++  D+    
Sbjct: 400 VRAVEVWLDDYKRHFYARNPLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDL--PV 457

Query: 114 PELPPNLFWGEAKNLGT-QKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
           PE  P  F G   N G+  +CLD           V+GT  CHG G +Q F  N++G L +
Sbjct: 458 PEDTPGQF-GALHNKGSPSRCLDYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRY 516

Query: 170 G---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW- 225
               E C+ A K +   E             +  + C         N  N +      W 
Sbjct: 517 TSQFELCI-AKKDDNSGE-------------IAAVMC---------NGKNVNPPARAIWI 553

Query: 226 ---RFHNSTRQLEHAILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKE 270
              +    T QL++     C++    T   ++    C   + YQQW F+E
Sbjct: 554 KIPQDDGKTFQLKNKSNGLCIATEKTTGNPKIVFESCKNTSRYQQWYFQE 603


>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Meleagris gallopavo]
          Length = 528

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 100/245 (40%), Gaps = 46/245 (18%)

Query: 42  KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
           KQ   ++  +L   +R    W+  +K++    +P A     GD++E+  LR+KL CKDF 
Sbjct: 312 KQAPYSRSKALANSVRAAEVWMDEYKELYYHRNPHARLEPYGDVTERRLLREKLKCKDFK 371

Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
           WF+++V  +++   PE  P  F G  KN G    C D        + G       CHG G
Sbjct: 372 WFLENVYPELHV--PEDRPGFF-GMLKNRGMANFCFDYNPPNEHEITGHRVILYPCHGMG 428

Query: 155 SSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL------GTV 208
            +Q F   +  ++ +  R  +A                 V+AG + +   L         
Sbjct: 429 QNQFFEYTSHNEIRYNTRQPEA--------------CAAVIAGTEYLTMNLCQENIHHVP 474

Query: 209 LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQ 265
            N    F   GT+     FH  T        ++CL     +   N   +LR   N+ YQ+
Sbjct: 475 ENQKFAFREDGTL-----FHMQT--------QKCLQAESNSYNGNPAPVLRPCSNSDYQK 521

Query: 266 WRFKE 270
           W FKE
Sbjct: 522 WFFKE 526


>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Takifugu rubripes]
          Length = 618

 Score = 68.2 bits (165), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 80/150 (53%), Gaps = 13/150 (8%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           +VI+  +  L       + K+F +     + +A   + GDISE++ LR++LHCK+FSW++
Sbjct: 416 EVITRNQVRLAEVWMDDYKKIFYRRNKNAAKMAKENNYGDISERLNLRERLHCKNFSWYL 475

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLD----SMGRTAPAVIGTSYCHGSGSSQL 158
             V  + +   P+L P+ F G  KN G++ CLD    ++G   P ++ T  CH  G +Q 
Sbjct: 476 NTVYPEAFV--PDLTPDRF-GAIKNQGSKTCLDVGENNLG-GKPVMMYT--CHNMGGNQY 529

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEM 185
           F  ++  +L H    E C+ A    + V++
Sbjct: 530 FEYSSHKELRHNIGKELCLHAIPHPEAVKI 559


>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Megachile rotundata]
          Length = 638

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 93/213 (43%), Gaps = 31/213 (14%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           + GD+S++ ALR KL CK F W++ +V  +++     +      GE +NLG     CLDS
Sbjct: 436 NYGDVSDRKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 491

Query: 137 MGRTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVT 191
             R A    PA  G   CH  G +Q+  L +         C+D+  K E L + + L   
Sbjct: 492 PARKADLHKPA--GLYPCHRQGGNQIRHLVS-------SICIDSPGKPEDLHQPVGLYPC 542

Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILK 240
                    +  + G +    +C + SGT            +  W ++  T+Q+ H    
Sbjct: 543 HRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNPQTKQIRHGSSD 602

Query: 241 RCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           +CL++     +L M  C  + + Q+W F+   P
Sbjct: 603 KCLAITESKQRLIMEECSPSAARQRWSFENYDP 635


>gi|62751482|ref|NP_001015534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
 gi|75057892|sp|Q5EA41.1|GALT6_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|59857821|gb|AAX08745.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
          Length = 622

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++L+C +FSWF+ +V  +
Sbjct: 428 QVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
           ++   P+L P  F+G  KNLG   CLD   +     P ++ T  CHG G +Q F    + 
Sbjct: 488 MFV--PDLKPT-FFGALKNLGVDHCLDVGENNNGGKPLILYT--CHGLGGNQYFEYTTQR 542

Query: 166 QLGH 169
            L H
Sbjct: 543 DLRH 546


>gi|344251833|gb|EGW07937.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Cricetulus
           griseus]
          Length = 457

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 70/141 (49%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF+K V  +++   
Sbjct: 257 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKKLRDKLQCKDFRWFLKTVYPELHV-- 314

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G ++ CLD        V G       CHG G +Q F   ++ ++ 
Sbjct: 315 PEDRPGFF-GMLQNKGLREYCLDYNPPNENQVDGHQVLLYLCHGMGQNQFFEYTSQKEIR 373

Query: 169 HGER----CVDADKQEKLVEM 185
           +  R    CV  ++ + ++ M
Sbjct: 374 YNTRQPEACVAVEEGKDVLVM 394


>gi|148878418|gb|AAI46056.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Bos
           taurus]
 gi|296487792|tpg|DAA29905.1| TPA: polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
          Length = 622

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 69/124 (55%), Gaps = 12/124 (9%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++L+C +FSWF+ +V  +
Sbjct: 428 QVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
           ++   P+L P  F+G  KNLG   CLD   +     P ++ T  CHG G +Q F    + 
Sbjct: 488 MFV--PDLKPT-FFGALKNLGVDHCLDVGENNNGGKPLILYT--CHGLGGNQYFEYTTQR 542

Query: 166 QLGH 169
            L H
Sbjct: 543 DLRH 546


>gi|354475881|ref|XP_003500155.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Cricetulus griseus]
          Length = 559

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/141 (31%), Positives = 70/141 (49%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF+K V  +++   
Sbjct: 359 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKKLRDKLQCKDFRWFLKTVYPELHV-- 416

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G ++ CLD        V G       CHG G +Q F   ++ ++ 
Sbjct: 417 PEDRPGFF-GMLQNKGLREYCLDYNPPNENQVDGHQVLLYLCHGMGQNQFFEYTSQKEIR 475

Query: 169 HGER----CVDADKQEKLVEM 185
           +  R    CV  ++ + ++ M
Sbjct: 476 YNTRQPEACVAVEEGKDVLVM 496


>gi|313233395|emb|CBY24510.1| unnamed protein product [Oikopleura dioica]
          Length = 679

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 18/142 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  +  GDIS++  +R++L CK FSWF+++V       +P
Sbjct: 489 RAAEVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENV-------YP 541

Query: 115 ELP-PN---LFWGEAK--NLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEG-Q 166
           EL  PN   + WG     N G ++C+   G T     +G   CHG G +Q F L  EG +
Sbjct: 542 ELRIPNKDAIGWGAVSQTNKGLEECI---GNTHGGGTLGMYRCHGDGGNQEFTLTKEGKE 598

Query: 167 LGHGERCVDADKQEKLVEMLKL 188
             H + C+  + +E +   +K 
Sbjct: 599 FRHNDLCIGYNAKEPVGNPVKF 620


>gi|195590539|ref|XP_002085003.1| GD14569 [Drosophila simulans]
 gi|194197012|gb|EDX10588.1| GD14569 [Drosophila simulans]
          Length = 590

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 63/257 (24%), Positives = 106/257 (41%), Gaps = 43/257 (16%)

Query: 21  QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
            LFR +   Q+ +  +  +++K++IS        R    WL  +K       P    + +
Sbjct: 350 HLFR-DGNFQVRYTNKDKNSEKKLISRNYR----RVAEIWLDEYKDKLYANMPHLTVIPV 404

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD--- 135
           G+++EQ AL+++LHCK F WF+ ++A D    +P L P  +  G  ++    K CLD   
Sbjct: 405 GNLAEQRALKNRLHCKPFKWFLDNLATDFLNLYPILDPAEYASGVLQSRALPKLCLDRKD 464

Query: 136 -SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
            S G+   A   + +   S   Q + L    +L  G  C++                   
Sbjct: 465 PSHGQPKLAPCSSDHVFPS-PEQYWSLTNHRELRSGFYCLEVRNH--------------- 508

Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQ 251
                      G  ++++ C   SG  +  W F + T Q+   +    + CL   PE N 
Sbjct: 509 -----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQEQNSRHCLEAQPELNA 555

Query: 252 LAMLRCDENNSYQQWRF 268
           +    CD  N  QQW+F
Sbjct: 556 VTSSVCDPKNHKQQWKF 572


>gi|339234661|ref|XP_003378885.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
 gi|316978493|gb|EFV61475.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
          Length = 1819

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 3/125 (2%)

Query: 55   RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
            R   +W+  +K+   L  P    +D GD+++Q  +R +L C  F WFMK+V +D    +P
Sbjct: 1613 RVAETWMDEYKEYLYLRMPRLRNVDPGDLTKQREVRQRLKCNGFDWFMKNVMFDQPKHYP 1672

Query: 115  EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH--GE 171
             + PP++  G  KN  T +C+D         + TS C      Q F L    ++    GE
Sbjct: 1673 LVEPPDVMSGRIKNKRTGRCMDVKNAGLQTELTTSQCKDYLKRQEFVLTWRSEIFQRLGE 1732

Query: 172  RCVDA 176
             C+D+
Sbjct: 1733 NCLDS 1737


>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 619

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/220 (25%), Positives = 91/220 (41%), Gaps = 36/220 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ     +P     + GD+++Q ALR+KL CK F WF++ VA D+  ++P
Sbjct: 406 RVAEVWMDEYKQFLYERNPQFDKTNPGDLTKQKALREKLKCKPFKWFLEEVAPDLLVRYP 465

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
              P  F  G  +++   K CLD++   A   IG   C  + +    +Q F L    + +
Sbjct: 466 LREPLPFASGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQNNQFFTLTYYRDIR 525

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
               E+C+DA      V                           LFNC  S G  +  WR
Sbjct: 526 AASVEKCLDASSDNAEV--------------------------ILFNCHESQG--NQLWR 557

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +   T+ + H    R   +     ++ + +CD     Q+W
Sbjct: 558 YDQETQMIMHGKPSRDQCLDLVGKKVVVSKCDNRKKTQRW 597


>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
          Length = 582

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 94/222 (42%), Gaps = 40/222 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ     +P     D GD+++Q ALR++L CK F WF++ VA D+  K+P
Sbjct: 369 RVAEVWMDEYKQFLFERNPEYDKTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLLLKYP 428

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSG----SSQLFRLNA--EGQ 166
              P  F  G  ++L     CLDS+       IG   C  +     S+Q F L+   + +
Sbjct: 429 LRDPKPFASGRVQSLANPILCLDSLNHKEKEPIGVFSCAANKTHPQSNQFFTLSYFRDIR 488

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
           +   ++C+DA  +                          G+ + LFNC    G  +  W+
Sbjct: 489 VASVDKCLDAASE--------------------------GSEVRLFNCHEIQG--NQLWQ 520

Query: 227 FHNSTRQLEHAI--LKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +    + + H +    +CL +     ++ + +CD     Q W
Sbjct: 521 YDMKNQMIRHGMPSWNQCLDL--VKQKITISKCDPTKKTQHW 560


>gi|313246954|emb|CBY35800.1| unnamed protein product [Oikopleura dioica]
          Length = 696

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 18/142 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  +  GDIS++  +R++L CK FSWF+++V       +P
Sbjct: 506 RAAEVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENV-------YP 558

Query: 115 ELP-PN---LFWGEAK--NLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEG-Q 166
           EL  PN   + WG     N G ++C+   G T     +G   CHG G +Q F L  EG +
Sbjct: 559 ELRIPNKDAIGWGAVSQTNKGLEECI---GNTHGGGTLGMYRCHGDGGNQEFTLTKEGKE 615

Query: 167 LGHGERCVDADKQEKLVEMLKL 188
             H + C+  + +E +   +K 
Sbjct: 616 FRHNDLCIGYNAKEPVGNPVKF 637


>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Taeniopygia guttata]
          Length = 631

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 62/235 (26%), Positives = 103/235 (43%), Gaps = 44/235 (18%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GDIS+++ LR +L CK+F+W++
Sbjct: 428 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRIDLRQRLQCKNFTWYL 487

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            +V  + Y   P+L P LF G  KN+G + CLD +G       P ++ +  CHG G +Q 
Sbjct: 488 SNVYPEAYV--PDLNP-LFSGYLKNIGNRMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 541

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFN 213
           F  +A  ++ H    E C+ A K          PV    C   G K              
Sbjct: 542 FEYSAHHEIRHNIQKELCLHASKG---------PVQLRECTYKGQKTF------------ 580

Query: 214 CFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
                   +  W  H   + L +  L  CL+ + E   LA   C+ ++ +Q+W F
Sbjct: 581 -----AVGEEQW-LHQKDQTLYNEALHMCLTGNGEHPSLA--SCNPSDPFQKWIF 627


>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
 gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
          Length = 619

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 102/226 (45%), Gaps = 40/226 (17%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K I   RS   + +A     GDIS+++ LR+ L CK+FSW++ +V  +
Sbjct: 423 QVRLAEVWMDDYKLIFYRRSQSAAKMAKEKGFGDISDRLKLREDLQCKNFSWYLSNVYPE 482

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
            +   P+L P  F G  KN G Q+CLD   S     P ++ T  CH  G +Q F   +  
Sbjct: 483 AFV--PDLSPVKF-GALKNRGAQQCLDVGESNNGGKPVIMYT--CHNMGGNQYFEYTSHK 537

Query: 166 QLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
           +L H    + C+ A    + V+     +  C L G                   SS   +
Sbjct: 538 ELRHNIGKQLCLQASPYPEPVK-----IELCTLKGK-----------------GSSLAPE 575

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             W      + L++    +CL  H + +++ M  C+ ++ +QQW F
Sbjct: 576 QEWLI-TEEQLLKNPSSGKCL--HKKGDKILMSPCNPDDHHQQWAF 618


>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           10-like [Acyrthosiphon pisum]
          Length = 592

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 51/222 (22%), Positives = 95/222 (42%), Gaps = 32/222 (14%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +   L  P    ++ GDI++Q  +R+ L CK F WF+  VA+D+  K+P + PP
Sbjct: 395 WMDEYAEYLYLRRPHYRNINTGDITKQKKIRENLKCKPFKWFIHEVAFDLVEKYPPIEPP 454

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
           ++F G+ +             TAP +     C  + S     L         + C+D D 
Sbjct: 455 DVFKGKIRTF-----------TAPEL-----CLDATSENPLNL---------KECIDNDD 489

Query: 179 Q-EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
           + +K +   +  +       + +    L   ++L+ C N  G  +  W + +   +L   
Sbjct: 490 ENQKFILSWRNDIKTRNNMCLDISDSSLKAKISLYGCHNGGG--NQLWHYDHGKMKLVQG 547

Query: 238 ILKRCLSVHPETNQLAMLRCDENNSYQQW---RFKEVRPDWS 276
              RC+     T ++ + +C +NN  Q+W    F     +WS
Sbjct: 548 KNPRCMEHDISTRKVYVTKCKQNNDDQKWIIENFNSKSTNWS 589


>gi|395824312|ref|XP_003785413.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Otolemur garnettii]
          Length = 508

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 94/226 (41%), Gaps = 37/226 (16%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K++    +P A     GD++E+  LR+KL CKDF WF++ V  +++   
Sbjct: 308 VRAAEVWMDDYKELYYHRNPRARLEPFGDVTERRQLREKLQCKDFKWFLETVFPELHV-- 365

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL- 167
           PE  P  F G  +N G +K C D        V G       CHG G +Q F   ++ ++ 
Sbjct: 366 PEDRPGFF-GMLQNKGLKKYCFDYNPPDENQVAGRQVILYLCHGLGQNQFFEYTSQYEIR 424

Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
               H E CV  ++   ++ M                 C+     N        G++   
Sbjct: 425 YNTHHPEACVAVEEDSDILVMH---------------LCKEVAPENQKFILQEDGSL--- 466

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
             FH  +++   A  K         N   +LR   N+ YQ+W F+E
Sbjct: 467 --FHKQSKKCVQAAKKES-----SDNYEPLLRDCTNSDYQRWFFQE 505


>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Oreochromis niloticus]
          Length = 575

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 63/133 (47%), Gaps = 12/133 (9%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  FK+I    +P A     GD++E+  LR+KL CK F W++ ++  D++   
Sbjct: 373 VRAAEVWLDEFKEIYYHRNPHARLEAFGDVTERRMLREKLGCKSFKWYLDNIYPDIHV-- 430

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           P   P +F G  KN G T  C D        V+G       CHG G +Q F  +  G++ 
Sbjct: 431 PHDRPGMF-GMLKNRGKTNYCFDYNPTDENVVVGQRVILYLCHGMGQNQFFEYSVNGEIC 489

Query: 169 HGER----CVDAD 177
           +  R    C+  D
Sbjct: 490 YNTREPAGCIAGD 502


>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Hydra magnipapillata]
          Length = 603

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 34/98 (34%), Positives = 56/98 (57%), Gaps = 11/98 (11%)

Query: 82  DISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW--GEAKNLGTQKCLDSM 137
           D++ +V LR KL CK F W++++V       +PEL  P + F   GE +N  +  CLD++
Sbjct: 446 DVTARVKLRSKLQCKSFQWYLENV-------YPELEIPDDKFLAAGEIRNPESGICLDTL 498

Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
           G+   A +G   CHG G +Q +  N +G++   + C+D
Sbjct: 499 GKQEGAPVGLYACHGQGGNQYYTYNNKGEIKAEDNCMD 536


>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
          Length = 621

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 27/207 (13%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           D GDI+ + ALR+KL CK F W++ ++  +++     +      GE +NLG   + CLDS
Sbjct: 419 DFGDITSRKALREKLGCKSFKWYLDNIYPELFIPGEAVAS----GEIRNLGIGGKTCLDS 474

Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
             R +     +G   CH  G +Q+  L+        E C+D+  K E L   + L     
Sbjct: 475 PARRSDLHKPVGLYPCHRQGGNQISVLDR-------ELCIDSPCKPEDLHNPIGLWPCHK 527

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSG--TVDGP---------WRFHNSTRQLEHAILKRC 242
                  ++ + G +     C + SG   +  P         W ++ +++ L H    +C
Sbjct: 528 QGGNQFWMYSKSGEIRRDEACLDYSGQEVILYPCHGSKGNQFWDYNANSKLLRHGSSDKC 587

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFK 269
           L+++   N+L M  CDE  + Q W  +
Sbjct: 588 LAINEAKNKLLMEPCDEEATRQHWSLE 614


>gi|345307494|ref|XP_001507147.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like [Ornithorhynchus anatinus]
          Length = 361

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 59/144 (40%), Gaps = 13/144 (9%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 142 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 201

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
            + PP   WGE +N+    C+DS        +    C   GS       QLF       +
Sbjct: 202 PVEPPPAAWGEIRNVVANLCVDSKHGATGTEVRLDVCVKDGSERTWSHEQLFTFGWREDI 261

Query: 168 GHGER------CVDADKQEKLVEM 185
             GE       C+DA      V +
Sbjct: 262 RPGEPLHTRKFCLDAISHSSPVTL 285


>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
          Length = 613

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 68/137 (49%), Gaps = 10/137 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  ++ GD++E+  LR+ L CK F W+++++       +P
Sbjct: 423 RTAEVWMDEYKAFFYKMVPAARNVEAGDVTERKKLRETLQCKSFKWYLENI-------YP 475

Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P P  F   G   N  T+KC+D+ G+      G   CHG+G +Q + L  +G++   +
Sbjct: 476 EAPLPADFRSLGAIVNRFTEKCVDTNGKKDGQPPGMQACHGAGGNQAWSLTGKGEIRSDD 535

Query: 172 RCVDADKQEKLVEMLKL 188
            C+ +    ++   LKL
Sbjct: 536 LCLSSGHVYQIGSELKL 552


>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 599

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 68/124 (54%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R++  + +A     GDISE++ LR++LHC++FSW++ ++  +
Sbjct: 420 QVRLAEVWMDEYKEIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPE 479

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLG  +CLD     R    +I    CH  G +Q F    +  
Sbjct: 480 MFV--PDLKPT-FYGAIKNLGIDQCLDVGENNRGGKPLIMYP-CHSLGGNQYFEYTTQRD 535

Query: 167 LGHG 170
           L H 
Sbjct: 536 LRHN 539


>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
          Length = 611

 Score = 67.8 bits (164), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 68/137 (49%), Gaps = 10/137 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  ++ GD++++  LR+ L CK F W+++++       +P
Sbjct: 423 RTAEVWMDEYKAFFYKMVPAAKNVEAGDVTDRKKLRETLQCKSFKWYLENI-------YP 475

Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
           E P P  F   G   N  T+KC+D+ G+      G   CHG+G +Q + L  +G++   +
Sbjct: 476 EAPLPADFRSLGSIVNRFTEKCIDTNGKKDGQAPGMQACHGAGGNQAWSLTGKGEIRSDD 535

Query: 172 RCVDADKQEKLVEMLKL 188
            C+ +    ++   LKL
Sbjct: 536 LCLSSGHVYQIGSELKL 552


>gi|313220438|emb|CBY31291.1| unnamed protein product [Oikopleura dioica]
          Length = 354

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 47/124 (37%), Positives = 61/124 (49%), Gaps = 17/124 (13%)

Query: 58  SSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAY---DVYTKF 113
             W K F   R    P    L  GDISEQV  R   HC  +F+W M+ +AY   D+Y   
Sbjct: 165 DDWTKYFYWNR----PELQKLPYGDISEQVKFRHD-HCPYNFTWMMEEIAYGITDMYDAP 219

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG---RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            EL   L WGE +  G+  C+DSM       PA     YCH  G +QLFR++A G +   
Sbjct: 220 NEL---LAWGEIRGKGSNICVDSMAHKDNDGPA--EAWYCHKQGGNQLFRIHAGGFIAQN 274

Query: 171 ERCV 174
            +C+
Sbjct: 275 LQCI 278


>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Danaus plexippus]
          Length = 533

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 100/217 (46%), Gaps = 38/217 (17%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQK-CL 134
           D GDIS +  LR+KL CK F W++K++       +PEL     ++  GE +N+G ++ CL
Sbjct: 325 DYGDISGRKELREKLKCKSFDWYLKNI-------YPELFIPGESVAHGEIRNIGFERTCL 377

Query: 135 DSMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           DS  R +     +G   CH  G +Q+   ++       + CVD+       E +K PV  
Sbjct: 378 DSPTRKSDHHKPVGLYPCHRQGGNQIANPSS-------DMCVDSAAGP---EDMKKPVNP 427

Query: 193 CVLAGV----KLIFCRLGTVLNLFNCFNSSG--TVDGP---------WRFHNSTRQLEHA 237
               G       ++ + G +     C + SG   V  P         W +  +T+ ++H 
Sbjct: 428 WPCHGEYGNQYWMYSKNGEIRRDETCLDYSGHDVVLYPCHGAKGNQLWLYDPTTKLIKHG 487

Query: 238 ILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPD 274
             ++C+++  + +++ M  C+E  + QQW  +    D
Sbjct: 488 SSEKCMAISRKKDKIVMETCNERENRQQWNMENFNAD 524


>gi|167520406|ref|XP_001744542.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776873|gb|EDQ90491.1| predicted protein [Monosiga brevicollis MX1]
          Length = 702

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 70/123 (56%), Gaps = 5/123 (4%)

Query: 53  ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
           +LR   +W+  +KQI  + +     +D GD+S++ A+RD+L CK F W++ +V  D+Y  
Sbjct: 421 DLRVAEAWMDEYKQI-VMDTRGFEGVDPGDLSDRKAIRDRLQCKSFKWYLDNVHPDLYV- 478

Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
            P+L P +  G   +   ++C+D+M       +G   CHG  ++Q +RL  +G + +G  
Sbjct: 479 -PDLHPKMA-GVIADPQKRQCIDNMQHDRGGPVGVYGCHGL-TTQRWRLMEDGVIANGVS 535

Query: 173 CVD 175
           C++
Sbjct: 536 CLN 538


>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
 gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
          Length = 608

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 92/220 (41%), Gaps = 36/220 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +KQ     +P     + GD+++Q ALR++L CK F WF++ VA D+  ++P
Sbjct: 395 RVAEVWMDEYKQFLYERNPQFDKTNPGDLTKQKALRERLKCKPFKWFLEEVAPDLLVRYP 454

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
              P  F  G  +++   K CLD++   A   IG   C  + +    +Q F L    + +
Sbjct: 455 LREPLPFASGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQNNQFFTLTYYRDIR 514

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
               E+C+DA   +  V                           LFNC  S G  +  WR
Sbjct: 515 AASVEKCLDASSDDAEV--------------------------ILFNCHESQG--NQLWR 546

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +   T+ + H    R   +     ++ + +CD     Q+W
Sbjct: 547 YDQETQMIMHGKPSRDQCLDLVGKKVVVSKCDNRKKTQRW 586


>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
 gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
          Length = 454

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 8/118 (6%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD-VYTKFPELPP 118
           W+  +K       P    +D GDIS+++ALR +L CK F W++++VA D V TK      
Sbjct: 313 WMDEYKDWLYDKRPEIKSVDYGDISDRIALRKRLKCKSFKWYLENVANDTVKTKLCAC-- 370

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
                + +N G   CLDSMGR     +G + CH  G +Q F+     +L   E C D 
Sbjct: 371 ----FQVRNQGKNMCLDSMGRKD-GHVGLASCHNMGGNQAFQYTYIRELRTDETCFDV 423


>gi|313230492|emb|CBY18708.1| unnamed protein product [Oikopleura dioica]
          Length = 644

 Score = 67.4 bits (163), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 50/222 (22%), Positives = 91/222 (40%), Gaps = 32/222 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   + +      P A  +D GD++E  A+R++L CK F WFMK + YD+   F
Sbjct: 416 VRVVEVWWDEYSKYFYERKPNAKHVDPGDLTEVKAVRERLQCKSFDWFMKEIGYDLPKHF 475

Query: 114 PELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           P + P N   G   N+GT  C+D               H +G      L+    +  G+ 
Sbjct: 476 PLVEPSNAARGHITNVGTNLCVDG--------------HHAGQGDQLALSECASVREGQW 521

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCR--------LGTVLNLFNCFNSSGTVDGP 224
            +   +  +       P++   +   K    R        +G  +  + C    G  +  
Sbjct: 522 MLTWHEDIR-------PLSGSNVENKKASDARKVCWDGTGVGNPIGFWECHTQGG--NQL 572

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +++    +Q  H+  + C + H E  ++ +  CD++N  Q+W
Sbjct: 573 FKYVPDKKQFIHSPSRACATAHEEQKKVYLKACDDSNDNQKW 614


>gi|72000997|ref|NP_001024216.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
 gi|51316004|sp|Q8I136.2|GALT4_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           Short=pp-GaNTase 4; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 4; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|3047189|gb|AAC13670.1| GLY4 [Caenorhabditis elegans]
 gi|11064525|emb|CAC14394.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
          Length = 589

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 72/144 (50%), Gaps = 22/144 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A F++ GDI++++A+RD+L CK F W++++V       +P
Sbjct: 398 RAAEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLENV-------YP 450

Query: 115 EL------PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF---RLNAEG 165
           +L      P   F  +  NL    CLDSM R      G   CHG+G +Q +   +L    
Sbjct: 451 QLEIPRKTPGKSFQMKIGNL----CLDSMARKESEAPGLFGCHGTGGNQEWVFDQLTKTF 506

Query: 166 QLGHGERCVD--ADKQEKLVEMLK 187
           +    + C+D  ++ + K V M+K
Sbjct: 507 KNAISQLCLDFSSNTENKTVTMVK 530


>gi|350592356|ref|XP_003359108.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like, partial [Sus scrofa]
          Length = 209

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 50/106 (47%), Gaps = 1/106 (0%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GD+S Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 76  RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 135

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
            + PP   WGE +N+    C+DS        +    C   GS + +
Sbjct: 136 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDICVKDGSERTW 181


>gi|260836359|ref|XP_002613173.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
 gi|229298558|gb|EEN69182.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
          Length = 539

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 43/138 (31%), Positives = 69/138 (50%), Gaps = 14/138 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  +  G+I  ++ LR KL CK F+W+++HV       +P
Sbjct: 350 RAAEVWMDEYKEYYYAAVPSARNVPFGNIQSRLELRKKLSCKPFAWYLEHV-------YP 402

Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  C+D++G  A   +G   CHGSG +Q + L  +  + H 
Sbjct: 403 ELRIPDKKDVAFG-ALQQGTL-CMDTLGHFADGTVGVYECHGSGGNQEWALTKDKSIRHS 460

Query: 171 ERCVDADKQEKLVEMLKL 188
           + C+    Q    E+LKL
Sbjct: 461 DLCLTVVNQNP-GELLKL 477


>gi|221130543|ref|XP_002162500.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Hydra magnipapillata]
          Length = 578

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 40/138 (28%), Positives = 69/138 (50%), Gaps = 13/138 (9%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A +   GDI +++ LR  L+CK F W+++++       +P
Sbjct: 389 RAAEVWMDDYKKYYFAAVPSARYSLFGDIRDRLQLRKDLNCKSFKWYLENI-------YP 441

Query: 115 ELP-PN---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL  P+   + +G+ K   ++ CLD+MG      IG   CHG G +Q +      Q+ H 
Sbjct: 442 ELKVPDDDVIKYGQIKYKVSEDCLDTMGHIKGEGIGLFPCHGQGGNQDWSWTKSNQIKHE 501

Query: 171 ERCVD--ADKQEKLVEML 186
             C+   + K E++V M+
Sbjct: 502 SLCLSGISKKSEEIVRMV 519


>gi|432934600|ref|XP_004081948.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3-like [Oryzias
           latipes]
          Length = 600

 Score = 67.4 bits (163), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 102/231 (44%), Gaps = 36/231 (15%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +     + +A     GDIS++  LR++L CK+FSW++
Sbjct: 400 QVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKEETFGDISKRKDLRERLQCKNFSWYL 459

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
           K++  +++   P+L P LF G  KN+G   CLD+            Y CHG G +Q F  
Sbjct: 460 KNIYPEIF--MPDLNPLLF-GSVKNVGKASCLDAGENNEGGKELIMYPCHGLGGNQYFEY 516

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC-FNSSGT 220
           +   ++ H       + Q++L          C+           G V+ L  C +    T
Sbjct: 517 STHREVRH-------NIQKEL----------CLHGA--------GGVVKLEECQYKGRNT 551

Query: 221 VDGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             G    W   ++ +         CL+ H +   LA  RC+ ++ YQQW F
Sbjct: 552 FVGAEQKWELKDN-QLFYFPGGNMCLTAHQDHPSLA--RCNPSDRYQQWSF 599


>gi|380021258|ref|XP_003694487.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like [Apis florea]
          Length = 537

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
           GD+SE+ ALR +L CK F W++ +V  +++     +      GE +NLG     CLDS  
Sbjct: 337 GDVSERKALRKRLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 392

Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
           R A    PA  G   CH  G +Q+  L +         C+D+  K E L + + L     
Sbjct: 393 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSPGKPEDLHQPVGLYPCHR 443

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
                  +  + G +    +C + SGT            +  W ++  ++Q+ H    +C
Sbjct: 444 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNPQSKQIRHGSSDKC 503

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           L++     +L M  C  + + Q+W F+   P
Sbjct: 504 LAITESKQRLIMEECSPSAARQRWSFENYDP 534


>gi|344271584|ref|XP_003407617.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 12-like [Loxodonta
           africana]
          Length = 576

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 57/229 (24%), Positives = 96/229 (41%), Gaps = 43/229 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 376 VRAAEVWMDEYKELYYHRNPHARLEPFGDVTERRQLRAKLQCKDFRWFLETVYPELHV-- 433

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G ++ C D        + G      +CHG G +Q F   ++ ++ 
Sbjct: 434 PEDRPG-FFGMLQNKGLKEYCFDYNPPKDDQISGHQVLLYHCHGMGQNQFFEYTSQNEIR 492

Query: 169 HGER----CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
           +  R    CV A  +  ++ M                 C+     N    F   G+    
Sbjct: 493 YNTRQPEGCVSAPAESDILFME---------------LCKETAPENQKFIFREDGS---- 533

Query: 225 WRFHNSTRQLEHAILKRCLSVH---PETNQLAMLRCDENNSYQQWRFKE 270
                    L H + K+C+      P  + + +LR   N+  Q+W FKE
Sbjct: 534 ---------LFHVLSKKCVQAERKAPGDSVVPLLRDCTNSERQKWFFKE 573


>gi|296484977|tpg|DAA27092.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
          Length = 320

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 50/106 (47%), Gaps = 1/106 (0%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GD+S Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 211 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFKWFMAAVAWDVPKYYP 270

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
            + PP   WGE +N+    C+DS        +    C   GS + +
Sbjct: 271 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTW 316


>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Sarcophilus harrisii]
          Length = 590

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 64/232 (27%), Positives = 90/232 (38%), Gaps = 49/232 (21%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK+I    +  A     GDI+E+  LRDKL CKDF WF+++V  +++   
Sbjct: 391 VRAAEVWMDEFKEIYYHRNMHARKEPYGDITERKELRDKLKCKDFRWFLENVYPELH--I 448

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G   N G    C D    +   + G       CHG G +Q F   +  +L 
Sbjct: 449 PEDRPGYF-GMLVNRGMADYCFDYNPPSESEITGNQVILYLCHGMGQNQFFEYTSHNELR 507

Query: 169 HGER----CVDADKQEKLVEMLKLPVTFC---VLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
           +  R    C   D     V    L +  C   V    K IF   GT+             
Sbjct: 508 YNTRQPEACAAVD-----VGTDHLTMHLCYKNVPENQKFIFKEDGTLF------------ 550

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWRFKE 270
                         H   K+C+     +   N   +LR   N+ YQ+W FKE
Sbjct: 551 --------------HVQSKKCVQAEANSSNGNPAPLLRTCTNSEYQKWFFKE 588


>gi|449275388|gb|EMC84260.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Columba livia]
          Length = 632

 Score = 67.0 bits (162), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 104/233 (44%), Gaps = 40/233 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GDIS+++ LR +L CK+F+W++
Sbjct: 429 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYL 488

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            +V  + Y   P+L P LF G  KN G + CLD +G       P ++ +  CHG G +Q 
Sbjct: 489 SNVYPEAYV--PDLNP-LFSGYLKNTGNRMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 542

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A  ++ H    E C+ A K                   V+L  C        +   
Sbjct: 543 FEYSAHHEIRHNIQKELCLHASK-----------------GPVQLRECH-------YKGQ 578

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
            +    +  W  H   + L +A L  CL+ + E   LA   C+ ++ +Q+W F
Sbjct: 579 KTFAVGEEQW-LHQKDQTLYNAALHMCLTGNGEHPSLA--SCNPSDPFQKWIF 628


>gi|307203928|gb|EFN82835.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
           [Harpegnathos saltator]
          Length = 482

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 88/211 (41%), Gaps = 27/211 (12%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           + GD+SE+  LR KL CK F W++ +V  +++     +      GE +NLG     CLDS
Sbjct: 280 NYGDVSERKTLRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 335

Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
             R A      G   CH  G +Q+ R  A G       C+D+  K E L + + L     
Sbjct: 336 PARKADLHKPCGLYPCHRQGGNQI-RQVASGI------CIDSPGKSEDLHQPVGLYPCHR 388

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
                  +  + G +    +C + SG+            +  W ++  T  + H    +C
Sbjct: 389 QGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWIYNPQTNHIRHGSSDKC 448

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           L++     QL M  C  N   Q+W F+   P
Sbjct: 449 LAITESKQQLVMEECSPNAPRQKWSFENYDP 479


>gi|432098843|gb|ELK28338.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Myotis davidii]
          Length = 428

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 91/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 222 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTQIAWDLPKFYPPVEPP 281

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +++GT  C D+    A + +    C  G G +     Q+F       +  G  
Sbjct: 282 AAAWGEIRSVGTGLCADTKHGAAGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 339

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  WR+    +
Sbjct: 340 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWRYRKD-K 378

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 379 SLYHPVSGSCMDCSESDHRIFMNICNPSSPTQQWLFEHT 417


>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 631

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 65/231 (28%), Positives = 97/231 (41%), Gaps = 10/231 (4%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVIS  +  L       + ++F +     + LA     GDISE++  R +L CK FSW++
Sbjct: 405 QVISRNQVRLAEVWMDDYKEIFYRRNQQAAQLARDKAFGDISERLDFRVRLRCKSFSWYL 464

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
           K++  + +   P+L P L +G  KN+G   CLD+            Y CHG G +Q F  
Sbjct: 465 KNIYPEAF--IPDLNP-LSFGSVKNVGKDSCLDAGENNEGGKKLIMYPCHGLGGNQYFEY 521

Query: 162 NAEGQLGHG---ERCVD-ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNS 217
           +   ++ H    E C+  A    +L E         V A  K     +  V  L    + 
Sbjct: 522 STHHEIRHNIQKELCLHGAAGAVRLEECQYKGRNTLVGAEQKWQLKDVSLVSLLLERSHF 581

Query: 218 SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           S      W F    R +    L +CLS   +   LA   C+  + YQ W F
Sbjct: 582 SFKTSICWLFCTQNRLVYIPALNKCLSARQDHPFLAA--CNPMDRYQLWVF 630


>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Amphimedon queenslandica]
          Length = 512

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 83/215 (38%), Gaps = 42/215 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A   DMG I  +VALR +L+C  F W+MK+V       +P
Sbjct: 333 RAAEVWMDNYKDYYYSARPSAKGRDMGSIKSRVALRKRLNCTTFDWYMKNV-------YP 385

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSS-QLFRLNAEGQLGHGE 171
           EL  P +      K      CLD++G  A   +G   C  S    Q + +  +G + H  
Sbjct: 386 ELSVPSSTNNKHGKLKQNNLCLDTLGHQAGEPVGLQDCQQSRQGYQDWSIAMKGLIRHLN 445

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C++A  Q                  V L +C    + N              W  H   
Sbjct: 446 LCLEARGQI-----------------VHLQYCSKDHIQN--------------WE-HTKN 473

Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
             + H   K CLS     N+L +  CD+ ++ QQW
Sbjct: 474 NHIVHLSTKLCLSSRHSLNELTLETCDDKSNNQQW 508


>gi|393910975|gb|EJD76111.1| glycosyl transferase, variant [Loa loa]
          Length = 549

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 39/131 (29%), Positives = 61/131 (46%), Gaps = 3/131 (2%)

Query: 30  QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
           ++GH  RC  V      I    S    R    W+  + +      P  + +D GD+S+Q 
Sbjct: 378 RVGHIYRCKYVPFPDPGIGDFISKNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSKQK 437

Query: 88  ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
           A+R +L CK F WFMK +A+D    +P + P +   GE +N+   KC+D+  + A    G
Sbjct: 438 AVRKRLKCKSFDWFMKEIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTHFKNAEEKFG 497

Query: 147 TSYCHGSGSSQ 157
              C   G + 
Sbjct: 498 LRKCVSDGPAN 508


>gi|426226648|ref|XP_004007451.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 6 [Ovis aries]
          Length = 792

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 68/124 (54%), Gaps = 12/124 (9%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ LR++L+C++FSWF+ ++  +
Sbjct: 598 QVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERLNCRNFSWFLDNIYPE 657

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
           ++   P+L P  F G  KNLG   CLD   +     P ++    CHG G +Q F    + 
Sbjct: 658 MFV--PDLKPTFF-GALKNLGVDHCLDVGENNNGGKPLIL--YACHGLGGNQYFEYTTQR 712

Query: 166 QLGH 169
            L H
Sbjct: 713 DLRH 716


>gi|432882425|ref|XP_004074024.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Oryzias latipes]
          Length = 549

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 94/230 (40%), Gaps = 42/230 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+I    +P A     GD+SE++ LR+KL CK+F W++ +V  +++   
Sbjct: 347 VRAAEVWMDDYKEIYYHRNPHARLEAFGDVSERLKLREKLGCKNFQWYLDNVYPEIHV-- 404

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           P   P +F G  KN G    C D        ++G       CHG G +Q F  + +G++ 
Sbjct: 405 PHDRPGMF-GMLKNRGKVNYCFDYNPADENTIVGQRVILYPCHGMGQNQFFEYSVDGEIC 463

Query: 169 HGER----CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
           +  R    C+  D                           + T L +  C      V   
Sbjct: 464 YNTREPAGCIVGDS--------------------------VNTYLTMQLCKKRGQPVPME 497

Query: 225 WRF-HNSTRQLEHAILKRCLSVHPETNQ---LAMLRCDENNSYQQWRFKE 270
            +F       L HA+ ++C+     T+       L+   ++ YQQW F+E
Sbjct: 498 QKFVFRKDGSLYHALSQKCVQAMDRTDNGTPAPALQPCSDSVYQQWFFEE 547


>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
           norvegicus]
 gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
           norvegicus]
 gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
          Length = 603

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCTDTKHGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590


>gi|308456556|ref|XP_003090710.1| hypothetical protein CRE_13795 [Caenorhabditis remanei]
 gi|308260992|gb|EFP04945.1| hypothetical protein CRE_13795 [Caenorhabditis remanei]
          Length = 487

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 22/143 (15%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A F++ GDIS+++A+RD+L CK F W++  V       +P
Sbjct: 180 RAAEVWMDEYKAIYLKNVPSARFVNYGDISDRLAIRDRLQCKSFKWYLDTV-------YP 232

Query: 115 EL-PPNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL--------NAE 164
           +L  P    G++  +     CLDSM R      G   CHG+G +Q +          NA 
Sbjct: 233 QLVVPKKTPGKSLQMKMGHLCLDSMARKEDDAPGLFACHGTGGNQEWVFDPLTKTFKNAI 292

Query: 165 GQLGHGERCVDADKQEKLVEMLK 187
            QL     C+D   ++K V M+K
Sbjct: 293 TQL-----CLDFAAEKKAVLMVK 310


>gi|148672125|gb|EDL04072.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
          Length = 436

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 42/226 (18%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +    + GDISE++ LR++L C +FSW++ +V  +
Sbjct: 242 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPE 301

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 302 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 357

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
           L H                           G +L     G+ L L +C     NS    D
Sbjct: 358 LRHN-------------------------IGKQLCLHASGSTLGLRSCQFVGKNSRVPKD 392

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             W      + + ++    CL+   +  + AM  C+  + YQ W F
Sbjct: 393 EEWEL-TQDQLIRNSGSGTCLT--SQDKKPAMAPCNPRDPYQLWLF 435


>gi|115533032|ref|NP_001041036.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
 gi|182676440|sp|O45947.3|GLT10_CAEEL RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
           10; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 10; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|3880991|emb|CAA16378.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
          Length = 684

 Score = 66.2 bits (160), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 89/220 (40%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+     +R+KL CK F WFMK +A+D    +P + P 
Sbjct: 474 WMDDYKETLYKHRPGVGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 533

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
               GE +N+GT  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 534 ASAEGEIRNVGTNFCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDIRPKGRKI 593

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D         + K PV                    LF+C +  G  +  +++  + +
Sbjct: 594 CFDCS-----TSVDKAPVI-------------------LFDCHSMKG--NQLFKYRVAQK 627

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
           Q+ H I  +CL+        L M +CD ++  Q+W ++ V
Sbjct: 628 QIYHPISGQCLTADENGKGFLHMKKCDSSSDLQKWAWQTV 667


>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
           AltName: Full=Polypeptide GalNAc transferase 6;
           Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 6;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 6
 gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
 gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
 gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
 gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
 gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 [Mus musculus]
          Length = 622

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 42/226 (18%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +    + GDISE++ LR++L C +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 543

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
           L H                           G +L     G+ L L +C     NS    D
Sbjct: 544 LRHN-------------------------IGKQLCLHASGSTLGLRSCQFVGKNSRVPKD 578

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             W      + + ++    CL+   +  + AM  C+  + YQ W F
Sbjct: 579 EEWEL-TQDQLIRNSGSGTCLTS--QDKKPAMAPCNPRDPYQLWLF 621


>gi|307172175|gb|EFN63700.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
           [Camponotus floridanus]
          Length = 433

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 54/211 (25%), Positives = 88/211 (41%), Gaps = 27/211 (12%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           + GD+SE+  LR KL CK F W++ ++  +++     +      GE +NLG     CLDS
Sbjct: 231 NYGDVSERKTLRKKLGCKSFKWYLDNIYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 286

Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
             R A      G   CH  G +Q+ ++ A G       C+D+  K E L + + L     
Sbjct: 287 PARKADLHKPCGLYPCHRQGGNQIRQI-ASGI------CIDSPGKSEDLHQPVGLYPCHR 339

Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
                  +  + G +    +C + SG+            +  W +   T  + H    +C
Sbjct: 340 QGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWIYSPQTNHIRHGSSDKC 399

Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           L++     QL M  C  N   Q+W F+   P
Sbjct: 400 LAITESKQQLVMEECSPNAPRQRWSFENYDP 430


>gi|15530299|gb|AAH13945.1| GALNT12 protein [Homo sapiens]
          Length = 272

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 72  VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 129

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F G  +N G T  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 130 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 187


>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
          Length = 622

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 42/226 (18%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +    + GDISE++ LR++L C +FSW++ +V  +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 543

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
           L H                           G +L     G+ L L +C     NS    D
Sbjct: 544 LRHN-------------------------IGKQLCLHASGSTLGLRSCQFVGKNSRVPKD 578

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             W      + + ++    CL+   +  + AM  C+  + YQ W F
Sbjct: 579 EEWEL-TQDQLIRNSGSGTCLTS--QDKKPAMAPCNPRDPYQLWLF 621


>gi|10437274|dbj|BAB15027.1| unnamed protein product [Homo sapiens]
          Length = 284

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 84  VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 141

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 142 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 199


>gi|115533034|ref|NP_001041037.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
 gi|87251651|emb|CAJ76949.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
          Length = 622

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 89/220 (40%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+     +R+KL CK F WFMK +A+D    +P + P 
Sbjct: 412 WMDDYKETLYKHRPGVGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 471

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
               GE +N+GT  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 472 ASAEGEIRNVGTNFCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDIRPKGRKI 531

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D         + K PV                    LF+C +  G  +  +++  + +
Sbjct: 532 CFDCS-----TSVDKAPVI-------------------LFDCHSMKG--NQLFKYRVAQK 565

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
           Q+ H I  +CL+        L M +CD ++  Q+W ++ V
Sbjct: 566 QIYHPISGQCLTADENGKGFLHMKKCDSSSDLQKWAWQTV 605


>gi|62148924|dbj|BAD93346.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 581

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 381 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 438

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F G  +N G T  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 439 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 496


>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
 gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
          Length = 693

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 57/229 (24%), Positives = 89/229 (38%), Gaps = 78/229 (34%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLG------ 129
           D GD+S++  LR+ L CK F W++        T +PEL  P N    GE +NLG      
Sbjct: 492 DFGDVSDRKKLREDLQCKSFQWYLD-------TIYPELFIPGNAVANGEIRNLGYGGRTC 544

Query: 130 -------------------------------TQKCLDSMGRTA-PAVIGTSYCHGSGSSQ 157
                                          T  CLD+M +    + +G   CHG G  Q
Sbjct: 545 LDSPSGKRNLKKAVGLYPCHKQGGNQIRNINTNMCLDAMLKNEDESPVGVYECHGQGGHQ 604

Query: 158 LFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNS 217
            + L+  G++   + C+D                    AG  +I         LF C  S
Sbjct: 605 YWMLSKAGEIRRDQACLD-------------------YAGKDVI---------LFGCHGS 636

Query: 218 SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
            G  +  W++  +T+ L H    +CL+++   ++L M  CD ++  Q W
Sbjct: 637 RG--NQFWQYRENTKLLHHGSSGKCLAINESRDKLIMEECDASHLRQHW 683


>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
 gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
          Length = 398

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/105 (35%), Positives = 58/105 (55%), Gaps = 4/105 (3%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+I     P     D GD+S + ALR++L CK F W++ +V  ++    
Sbjct: 297 MRLAEVWMDDYKKIFYAMRPQLKGKDYGDVSGRKALRERLMCKSFKWYLDNVISEL--AI 354

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQ 157
           P+L P +  GE +NLGT  CLD++ +  A    G   CHG G++Q
Sbjct: 355 PDLYP-IGRGEIRNLGTNTCLDTLAKNEAGGEPGMYMCHGMGNNQ 398


>gi|119579301|gb|EAW58897.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 (GalNAc-T12) [Homo
           sapiens]
          Length = 517

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 61/122 (50%), Gaps = 8/122 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 317 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 374

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 375 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 433

Query: 169 HG 170
           + 
Sbjct: 434 YN 435


>gi|114625882|ref|XP_001157326.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
           troglodytes]
          Length = 483

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 61/122 (50%), Gaps = 8/122 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 283 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 340

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 341 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 399

Query: 169 HG 170
           + 
Sbjct: 400 YN 401


>gi|341894191|gb|EGT50126.1| CBN-GLY-4 protein [Caenorhabditis brenneri]
          Length = 584

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/138 (30%), Positives = 68/138 (49%), Gaps = 12/138 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A F++ GDI++++A+RD+L CK F W++  V       +P
Sbjct: 395 RAAEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLDTV-------YP 447

Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLF---RLNAEGQLGH 169
           +L  P    G++  +     CLDSM R      G   CHG+G +Q +    L    +   
Sbjct: 448 QLEVPKKAAGKSVQVKMGHHCLDSMARKENEAPGLFACHGTGGNQEWVFDHLTKTFKNAI 507

Query: 170 GERCVDADKQEKLVEMLK 187
            + C+D  + +K V M+K
Sbjct: 508 TQLCLDFAEDKKPVMMVK 525


>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 683

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 100/225 (44%), Gaps = 50/225 (22%)

Query: 54  LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +R    WL  +KQ   + +G  L    D GD+S +  LRD L C+ F W+++       T
Sbjct: 485 VRLAEVWLDEYKQYYYQRIGDDLG---DFGDVSARKRLRDNLKCRSFDWYVR-------T 534

Query: 112 KFPEL--PPN-LFWGEAKNLGT--QKCLDS-MGR-TAPAVIGTSYCHGSGSSQLFRLNAE 164
            +PEL  P + +  GE +N G     CLDS  GR      +G   CHG G +Q + L+ E
Sbjct: 535 IYPELFVPGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKE 594

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
           G++   E C+D                    AG  +I         L+ C  S G  +  
Sbjct: 595 GEIRRDEACLD-------------------YAGSDVI---------LYPCHGSKG--NQL 624

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           W + +++  ++H   ++CL +  +  +++M  C   +  Q W F+
Sbjct: 625 WIYDDASGTVQHGSSRKCLEMSEDRQKVSMEIC-SGSLRQSWIFQ 668


>gi|397499979|ref|XP_003820707.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
           paniscus]
          Length = 568

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 368 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 425

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 426 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 483


>gi|112807221|ref|NP_078918.3| polypeptide N-acetylgalactosaminyltransferase 12 [Homo sapiens]
 gi|84028209|sp|Q8IXK2.3|GLT12_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
           AltName: Full=Polypeptide GalNAc transferase 12;
           Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 12;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 12
 gi|25815116|emb|CAC80100.2| UDP-GalNAc-transferase 12 [Homo sapiens]
 gi|151555601|gb|AAI48740.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
           [synthetic construct]
 gi|151556464|gb|AAI48410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
           [synthetic construct]
 gi|261858036|dbj|BAI45540.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 [synthetic
           construct]
          Length = 581

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 8/122 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 381 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 438

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G T  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 439 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 497

Query: 169 HG 170
           + 
Sbjct: 498 YN 499


>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Oryzias latipes]
          Length = 607

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 58/223 (26%), Positives = 91/223 (40%), Gaps = 42/223 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+S Q  LR  L+CK+F WFM+ VA+D+   +P + PP
Sbjct: 399 WMDEYAEYVYQRRPEYRHLSAGDMSAQKELRSHLNCKNFRWFMEEVAWDLPKHYPPVEPP 458

Query: 119 NLFWGEAKNLGTQKCLD-------SMGRTAPAVIG---TSYCHGSGSSQLFRLNAE-GQL 167
              WGE +++G+  C++       S  R    V G    S+ HG   +  +R +   G  
Sbjct: 459 AAAWGEIRSVGSGMCMEIKHFVSGSPIRLESCVKGRGDVSWSHGQVLTFGWREDIRVGDP 518

Query: 168 GHGER-CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
            H  + C DA            PVT                   L++C    G  +  WR
Sbjct: 519 MHTRKVCFDAVSHHS-------PVT-------------------LYDCHGMKG--NQLWR 550

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           +    + L H +   C+       ++ M  CD  +  QQW F+
Sbjct: 551 YRKD-KSLFHPVSNSCIDSSASERRVFMNTCDPTSLSQQWLFE 592


>gi|5834643|emb|CAB55352.1| N-acetylgalactosaminyltransferase T-6 [Mus musculus]
          Length = 623

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 68/124 (54%), Gaps = 10/124 (8%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +    + GDISE++ LR++L C +FSW++ +V  +
Sbjct: 429 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLQLREQLRCHNFSWYLHNVYPE 488

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           ++   P+L P  F+G  KNLGT +CLD     R    +I    CH  G +Q F   ++  
Sbjct: 489 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 544

Query: 167 LGHG 170
           L H 
Sbjct: 545 LRHN 548


>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 648

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 100/225 (44%), Gaps = 50/225 (22%)

Query: 54  LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +R    WL  +KQ   + +G  L    D GD+S +  LRD L C+ F W+++       T
Sbjct: 450 VRLAEVWLDEYKQYYYQRIGDDLG---DFGDVSARKRLRDNLKCRSFDWYVR-------T 499

Query: 112 KFPEL--PPN-LFWGEAKNLGT--QKCLDS-MGR-TAPAVIGTSYCHGSGSSQLFRLNAE 164
            +PEL  P + +  GE +N G     CLDS  GR      +G   CHG G +Q + L+ E
Sbjct: 500 IYPELFVPGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKE 559

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
           G++   E C+D                    AG  +I         L+ C  S G  +  
Sbjct: 560 GEIRRDEACLD-------------------YAGSDVI---------LYPCHGSKG--NQL 589

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           W + +++  ++H   ++CL +  +  +++M  C   +  Q W F+
Sbjct: 590 WIYDDASGTVQHGSSRKCLEMSEDRQKVSMEIC-SGSLRQSWIFQ 633


>gi|22122074|dbj|BAC07181.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 [Homo sapiens]
          Length = 581

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 8/122 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LRDKL CKDF WF++ V  +++   
Sbjct: 381 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 438

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G T  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 439 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 497

Query: 169 HG 170
           + 
Sbjct: 498 YN 499


>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
           [Loxodonta africana]
          Length = 622

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 98/224 (43%), Gaps = 38/224 (16%)

Query: 53  ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I   R+L  + +A     GDISE++ L+++LHC +FSWF+ +V  +
Sbjct: 428 QVRLAEVWMDDYKEIFYRRNLQAAKMAEEKSFGDISERLKLKEQLHCHNFSWFLHNVYPE 487

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQL 167
           ++   P+L P  F+G  K+LGT  CLD             Y CH  G +Q F    +  L
Sbjct: 488 MFV--PDLKPT-FYGAIKSLGTDHCLDVGENNHGGKPLIMYPCHSLGGNQYFEYTTQRDL 544

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
            H    + C+ A+                  AG       LG     F   NS    D  
Sbjct: 545 RHNIAKQLCLHAN------------------AGT------LGLRSCHFTGKNSQVPKDEE 580

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           W      + + ++    CL+   +  Q AM  C+ ++ +Q W F
Sbjct: 581 WEL-TQDQLIRNSGSGTCLTS--QDKQPAMAPCNPSDPHQHWLF 621


>gi|157118275|ref|XP_001653147.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108875773|gb|EAT39998.1| AAEL008252-PA [Aedes aegypti]
          Length = 648

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/194 (27%), Positives = 83/194 (42%), Gaps = 48/194 (24%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPPN--LFWGEAKN--LGTQKC 133
           D GD+S +  LR  L+CK F W++++V       FPE   P+  +  GE +N   GT +C
Sbjct: 481 DFGDVSSRKQLRQHLNCKSFRWYLENV-------FPEQFDPSRAVGRGEFRNGENGTDRC 533

Query: 134 LD-SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
           LD  + R      G + CHG G  Q++    EG++   + C+D D               
Sbjct: 534 LDWPLARNQ---CGVTSCHGRGRHQMWYFTREGEITRKDHCLDYD--------------- 575

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
                        G  L +  C    G  +  W +   T+Q  H + K+CL       +L
Sbjct: 576 -------------GKTLEMNRCHQMGG--NQLWEYAEKTQQFRHFLSKKCLEF--SEGKL 618

Query: 253 AMLRCDENNSYQQW 266
            M +C ++ S Q+W
Sbjct: 619 NMKKCMKSGSGQKW 632


>gi|443721252|gb|ELU10645.1| hypothetical protein CAPTEDRAFT_228331 [Capitella teleta]
          Length = 512

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 53/219 (24%), Positives = 96/219 (43%), Gaps = 40/219 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  ++ GDI  ++ LR+KL C+ FSWF+++V       +P
Sbjct: 324 RAAEVWMDEYKSYYYAEVPSAKSVNFGDIRSRLELREKLKCRPFSWFLQNV-------YP 376

Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            L      ++ +G  +      C+D++       +G   CH +G +Q + L    ++ H 
Sbjct: 377 SLIVPSEQDVQFGYIQQ--GSMCVDTLSNALGGKVGMFQCHNTGGNQEWVLTKNQKIKHL 434

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
           + C+             +P      A  + +  RL T  N        G+ +  WR  N+
Sbjct: 435 DLCI------------TMP------ASARHMALRLETCAN--------GSRNQNWRLTNN 468

Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
              L+ +    C+ SV  ET+ L + +CD  ++ QQ++F
Sbjct: 469 GHMLKSSRADLCIDSVDHETHGLTLDKCDSGSATQQFQF 507


>gi|195478397|ref|XP_002086495.1| GE23160 [Drosophila yakuba]
 gi|194186285|gb|EDW99896.1| GE23160 [Drosophila yakuba]
          Length = 590

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/256 (23%), Positives = 108/256 (42%), Gaps = 41/256 (16%)

Query: 21  QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
            LFR + + Q+ +  +  ++++++IS        R    WL  +K       P    + +
Sbjct: 350 HLFR-DGKFQVRYTNKDKNSERKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLDSM- 137
           G+++EQ  L+ +LHCK F WF+ ++A D    +P L P  +  G  +++ + K CLD   
Sbjct: 405 GNLAEQRDLKKRLHCKPFKWFLDNLAADFLNLYPILDPAEYASGVLQSVSSPKLCLDRKE 464

Query: 138 GRTAPAVIG--TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
            R     +G  +S        Q + L    +L  G  C++   Q                
Sbjct: 465 PRNGQPKLGPCSSDHIFPSPEQYWSLTNRRELRSGFYCLEVRNQ---------------- 508

Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQL 252
                     G  ++++ C   SG  +  W F + T Q+   +   ++ CL   PE+N +
Sbjct: 509 ----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVVSGQQQNIRHCLEAQPESNSV 556

Query: 253 AMLRCDENNSYQQWRF 268
               CD  N  Q+W+F
Sbjct: 557 TSSVCDLKNHRQRWKF 572


>gi|195495309|ref|XP_002095212.1| GE22271 [Drosophila yakuba]
 gi|194181313|gb|EDW94924.1| GE22271 [Drosophila yakuba]
          Length = 590

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/256 (23%), Positives = 108/256 (42%), Gaps = 41/256 (16%)

Query: 21  QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
            LFR + + Q+ +  +  ++++++IS        R    WL  +K       P    + +
Sbjct: 350 HLFR-DGKFQVRYTNKDKNSERKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404

Query: 81  GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD-SM 137
           G+++EQ  L+ +LHCK F WF+ ++A D    +P L P  +  G  +++ + K CLD   
Sbjct: 405 GNLAEQRDLKKRLHCKPFKWFLDNLAADFLNLYPILDPAEYASGVLQSVSSPKLCLDRKE 464

Query: 138 GRTAPAVIG--TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
            R     +G  +S        Q + L    +L  G  C++   Q                
Sbjct: 465 PRNGQPKLGPCSSDHVFPSPEQYWSLTNRRELRSGFYCLEVRNQ---------------- 508

Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQL 252
                     G  ++++ C   SG  +  W F + T Q+   +   ++ CL   PE+N +
Sbjct: 509 ----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQQQNIRHCLEAQPESNSV 556

Query: 253 AMLRCDENNSYQQWRF 268
               CD  N  Q+W+F
Sbjct: 557 TSSVCDLKNHRQRWKF 572


>gi|348519859|ref|XP_003447447.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Oreochromis niloticus]
          Length = 624

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 66/128 (51%), Gaps = 4/128 (3%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +     + +A     GDIS++V LR+KL CK FSW++
Sbjct: 424 QVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKDGAFGDISKRVELREKLQCKSFSWYL 483

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
           ++V  +V+   P+L P L +G  KN+G   CLD+            Y CHG G +Q F  
Sbjct: 484 QNVYPEVF--MPDLNP-LRFGSVKNVGKDSCLDAGENNEGGKQLIMYPCHGLGGNQYFEY 540

Query: 162 NAEGQLGH 169
           +   ++ H
Sbjct: 541 STHHEIRH 548


>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Acyrthosiphon pisum]
          Length = 674

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 98/211 (46%), Gaps = 38/211 (18%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNL--GTQKC 133
           D GDI+ +  LR KL CK F W+++++       +PEL  P + +  GE +NL  G + C
Sbjct: 467 DYGDITSRKDLRRKLKCKSFKWYLENI-------YPELFIPGDAVASGEVRNLGYGNKTC 519

Query: 134 LDSMGR----TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKL 188
           LDS  R      PA  G   CH  G +Q+  + +         CVD+     K V++ + 
Sbjct: 520 LDSPARKTDLNKPA--GLYPCHKMGGNQIKNIVS-------NMCVDSKGDANKPVDLWQC 570

Query: 189 PV----TFCVLAGVKLI------FCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI 238
                  + +L+ +  I          G  + L+ C  S G  +  W + + T+QL H  
Sbjct: 571 HQQGGNQYWMLSKIGEIRRDESCLDYAGNDVILYPCHGSKG--NQYWNYDHKTKQLRHGS 628

Query: 239 LKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            ++CL+++   ++L M  CD     Q W+F+
Sbjct: 629 SRKCLTINSSKDKLLMEECDLMLPKQMWQFE 659


>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Anolis carolinensis]
          Length = 634

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 62/236 (26%), Positives = 103/236 (43%), Gaps = 46/236 (19%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       +  +F +  +  + +      GDIS++  L+ +L CKDF W++
Sbjct: 431 QVITRNQVRLAEVWMDEYKNIFYRRNTEAAKIVKQQTFGDISKRHELKQRLQCKDFKWYL 490

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            +V  + Y   P+L P L  G  KN+G + CLD +G       P ++ T  CHG G +Q 
Sbjct: 491 SNVYPEAYV--PDLNPPLS-GFLKNVGRRACLD-VGENNHGGKPLIMYT--CHGLGGNQY 544

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A  ++ H    E C+ A +                   V+L  C        +N  
Sbjct: 545 FEYSARHEIRHNIQKELCLHASQ-----------------GAVQLNECS-------YNGK 580

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLS---VHPETNQLAMLRCDENNSYQQWRF 268
            S  + +  W      + L +A L  CL+    HP     +M+ CD ++ +Q+W F
Sbjct: 581 KSRTSKEEKWEIRED-QLLYNAELNMCLTGNWEHP-----SMVSCDPSDHFQKWIF 630


>gi|432865221|ref|XP_004070476.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Oryzias latipes]
          Length = 621

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 59/230 (25%), Positives = 100/230 (43%), Gaps = 48/230 (20%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I        + +A     GDIS+++ LR+ LHCK+FSW++  +  +
Sbjct: 425 QVRLAEVWMDDYKKIYYRRNKNAAIMAQEKKYGDISDRLKLREDLHCKNFSWYLNTIYPE 484

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           ++   P+L P  F G  KNLG+  CLD +G       P ++    CH  G +Q F  ++ 
Sbjct: 485 IFV--PDLTPEKF-GAIKNLGSDTCLD-VGENNQGGKPVIM--YMCHNMGGNQYFEYSSH 538

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            +L H    + C+ A    + V      +  C L G                     GT 
Sbjct: 539 KELRHNIGKQLCLQAFDHPEQVR-----IELCQLKG--------------------KGTD 573

Query: 222 DGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
             P   W F      L++    +CL +  +  ++ M  C+  + +QQW F
Sbjct: 574 VAPQQQWIF-TEENLLKNPSSGKCLQL--KGGKVYMDSCNSADMHQQWAF 620


>gi|195576344|ref|XP_002078036.1| GD23236 [Drosophila simulans]
 gi|194190045|gb|EDX03621.1| GD23236 [Drosophila simulans]
          Length = 674

 Score = 65.1 bits (157), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 60/228 (26%), Positives = 96/228 (42%), Gaps = 41/228 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +KQ      P     LD GD++ Q  +R++L CK F WFM  VA D   KF
Sbjct: 458 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 517

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
           P + PP+   G  +N+     CLD+MG++    +G   C  + +    +Q + L+    L
Sbjct: 518 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSCADNRTHPQPNQFWELSIFRDL 577

Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
              G    C+D  +                              + +++C +  G  +  
Sbjct: 578 RMKGFDSVCLDVHEGPP------------------------NATVWMWSCHSQGG--NQF 611

Query: 225 WRFHNSTRQLEHA-ILKRCLSVHPETNQLAML---RCDENNSYQQWRF 268
           W +   T++L H    KRCL    E N +AM+    C++ N  Q+W F
Sbjct: 612 WYYDRQTQRLVHGENNKRCLEGFVE-NGIAMVVANSCEDGNDRQRWEF 658


>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
           scrofa]
          Length = 506

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 91/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 300 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 359

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 360 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 417

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 418 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-Q 456

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            + H +   C+      +++ M  C+ ++  QQW F+ +
Sbjct: 457 TMYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHI 495


>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
           taurus]
          Length = 652

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 446 WMDEYAEHIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPP 505

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 506 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPG-- 563

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 564 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-Q 602

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 603 TLYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHT 641


>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
          Length = 528

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 322 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 381

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +  G  
Sbjct: 382 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 439

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 440 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 478

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 479 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 515


>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
 gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
          Length = 556

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 9/120 (7%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P       GDISE + LR +L C+ F W++++V  DV      + PN
Sbjct: 367 WMDDYKKFFYQRHPYLTDQSFGDISENLKLRQRLKCRSFRWYLQNVFTDV------ILPN 420

Query: 120 ---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
              +  G+ +N  +  CLD+ GRT+   +G S C+    + LF   +  ++   + C+DA
Sbjct: 421 ETAIATGKVRNPISNMCLDTFGRTSNTFLGLSPCNIQRDTMLFAYTSRKEISWNDACLDA 480


>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
           scrofa]
          Length = 624

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 91/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 418 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 477

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 478 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 535

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 536 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-Q 574

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            + H +   C+      +++ M  C+ ++  QQW F+ +
Sbjct: 575 TMYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHI 613


>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
          Length = 603

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 590


>gi|410978730|ref|XP_003995741.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
           partial [Felis catus]
          Length = 469

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 12/133 (9%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+++V  +++   
Sbjct: 270 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHV-- 327

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G      +CHG G +Q F   +  ++ 
Sbjct: 328 PEDRPGFF-GMLQNKGLRDYCFDYNPPNENQIVGHQVLLYHCHGMGQNQFFEYTSRNEIR 386

Query: 169 HG----ERCVDAD 177
           +     E C+  D
Sbjct: 387 YNTHQPEACIAVD 399


>gi|167519663|ref|XP_001744171.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777257|gb|EDQ90874.1| predicted protein [Monosiga brevicollis MX1]
          Length = 607

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 94/229 (41%), Gaps = 32/229 (13%)

Query: 53  ELRKRSSWLKVFKQI-RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +LR    W+  +K++ + +  PL   +D+GD+S    +R K  CK F WF+K+V  +++ 
Sbjct: 403 KLRAAEVWMDEYKEVVQRVMPPLPRGMDLGDLSAMQEIRRKFQCKPFKWFLKNVYPEMFV 462

Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              E       GE +N  T  C D++G +   A IG   CH S  +Q F L+  G     
Sbjct: 463 PNDEESIEAS-GEIRNPQTNACFDTLGASHQGAKIGVYPCHHSHGTQEFVLSKAGD---- 517

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
                       V +  +    C+  G           + ++ C  + G  +  W++  +
Sbjct: 518 ------------VRVAAMDFDNCLDRG------NGDGSVGIWPCHQTGG--NQAWKWDRT 557

Query: 231 TRQLEHAILKRCLSVHPETN-----QLAMLRCDENNSYQQWRFKEVRPD 274
           T     A    C+ V  E        L + +CD  +  QQW   + + D
Sbjct: 558 TGLFGDATGDFCVEVRREQTANSPFALHLAKCDPKDPMQQWHVGQPKSD 606


>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Saimiri boliviensis boliviensis]
          Length = 682

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 476 WMDEYAEYIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 535

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 536 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 593

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 594 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 632

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 633 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 671


>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
           anubis]
          Length = 637

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 431 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 490

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 491 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 548

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 549 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 587

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 588 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 624


>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
          Length = 634

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 428 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 487

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +  G  
Sbjct: 488 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 545

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 546 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 584

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 585 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 621


>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
           rerio]
          Length = 578

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    SP A     GD++++  LR +L CKDF WF+ ++  D+  + 
Sbjct: 376 VRAAEVWMDDFKEVYYHRSPHARLEAYGDVTDRRKLRMRLRCKDFRWFLDNIYPDI--QV 433

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P +F G  KN G T  C D        + G       CHG G +Q F  +   ++ 
Sbjct: 434 PEDKPGMF-GMLKNKGMTNYCFDYNPPDEHKIAGHRVILYPCHGMGQNQFFEYSTLQEIR 492

Query: 169 HGER----CVDADKQEKLVEM 185
           +  R    CV AD     + M
Sbjct: 493 YNTRDPAGCVVADPVSTFLTM 513


>gi|359320847|ref|XP_532008.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Canis
           lupus familiaris]
          Length = 578

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+++V  +++   
Sbjct: 379 VRAAEVWMDDFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHV-- 436

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        V+G       CHG G +Q F   ++ ++ 
Sbjct: 437 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQVVGYQVLLYICHGMGQNQFFEYTSQNEIR 495

Query: 169 HG----ERCVDADKQEKLVEM 185
           +     E C+  D    ++ M
Sbjct: 496 YNTHQPEACIAVDAGTDVLVM 516


>gi|313242250|emb|CBY34413.1| unnamed protein product [Oikopleura dioica]
          Length = 644

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 49/222 (22%), Positives = 91/222 (40%), Gaps = 32/222 (14%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   + +      P A  +D GD++E  A+R++L CK F WFMK + YD+   F
Sbjct: 416 VRVVEVWWDEYSKYFYERKPNAKHVDPGDLTEVKAVRERLQCKSFDWFMKEIGYDLPKHF 475

Query: 114 PELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           P + P N   G   N+GT  C+D               H +G      L+    +  G+ 
Sbjct: 476 PLVEPSNAARGHITNVGTNLCVDG--------------HHAGQGDQLALSECASVREGQW 521

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCR--------LGTVLNLFNCFNSSGTVDGP 224
            +   +  +       P++   +   K    R        +G  +  + C    G  +  
Sbjct: 522 MLTWHEDIR-------PLSGSNVENKKASDARKVCWDGTGVGNPIGFWECHTQGG--NQL 572

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +++    +Q  H+  + C + + E  ++ +  CD++N  Q+W
Sbjct: 573 FKYVPDKKQFIHSPSRACATANEEQKKVYLKACDDSNDNQKW 614


>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           isoform 5 [Macaca mulatta]
          Length = 603

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590


>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
           taurus]
 gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
           taurus]
          Length = 620

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 414 WMDEYAEHIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPP 473

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 474 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPG-- 531

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 532 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-Q 570

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 571 TLYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFE 607


>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
 gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mus musculus]
 gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
          Length = 603

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 590


>gi|410897032|ref|XP_003962003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Takifugu rubripes]
          Length = 624

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 96/230 (41%), Gaps = 34/230 (14%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVIS  +  L       + ++F +     + L      GDIS+++ LR +L CK FSW++
Sbjct: 424 QVISRNQVRLAEVWMDDYKEIFYRRNQQAAQLVRDKAFGDISQRMDLRARLKCKSFSWYL 483

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
           K++  + +   P+L P L +G  KN+G   CLD+            Y CHG G +Q F  
Sbjct: 484 KNIYPEAF--IPDLNP-LGFGSVKNVGKDSCLDAGENNEGGKRVIMYPCHGLGGNQYFEY 540

Query: 162 NAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
           +   ++ H    E C+                       VKL  C+       +   N+ 
Sbjct: 541 STRHEIRHNIQKELCLHG-----------------AAGAVKLEECQ-------YKGRNTL 576

Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
              +  W+  ++ R +    L +CLS   E   LA   C+  + YQ W F
Sbjct: 577 VGAEQKWQLKDN-RLIYMPELNKCLSARQERPFLAA--CNPMDRYQLWVF 623


>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Mus musculus]
          Length = 603

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 590


>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 599

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 88/217 (40%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  F +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 393 WMDEFAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 452

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 453 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 510

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 511 --DPQHTKK----------FCFDA------VSHSSPVTLYDCHSMKG--NQLWKYRKD-K 549

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+       ++ M  C  ++  QQW F+
Sbjct: 550 TLYHPVSGSCMDCSESDRRVFMNACRPSSPTQQWLFE 586


>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Sarcophilus harrisii]
          Length = 651

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 58/220 (26%), Positives = 95/220 (43%), Gaps = 36/220 (16%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR+ L+CK F WFM  +A+D+   +P + P 
Sbjct: 442 WMDEYAEYIYQRLPEYRHLSTGDVTAQKDLRNHLNCKSFKWFMTEIAWDLPRYYPPVEPA 501

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD--A 176
              WGE +N+GTQ C           IGT   HG+  S L RL         E CV   A
Sbjct: 502 AAAWGEIRNVGTQLC-----------IGTK--HGAPGSPL-RL---------ESCVKGRA 538

Query: 177 DKQEKLVEMLKLPVTFCVLAG-----VKLIFCRL--GTVLNLFNCFNSSGTVDGPWRFHN 229
           +     V++        +  G      K  F  +   + + L++C    G  +  W++ N
Sbjct: 539 EAAWSNVQVFTFSWREDIRPGDPQHTKKFCFDTISHSSPVTLYDCHGMKG--NQLWKYVN 596

Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
             + L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 597 G-KTLYHPVSGSCMDCTESDHRVFMNTCNPSSPTQQWIFQ 635


>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           isoform 4 [Macaca mulatta]
          Length = 602

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 396 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 455

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 456 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 513

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 514 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 552

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 553 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 589


>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Otolemur garnettii]
          Length = 603

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAAQKRLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNACNPSSLTQQWLFE 590


>gi|16741554|gb|AAH16585.1| Galnt10 protein [Mus musculus]
          Length = 339

 Score = 64.7 bits (156), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 55/222 (24%), Positives = 91/222 (40%), Gaps = 30/222 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P
Sbjct: 128 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYP 187

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQL 167
            + PP   WGE +N+GT  C D+   T  + +    C  G G     S Q+F       +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDI 247

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             G    D    +K          FC  A          + + L++C +  G  +  W++
Sbjct: 248 RPG----DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKY 285

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 286 RKD-KTLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 326


>gi|268569766|ref|XP_002648333.1| C. briggsae CBR-GLY-4 protein [Caenorhabditis briggsae]
          Length = 523

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 68/144 (47%), Gaps = 18/144 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K I     P A F++ GDI +++A+RD+L CK F W++      VY +  
Sbjct: 326 RAAEVWMDEYKAIYLKNVPSARFVNYGDIGDRLAIRDRLQCKSFKWYLD----TVYPQLA 381

Query: 115 ELPPNL-----FWGEAKNLGTQKCLDSMGRT---APAVIGTSYCHGSGSSQLF---RLNA 163
            L  N+      W  A       CLDSM R    APA+     CHG+G +Q +    L  
Sbjct: 382 SLTRNVSSQKDAWQIAPMKIGHLCLDSMARKENEAPALFA---CHGTGGNQEWIFDDLTK 438

Query: 164 EGQLGHGERCVDADKQEKLVEMLK 187
             +    + C+D   ++K V M+K
Sbjct: 439 TFKNAISQMCLDFSAEKKDVVMVK 462


>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
 gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
          Length = 551

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 345 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 404

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 405 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 462

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 463 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 501

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 502 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 538


>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
           partial [Ailuropoda melanoleuca]
          Length = 598

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 392 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKTFKWFMTKIAWDLPKFYPPVEPP 451

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE  N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 452 AAAWGEIHNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 509

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 510 --DPQHTKK----------FCFDA------ISSSSPVTLYDCHSMKG--NQLWKYRRD-K 548

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  CD ++  QQW F+
Sbjct: 549 TLYHPVSGGCMDCSESDHRIFMNTCDPSSLTQQWLFE 585


>gi|426229966|ref|XP_004009054.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Ovis
           aries]
          Length = 339

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/224 (23%), Positives = 91/224 (40%), Gaps = 30/224 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P
Sbjct: 128 RVAEVWMDEYAEHIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYP 187

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
            + PP   WGE +N+GT  C D+      + +    C  G G +     Q+F       +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDI 247

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             G    D    +K          FC  A          + + L++C +  G  +  W++
Sbjct: 248 RPG----DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKY 285

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
               + L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 286 RKD-QTLYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHT 328


>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Callithrix jacchus]
          Length = 667

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 461 WMDEYAEYIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMMKIAWDLPKFYPPVEPP 520

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 521 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 578

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 579 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 617

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 618 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 654


>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
          Length = 551

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 345 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKTFKWFMTKIAWDLPKFYPPVEPP 404

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE  N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 405 AAAWGEIHNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 462

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 463 --DPQHTKK----------FCFDA------ISSSSPVTLYDCHSMKG--NQLWKYRRD-K 501

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  CD ++  QQW F+
Sbjct: 502 TLYHPVSGGCMDCSESDHRIFMNTCDPSSLTQQWLFE 538


>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
           caballus]
          Length = 561

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 355 WMDEYAEHIYQRRPEYRHLSAGDVAAQKQLRSSLNCKTFRWFMTQIAWDLPKFYPPVEPP 414

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 415 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 472

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 473 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 511

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 512 TLYHPVSGSCVDCSESDHRIFMNTCNPSSRTQQWLFE 548


>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Strongylocentrotus purpuratus]
          Length = 756

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 13/123 (10%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE---L 116
           WL  +K+     SP A+ ++ G+ +++V LRD+L C+ F W++++V       FPE    
Sbjct: 564 WLDGYKEFFYFMSPSALKVNAGNYTDRVELRDRLGCRSFQWYLENV-------FPEGGWP 616

Query: 117 PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCH--GSGSSQLFRLNAEGQLGHGERCV 174
             N  +GE ++  T  CLD+ GRT P +      H   +  +Q++    E ++ H   C+
Sbjct: 617 GRNKIYGEVRHTATNWCLDTGGRTTP-ITEPMVAHRCDNNVNQIWMYTEEQEIKHSSLCL 675

Query: 175 DAD 177
           D D
Sbjct: 676 DYD 678


>gi|32425807|gb|AAH07224.2| GALNT10 protein, partial [Homo sapiens]
          Length = 247

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 41  WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 100

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 101 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 158

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 159 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 197

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 198 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 236


>gi|167537119|ref|XP_001750229.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771219|gb|EDQ84888.1| predicted protein [Monosiga brevicollis MX1]
          Length = 587

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 60/224 (26%), Positives = 91/224 (40%), Gaps = 45/224 (20%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+I     P    L+ GD+SE+ ALR++L CK F W++ +V  D+    P+    
Sbjct: 390 WMDEYKEIYYEFRPSHRQLETGDVSERKALRERLKCKPFKWYLDNVFPDMMV--PDRHNL 447

Query: 120 LFWGEAKNLGTQKCLDSMG-RTAPAVIGTSYCHGSGSS--QLF-------RLNAEGQLGH 169
              G+ +N  T  CLD++  R A    G   C  S  S  Q+F        +  EG    
Sbjct: 448 YAKGQLRNANTNMCLDTLTPREADMKAGVYACASSPKSENQMFFYTKRFKEIRREGTF-- 505

Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
           G RC+D                    AG            +++ C    G  +  W  H 
Sbjct: 506 GSRCLD-------------------FAGGP------SGSPSMYGCHLMKGNQE--W-IHT 537

Query: 230 STRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
               + H   K CL + P+++    L +  C E N  Q+WRF E
Sbjct: 538 PDHHILHMASKLCLEIEPKSDHGYHLVLNPCFEANERQKWRFSE 581


>gi|195359229|ref|XP_002045319.1| GM11142 [Drosophila sechellia]
 gi|194122575|gb|EDW44618.1| GM11142 [Drosophila sechellia]
          Length = 658

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 58/227 (25%), Positives = 93/227 (40%), Gaps = 39/227 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +KQ      P     LD GD++ Q  +R++L CK F WFM  VA D   KF
Sbjct: 443 RVAEVWMDEYKQYVYKRDPKTYDSLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 502

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
           P + PP+   G  +N+     CLD+MG++    +G   C  + +    +Q + L+    L
Sbjct: 503 PPVEPPSYSAGIIQNVANPVYCLDNMGKSTEEAVGMFSCADNRTHPQPNQFWELSIFRDL 562

Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
              G    C+D  +                              + ++NC +  G  +  
Sbjct: 563 RMKGFDSVCLDVHEGPP------------------------NATVWMWNCHSQGG--NQF 596

Query: 225 WRFHNSTRQLEHA-ILKRCLS--VHPETNQLAMLRCDENNSYQQWRF 268
           W +   T++L H    KRCL   V     ++    C++ N  Q+W F
Sbjct: 597 WYYDRQTQRLVHGENNKRCLEGFVENGIAKVVANACEDGNDRQRWEF 643


>gi|5419872|emb|CAB46378.1| hypothetical protein [Homo sapiens]
          Length = 276

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 70  WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 129

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 130 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 187

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 188 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 226

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 227 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 265


>gi|334333371|ref|XP_001365881.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Monodelphis domestica]
          Length = 757

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 58/229 (25%), Positives = 89/229 (38%), Gaps = 43/229 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+I    +  A     GD++E++ LR KL CK+F WF+ +V  +++   
Sbjct: 404 VRAAEVWMDEYKEIYYHRNMHARKEPYGDVTERLQLRAKLKCKNFKWFLDNVYPELHV-- 461

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  KN G T  C D    +   + G+      CHG G +Q F      ++ 
Sbjct: 462 PEDRPGFF-GMLKNRGMTDYCFDYNPPSENEITGSQVILYPCHGMGQNQFFEYTKHNEIR 520

Query: 169 HGER----CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
           +  R    CV  D     + M                 C      N    F   G+    
Sbjct: 521 YNTRQPEGCVAVDAGTDHLTMY---------------LCSKNAPENQKFIFKEDGS---- 561

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQ---LAMLRCDENNSYQQWRFKE 270
                    L H   K+CL    + +      +LR   ++ YQQW F+E
Sbjct: 562 ---------LYHVQSKKCLQAEAKASNGSPAPLLRTCTDSKYQQWFFQE 601


>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
          Length = 658

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 55/228 (24%), Positives = 87/228 (38%), Gaps = 48/228 (21%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +KQ      P    +D GD+SE+  LR +L C DF W++  V        
Sbjct: 467 MRVAEVWMDHYKQYFLDTRPGQNIIDAGDVSERKELRQRLQCHDFKWYLNTVL------- 519

Query: 114 PELPPNLFWGEAKNLGTQK--------CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
               P+LF  +A ++  Q         C+D MG+    V G   CHG G++Q +  +   
Sbjct: 520 ----PDLFIPDANHIQHQGTLHTPDNICVDKMGQRNGGVAGVYPCHGQGTNQAWMYSITN 575

Query: 166 QL-GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
           ++  H   C+DA                            L + ++L  C    G  +  
Sbjct: 576 EIRTHDSLCLDA------------------------WGSTLPSPVHLGRCHGMRGNQE-- 609

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKE 270
           WR+      + H  L  CL      +Q  L + +C   N  Q W + E
Sbjct: 610 WRYDPLRHTMVHVPLNACLEASSVDSQKKLQINKCLSGNQAQVWFWDE 657


>gi|119582045|gb|EAW61641.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
           isoform CRA_c [Homo sapiens]
          Length = 274

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 68  WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 127

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 128 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 185

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 186 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 224

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 225 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 263


>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
           troglodytes]
          Length = 551

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 345 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 404

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 405 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNVQVFTFTWREDIRPG-- 462

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 463 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 501

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 502 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 538


>gi|48734736|gb|AAH72450.1| GALNT10 protein, partial [Homo sapiens]
          Length = 413

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 30/222 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P
Sbjct: 202 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYP 261

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
            + PP   WGE +N+GT  C D+      + +    C  G G +     Q+F       +
Sbjct: 262 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDI 321

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             G    D    +K          FC  A          + + L++C +  G  +  W++
Sbjct: 322 RPG----DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKY 359

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 360 RKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 400


>gi|194759472|ref|XP_001961971.1| GF15238 [Drosophila ananassae]
 gi|190615668|gb|EDV31192.1| GF15238 [Drosophila ananassae]
          Length = 663

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 58/109 (53%), Gaps = 6/109 (5%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQ-KCLD 135
           +D GD++ Q  +R++L CK F WFM  VA D   K+P + PP    G  +N+     CLD
Sbjct: 472 IDAGDLTRQKGVRERLKCKSFHWFMTEVAPDFLEKYPPVEPPAYASGAIQNVAYPIYCLD 531

Query: 136 SMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLGHGERCVDADKQE 180
           SMG+     IG   C  + S    +Q ++L++  +L H E  +  D Q+
Sbjct: 532 SMGKIINDAIGLFSCADNRSHPQANQNWKLSSSRELRHKEESICLDVQD 580


>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Cavia porcellus]
          Length = 602

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 88/217 (40%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 396 WMDDYAEYIYQRRPEYRHLSAGDVVAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 455

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      A +    C  G G +     Q+F       +  G  
Sbjct: 456 AAAWGEIRNVGTGLCADTKHGALGAPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPG-- 513

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 514 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRQD-K 552

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+  +  QQW F+
Sbjct: 553 TLFHPVSGSCMDCSESDHRIFMNTCNPASVTQQWLFE 589


>gi|313227738|emb|CBY22887.1| unnamed protein product [Oikopleura dioica]
          Length = 1030

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPP 118
           W   +K+      P    +D+GD++  +A+++KL CKDFSW+M+ +AYD+   +P  +P 
Sbjct: 818 WDDNYKKYYYERRPENKNVDVGDLTAVLAIKEKLQCKDFSWYMREIAYDIPKYWPMVMPK 877

Query: 119 NLFWGEAKNLGTQKCLDS 136
           N  WG  KN GT  C +S
Sbjct: 878 NQAWGILKNEGTGLCPNS 895


>gi|260790280|ref|XP_002590171.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
 gi|229275360|gb|EEN46182.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
          Length = 1466

 Score = 64.3 bits (155), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 6/149 (4%)

Query: 30   QLGH--GERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
            ++GH  G++        +    S   +R    W+  +K    +  P  M    GDIS++ 
Sbjct: 1243 RVGHVWGKKTYSTGNMTLHDWASRNNMRVAEVWMDHYKVHYYIRRPYLMKRKFGDISDRR 1302

Query: 88   ALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGT 147
             LR++L CKDF W++ +   D+Y   P+  P  + G+ +N GT  CLD   +     +  
Sbjct: 1303 RLRERLQCKDFRWYLDNAFPDLY--IPDDIPGRY-GQVRNNGTNTCLDWTSKPQRE-LEM 1358

Query: 148  SYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
              CH    +Q F L  + QL     C++A
Sbjct: 1359 FPCHHGLGTQFFELTGQNQLRDERSCLEA 1387


>gi|260817709|ref|XP_002603728.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
 gi|229289050|gb|EEN59739.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
          Length = 501

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/138 (32%), Positives = 62/138 (44%), Gaps = 10/138 (7%)

Query: 30  QLGHGERCVDADKQVISTQKSSLEL-----RKRSSWLKVFKQIRSLGSPLAMFLDMGDIS 84
           ++GH ER       V S     + +     R    W+  +K+      P    +  GDIS
Sbjct: 273 RVGHIERTDKPYLYVRSNDTKDINIEVNKARVAEVWMDEYKRYLYAREPQLKNISYGDIS 332

Query: 85  EQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSM-GRTAPA 143
           E+ ALR +L C+ F W+M++V  D   +  E      WGE +NL    CLD M GR    
Sbjct: 333 ERQALRKRLGCQSFQWYMENVYPDRLEQTVENGYYRAWGELRNLQAGLCLDLMDGRG--- 389

Query: 144 VIGTSYCHGSGSSQLFRL 161
            +G   CHG G  Q F L
Sbjct: 390 -VGLWDCHGQGGQQFFAL 406


>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
 gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
           AltName: Full=Polypeptide GalNAc transferase 10;
           Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 10;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
 gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
 gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
           [synthetic construct]
 gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
           [synthetic construct]
 gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
 gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [synthetic
           construct]
          Length = 603

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590


>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
           troglodytes]
 gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
           troglodytes]
 gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
           troglodytes]
          Length = 603

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNVQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590


>gi|397517643|ref|XP_003829017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
           paniscus]
          Length = 444

 Score = 63.9 bits (154), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 238 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 297

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 298 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNVQVFTFTWREDIRPG-- 355

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 356 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 394

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 395 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 431


>gi|426350728|ref|XP_004042920.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Gorilla gorilla gorilla]
          Length = 444

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 238 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 297

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 298 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 355

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 356 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 394

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 395 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 431


>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Nomascus leucogenys]
 gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
           isoform CRA_d [Homo sapiens]
 gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
           isoform CRA_d [Homo sapiens]
          Length = 506

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 300 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 359

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 360 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 417

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 418 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 456

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 457 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 493


>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
           mulatta]
          Length = 565

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 359 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 418

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 419 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 476

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 477 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 515

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 516 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 552


>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
           Mn2+
 gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
           Galnac-ser On Lectin Domain
          Length = 570

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 364 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 423

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 424 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 481

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 482 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 520

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 521 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 557


>gi|348521382|ref|XP_003448205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Oreochromis niloticus]
          Length = 620

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 62/231 (26%), Positives = 100/231 (43%), Gaps = 50/231 (21%)

Query: 53  ELRKRSSWLKVFKQIRSLGSPLAMFLD----MGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I    +  A  +      GDIS ++ LR++LHCK+FSW++  V  +
Sbjct: 424 QVRLAEVWMDDYKKIYYRRNKNAAIMASEHRFGDISARLNLRERLHCKNFSWYLNTVYPE 483

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT---APAVIGTSYCHGSGSSQLFRLNAEG 165
           ++   P+L P    G  KNLG+  CLD+        P ++   +CH  G +Q F   +  
Sbjct: 484 IF--IPDLNPEKS-GSIKNLGSNMCLDAGENNQGGKPLIM--YHCHNMGGNQYFEYTSHK 538

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAG-----VKLIFCRLGTVLNLFNCFNSSGT 220
           +L H     +  KQ             C+ A      VK+  C+L             GT
Sbjct: 539 ELRH-----NIGKQ------------LCLHAAVGSDPVKIELCQL----------KGKGT 571

Query: 221 VDGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
              P   W F      L++    +CL +  +  QL M  C+ +  +QQW F
Sbjct: 572 SVAPEQEWVF-TEENLLKNPSSGKCLVL--KGGQLLMDYCNPSELHQQWTF 619


>gi|358331987|dbj|GAA50722.1| putative polypeptide N-acetylgalactosaminyltransferase 10
           [Clonorchis sinensis]
          Length = 738

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 37/113 (32%), Positives = 59/113 (52%), Gaps = 5/113 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  +K+      P    +D GD+SEQ A+R++LHCK F WFM+ +A+D+  K+P
Sbjct: 521 RVAETWMDEYKEYIYSRRPHYRHIDAGDLSEQKAVRERLHCKPFKWFMETIAFDLTKKYP 580

Query: 115 ELPP-NLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYC---HGSGSSQLFRLN 162
            + P +   G+ + +  Q  C+D+ G T    I    C    G    Q F L+
Sbjct: 581 LVEPVSAATGDVRPVVDQTLCVDAEGATKTEPIKLKKCVEYVGGNGVQKFELS 633


>gi|71051628|gb|AAH98444.1| GALNT10 protein, partial [Homo sapiens]
          Length = 310

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 104 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 163

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 164 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 221

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 222 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 260

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            L H +   C+      +++ M  C+ ++  QQW F+  
Sbjct: 261 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 299


>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
           partial [Taeniopygia guttata]
          Length = 402

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 71/134 (52%), Gaps = 10/134 (7%)

Query: 60  WLKVFKQ---IRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           W+  FK    I S G+P  ++   +  G +     L+ ++ C+ FSW++++V  D  ++ 
Sbjct: 212 WMDDFKDFFYIISPGAPRFVWDKRILYGIVPWCGTLKIRMKCQPFSWYLENVYPD--SQI 269

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P    +L  GE +N+ T +CLD+MGR     +G   CHG G +Q+F   A+ ++   + C
Sbjct: 270 PRRYYSL--GEIRNVETNQCLDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLC 327

Query: 174 VDADKQEKLVEMLK 187
           +D  +    V MLK
Sbjct: 328 LDVSRLNGPVLMLK 341


>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
 gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
           [Pediculus humanus corporis]
          Length = 657

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 58/215 (26%), Positives = 92/215 (42%), Gaps = 43/215 (20%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEA--------KNLGT 130
           D GD+S +  LR +L+CK F W++ ++       +PEL      GEA        K LG 
Sbjct: 455 DFGDVSARKELRKRLNCKSFKWYLDNI-------YPEL---FIPGEAVAGGEVRNKGLGG 504

Query: 131 QKCLDSMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLK 187
           + CLDS  R A     +G   CH  G +Q+   N  GQ      C+D+  K E + + + 
Sbjct: 505 KTCLDSPARKADLHKAVGLFPCHRQGGNQVSN-NWSGQ------CLDSPCKSEDMHKPVG 557

Query: 188 LPVTFCVLAGVKLIFCRLGTV-------------LNLFNCFNSSGTVDGPWRFHNSTRQL 234
           L            +  + G +             + L+ C  S G  +  W ++  T  +
Sbjct: 558 LWPCHKQGGNQYWMLSKAGEIRRDEACLDYAGQDVILYPCHGSKG--NQYWHYNPDTNLI 615

Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           +H   K+CL++     +L M  CDE    Q+W F+
Sbjct: 616 QHGSSKKCLTMASTKQKLLMEDCDEMEPRQKWIFQ 650


>gi|195587296|ref|XP_002083401.1| GD13712 [Drosophila simulans]
 gi|194195410|gb|EDX08986.1| GD13712 [Drosophila simulans]
          Length = 631

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 49/186 (26%), Positives = 84/186 (45%), Gaps = 35/186 (18%)

Query: 92  KLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY 149
           +L CK F WFM+ VA+D+   +P + PP    G  +N+G Q  CLD++GR     +G   
Sbjct: 453 ELKCKSFKWFMEEVAFDLMKTYPPVDPPAYAMGALQNVGNQNLCLDTLGRKKHNKMGMYA 512

Query: 150 CHGS----GSSQLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFC 203
           C  +      SQ + L+   + +L   + C+D       +     PV             
Sbjct: 513 CADNIKIPQRSQFWELSWKRDVRLRRKKECLDVQ-----IWDANAPVW------------ 555

Query: 204 RLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNS 262
                  L++C +  G  +  W +    + L+H    +RCL + P + ++ + +CD +N 
Sbjct: 556 -------LWDCHSQGG--NQYWYYDYHHKLLKHGTEGRRCLELLPFSQEVVVNKCDSDNR 606

Query: 263 YQQWRF 268
           +QQW F
Sbjct: 607 FQQWNF 612


>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
           lupus familiaris]
          Length = 603

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSALNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE  N+GT  C+D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIHNVGTGLCVDTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDAISNT------SPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590


>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 [Homo sapiens]
          Length = 603

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C  G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQYTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590


>gi|119582048|gb|EAW61644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
           isoform CRA_e [Homo sapiens]
          Length = 339

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 30/222 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P
Sbjct: 128 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYP 187

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
            + PP   WGE +N+GT  C D+      + +    C  G G +     Q+F       +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDI 247

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             G    D    +K          FC  A          + + L++C +  G  +  W++
Sbjct: 248 RPG----DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKY 285

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 286 RKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 326


>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Ornithorhynchus anatinus]
          Length = 467

 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 61/243 (25%), Positives = 93/243 (38%), Gaps = 45/243 (18%)

Query: 42  KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
           KQ   ++  +L   +R    W+  +K++    +P A     GD++ +  LR KL C+DF 
Sbjct: 254 KQAPYSRSKALANSVRAAEVWMDGYKELYYHRNPHARLEPYGDVTARRDLRSKLKCRDFK 313

Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
           WF+++V  +++   PE  P  F G  KN G +  C D        V G       CHG G
Sbjct: 314 WFLENVYPELHV--PEDRPGYF-GMLKNKGMENHCFDYNPPDENEVTGQRLILYPCHGMG 370

Query: 155 SSQLFRLNAEGQL----GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLN 210
            +Q F   +  ++     H E C   D     V M                 C+     N
Sbjct: 371 QNQFFEYTSHHEIRYNTRHPEACAAVDVGTDYVTMY---------------LCQENVPEN 415

Query: 211 LFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWR 267
               F   G+             L H   ++CL         N   +LR   N+ +Q+W 
Sbjct: 416 QKFIFQEDGS-------------LLHVQSQKCLQAEANAYNGNPAPLLRPCTNSDHQKWF 462

Query: 268 FKE 270
           FKE
Sbjct: 463 FKE 465


>gi|195385643|ref|XP_002051514.1| GJ11806 [Drosophila virilis]
 gi|194147971|gb|EDW63669.1| GJ11806 [Drosophila virilis]
          Length = 653

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 89/228 (39%), Gaps = 37/228 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K+      P     +D GD+S Q A+R++L CK F WFMK VA D   KF
Sbjct: 440 RVAEVWMDEYKEHVYRRDPATYDNIDAGDLSRQRAVRERLKCKSFDWFMKEVAPDFLIKF 499

Query: 114 PELPPNLFWGEA--KNLGTQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQL 167
           P + P  +   A      T  CLD M   A   +G   C  +      +Q + L +  +L
Sbjct: 500 PPIDPPAYASGAIQSQAYTDFCLDCMNVGANHPVGMYNCAENLTYPQDNQNWALTSHREL 559

Query: 168 GH-GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
               + C+D                                 + +++C +  G  +  W 
Sbjct: 560 RRLDDACLDVQDAHP------------------------NATVWMWDCHHQGG--NQFWY 593

Query: 227 FHNSTRQLEHAIL-KRCLSVHPETNQLAML--RCDENNSYQQWRFKEV 271
           +      L H +  K C+    E     +L  RCDENN  Q+W F +V
Sbjct: 594 YDRQHHWLVHGLRGKNCMEAFVENGVTKVLTNRCDENNVRQRWNFGKV 641


>gi|403298655|ref|XP_003940127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Saimiri boliviensis boliviensis]
          Length = 272

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    SP A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 72  VRAAEVWMDEFKELYYHRSPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 129

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F G  +N G +  C D        ++G+      CHG G +Q F   ++ ++
Sbjct: 130 PEDRPGFF-GMLQNKGLKDYCFDYNPPDENQIVGSQVILYLCHGMGQNQFFEYTSQKEI 187


>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
           [Oryctolagus cuniculus]
          Length = 603

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 51/217 (23%), Positives = 88/217 (40%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC------HGSGSSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+      + +    C          S Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHWALGSPLRLESCVRDRGEAAWNSMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++  + +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYR-TDK 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+  +  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPASLTQQWLFE 590


>gi|170587206|ref|XP_001898369.1| glycosyl transferase, group 2 family protein [Brugia malayi]
 gi|158594195|gb|EDP32781.1| glycosyl transferase, group 2 family protein [Brugia malayi]
          Length = 582

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 42/135 (31%), Positives = 70/135 (51%), Gaps = 13/135 (9%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    WL  +K +     P A +++ GDI+ ++ L+ +L CKDF W++K +       +P
Sbjct: 392 RAAEVWLGDYKYLYLRKVPSARYVNFGDITARLDLKKRLRCKDFDWYLKEI-------YP 444

Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGR-TAPAVIGTSYCHGSGSSQLFRLNAE-GQLG-- 168
           EL  P+   G          C+DS+GR TA + +G   CHG+G +Q + LN + G L   
Sbjct: 445 ELAIPSKEQGRYLTFRQGNVCIDSLGRHTALSSVGIYRCHGTGGNQEWVLNDKFGVLKSP 504

Query: 169 HGERCVDADKQEKLV 183
           +   C+  D++  L+
Sbjct: 505 YSNLCITDDEKGTLI 519


>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
           9-like isoform 1 [Nasonia vitripennis]
          Length = 646

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 53/234 (22%), Positives = 85/234 (36%), Gaps = 73/234 (31%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
           + GD+S++ ALR  L CK F W++ ++  +++     +      GE +NLG     CLDS
Sbjct: 444 NYGDVSDRKALRKNLGCKSFKWYLDNIYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 499

Query: 137 MGRTA----PA---------------------------------VIGTSYCHGSGSSQLF 159
             R A    PA                                  +G   CH  G +Q +
Sbjct: 500 PARKADLHKPAGLYPCHRQGGNQIRHLASRLCIDSPGNPEDLHQAVGFYECHNQGGNQYW 559

Query: 160 RLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSG 219
            L+  G++   E C+D                              GT + L+ C  S G
Sbjct: 560 MLSKTGEIRRDESCLDYS----------------------------GTDVILYPCHGSKG 591

Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
                W ++  T+Q+ H    +CL++     +L M  C      Q+W F+   P
Sbjct: 592 NQQ--WTYNTQTKQIRHESSGKCLAITESKQRLLMEECSPAAPRQRWAFENYDP 643


>gi|195471079|ref|XP_002087833.1| GE18238 [Drosophila yakuba]
 gi|194173934|gb|EDW87545.1| GE18238 [Drosophila yakuba]
          Length = 659

 Score = 63.5 bits (153), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 3/99 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K+      P     +D GD++ Q A+R++L CK F W+M  VA D   KF
Sbjct: 443 RVAEVWMDEYKEYVYKRDPATYDNVDAGDLTRQRAVRERLKCKSFHWYMTEVAPDFLIKF 502

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
           P + PP+   G  +N+     CLDSMG+ A   +G   C
Sbjct: 503 PPVEPPSYAAGVIQNVANPVYCLDSMGKNAEEAVGMFSC 541


>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Acyrthosiphon pisum]
          Length = 581

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 49/218 (22%), Positives = 87/218 (39%), Gaps = 41/218 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      PL+  +  G+I++++AL+  L CK F W++ +V       +P
Sbjct: 391 RAAEVWMDQYKRYYYNAVPLSRIVPFGNIADRLALKKNLGCKPFKWYLDNV-------YP 443

Query: 115 E--LPPNL--FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  LP  +  F G  +      CLD++        G   CH  G +Q +     G + H 
Sbjct: 444 ELKLPATVDEFVGSIRQ--GYMCLDTLENQVGKTAGIFPCHDYGGNQEWTFTIGGSIKHD 501

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
             C+              P  +  ++ + +  C                T D  W+F  +
Sbjct: 502 MMCLS-------------PTDYSSMSLIIMKPC--------------DSTTD-EWKFDEN 533

Query: 231 TRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           T+QL    +  CL      + +++  CD  NS Q+W +
Sbjct: 534 TKQLRLKAVNLCLDTLGVHDMISINVCDSLNSNQKWEY 571


>gi|260803980|ref|XP_002596867.1| hypothetical protein BRAFLDRAFT_129116 [Branchiostoma floridae]
 gi|229282127|gb|EEN52879.1| hypothetical protein BRAFLDRAFT_129116 [Branchiostoma floridae]
          Length = 481

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 86/224 (38%), Gaps = 45/224 (20%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A     G+I  +  LR+KL+CK F W++++V       +P
Sbjct: 286 RTAEVWMDDYKNNFYAARPSAKGKPYGNIQGRKELREKLNCKTFQWYLENV-------YP 338

Query: 115 ELP---PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCH-----GSGSSQLFRLNAEGQ 166
           EL     N+ WGE K     +CLD++G   P  +G   C+     G  S Q + L     
Sbjct: 339 ELDIPGSNMKWGELKQ--ADQCLDTLGHKPPGAVGMQACNLHPGSGMASRQQWALTKRNM 396

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
           + HG                    ++C+ A   +     GT++ L  C       +   R
Sbjct: 397 IQHGS------------------WSWCLAADTAIP----GTLVYLITC----NIQENKMR 430

Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAML--RCDENNSYQQWRF 268
           +    R L H     CL        L +   +CD+    QQW +
Sbjct: 431 WKRQDRLLMHIATGLCLDSVEAAQGLGVTIEQCDKTRVSQQWEY 474


>gi|10435819|dbj|BAB14676.1| unnamed protein product [Homo sapiens]
          Length = 339

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 30/222 (13%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P
Sbjct: 128 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYP 187

Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
            + PP   WGE +N+GT  C D+      + +    C  G G +     Q+F       +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDI 247

Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
             G    D    +K          FC  A          + + L++C +  G  +  W++
Sbjct: 248 RPG----DPQYTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKY 285

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
               + L H +   C+      +++ M  C+ ++  QQW F+
Sbjct: 286 RKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 326


>gi|351699369|gb|EHB02288.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Heterocephalus
           glaber]
          Length = 570

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 65/133 (48%), Gaps = 12/133 (9%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KLHCKDF WF++ V  +++   
Sbjct: 370 VRAAEVWMDEFKELYYHRTPRARLEPYGDVTERRQLRAKLHCKDFRWFLETVYPELHV-- 427

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G ++ C D        V G       CHG G +Q F   ++ ++ 
Sbjct: 428 PEDRPG-FFGMLQNKGLKEYCFDYNPPNENQVEGHQVILYACHGMGQNQFFEYTSQKEIR 486

Query: 169 HG----ERCVDAD 177
           +     E CV  +
Sbjct: 487 YNTHQPETCVAVE 499


>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
           catus]
          Length = 603

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 397 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 515 --DPQHTKK----------FCFDAISNT------SPVTLYDCHSMKG--NQLWKYRKD-K 553

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H +   C+      +++ M  C+  +  QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPASLTQQWLFE 590


>gi|170038563|ref|XP_001847118.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
           quinquefasciatus]
 gi|167882317|gb|EDS45700.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
           quinquefasciatus]
          Length = 531

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 59/218 (27%), Positives = 95/218 (43%), Gaps = 41/218 (18%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNL--GTQKC 133
           D GD+S +  LR++L CK F W++ ++       FPEL      +  GE +N+  G + C
Sbjct: 329 DYGDVSSRKQLREELGCKSFRWYLDNI-------FPELFIPGEAVASGEVRNMGYGNRTC 381

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD-ADKQEKLVEMLKLPV 190
           LD+ G  +     +G   CH  G +Q+   N    L     C+D A K E +     L +
Sbjct: 382 LDAPGGKKNLRKPVGLYPCHNQGGNQV--ANPWSGL-----CIDSAAKPEDM--HTPLGI 432

Query: 191 TFCVLAGVK--LIFCRLGTV-------------LNLFNCFNSSGTVDGPWRFHNSTRQLE 235
             C  AG     +  + G +             + L+ C  S G  +  W +  S+R L 
Sbjct: 433 WPCHQAGGNQYWMLSKTGEIRRDEACLDYAGQDVILYPCHGSKG--NQYWNYSPSSRLLR 490

Query: 236 HAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           H    +CL+++    +L M  CD +   Q+W F+   P
Sbjct: 491 HGSSDKCLAINESKQKLVMADCDASVEAQKWLFQNYDP 528


>gi|194225536|ref|XP_001494993.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Equus
           caballus]
          Length = 460

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 42/141 (29%), Positives = 68/141 (48%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K++    +P A     GD++E+  LR+KL CKDF WF+++V  +++   
Sbjct: 260 VRAAEVWMDGYKELYYHRNPHARLEPFGDVTERKQLREKLRCKDFRWFLENVYPELHV-- 317

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIG---TSY-CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        + G   T Y CHG G +Q F   +  ++ 
Sbjct: 318 PEDRPGCF-GMLQNKGLKDYCFDYNPPNENQITGHQVTLYLCHGMGQNQFFEYTSRNEIR 376

Query: 169 HGER----CVDADKQEKLVEM 185
           +  R    CV  D    ++ M
Sbjct: 377 YNTRQPEGCVAVDAGTDILTM 397


>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
            [Saccoglossus kowalevskii]
          Length = 2434

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/219 (23%), Positives = 89/219 (40%), Gaps = 40/219 (18%)

Query: 55   RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
            R    W+  +K+      P +  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 2242 RAAEVWMDEYKKYYYSAVPSSKNIAFGNIQSRLDLRKKLQCKSFGWYLENV-------YP 2294

Query: 115  EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            EL      ++ +G  +      C+D++G      +G   CH +G +Q F L  +  + H 
Sbjct: 2295 ELRIPDKKDIAFGALQQ--GHLCMDTLGHFTDGTLGLYECHNTGGNQEFALTKDKAIRHQ 2352

Query: 171  ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
            + C+                         ++  R   V+ L  C  S   ++  W    +
Sbjct: 2353 DLCL------------------------TVMDHRPSGVIKLHGC--SESNLNQKWEQIEN 2386

Query: 231  TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
             RQL+      C+ S    T  L + RC++N   Q WRF
Sbjct: 2387 NRQLKFRGTDLCIDSKSVATVGLVVERCEKNAITQHWRF 2425


>gi|149639508|ref|XP_001513185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Ornithorhynchus anatinus]
          Length = 634

 Score = 63.2 bits (152), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 41/134 (30%), Positives = 71/134 (52%), Gaps = 15/134 (11%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  FK+I     +  + +      GD+S+++ LRD+L CK+F+W++  +  +
Sbjct: 437 QVRLAEVWMDEFKEIFYRRNTEAAKIVKQKAFGDLSKRLELRDRLQCKNFTWYLNTIYPE 496

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
           VY   P+L P L  G  K++G   CLD   +   T P ++ T  CHG G +Q F  + + 
Sbjct: 497 VYV--PDLNPVLS-GYIKSVGRHVCLDVGENNQGTKPLIMYT--CHGLGGNQYFEYSEQH 551

Query: 166 QLGHG---ERCVDA 176
           ++ H    E C+ A
Sbjct: 552 EIRHNIQKELCLHA 565


>gi|344265184|ref|XP_003404666.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 10-like [Loxodonta
           africana]
          Length = 602

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 396 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 455

Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
              WGE  ++GT  C D+  G     +   S   G G +     Q+F       +  G  
Sbjct: 456 AAAWGEIHSVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 513

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +K          FC  A          + + L++C +  G  +  W++    +
Sbjct: 514 --DPQHTKK----------FCFDA------ISHNSPVTLYDCHSMKG--NQLWKYRKD-K 552

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L H I   C+      +++ M  C+ ++  QQW F+
Sbjct: 553 TLYHPISGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 589


>gi|170038567|ref|XP_001847120.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
 gi|167882319|gb|EDS45702.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
          Length = 494

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 45/189 (23%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPPNLFWGEAKNLGTQKCLDSM 137
           D GD+S +  LR +L+C+ F W+M+       T FPE   P+   G  +      CLD  
Sbjct: 337 DFGDVSSRKKLRARLNCRPFRWYME-------TVFPEQFDPSKAVGRGQFRIGGGCLD-- 387

Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAG 197
             T  +VIG   CHG G  QL+   A+G++   + C+D D ++                 
Sbjct: 388 WPTKLSVIG---CHGLGGHQLWFFTADGEITREDHCMDFDSKK----------------- 427

Query: 198 VKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRC 257
                      L +  C    G  +  W F       +HA  +RCL    + N+L M  C
Sbjct: 428 -----------LEMIRCHKQKG--NQMWVFEEKAGHFKHAYYERCLEW--DGNELRMSEC 472

Query: 258 DENNSYQQW 266
            + +  Q+W
Sbjct: 473 IDEDMRQKW 481


>gi|432092277|gb|ELK24900.1| N-acetylgalactosaminyltransferase 7 [Myotis davidii]
          Length = 802

 Score = 62.8 bits (151), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 101/229 (44%), Gaps = 26/229 (11%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 477 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 536

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P  P N+ WGE K   T + L +  R         + H S  + L       + G     
Sbjct: 537 PLPPKNIDWGEGK---THQLLRNSPRRLRERALPPHVHPSILTDLTPQPPCRREG----- 588

Query: 174 VDADKQEKLVEML----KLPVTFCVLAGVKLIFC------RLGTVLNLFNCFNSSGTVDG 223
              + Q+ L+ +      L ++ C + G +  +C        G  + L  C    G  + 
Sbjct: 589 --PEDQDLLLLLFVSSRALTLSQCSIRGFESAYCIDSMGRTNGGFVELGPCHRMGG--NQ 644

Query: 224 PWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
            +R + + + +++    +CL+  P+ +++ +  C+  N Y++W++ +VR
Sbjct: 645 LFRINEANQLMQY---DQCLTKGPDGSKIMITHCNL-NEYKEWQYFKVR 689


>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 [Mustela putorius
           furo]
          Length = 452

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 67/141 (47%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+++V  +++   
Sbjct: 254 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHV-- 311

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 312 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQIMGHQVLLYLCHGMGQNQFFEYTSQKEIR 370

Query: 169 HG----ERCVDADKQEKLVEM 185
           +     E C+  +    ++ M
Sbjct: 371 YNTHQPEACIAVEAGTDILTM 391


>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Cavia porcellus]
          Length = 633

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 60/238 (25%), Positives = 107/238 (44%), Gaps = 46/238 (19%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++ A+R +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDLSKRFAIRKRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             V  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTVYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A       ++L+L    C   G                  
Sbjct: 544 FEYSAQHEIRHSIQKELCLHATS-----DLLQLKA--CAYKGR----------------- 579

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSV---HPETNQLAMLRCDENNSYQQWRFKE 270
           N+  T +  W      + L +  LK CLS    HP     +++ C+ ++  Q+W F +
Sbjct: 580 NTIVTGEQVWEIQKD-QLLYNPFLKLCLSANGKHP-----SLVSCNPSDPLQKWIFNQ 631


>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
 gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
           norvegicus]
 gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
           norvegicus]
          Length = 578

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 97/229 (42%), Gaps = 44/229 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDDYKEHFYNRNPPARKETYGDISERKLLRERLQCKSFDWYLKNVFSNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP      A +    CHG G +Q F   +  ++
Sbjct: 439 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 494

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
                 E C +  +Q+  V M   P               L   +N+   F   GT+   
Sbjct: 495 RFNSVTELCAEVPQQKDYVGMQNCPKD------------GLPIPVNIIWHFKEDGTI--- 539

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
             FH  TR         CLS +   +    + M  CD  +  Q WRF++
Sbjct: 540 --FHPHTRL--------CLSAYRTADGRPSVQMKTCDALDKNQIWRFEK 578


>gi|348544073|ref|XP_003459506.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 3-like
           [Oreochromis niloticus]
          Length = 600

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 62/228 (27%), Positives = 84/228 (36%), Gaps = 45/228 (19%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
           LR    W+  +K    L   + M    +D GDIS++VALR  L CK F W++     +VY
Sbjct: 401 LRVAEVWMDEYKSNVYLAWNIPMENHGIDYGDISQRVALRKSLQCKSFHWYLD----NVY 456

Query: 111 TKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL-- 167
            +       LF+GE +N   +  C+D  G           CHG G  QL R   EGQL  
Sbjct: 457 PEMRRYNDTLFYGEVRNSKASHLCMDQ-GVKENHTATLHPCHGWG-PQLGRYTKEGQLFL 514

Query: 168 ------GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
                 G   RCV  D+     ++L                          NC   +   
Sbjct: 515 GPLGSTGEDTRCVVDDQISNFPQLL--------------------------NCEKVTNVK 548

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
              W F  +   +  A   RCL V P       L   +  S Q+W  K
Sbjct: 549 QKTWHFSQNESIINRAT-GRCLEVVPANVYFGHLLVLQPCSGQRWTIK 595


>gi|395740752|ref|XP_003780818.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 12, partial [Pongo
           abelii]
          Length = 538

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 338 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 395

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 396 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 453


>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
           livia]
          Length = 560

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 94/226 (41%), Gaps = 37/226 (16%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR+ L+CK F WFM  VA+D+   +P + PP
Sbjct: 340 WMDEYAEYIYQRRPEYRHLSAGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPP 399

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGERCV 174
              WGEA++  T              +GT  C    HG+  S L RL         E CV
Sbjct: 400 AAAWGEARDSATSLLFQIRN------VGTGLCVDTKHGALGSPL-RL---------ENCV 443

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDG 223
             D+ E     +++  TF     ++         FC       + + L++C    G  + 
Sbjct: 444 -KDRGEAAWNNVQV-FTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQ 499

Query: 224 PWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            WR+    + L H +   C+       ++ M  C+ ++  QQW F+
Sbjct: 500 LWRYRKD-KTLFHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 544


>gi|195115611|ref|XP_002002350.1| GI13183 [Drosophila mojavensis]
 gi|193912925|gb|EDW11792.1| GI13183 [Drosophila mojavensis]
          Length = 655

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 95/231 (41%), Gaps = 37/231 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMF-LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K+      P     +D GD++ Q A+R++L CK F W+MK VA D   KF
Sbjct: 439 RVAEVWMDEYKEFVYKRDPATYNNIDAGDLTRQRAVRERLQCKSFDWYMKEVAPDFLVKF 498

Query: 114 PEL-PPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
           P + PP    G  ++L    +CLD +   +   +G   C  + +    +Q + L A  +L
Sbjct: 499 PPVEPPAYASGAVESLAFKDQCLDCLNLGSNHPVGLYACAHNRTHPQENQNWALTAHREL 558

Query: 168 GH-GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
               + C+D                                 + +++C +  G  +  W 
Sbjct: 559 RRLDDACLDVQDSHP------------------------NATVWMWDCHHQGG--NQFWY 592

Query: 227 FHNSTRQLEHAILKR-CLSVHPE--TNQLAMLRCDENNSYQQWRFKEVRPD 274
           +    + L H    R CL    E   +++   RCDE+N  Q+WR  +V  D
Sbjct: 593 YDRKHQWLVHGHHGRNCLEAFVENGVSKVVTNRCDEHNERQRWRIGQVNDD 643


>gi|221330664|ref|NP_001137779.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
           melanogaster]
 gi|442625712|ref|NP_722910.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
           melanogaster]
 gi|25987157|gb|AAN75751.1|AF324752_1 N-acetylgalactosaminyltransferase [Drosophila melanogaster]
 gi|220901927|gb|ACL82986.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
           melanogaster]
 gi|440213268|gb|AAN10370.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
           melanogaster]
          Length = 644

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 3/99 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +KQ      P     LD GD++ Q  +R++L CK F WFM  VA D   KF
Sbjct: 428 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 487

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
           P + PP+   G  +N+     CLD+MG++    +G   C
Sbjct: 488 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSC 526


>gi|354548807|gb|AER27632.1| AT25481p1 [Drosophila melanogaster]
          Length = 666

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 3/99 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +KQ      P     LD GD++ Q  +R++L CK F WFM  VA D   KF
Sbjct: 450 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 509

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
           P + PP+   G  +N+     CLD+MG++    +G   C
Sbjct: 510 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSC 548


>gi|51316006|sp|Q8IA42.2|GALT4_DROME RecName: Full=N-acetylgalactosaminyltransferase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4; Short=pp-GaNTase 4
 gi|34042946|gb|AAQ56701.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           [Drosophila melanogaster]
          Length = 659

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 3/99 (3%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +KQ      P     LD GD++ Q  +R++L CK F WFM  VA D   KF
Sbjct: 443 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 502

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
           P + PP+   G  +N+     CLD+MG++    +G   C
Sbjct: 503 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSC 541


>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
 gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
           [Ixodes scapularis]
          Length = 493

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 60/127 (47%), Gaps = 10/127 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P+A  + MG + E++ LR  L C  F W++ +V       +P
Sbjct: 370 RAAEVWMDDYKKYYYATVPVARNVPMGSVEERLNLRKSLGCHSFQWYLDNV-------YP 422

Query: 115 ELPPNLFWGE--AKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL      GE  A     Q CLD++G +    +G   CHGSG +Q + L A   + HG  
Sbjct: 423 ELKVPAAGGERLASLRQGQMCLDTLGGSEGNPVGLFTCHGSGGNQQWSL-ASRLIAHGGL 481

Query: 173 CVDADKQ 179
           C+  D +
Sbjct: 482 CITVDAE 488


>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
          Length = 637

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 54/223 (24%), Positives = 98/223 (43%), Gaps = 20/223 (8%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  + +     +PL    D GD++ Q  LR++L C+ F WF++ +A D+  ++P
Sbjct: 402 RVAEVWMDEYAKFLYERNPLFEKTDPGDLTAQRELRERLQCRPFKWFLEEIAPDLLIRYP 461

Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRL--NAEGQ 166
              P  F  G  +++  ++ CLDS+   A   IG   C  + +    +Q F L  + + +
Sbjct: 462 VREPQPFASGRVQSVADRRLCLDSLNHQAKQPIGLYTCASNQTHPQNNQFFTLSFHRDIR 521

Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFN-CFNSSGTVDGPW 225
           +   ++C+DA +    V      + F           R   V   F+   NS+G  D P+
Sbjct: 522 VRSNDKCLDASRLNDEV------ILFSCHESQGNQMWRYDYVSTEFSRKQNSTGISDTPF 575

Query: 226 RFHNSTRQLEHAILK--RCLSVHPETNQLAMLRCDENNSYQQW 266
           +    ++ + H      RCL       ++ +  CD +N  Q W
Sbjct: 576 Q---ESKMIIHGKEHNGRCLEADIVKRKVTVGSCDRSNPAQSW 615


>gi|47221042|emb|CAG12736.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 611

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 59/227 (25%), Positives = 82/227 (36%), Gaps = 43/227 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
           LR    W+  +K    L   + M    +D GDIS++VALR  L CK F W++     +VY
Sbjct: 412 LRVAEVWMDQYKSNVYLAWNIPMESHGIDYGDISQRVALRKSLQCKSFEWYLD----NVY 467

Query: 111 TKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL--- 167
            +       LF+GE +N          G           CHG G  QL R   +GQL   
Sbjct: 468 PEMRRYNNTLFYGEIRNSKVSHLCMDQGIKENHTATLHPCHGWG-PQLGRYTKDGQLFLG 526

Query: 168 -----GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
                G   RCV  D+                          + +V  L NC  ++    
Sbjct: 527 PLGSTGEDTRCVVDDQ--------------------------ISSVPQLINCDRATDVKQ 560

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
             W F  +   +  A   RCL V P       L   +  + Q+W  K
Sbjct: 561 MTWHFSQNESIVNQAT-GRCLEVVPANVYFGHLLILQPCTGQRWSIK 606


>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
          Length = 553

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 39/127 (30%), Positives = 64/127 (50%), Gaps = 15/127 (11%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P+A  +  G+I++++ALR+ L+CK F W++++V       +P
Sbjct: 362 RAAEVWMDDYKKYYFAAVPMARTVTFGNITDRLALRNSLNCKPFKWYVENV-------YP 414

Query: 115 EL---PPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL 167
           EL    P +      N G  K    C D+ GR A + IG   CH +G +Q +     G+L
Sbjct: 415 ELLKHLPTVRDPSGTNSGAIKYKSLCFDTYGRGAGSHIGLYACHMTGGNQAWTY-LNGRL 473

Query: 168 GHGERCV 174
            HG  C+
Sbjct: 474 RHGSWCL 480


>gi|426362479|ref|XP_004048391.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Gorilla gorilla gorilla]
          Length = 633

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 433 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 490

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 491 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 548


>gi|300797173|ref|NP_001180032.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Bos
           taurus]
 gi|296472282|tpg|DAA14397.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Bos
           taurus]
          Length = 571

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL+CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|441593636|ref|XP_003260599.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 isoform
           2 [Nomascus leucogenys]
          Length = 495

 Score = 62.0 bits (149), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 60/122 (49%), Gaps = 8/122 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 295 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 352

Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G T  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 353 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIMGHQVILYLCHGMGQNQFFEYTSQKEIR 411

Query: 169 HG 170
           + 
Sbjct: 412 YN 413


>gi|440891991|gb|ELR45390.1| Polypeptide N-acetylgalactosaminyltransferase 2, partial [Bos
           grunniens mutus]
          Length = 530

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL+CK F W++++V       +P
Sbjct: 342 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 394

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 395 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 452

Query: 171 ERCV 174
           + C+
Sbjct: 453 DLCL 456


>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Papio anubis]
          Length = 571

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DXCL 497


>gi|148670411|gb|EDL02358.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12, isoform CRA_b [Mus
           musculus]
          Length = 222

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+  V  +++   
Sbjct: 22  VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 79

Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G +  CLD        V G       CHG G +Q F      ++ 
Sbjct: 80  PEDRPG-FFGMLQNRGLRGYCLDYNPPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIR 138

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 139 YNTRQPEA 146


>gi|296190391|ref|XP_002743190.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
           partial [Callithrix jacchus]
          Length = 571

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 8/121 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 371 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRTKLQCKDFKWFLETVYPELHV-- 428

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 429 PEDRPG-FFGMLQNKGLKDYCFDYNPPDENQIVGYQVILYLCHGMGQNQFFEYTSQKEIR 487

Query: 169 H 169
           +
Sbjct: 488 Y 488


>gi|391346326|ref|XP_003747427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Metaseiulus occidentalis]
          Length = 622

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 13/123 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P A  +  GDISE++ LR+KL CK F W+MK+V       +P
Sbjct: 411 RAAEVWMDDYKKYYYEQVPAAKSVAYGDISERLKLREKLRCKSFEWYMKNV-------YP 463

Query: 115 E--LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGE 171
           E  LP N+     +N    +CLD++G  +       Y CH  G +Q FRL     L   +
Sbjct: 464 ELKLPSNVHGYVRQN---NRCLDTLGAISDGSTVHVYPCHYLGGNQDFRLAKNHLLMVHD 520

Query: 172 RCV 174
            CV
Sbjct: 521 MCV 523


>gi|390332984|ref|XP_784677.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Strongylocentrotus purpuratus]
          Length = 572

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 58/124 (46%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  F++      P A  +  G+I  +  LR+KLHCK F W++++V       +P
Sbjct: 382 RAAEVWMDEFREFYYAAVPSARNVPFGNIQARTELREKLHCKPFKWYLENV-------YP 434

Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  +      C+D++G      +G   CH SG +Q F    +  + H 
Sbjct: 435 ELRVPDKQDIAFGSIQQ--GSMCMDTLGHVHQGTLGLYDCHNSGGNQEFSFTKDKLIKHS 492

Query: 171 ERCV 174
           + C+
Sbjct: 493 DLCL 496


>gi|380795279|gb|AFE69515.1| polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
           mulatta]
          Length = 243

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 43  VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 100

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQL 167
           PE  P  F+G  +N G +  C D        ++G       CHG G +Q F   ++ ++
Sbjct: 101 PEDRPG-FFGMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEI 158


>gi|194855550|ref|XP_001968569.1| GG24947 [Drosophila erecta]
 gi|190660436|gb|EDV57628.1| GG24947 [Drosophila erecta]
          Length = 659

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 92/227 (40%), Gaps = 39/227 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +KQ      P     +D GD++ Q  +R++L CK F W+M  VA D   K+
Sbjct: 443 RVAEVWMDEYKQYVYNRDPTTYDNVDAGDLTRQRGIRERLKCKSFHWYMTEVAPDFLVKY 502

Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
           P + PP+   G  +N+     CLD+MG++    +G   C  + +    +Q + L+    L
Sbjct: 503 PPVEPPSYAEGVIQNVANPVYCLDNMGKSTEKEVGMFSCADNRTHPQPNQFWELSIFRDL 562

Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
                   C+D  + +                            + ++NC    G  +  
Sbjct: 563 RMKSLDSVCLDVHEPQP------------------------NATVWMWNCHTQGG--NQF 596

Query: 225 WRFHNSTRQLEHA-ILKRCLS--VHPETNQLAMLRCDENNSYQQWRF 268
           W +   T++L H    KRCL   V     ++    C+E N  Q+W F
Sbjct: 597 WYYDRQTQRLVHGETNKRCLEGLVENGVARVVANACEEGNDRQRWEF 643


>gi|296490594|tpg|DAA32707.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Bos
           taurus]
 gi|440907905|gb|ELR57989.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Bos grunniens
           mutus]
          Length = 633

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 104/235 (44%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGRPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A                  L  V+L  C        +   
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA-----------------ALGAVQLKAC-------AYKGH 579

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
            +  T +  W      + L +   K CLS   E    +++ C+ ++S Q+W F +
Sbjct: 580 KTVATGEQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDSLQKWIFNQ 631


>gi|297668747|ref|XP_002812581.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           1 [Pongo abelii]
 gi|297668749|ref|XP_002812582.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           2 [Pongo abelii]
 gi|297668751|ref|XP_002812583.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           3 [Pongo abelii]
          Length = 633

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 107/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV------------- 583

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               T +  W      + L +  LK CLS + E    +++ CD ++  Q+W F +
Sbjct: 584 ----TGEQMWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCDPSDPLQKWIFSQ 631


>gi|126331345|ref|XP_001372222.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Monodelphis
           domestica]
          Length = 585

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 42/85 (49%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W   +K       P +  L  GDISE    R+  +CK F WFM+ +AYD+ + +
Sbjct: 466 IRVVEVWWDGYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIISHY 525

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG 138
           P  P N+ WGEA  L     +D  G
Sbjct: 526 PLPPKNVEWGEAPELMRMFGMDGQG 550


>gi|431909863|gb|ELK12965.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Pteropus alecto]
          Length = 543

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 14/155 (9%)

Query: 42  KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
           KQ   ++K +L   +R    W+  FK++    +P A     GD++E+  LR KL CKDF 
Sbjct: 329 KQAPYSRKKALANSVRAAEVWMDEFKELYYHRNPHARLEPFGDVTERRQLRAKLQCKDFK 388

Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
           WF++ V  +++   PE  P  F+G  +N G +  C D        + G       CHG G
Sbjct: 389 WFLETVYPELHV--PEDRPG-FFGMLQNRGLKDYCFDYNPPNEHDITGHQVLLYLCHGMG 445

Query: 155 SSQLFRLNAEGQLGHG----ERCVDADKQEKLVEM 185
            +Q F   ++ ++ +     E C+  +    ++ M
Sbjct: 446 QNQFFEYTSQREIRYNTHQPEACIAVEAGTDILIM 480


>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
           [Cricetulus griseus]
          Length = 715

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 509 WMDEYAEYIYQRRPEYRHLSAGDVVAQKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPP 568

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
              WGE +N+GT  C D+   T+   +    C  G G     S Q+F    +  +  G  
Sbjct: 569 AAAWGEIRNVGTGLCTDTKHGTSGLPLRLETCIRGRGEAAWNSMQVFTFTWKEDIRPG-- 626

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
             D    +KL          C  A          + + L++C +  G  +  W++    +
Sbjct: 627 --DPQHTKKL----------CFDA------VSHNSPVTLYDCHSMKG--NQLWKYRKD-K 665

Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
            L   +   C+      +++ M  C+ ++  QQW F+
Sbjct: 666 TLYLPVSGGCMDCSESDHRIFMNTCNPSSLTQQWLFE 702


>gi|355567593|gb|EHH23934.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
           mulatta]
          Length = 457

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 94/229 (41%), Gaps = 43/229 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 257 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 314

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 315 PEDRPGFF-GMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIR 373

Query: 169 HG----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTV 221
           +     E C+                   V AG+ ++    C      N        G++
Sbjct: 374 YNTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL 415

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                FH  +++   A  K         + + +LR   N+ +Q+W FKE
Sbjct: 416 -----FHEQSKKCVQAARKES-----SDSFVPLLRDCTNSDHQKWFFKE 454


>gi|302563901|ref|NP_001181506.1| polypeptide N-acetylgalactosaminyltransferase 12 [Macaca mulatta]
          Length = 581

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 94/229 (41%), Gaps = 43/229 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 381 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 438

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 439 PEDRPGFF-GMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIR 497

Query: 169 HG----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTV 221
           +     E C+                   V AG+ ++    C      N        G++
Sbjct: 498 YNTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL 539

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                FH  +++   A  K         + + +LR   N+ +Q+W FKE
Sbjct: 540 -----FHEQSKKCVQAARKES-----SDSFVPLLRDCTNSDHQKWFFKE 578


>gi|426256000|ref|XP_004021634.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Ovis
           aries]
          Length = 674

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL+CK F W++++V       +P
Sbjct: 486 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 538

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 539 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 596

Query: 171 ERCV 174
           + C+
Sbjct: 597 DLCL 600


>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_a [Homo sapiens]
          Length = 508

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455

Query: 171 ERCV 174
           + C+
Sbjct: 456 DLCL 459


>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
           caballus]
          Length = 539

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 351 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 403

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 404 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 461

Query: 171 ERCV 174
           + C+
Sbjct: 462 DLCL 465


>gi|344268030|ref|XP_003405867.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Loxodonta africana]
          Length = 633

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVRQKSFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             V  +VY   P+L P +  G  K+ G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NSVYPEVYV--PDLNP-VISGYIKSFGQHLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
           F  +A+ ++ H       + Q++L          C+ A    +  R  T    +    + 
Sbjct: 544 FEYSAQHEIRH-------NIQKEL----------CLHAAPGPVQLRTCT----YKGHKTV 582

Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
              +  W      + L ++ LK CLS + E    +++ C+ ++S+Q+W F +
Sbjct: 583 AAGEQIWEIQKD-QLLYNSFLKMCLSANGE--HPSLVSCNPSDSFQKWIFSQ 631


>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
           And Ea2
 gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
 gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
          Length = 501

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 313 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 365

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 366 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 423

Query: 171 ERCV 174
           + C+
Sbjct: 424 DLCL 427


>gi|403258871|ref|XP_003921965.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Saimiri
           boliviensis boliviensis]
          Length = 633

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 106/229 (46%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +  LK CLS + E    +++ C+ ++S Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDSLQKWIFSQ 631


>gi|348569970|ref|XP_003470770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Cavia porcellus]
          Length = 579

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 63/128 (49%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL C+DF WF+++V  +++   
Sbjct: 379 VRAAEVWMDEFKELYYHRNPRARLEPYGDVTERRELRAKLRCRDFRWFLENVYPELHV-- 436

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G ++ C D        V G       CHG G +Q F   ++ ++ 
Sbjct: 437 PEDRPGFF-GMLQNKGMKEYCFDYNPPDENQVWGHQVILYNCHGMGQNQFFEYTSQKEIR 495

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 496 YNTRQPEA 503


>gi|281341254|gb|EFB16838.1| hypothetical protein PANDA_002911 [Ailuropoda melanoleuca]
          Length = 496

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR +L CKDF WF+++V  +++   
Sbjct: 297 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHV-- 354

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 355 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIR 413

Query: 169 HG----ERCVDADKQEKLVEM 185
           +     E C+  +  + ++ M
Sbjct: 414 YNTHQPEACIAVEAGKDVLIM 434


>gi|397508104|ref|XP_003824510.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
           paniscus]
          Length = 533

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455

Query: 171 ERCV 174
           + C+
Sbjct: 456 DLCL 459


>gi|395728898|ref|XP_002809364.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Pongo abelii]
          Length = 532

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 344 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 396

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 397 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 454

Query: 171 ERCV 174
           + C+
Sbjct: 455 DLCL 458


>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           4 [Pan troglodytes]
          Length = 533

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455

Query: 171 ERCV 174
           + C+
Sbjct: 456 DLCL 459


>gi|350592744|ref|XP_001927809.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Sus
           scrofa]
          Length = 571

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|332812181|ref|XP_003308857.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
           troglodytes]
 gi|410227516|gb|JAA10977.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
 gi|410264536|gb|JAA20234.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
 gi|410296424|gb|JAA26812.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
          Length = 576

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 388 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 440

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 441 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 498

Query: 171 ERCV 174
           + C+
Sbjct: 499 DLCL 502


>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455

Query: 171 ERCV 174
           + C+
Sbjct: 456 DLCL 459


>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_b [Homo sapiens]
 gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_b [Homo sapiens]
          Length = 533

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455

Query: 171 ERCV 174
           + C+
Sbjct: 456 DLCL 459


>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
           gorilla gorilla]
          Length = 533

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455

Query: 171 ERCV 174
           + C+
Sbjct: 456 DLCL 459


>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 571

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|148670410|gb|EDL02357.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12, isoform CRA_a [Mus
           musculus]
          Length = 259

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+  V  +++   
Sbjct: 59  VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 116

Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G +  CLD        V G       CHG G +Q F      ++ 
Sbjct: 117 PEDRPG-FFGMLQNRGLRGYCLDYNPPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIR 175

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 176 YNTRQPEA 183


>gi|291402210|ref|XP_002717436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Oryctolagus cuniculus]
          Length = 571

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|4758412|ref|NP_004472.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Homo
           sapiens]
 gi|51315838|sp|Q10471.1|GALT2_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           AltName: Full=Polypeptide GalNAc transferase 2;
           Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 2;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 2
           soluble form
 gi|971461|emb|CAA59381.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
           sapiens]
 gi|26996816|gb|AAH41120.1| GALNT2 protein [Homo sapiens]
 gi|119590317|gb|EAW69911.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
           CRA_c [Homo sapiens]
 gi|239740418|gb|ACS13744.1| polypeptide N-acetylgalactosaminyltransferase 2 [Homo sapiens]
 gi|307686451|dbj|BAJ21156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [synthetic
           construct]
          Length = 571

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
           [Macaca mulatta]
          Length = 554

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 366 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 418

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 419 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 476

Query: 171 ERCV 174
           + C+
Sbjct: 477 DLCL 480


>gi|158261119|dbj|BAF82737.1| unnamed protein product [Homo sapiens]
          Length = 571

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
           lupus familiaris]
          Length = 588

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 400 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 452

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 453 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 510

Query: 171 ERCV 174
           + C+
Sbjct: 511 DLCL 514


>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
           griseus]
          Length = 494

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 57/228 (25%), Positives = 96/228 (42%), Gaps = 32/228 (14%)

Query: 51  SLELRKRSS--WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           SL+  KR +  W+  + +      P    L  GD+  Q  LR  L+CK F WFM  +A+D
Sbjct: 277 SLQNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVVAQKRLRGSLNCKSFKWFMTKIAWD 336

Query: 109 VYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRL 161
           +   +P + PP   WGE +N+GT  C D+   T+   +    C  G G     S Q+F  
Sbjct: 337 LPKFYPPVEPPAAAWGEIRNVGTGLCTDTKHGTSGLPLRLETCIRGRGEAAWNSMQVFTF 396

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
             +  +  G    D    +KL          C  A          + + L++C +  G  
Sbjct: 397 TWKEDIRPG----DPQHTKKL----------CFDA------VSHNSPVTLYDCHSMKG-- 434

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
           +  W++    + L   +   C+      +++ M  C+ ++  QQW F+
Sbjct: 435 NQLWKYRKD-KTLYLPVSGGCMDCSESDHRIFMNTCNPSSLTQQWLFE 481


>gi|332265853|ref|XP_003281928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           2 [Nomascus leucogenys]
          Length = 571

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|386780726|ref|NP_001248284.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Macaca
           mulatta]
 gi|384941838|gb|AFI34524.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
 gi|387540526|gb|AFJ70890.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
          Length = 571

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
          Length = 530

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 342 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 394

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 395 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 452

Query: 171 ERCV 174
           + C+
Sbjct: 453 DLCL 456


>gi|410342331|gb|JAA40112.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
 gi|410342333|gb|JAA40113.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
           troglodytes]
          Length = 576

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 388 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 440

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 441 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 498

Query: 171 ERCV 174
           + C+
Sbjct: 499 DLCL 502


>gi|332265851|ref|XP_003281927.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
           1 [Nomascus leucogenys]
          Length = 556

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 368 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 420

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 421 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 478

Query: 171 ERCV 174
           + C+
Sbjct: 479 DLCL 482


>gi|355767580|gb|EHH62635.1| hypothetical protein EGM_21033, partial [Macaca fascicularis]
          Length = 453

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 265 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 317

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 318 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 375

Query: 171 ERCV 174
           + C+
Sbjct: 376 DLCL 379


>gi|260814831|ref|XP_002602117.1| hypothetical protein BRAFLDRAFT_128843 [Branchiostoma floridae]
 gi|229287423|gb|EEN58129.1| hypothetical protein BRAFLDRAFT_128843 [Branchiostoma floridae]
          Length = 301

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 39/112 (34%), Positives = 61/112 (54%), Gaps = 9/112 (8%)

Query: 64  FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
           +K+I     P A+ +D G+I+EQ+  R K  C+ F W+++++  D+    P++      G
Sbjct: 121 YKKIFYALMPEALHIDYGNITEQLEFRRK--CRPFKWYLENIYPDL--GVPQINLQAV-G 175

Query: 124 EAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
           E +NLG Q  C+D++G    A +G   CHG G  Q F L+  G L  G  CV
Sbjct: 176 EIRNLGQQDVCVDTLG---GAHLGLWPCHGQGEHQSFSLDKSGTLKSGSLCV 224


>gi|326674124|ref|XP_696189.4| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 3-like
           [Danio rerio]
          Length = 604

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 60/230 (26%), Positives = 84/230 (36%), Gaps = 45/230 (19%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
           LR    W+  +K    L   L M    +D GD+SE++ALR +L CK F W++     ++Y
Sbjct: 401 LRVAEVWMDQYKSNVYLAWNLPMKNHGIDYGDVSERLALRKRLQCKSFDWYLD----NIY 456

Query: 111 TKFPELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL-- 167
            +       +F+GE +N   T  C+D  G           CHG G  QL R   EG L  
Sbjct: 457 PEMRRYNNTVFYGEIRNTNVTHLCVDQ-GIKENHTATLHPCHGWG-PQLGRYTKEGYLFL 514

Query: 168 ------GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
                 G   RCV  DK                          + +   L NC   S   
Sbjct: 515 GPLGSTGEDTRCVVDDK--------------------------ISSYPQLLNCEKVSSVR 548

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
              W F  +   +  A   RCL V P            + + Q+W  K +
Sbjct: 549 QKTWHFAQNESIISRAT-GRCLEVVPANVYFGYALILRSCTGQKWNIKNL 597


>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
          Length = 589

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 50/213 (23%), Positives = 89/213 (41%), Gaps = 47/213 (22%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPP 118
           W+  + +      P    +D GD+S+  ALR +L CK F W++K+V  D+   +P  LPP
Sbjct: 405 WMDEYAEFFYRRRPYVRKIDAGDLSKAKALRKELKCKSFDWYIKNVIPDLVQYYPPILPP 464

Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
           +  WG  K++ +  C+D   +    V+  S C             EG +     C+DA  
Sbjct: 465 SAAWGRLKHVVSNLCIDPQVKKGSQVV-VSQCQ----------TPEGAV---RTCLDASY 510

Query: 179 QEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI 238
           + K                         ++L  ++C N  G  +  W++    +QL H  
Sbjct: 511 RSK-------------------------SILT-WDCHNQHG--NQLWKYFE--KQLIHPS 540

Query: 239 LKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            K+C +V   +  L M+ C   +   +W ++ +
Sbjct: 541 SKKCATV--ASGALLMMPCSPGDRLFEWEWEHI 571


>gi|402896867|ref|XP_003911504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Papio
           anubis]
          Length = 581

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 93/229 (40%), Gaps = 43/229 (18%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 381 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 438

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 439 PEDRPGFF-GMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIR 497

Query: 169 HG----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTV 221
           +     E C+                   V AG+ ++    C      N        G++
Sbjct: 498 YNTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL 539

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                FH  +++   A  K           + +LR   N+ +Q+W FKE
Sbjct: 540 -----FHEQSKKCVQAARKES-----SDGFVPLLRDCTNSDHQKWFFKE 578


>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
          Length = 569

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 381 RAAEVWMDEFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 433

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 434 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 491

Query: 171 ERCV 174
           + C+
Sbjct: 492 DLCL 495


>gi|441612314|ref|XP_004088076.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Nomascus leucogenys]
          Length = 570

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 382 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 434

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 492

Query: 171 ERCV 174
           + C+
Sbjct: 493 DLCL 496


>gi|149045853|gb|EDL98853.1| rCG55041, isoform CRA_a [Rattus norvegicus]
 gi|149045854|gb|EDL98854.1| rCG55041, isoform CRA_a [Rattus norvegicus]
          Length = 284

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+  V  +++   
Sbjct: 84  VRAAEVWMDDFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 141

Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G +  CLD    +   V G       CHG G +Q F   +  ++ 
Sbjct: 142 PEDRPG-FFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHGMGQNQFFEYTSRQEIR 200

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 201 YNTRQPEA 208


>gi|126307024|ref|XP_001369295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Monodelphis domestica]
          Length = 571

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 9/122 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL+CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 435

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P +             CLD++G  A  V+G   CH SG +Q + L  +  + H + 
Sbjct: 436 ELRVPDHQDIAFGALQQGNNCLDTLGHFADGVVGVYECHNSGGNQEWALTKDKSVKHMDL 495

Query: 173 CV 174
           C+
Sbjct: 496 CL 497


>gi|392347955|ref|XP_232988.5| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Rattus norvegicus]
          Length = 579

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+  V  +++   
Sbjct: 376 VRAAEVWMDDFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 433

Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G +  CLD    +   V G       CHG G +Q F   +  ++ 
Sbjct: 434 PEDRPG-FFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHGMGQNQFFEYTSRQEIR 492

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 493 YNTRQPEA 500


>gi|403300209|ref|XP_003940844.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Saimiri
           boliviensis boliviensis]
          Length = 724

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 536 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 588

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 589 ELRVPDHQDIAFG-ALQQGT-NCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 646

Query: 171 ERCV 174
           + C+
Sbjct: 647 DLCL 650


>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
          Length = 577

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 6/111 (5%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K++  L  P     D+GD+SE+ ALR KL CK F W++ +V   +  KF      
Sbjct: 391 WMDDYKRLYYLHRPDLKTKDVGDLSERKALRKKLKCKSFKWYLDNV---IPHKFIPDEGV 447

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVI--GTSYCHGSGSS-QLFRLNAEGQL 167
           + +G  +N  +  CLD++ R   + I  G   C   GSS Q+F L   GQL
Sbjct: 448 VGYGALRNPNSGLCLDTLQRDEKSTITLGIFACQTGGSSAQVFSLTKSGQL 498


>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 (predicted), isoform
           CRA_a [Rattus norvegicus]
          Length = 504

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 316 RAAEVWMDEFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 368

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 369 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 426

Query: 171 ERCV 174
           + C+
Sbjct: 427 DLCL 430


>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
 gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
          Length = 688

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 50/208 (24%), Positives = 90/208 (43%), Gaps = 34/208 (16%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
           D GD++++  LR  L CK F W++ ++       +PEL     ++  GE +NLG   + C
Sbjct: 485 DWGDVTDRKKLRADLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 537

Query: 134 LD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
           LD  S  +     +GT  CH  G +Q+  L           C+DA +     +   + + 
Sbjct: 538 LDAPSGKKHQKKAVGTYPCHRQGGNQIANLPT-------GMCLDAKELSTEGDDTSVSIY 590

Query: 192 FCVLAGVK--LIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAI 238
            C   G     +  + G +    +C + +G             +  W +  +T+QL H  
Sbjct: 591 ECHGQGGNQYWMLSKTGEIRRDDSCLDYAGKEVTLYPCHGGKGNQFWSYRANTKQLHHGT 650

Query: 239 LKRCLSVHPETNQLAMLRCDENNSYQQW 266
             +CL++    ++L M  C+ + + QQW
Sbjct: 651 SGKCLAISDSKDKLQMEECNVSVARQQW 678


>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
           catus]
          Length = 653

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 465 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 517

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 518 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 575

Query: 171 ERCV 174
           + C+
Sbjct: 576 DLCL 579


>gi|301758254|ref|XP_002914993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Ailuropoda melanoleuca]
          Length = 540

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR +L CKDF WF+++V  +++   
Sbjct: 341 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHV-- 398

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        ++G       CHG G +Q F   ++ ++ 
Sbjct: 399 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIR 457

Query: 169 HG----ERCVDADKQEKLVEM 185
           +     E C+  +  + ++ M
Sbjct: 458 YNTHQPEACIAVEAGKDVLIM 478


>gi|109476381|ref|XP_001066416.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Rattus norvegicus]
          Length = 576

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+  V  +++   
Sbjct: 376 VRAAEVWMDDFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 433

Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F+G  +N G +  CLD    +   V G       CHG G +Q F   +  ++ 
Sbjct: 434 PEDRPG-FFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHGMGQNQFFEYTSRQEIR 492

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 493 YNTRQPEA 500


>gi|395519661|ref|XP_003763961.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Sarcophilus harrisii]
          Length = 631

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 56/227 (24%), Positives = 105/227 (46%), Gaps = 44/227 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  FK+I     +  + +      GDIS+++ ++ +L CK+F+W++ +V  +
Sbjct: 434 QVRLAEVWMDEFKEIFYRRNTEAAKIVKQKTFGDISKRLEIKHRLQCKNFTWYLNNVYPE 493

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           +Y   P+L P +  G  +N G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 494 IYV--PDLNP-VISGYIQNKGRHLCLD-VGENNLGGKPLIMYT--CHGLGGNQYFEYSAQ 547

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
            ++ H      + +QE  +  ++ PV    C   G K +                  T+D
Sbjct: 548 HEIRH------SIQQELCLHAVQGPVQLNTCSYKGQKTL------------------TID 583

Query: 223 -GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
              W      + L +  L  CL+ + E    +++ C+ ++ +QQW F
Sbjct: 584 VQKWEIQKD-QLLYNPSLNLCLTGNGE--HPSLVPCNPSDVFQQWDF 627


>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           4 [Ciona intestinalis]
          Length = 582

 Score = 60.8 bits (146), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 98/227 (43%), Gaps = 47/227 (20%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR    W+  +K+   + +P A   + GDIS +  LR+ L C DF W++ +V  D++   
Sbjct: 386 LRAAEVWMDDYKRHFYIRNPPASKENYGDISARKDLRNSLQCHDFKWYLDNVYPDLHV-- 443

Query: 114 PELPPNLFWGEAKNLGTQK-CLD--------SMGRTAPAVIGTSYCHGSGSSQLFRLNAE 164
           PE  P  ++G  +N G    CLD        + GR   ++ G   CHG G +Q F   ++
Sbjct: 444 PEDRPG-YYGAFRNSGMSSFCLDYAPPQHNPTGGRV--SIFG---CHGQGGNQFFEYTSK 497

Query: 165 GQL---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFN--CFNSSG 219
            ++      E C+ A + + +  +  +P      A  + +    GT+ ++ N  C ++S 
Sbjct: 498 REVRFNSEKEMCMSAVEDDTITMLDCVPDQRDSPASQEWVHSHDGTIQSVMNDKCLHASM 557

Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           + DG                       PE   L +  CD  + +QQW
Sbjct: 558 SKDG----------------------KPE---LLLKTCDPGSRFQQW 579


>gi|27370010|ref|NP_766281.1| polypeptide N-acetylgalactosaminyltransferase 12 [Mus musculus]
 gi|51315979|sp|Q8BGT9.1|GLT12_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
           AltName: Full=Polypeptide GalNAc transferase 12;
           Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 12;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 12
 gi|26329325|dbj|BAC28401.1| unnamed protein product [Mus musculus]
 gi|26334957|dbj|BAC31179.1| unnamed protein product [Mus musculus]
 gi|33991661|gb|AAH56425.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 12 [Mus musculus]
 gi|52851351|dbj|BAD52068.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase [Mus musculus]
 gi|74140287|dbj|BAE33836.1| unnamed protein product [Mus musculus]
          Length = 576

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 59/128 (46%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF+  V  +++   
Sbjct: 376 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 433

Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  CLD        V G       CHG G +Q F      ++ 
Sbjct: 434 PEDRPGFF-GMLQNRGLRGYCLDYNPPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIR 492

Query: 169 HGERCVDA 176
           +  R  +A
Sbjct: 493 YNTRQPEA 500


>gi|291382916|ref|XP_002708201.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
           [Oryctolagus cuniculus]
          Length = 476

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 12/141 (8%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 277 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHV-- 334

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        + G       CHG G +Q F   ++ ++ 
Sbjct: 335 PEDRPGFF-GMLQNKGLKNFCFDYNPPDENQITGHQVILYTCHGMGQNQFFEYTSQMEIR 393

Query: 169 HG----ERCVDADKQEKLVEM 185
           +     E CV  +  + ++ M
Sbjct: 394 YNTHQPEGCVAVEADKDVLVM 414


>gi|301611308|ref|XP_002935181.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Xenopus (Silurana) tropicalis]
          Length = 532

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 62/133 (46%), Gaps = 12/133 (9%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    WL  +K+I    +P A     GD+SE++ L+++L CK F WF+ ++  +++   
Sbjct: 364 VRAAEVWLDEYKEIYYHRNPHARKEPYGDVSERLQLKERLQCKHFKWFLDNIYPELHV-- 421

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  KN G +  C D        V G       CHG G +Q F   +  ++ 
Sbjct: 422 PEDKPGRF-GMLKNKGMENVCFDYNPPNENDVSGQQIILYPCHGMGQNQFFEYTSYNEIR 480

Query: 169 HGER----CVDAD 177
           +  R    C  AD
Sbjct: 481 YNTRQPEGCASAD 493


>gi|395736404|ref|XP_002816158.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pongo
           abelii]
          Length = 527

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
           W+  + +      P    L  GD++ Q  LR  L+CK F WFM  +A+D+   +P + PP
Sbjct: 262 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPP 321

Query: 119 NLFWGEAKNLGTQKCLDS 136
              WGE +N+GT  C D+
Sbjct: 322 AAAWGEIRNVGTGLCADT 339


>gi|149730677|ref|XP_001496099.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Equus
           caballus]
          Length = 633

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 105/229 (45%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++ A++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFAIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K+ G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSFGQSLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTV-----------------ATG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +  L+ CL+ + E    +++ C+ ++  Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLQMCLTANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
          Length = 597

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 67/192 (34%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           D GD+S +  LR  L CK F W++ ++       +PEL                      
Sbjct: 458 DFGDVSARKKLRSDLGCKSFKWYLDNI-------YPEL---------------------- 488

Query: 139 RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGV 198
                V G S   G    + + L+ +G++   E CVD                       
Sbjct: 489 ----FVPGESVAKG----EYWMLSKDGEIRRDESCVDY---------------------- 518

Query: 199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCD 258
                  G+ + +F C    G  +  WR+++ T +L+HA+ ++CL +  +  +L M+ C 
Sbjct: 519 ------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTRDGAKLEMVTCQ 570

Query: 259 ENNSYQQWRFKE 270
            ++ YQ W+FKE
Sbjct: 571 YDDPYQHWKFKE 582


>gi|432111808|gb|ELK34851.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Myotis davidii]
          Length = 539

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 363 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFRWYLENV-------YP 415

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 416 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 473

Query: 171 ERCV 174
           + C+
Sbjct: 474 DLCL 477


>gi|268564602|ref|XP_002647197.1| C. briggsae CBR-GLY-10 protein [Caenorhabditis briggsae]
          Length = 623

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 50/220 (22%), Positives = 87/220 (39%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+     +R+KL CK F WFMK +A+D    +P + P 
Sbjct: 413 WMDEYKETLYKHRPGIGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAIEPK 472

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER-- 172
               GE +++ +  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 473 ASAEGEIRHVASNLCIDTQFKEQNQRFGLRKCASEDKDGGGEQDLRLTRWHDIRPKGRKI 532

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D         + K P+                    LF+C +  G  +  +++    +
Sbjct: 533 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRMPQK 566

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
           Q+ H +  +CL+        L M +CD ++  QQW ++ +
Sbjct: 567 QIYHPVSGQCLTADENGKGFLHMKKCDSSSKLQQWAWQTI 606


>gi|432110716|gb|ELK34193.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Myotis davidii]
          Length = 466

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 14/147 (9%)

Query: 42  KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
           KQ   ++K +L   +R    W+  FK++    +P A     GD++E+  LR KL CKDF 
Sbjct: 252 KQAPYSRKKALANSVRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFK 311

Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
           WF++ V  +++   PE  P  F G  +N G +  C D    +   + G       CHG G
Sbjct: 312 WFLETVYPELHV--PEDRPGFF-GMLQNKGLKDYCFDYNPPSEHDLTGHQVLLYLCHGMG 368

Query: 155 SSQLFRLNAEGQLGHG----ERCVDAD 177
            +Q F   ++ ++ +     E C+  +
Sbjct: 369 QNQFFEHTSQNEIRYNTHQPEACIAVE 395


>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
           [Desmodus rotundus]
          Length = 582

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 50/179 (27%), Positives = 84/179 (46%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 385 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHV--P 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 443 EDRPG-WHGAIRSMGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 501

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+K V M   P   F V   +   F   GT+ + F+  C ++  T +G
Sbjct: 502 SVTELCAEVPEQKKHVGMQNCPKDGFPVPTNIIWHFKEDGTIFHPFSGLCLSAYRTPEG 560


>gi|261260064|sp|A8Y236.2|GLT10_CAEBR RecName: Full=Putative polypeptide
           N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
           10; AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 10; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 10
          Length = 629

 Score = 60.8 bits (146), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 50/220 (22%), Positives = 87/220 (39%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+     +R+KL CK F WFMK +A+D    +P + P 
Sbjct: 419 WMDEYKETLYKHRPGIGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAIEPK 478

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER-- 172
               GE +++ +  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 479 ASAEGEIRHVASNLCIDTQFKEQNQRFGLRKCASEDKDGGGEQDLRLTRWHDIRPKGRKI 538

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D         + K P+                    LF+C +  G  +  +++    +
Sbjct: 539 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRMPQK 572

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
           Q+ H +  +CL+        L M +CD ++  QQW ++ +
Sbjct: 573 QIYHPVSGQCLTADENGKGFLHMKKCDSSSKLQQWAWQTI 612


>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
           norvegicus]
          Length = 578

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/229 (26%), Positives = 97/229 (42%), Gaps = 44/229 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R+   W+  +K+     +P A      DISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 REAEVWMDDYKEHFYNRNPPARKETYDDISERKLLRERLQCKSFDWYLKNVFSNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP      A +    CHG G +Q F   +  ++
Sbjct: 439 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 494

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
                 E C +  +Q+  V M   P               L   +N+   F   GT+   
Sbjct: 495 RFNSVTELCAEVPQQKDYVGMQNCPKD------------GLPIPVNIIWHFKEDGTI--- 539

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
             FH  TR         CLS +   +    + M  CD  +  Q WRF++
Sbjct: 540 --FHPHTRL--------CLSAYRTADGRPSVQMKTCDALDKNQIWRFEK 578


>gi|354468855|ref|XP_003496866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Cricetulus griseus]
 gi|344247257|gb|EGW03361.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Cricetulus
           griseus]
          Length = 535

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 347 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLDNV-------YP 399

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 400 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 457

Query: 171 ERCV 174
           + C+
Sbjct: 458 DLCL 461


>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
 gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           AltName: Full=Polypeptide GalNAc transferase 4;
           Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
 gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
 gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 [Mus musculus]
 gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
 gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
 gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 [Mus musculus]
          Length = 578

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 60/229 (26%), Positives = 96/229 (41%), Gaps = 44/229 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GD+SE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDLSERKLLRERLKCKSFDWYLKNVFSNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP      A +    CHG G +Q F   +  ++
Sbjct: 439 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 494

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
                 E C +  +Q+  V M   P               L   +N+   F   GT+   
Sbjct: 495 RFNSVTELCAEVPQQKDYVGMQNCPKD------------GLPVPVNIIWHFKEDGTI--- 539

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
             FH  TR         CLS +        + M  CD  +  Q WRF++
Sbjct: 540 --FHPHTRL--------CLSAYRTAEGRPSVHMKTCDALDKNQLWRFEK 578


>gi|395531657|ref|XP_003767891.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Sarcophilus harrisii]
          Length = 542

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 9/122 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 354 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 406

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P +             CLD++G  A  V+G   CH SG +Q + L  +  + H + 
Sbjct: 407 ELRVPDHQDIAFGALQQGNNCLDTLGHFADGVVGVYECHNSGGNQEWALTKDKSVKHMDL 466

Query: 173 CV 174
           C+
Sbjct: 467 CL 468


>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
 gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
          Length = 534

 Score = 60.5 bits (145), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 37/104 (35%), Positives = 55/104 (52%), Gaps = 7/104 (6%)

Query: 40  ADKQVISTQKSSLELRKRSSWLKVFKQI---RSLGSPLAMF-LDMGDISEQVALRDKLHC 95
            D  V+   K+ +  R    W+  +KQ    R L  P  +   D+GD+S Q  LR +L C
Sbjct: 309 TDNDVLQILKNYM--RVAEVWMDDYKQYYYKRMLRGPKNVTNFDLGDLSSQKPLRQRLGC 366

Query: 96  KDFSWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQKCLDSMG 138
           +DF WFM+ VA D+   +P   P++   G  +++GT  CLDS G
Sbjct: 367 RDFGWFMREVASDLVKHYPLKDPDVLQQGRIQSVGTGLCLDSDG 410


>gi|308452095|ref|XP_003088913.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
 gi|308244364|gb|EFO88316.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
          Length = 620

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 52/223 (23%), Positives = 87/223 (39%), Gaps = 34/223 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+     +R+KL CK F WFMK +A+D    +P + P 
Sbjct: 410 WMDEYKETLYKHRPGVGSADAGDLKLMKGVREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 469

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
               GE ++  +  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 470 ASAQGEIRHGASNLCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDVRPKGRKI 529

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D         + K P+                    LF+C +  G  +  +++  S +
Sbjct: 530 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRVSQK 563

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEVRPD 274
           Q+ H +  +CLS        L M +CD  +  Q+W ++ +  D
Sbjct: 564 QIYHPVSGQCLSADENGKGFLYMKKCDSTSDLQKWTWQTIDND 606


>gi|195148068|ref|XP_002014996.1| GL18655 [Drosophila persimilis]
 gi|194106949|gb|EDW28992.1| GL18655 [Drosophila persimilis]
          Length = 646

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 91/229 (39%), Gaps = 38/229 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K+ + S        +D GD++ Q A+R++L CK F W++K VA D   KF
Sbjct: 432 RVAEVWMDEYKEHVYSRDPQTYNNIDAGDLTRQRAIRERLQCKSFDWYLKEVAPDFLKKF 491

Query: 114 PELP-PNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
           P +  PN   G  +N+     CLD+M   A   +G   C  + +    +Q + L  + +L
Sbjct: 492 PPVELPNYASGAFQNVAFATHCLDTMNLGAHNAVGLYSCAENRTHPQFNQFWLLTPDREL 551

Query: 168 GH--GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
            H     C+D                                 + +++C N  G  +  W
Sbjct: 552 RHNGASNCLDVQD------------------------VHANATVWMWSCHNQGG--NQFW 585

Query: 226 RFHNSTRQLEHAIL-KRCLS--VHPETNQLAMLRCDENNSYQQWRFKEV 271
            +  + + L HA   KRC+   V      +    CD  N  Q+W    V
Sbjct: 586 YYDRAHKWLVHAPNGKRCMEGIVENGIGTVYTNECDPANDRQKWEIGYV 634


>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
          Length = 571

 Score = 60.5 bits (145), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 9/122 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P +             CLD++G  A  V+G   CH +G +Q + L  E  + H + 
Sbjct: 436 ELRVPDHQDIALGXLQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDL 495

Query: 173 CV 174
           C+
Sbjct: 496 CL 497


>gi|198474477|ref|XP_001356707.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
 gi|198138408|gb|EAL33772.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
          Length = 646

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 91/229 (39%), Gaps = 38/229 (16%)

Query: 55  RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           R    W+  +K+ + S        +D GD++ Q A+R++L CK F W++K VA D   KF
Sbjct: 432 RVAEVWMDEYKEHVYSRDPQTYNNIDAGDLTRQRAIRERLQCKSFDWYLKEVAPDFLKKF 491

Query: 114 PELP-PNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
           P +  PN   G  +N+     CLD+M   A   +G   C  + +    +Q + L  + +L
Sbjct: 492 PPVELPNYASGAFQNVAFATHCLDTMNLGAHNAVGLYSCAENRTHPQFNQFWLLTPDREL 551

Query: 168 GH--GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
            H     C+D                                 + +++C N  G  +  W
Sbjct: 552 RHNGASNCLDVQDMHA------------------------NATVWMWSCHNQGG--NQFW 585

Query: 226 RFHNSTRQLEHAIL-KRCLS--VHPETNQLAMLRCDENNSYQQWRFKEV 271
            +  + + L HA   KRC+   V      +    CD  N  Q+W    V
Sbjct: 586 YYDRAHKWLVHAPNGKRCMEGIVENGIGTVYTNECDPANDRQKWEIGYV 634


>gi|417403505|gb|JAA48553.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
           rotundus]
          Length = 633

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 104/229 (45%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
            Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 AYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----NACTYKGHKTV-----------------ATG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 586 EQKWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|410910520|ref|XP_003968738.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 3-like
           [Takifugu rubripes]
          Length = 598

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 87/251 (34%), Gaps = 43/251 (17%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H  R        I+       LR    W+  ++    L   + M    +D GDIS++
Sbjct: 375 RVAHIARVKKPYHSNIAYHTRRNALRVAEVWMDEYRSNVYLAWNIPMENHGIDYGDISQR 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
           VALR  L CK F W+++    +VY +       LF+GE +N          G        
Sbjct: 435 VALRKSLQCKSFEWYLE----NVYPEMRRYNNTLFYGEIRNSKVSHLCMDQGIKENHTAT 490

Query: 147 TSYCHGSGSSQLFRLNAEGQL--------GHGERCVDADKQEKLVEMLKLPVTFCVLAGV 198
              CHG G  QL R   EGQL        G   RCV  D+                    
Sbjct: 491 LHPCHGWG-PQLGRYTKEGQLFLGPLGSTGEDTRCVVDDQ-------------------- 529

Query: 199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCD 258
                 L +   L NC   +      W F  +   +  A   RCL V P       L   
Sbjct: 530 ------LSSFPQLINCDRVTNAKQMTWHFSQNESIVNQAT-GRCLEVVPANVYFGHLLIL 582

Query: 259 ENNSYQQWRFK 269
           +  + Q+W  K
Sbjct: 583 QPCTGQRWIIK 593


>gi|226482458|emb|CAX73828.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
          Length = 603

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 96/235 (40%), Gaps = 42/235 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P    LD GD++EQ  +R++L CK F WFM  VA+D+  K+P
Sbjct: 385 RVAEVWMDEYKEYLYKRRPSYRHLDPGDLTEQFKIRERLKCKSFKWFMTEVAFDLVKKYP 444

Query: 115 ELPP-NLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCH----GSGSSQLFRLNAEGQ-- 166
            + P ++  GE +++     CLD+   +    +    C     G  S Q F  N      
Sbjct: 445 LIEPISVAVGEIRSVSDSSLCLDATDASEYTPVKLRPCDKDNPGVSSVQKFEYNYRQDIR 504

Query: 167 -LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
            L HG  C+D                   L   K +F        L++C    G  +  W
Sbjct: 505 VLNHGT-CLDEPG----------------LKNEKTVF--------LYSCHGQGG--NQRW 537

Query: 226 RFH------NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPD 274
           +        ++  +L H     CL    +   +++  CD  +  Q+W +++++ D
Sbjct: 538 KIQPVNNDKSNPIRLIHGANAGCLEADTKNRVVSVKPCDYGSFTQRWTWEKLQFD 592


>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
          Length = 570

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 382 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 434

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 492

Query: 171 ERCV 174
           + C+
Sbjct: 493 DLCL 496


>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
          Length = 526

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 338 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 390

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 391 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 448

Query: 171 ERCV 174
           + C+
Sbjct: 449 DLCL 452


>gi|26338209|dbj|BAC32790.1| unnamed protein product [Mus musculus]
          Length = 570

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 382 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 434

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 492

Query: 171 ERCV 174
           + C+
Sbjct: 493 DLCL 496


>gi|226482456|emb|CAX73827.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
          Length = 603

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 96/235 (40%), Gaps = 42/235 (17%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P    LD GD++EQ  +R++L CK F WFM  VA+D+  K+P
Sbjct: 385 RVAEVWMDEYKEYLYKRRPSYRHLDPGDLTEQFKIRERLKCKSFKWFMTEVAFDLVKKYP 444

Query: 115 ELPP-NLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCH----GSGSSQLFRLNAEGQ-- 166
            + P ++  GE +++     CLD+   +    +    C     G  S Q F  N      
Sbjct: 445 LIEPISVAVGEIRSVSDSSLCLDATDASEYTPVKLRPCDKDNPGVSSVQKFEYNYRQDIR 504

Query: 167 -LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
            L HG  C+D                   L   K +F        L++C    G  +  W
Sbjct: 505 VLNHGT-CLDEPG----------------LKNEKTVF--------LYSCHGQGG--NQRW 537

Query: 226 RFH------NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPD 274
           +        ++  +L H     CL    +   +++  CD  +  Q+W +++++ D
Sbjct: 538 KIQPVNNDKSNPIRLIHGANAGCLEADTKNRVVSVKPCDYGSFTQRWTWEKLQFD 592


>gi|195030214|ref|XP_001987963.1| GH10909 [Drosophila grimshawi]
 gi|193903963|gb|EDW02830.1| GH10909 [Drosophila grimshawi]
          Length = 668

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 82/204 (40%), Gaps = 36/204 (17%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLG-TQKCLD 135
           +D GD++ Q A+R++L CK F WFMK VA D    FP + PP    G  +++     CLD
Sbjct: 479 IDAGDLTRQRAVRERLQCKSFDWFMKEVAPDFLVHFPPVEPPPYASGAIQSVAYPNYCLD 538

Query: 136 SMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGH-GERCVDADKQEKLVEMLKLPV 190
           SMG  +   +G   C         SQ + L+   +L      C+D               
Sbjct: 539 SMGLGSNNAVGMFSCAEDKESPQDSQNWVLSIHRELRRIDHECLDVQD------------ 586

Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPET 249
                         +   + ++ C N  G  +  W +      + H +  K+C+    E 
Sbjct: 587 ------------SHVNATVWMWTCHNQGG--NQFWYYDREKYWIVHGLHGKQCMEAFVEN 632

Query: 250 N--QLAMLRCDENNSYQQWRFKEV 271
           N  ++    C+E N  Q+W F  V
Sbjct: 633 NVAKVCTNACEEGNVRQRWNFGMV 656


>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
           musculus]
 gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
           AltName: Full=Polypeptide GalNAc transferase 2;
           Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 2;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 2; Contains: RecName:
           Full=Polypeptide N-acetylgalactosaminyltransferase 2
           soluble form
 gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [Mus musculus]
          Length = 570

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 382 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 434

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 492

Query: 171 ERCV 174
           + C+
Sbjct: 493 DLCL 496


>gi|344278311|ref|XP_003410938.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Loxodonta africana]
          Length = 572

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 384 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 436

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 437 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 494

Query: 171 ERCV 174
           + C+
Sbjct: 495 DLCL 498


>gi|67970320|dbj|BAE01503.1| unnamed protein product [Macaca fascicularis]
          Length = 250

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 47  QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 106

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 107 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 160

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 161 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV------------- 200

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               T +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 201 ----TGEQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 248


>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
           aries]
          Length = 633

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A +    V  LK     C   G K +              
Sbjct: 544 FEYSAQREIRHNIQKELCLHAAQG---VVQLKA----CAYKGRKTV-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               T +  W      + L +   K CLS   E    +++ C+ ++S Q+W F +
Sbjct: 583 ---ATGEQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDSLQKWIFNQ 631


>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
          Length = 544

 Score = 60.1 bits (144), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 356 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 408

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 409 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 466

Query: 171 ERCV 174
           + C+
Sbjct: 467 DLCL 470


>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 2 [Mus musculus]
          Length = 548

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 360 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 412

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 413 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 470

Query: 171 ERCV 174
           + C+
Sbjct: 471 DLCL 474


>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
           (Silurana) tropicalis]
 gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
           tropicalis]
 gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
           tropicalis]
          Length = 582

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 71/145 (48%), Gaps = 16/145 (11%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K++    +P A   + GDISE+  LR++L CK F W++K+V  D++   P
Sbjct: 385 RAAEVWMDGYKELFYNRNPPARKENYGDISERKLLRERLQCKSFDWYLKNVFPDLH--IP 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP      A +    CHG G +Q F      ++
Sbjct: 443 EDRPG-WHGAVRSMGISNECLDY---NAPDHNPTGAHLSLFGCHGQGGNQFFEYTTMREI 498

Query: 168 GHG---ERCVDADKQEKLVEMLKLP 189
                 E C +  +Q+  + M+  P
Sbjct: 499 RFNSVTELCAEVPEQQTYIGMMHCP 523


>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
          Length = 536

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 348 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 400

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 401 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 458

Query: 171 ERCV 174
           + C+
Sbjct: 459 DLCL 462


>gi|349605004|gb|AEQ00388.1| Polypeptide N-acetylgalactosaminyltransferase 3-like protein,
           partial [Equus caballus]
          Length = 337

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 102/222 (45%), Gaps = 44/222 (19%)

Query: 60  WLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE 115
           W+  +K+I     +  + +      GD+S++ A++ +L CK+F+W++ ++  +VY   P+
Sbjct: 147 WMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFAIKHRLQCKNFTWYLNNIYPEVYV--PD 204

Query: 116 LPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAEGQLGHG- 170
           L P +  G  K+ G   CLD +G       P ++ T  CHG G +Q F  +A+ ++ H  
Sbjct: 205 LNP-VISGYIKSFGQSLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQHEIRHNI 260

Query: 171 --ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
             E C+ A   + LV++       C   G K +                  T +  W   
Sbjct: 261 QKELCLHA--AQGLVQLKA-----CTYKGHKTV-----------------ATGEQIWEIQ 296

Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
              + L +  L+ CL+ + E    +++ C+ ++  Q+W F +
Sbjct: 297 KD-QLLYNPFLQMCLTANGE--HPSLVSCNPSDPLQKWIFSQ 335


>gi|109099998|ref|XP_001096023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           1 [Macaca mulatta]
 gi|297264195|ref|XP_002798936.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           2 [Macaca mulatta]
 gi|355564937|gb|EHH21426.1| hypothetical protein EGK_04492 [Macaca mulatta]
 gi|355750584|gb|EHH54911.1| hypothetical protein EGM_04018 [Macaca fascicularis]
          Length = 633

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 105/229 (45%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|332234083|ref|XP_003266237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Nomascus leucogenys]
          Length = 633

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV------------- 583

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               T +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 584 ----TGEQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
          Length = 618

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++ +V       +P
Sbjct: 385 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 437

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 438 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 495

Query: 171 ERCV 174
           + C+
Sbjct: 496 DLCL 499


>gi|402888519|ref|XP_003907606.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Papio
           anubis]
          Length = 633

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 105/229 (45%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|125815709|ref|XP_001338018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Danio
           rerio]
          Length = 604

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 47/149 (31%), Positives = 66/149 (44%), Gaps = 16/149 (10%)

Query: 29  GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
            ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE
Sbjct: 374 ARVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMNNPGVDFGDVSE 433

Query: 86  QVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTA 141
           +VALR K++CK F W+++HV       +PE+      + +GE +N   +  CLD      
Sbjct: 434 RVALRKKMNCKSFRWYLEHV-------YPEMRIYNNTITYGEVRNSKASGYCLDQGSEED 486

Query: 142 PAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              I    CHG  SSQL R   EG L  G
Sbjct: 487 ERAILYP-CHGM-SSQLARYTTEGLLQLG 513


>gi|291391661|ref|XP_002712292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Oryctolagus cuniculus]
          Length = 633

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 70/135 (51%), Gaps = 17/135 (12%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++++L CK+F+W++  V  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKNRLQCKNFTWYLNTVYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   PEL P +  G  K +G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PELNP-VISGYIKTVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDA 176
            ++ H    E C+ A
Sbjct: 550 NEIRHNIQKELCLHA 564


>gi|326918600|ref|XP_003205576.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
           6-like, partial [Meleagris gallopavo]
          Length = 303

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR  L CKDF WFM  VA+DV   +P
Sbjct: 228 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 287

Query: 115 EL-PPNLFWGEAKN 127
            + PP   WGE +N
Sbjct: 288 PVEPPPAAWGEVRN 301


>gi|326430087|gb|EGD75657.1| hypothetical protein PTSG_07775 [Salpingoeca sp. ATCC 50818]
          Length = 711

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 67/125 (53%), Gaps = 5/125 (4%)

Query: 53  ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
           +LR    W+  +K+I    +      D GD+SE+ A+R++L+C  F WF+++V  D++  
Sbjct: 418 DLRVAEVWMDEYKEI-VYDTRGFYGKDPGDLSERRAIRERLNCHSFKWFLENVHPDLWV- 475

Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
            P++ P  F G   +   + C+D+M       IG   CHG  ++Q +RL  +G + +G  
Sbjct: 476 -PDIHPK-FSGIISDPSKRICVDNMQHDRGGPIGMYGCHGM-TTQRWRLMNDGVISNGVA 532

Query: 173 CVDAD 177
           C+  D
Sbjct: 533 CLRWD 537


>gi|341897758|gb|EGT53693.1| CBN-GLY-10 protein [Caenorhabditis brenneri]
          Length = 620

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/220 (22%), Positives = 88/220 (40%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+    A+R+KL CK F WFMK +A+D    +P + P 
Sbjct: 410 WMDEYKETLYKHRPGVGNADAGDLKLMKAVREKLQCKSFDWFMKEIAFDQDKYYPAIEPK 469

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
               GE ++  +  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 470 PSAQGEIRHAASNLCIDTQFKEQNQRFGLRKCTSDEKDGGGEQDLRLTRWHDIRPKGRKI 529

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D  +      + K P+                    LF+C +  G  +  +++  + +
Sbjct: 530 CFDVSQS-----IDKAPII-------------------LFDCHSMKG--NQLFKYRVAAK 563

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
           Q+ H +  +CL+        L M +CD  +  Q+W ++ +
Sbjct: 564 QIYHPVSGQCLTGDENGKGFLHMKKCDSTSDLQKWTWQTI 603


>gi|308457549|ref|XP_003091148.1| CRE-GLY-10 protein [Caenorhabditis remanei]
 gi|308258137|gb|EFP02090.1| CRE-GLY-10 protein [Caenorhabditis remanei]
          Length = 620

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 86/220 (39%), Gaps = 34/220 (15%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P     D GD+     +R+KL CK F WFMK +A+D    +P + P 
Sbjct: 410 WMDEYKETLYKHRPGVGSADAGDLKLMKGVREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 469

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
               GE ++  +  C+D+  +      G   C       G  Q  RL     +    R  
Sbjct: 470 ASAQGEIRHGASNLCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDVRPKGRKI 529

Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
           C D         + K P+                    LF+C +  G  +  +++  S +
Sbjct: 530 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRVSQK 563

Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
           Q+ H +  +CLS        L M +CD  +  Q+W ++ +
Sbjct: 564 QIYHPVSGQCLSADENGKGFLYMKKCDSTSDLQKWTWQTI 603


>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Callithrix jacchus]
          Length = 633

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 105/229 (45%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKTFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGHPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +  L  CLS + E    +++ C+ ++S Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLNMCLSANGE--HPSLVSCNPSDSLQKWIFSQ 631


>gi|300797404|ref|NP_001179787.1| polypeptide N-acetylgalactosaminyltransferase 3 [Bos taurus]
          Length = 633

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 103/235 (43%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGRPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A                  L  V+L  C        +   
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA-----------------ALGAVQLKAC-------AYKGH 579

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
            +    +  W      + L +   K CLS   E    +++ C+ ++S Q+W F +
Sbjct: 580 KTVAIGEQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDSLQKWIFNQ 631


>gi|401399841|ref|XP_003880648.1| putative UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3 [Neospora caninum
           Liverpool]
 gi|325115059|emb|CBZ50615.1| putative UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3 [Neospora caninum
           Liverpool]
          Length = 635

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/235 (26%), Positives = 100/235 (42%), Gaps = 46/235 (19%)

Query: 46  STQKSSLELRKRSS--WLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
           S+   S+ + K  +  W+  +  +  R +G P   +     + ++   R +  CK F WF
Sbjct: 424 SSPGDSIAINKMRTMMWMDEYADLAWRVIGQPKVNYRPE-SLEKRREWRKRKGCKSFRWF 482

Query: 102 MKHV--AYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPA-VIGTSYCHGSGSSQL 158
           M++V    DV T   ++P   + G  +N   + CLD+MG  +P   +G   CHG G +Q 
Sbjct: 483 MENVFPEGDVVT-LDDVP---YLGPLRNAKVEMCLDNMGWASPGHAVGLGRCHG-GETQT 537

Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
           F      ++GH     D +               C+    KL +CR              
Sbjct: 538 FMFFR--KVGHVMPVNDDEA--------------CLQPSGKLDWCR-------------- 567

Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
           GT    W F  S+ QL     K+CLS      QL M+ CD+ ++YQ W +K  +P
Sbjct: 568 GTSQFWWDF-TSSGQLMFRETKQCLSAF--GRQLRMVDCDDTDAYQIWSWKAYKP 619


>gi|327262105|ref|XP_003215866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Anolis carolinensis]
          Length = 575

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/219 (25%), Positives = 94/219 (42%), Gaps = 41/219 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ L+ +L+CK F W++++V       +P
Sbjct: 387 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRLNCKPFKWYLENV-------YP 439

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 440 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 497

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
           + C        L  + + P                G+++ L  C  + G     W    S
Sbjct: 498 DLC--------LTVVDRAP----------------GSLIKLQGCRENDGR--QKWEQIES 531

Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
             +L H     CL S + +   L +  C+ + S QQW+F
Sbjct: 532 NAKLRHVGSNLCLDSRNAKNGGLTVEVCNPSLS-QQWKF 569


>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
 gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
          Length = 455

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 5/110 (4%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  + +      P A  +  GDIS+++ LR+KL C+ F W+M++V  ++    P
Sbjct: 313 RAAEAWMDGYAKFYYKREPGARGVPYGDISDRLKLREKLKCRSFKWYMRNVYPEL--NVP 370

Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAE 164
           E   + F GE +  G  KCLDS+G      + T  CHG G +Q + +  +
Sbjct: 371 EGVNDKF-GELRQGG--KCLDSIGGKPGDRVSTFPCHGGGGNQAWDMTKD 417


>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
           gallus]
          Length = 566

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR +L CK F W++++V       +P
Sbjct: 378 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENV-------YP 430

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 431 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 488

Query: 171 ERCV 174
           + C+
Sbjct: 489 DLCL 492


>gi|47199070|emb|CAF89237.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 129

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 12/107 (11%)

Query: 80  MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLG-TQKCLDSMG 138
            GD++E+  LR+KL CK F WF+K++  D++   PE  P +F G  KN G    C D   
Sbjct: 5   FGDVTERRKLREKLGCKSFGWFLKNIYPDLHV--PEDNPGMF-GMLKNRGKADHCFDYNP 61

Query: 139 RTAPAVIGTSY----CHGSGSSQLFRLNAEGQLGHGER----CVDAD 177
                V+G       CHG G +Q F  + +G+L +  R    CV  D
Sbjct: 62  TDDDVVVGERVILYPCHGMGQNQFFEYSKDGELRYNTRTPAGCVMGD 108


>gi|326428122|gb|EGD73692.1| glycosyl transferase [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 51/205 (24%), Positives = 80/205 (39%), Gaps = 51/205 (24%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYD-VYTKFPELPPNLFWGEAKNLGTQKCLDSM 137
           D+G +   + L+ +L CK F W++K+V  + + T   ++      G  KN     C+D+M
Sbjct: 393 DIGPLDHMLELKQQLQCKSFDWYLKNVYPEGIITDLSDIRA---LGTVKNTANNMCVDNM 449

Query: 138 GRT-APAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
               A A +G   CHG GS                                   TF +LA
Sbjct: 450 QHMYADAKMGAYACHGHGSQ----------------------------------TFLILA 475

Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTV-------DGPWRFHNSTRQLEHAILKRCLSVHPET 249
             K     +  V NL  C  S  T+       D  W +  +   L++   ++CL+V  + 
Sbjct: 476 RTK----EIRPVGNLELCLTSDTTISWCESRHDVTWDYDQANMLLKNTKTQKCLAVG-DG 530

Query: 250 NQLAMLRCDENNSYQQWRFKEVRPD 274
           N L    CD +N   +W F +  PD
Sbjct: 531 NNLVTEDCDLHNDRHKWTFDDKHPD 555


>gi|395836156|ref|XP_003791031.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Otolemur garnettii]
          Length = 571

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G     V+G   CH +G +Q + L  E  + H 
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFTDGVVGVYECHNAGGNQEWALTKEKAVKHI 493

Query: 171 ERCV 174
           + C+
Sbjct: 494 DLCL 497


>gi|395844920|ref|XP_003795196.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Otolemur garnettii]
          Length = 633

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 102/231 (44%), Gaps = 48/231 (20%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  ++ 
Sbjct: 496 VYV--PDLNP-VISGYIKSIGKPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSSL 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFNCFNSSG 219
            ++ H    E C+ A K          PV    C   G K +                  
Sbjct: 550 REIRHNIQKELCLHAAKG---------PVQLKACAYKGHKTVV----------------- 583

Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           T++  W      + L + + K CLS + E   L  + C+ ++  Q+W F +
Sbjct: 584 TIEQIWEIQKD-QLLYNPLFKMCLSANGEHPNL--VSCNPSDPLQKWIFSQ 631


>gi|47216162|emb|CAG10036.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 534

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 100/233 (42%), Gaps = 36/233 (15%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +KQ     +P A     GDIS ++ LR+KL C+ F+W++K++       +
Sbjct: 321 VRAAEVWMDSYKQHFYNRNPAARKETYGDISGRLLLREKLKCQSFTWYLKNI-------Y 373

Query: 114 PEL--PPNL--FWGEAKNLG-TQKCLDSMGRTAP--AVIGTSY----CHGSGSSQLFRLN 162
           PEL  P +   + G  +NLG + +CLD     AP  +V G       CHG G +Q F   
Sbjct: 374 PELHIPEDRAGWHGAVRNLGISSECLDY---NAPEHSVTGAQLSLFGCHGQGGNQYFEYT 430

Query: 163 AEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSG 219
           ++ ++      E C +  + +  V M   P          L   R     +L     ++ 
Sbjct: 431 SQKEIRFNTVTELCAEVLEGKVAVGMRHCPGDGESRPASILWEFRQVRPPSLRKERENNE 490

Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA---MLRCDENNSYQQWRFK 269
             DG          + H     C++V+  +   A   M RCD  +  QQW+F+
Sbjct: 491 KPDG---------SIYHPHTDTCITVYRTSTGRADAKMARCDPGDKTQQWKFE 534


>gi|195030212|ref|XP_001987962.1| GH10908 [Drosophila grimshawi]
 gi|193903962|gb|EDW02829.1| GH10908 [Drosophila grimshawi]
          Length = 684

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 44/81 (54%), Gaps = 2/81 (2%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLG-TQKCLD 135
           +D GD++ Q A+R++L CK F WFMK VA D    FP + PP    G  +++     CLD
Sbjct: 495 IDAGDLTRQRAVRERLQCKSFDWFMKEVAPDFLVHFPPVEPPPYASGAIQSVAYPNYCLD 554

Query: 136 SMGRTAPAVIGTSYCHGSGSS 156
           SMG+     +G   C  +  S
Sbjct: 555 SMGKGWKHAVGLFTCAQNKKS 575


>gi|169790935|ref|NP_938048.2| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Mus
           musculus]
 gi|148688066|gb|EDL20013.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9, isoform CRA_a [Mus
           musculus]
          Length = 604

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 16/148 (10%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
           +ALR+KL C+ F W++++V       +PE+      L +GE +N   +  CLD       
Sbjct: 435 LALREKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 487

Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
             I    CHG  SSQL R +AEG L  G
Sbjct: 488 RAILYP-CHGM-SSQLVRYSAEGLLQLG 513


>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
 gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
          Length = 518

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/116 (27%), Positives = 55/116 (47%), Gaps = 3/116 (2%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P       GDIS+++ LR KL CK+F W++K+V  +++     +   
Sbjct: 328 WMDGYKRYFYKRQPHLKGHPFGDISKRLELRKKLKCKNFDWYIKNVVPEIFLPNSSI--- 384

Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
           +  GE +N  +  C+DS+G    A IG   CH    +Q        ++   + C D
Sbjct: 385 IARGELRNPASGDCIDSLGAGEHAYIGIYKCHKQMGNQYLVYTKNEEIIVDDNCFD 440


>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Taeniopygia guttata]
          Length = 669

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR +L CK F W++++V       +P
Sbjct: 481 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENV-------YP 533

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 534 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 591

Query: 171 ERCV 174
           + C+
Sbjct: 592 DLCL 595


>gi|169790937|ref|NP_001116111.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform B [Mus
           musculus]
 gi|74142966|dbj|BAE42509.1| unnamed protein product [Mus musculus]
 gi|148688067|gb|EDL20014.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9, isoform CRA_b [Mus
           musculus]
          Length = 237

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 16/148 (10%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 8   RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
           +ALR+KL C+ F W++++V       +PE+      L +GE +N   +  CLD       
Sbjct: 68  LALREKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 120

Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
             I    CHG  SSQL R +AEG L  G
Sbjct: 121 RAILYP-CHGM-SSQLVRYSAEGLLQLG 146


>gi|327283203|ref|XP_003226331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Anolis carolinensis]
          Length = 644

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 66/145 (45%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 377 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMMNPGVDFGDVSER 436

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAK-NLGTQKCLDSMGRTAPAVI 145
           VALR +L C+ F W+++    +VY++       + +GE + N  +  CLD         I
Sbjct: 437 VALRQRLQCRSFKWYLE----NVYSEMRIYNNTITYGEVRNNKASGYCLDQGAEDDDKAI 492

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R ++EG L  G
Sbjct: 493 LYP-CHGM-SSQLVRYSSEGLLQLG 515


>gi|190358441|ref|NP_001121823.1| polypeptide N-acetylgalactosaminyltransferase 2 [Danio rerio]
          Length = 559

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 57/122 (46%), Gaps = 9/122 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK       P A  +  G+I  ++ ++ +L CK F W++++V       +P
Sbjct: 371 RAAEVWMDDFKNFYYAAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLENV-------YP 423

Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
           EL  P +           Q CLD++G  A  V+G   CH +G +Q + L  +  + H + 
Sbjct: 424 ELRVPDHQDIAFGALQQGQNCLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHMDL 483

Query: 173 CV 174
           C+
Sbjct: 484 CL 485


>gi|21912817|gb|AAM81087.1|AF321881_1 UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1 [Toxoplasma gondii]
 gi|21912813|gb|AAM81085.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1 [Toxoplasma gondii]
 gi|221487005|gb|EEE25251.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1, putative
           [Toxoplasma gondii GT1]
          Length = 751

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 13/112 (11%)

Query: 53  ELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +LR    W+  + +I  L  P L     +GD++E  ALRD+L CKDF+W++K++      
Sbjct: 543 KLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELKALRDRLKCKDFNWYLKNI------ 596

Query: 112 KFPEL-PPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
            +PEL PPNL     G  +N     CLD++  T    IG   CH    +Q F
Sbjct: 597 -YPELEPPNLAHAMTGAMRNPKFNCCLDTL-TTKNQEIGVYPCHFEHGTQAF 646


>gi|431895640|gb|ELK05066.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Pteropus alecto]
          Length = 367

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR  L CK F W++++V       +P
Sbjct: 179 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKTLACKPFKWYLENV-------YP 231

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 232 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 289

Query: 171 ERCV 174
           + C+
Sbjct: 290 DLCL 293


>gi|237831699|ref|XP_002365147.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1, putative
           [Toxoplasma gondii ME49]
 gi|211962811|gb|EEA98006.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1, putative
           [Toxoplasma gondii ME49]
 gi|221506688|gb|EEE32305.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1, putative
           [Toxoplasma gondii VEG]
          Length = 751

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 13/112 (11%)

Query: 53  ELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +LR    W+  + +I  L  P L     +GD++E  ALRD+L CKDF+W++K++      
Sbjct: 543 KLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELKALRDRLKCKDFNWYLKNI------ 596

Query: 112 KFPEL-PPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
            +PEL PPNL     G  +N     CLD++  T    IG   CH    +Q F
Sbjct: 597 -YPELEPPNLAHAMTGAMRNPKFNCCLDTL-TTKNQEIGVYPCHFEHGTQAF 646


>gi|301604565|ref|XP_002931923.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Xenopus (Silurana) tropicalis]
          Length = 604

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           VALR KL C+ F W+++    +VY +       + +GE +N   T  CLD         I
Sbjct: 435 VALRQKLQCRSFKWYLE----NVYPEMRVYNNTITYGELRNSKATGYCLDQGSEDDNKAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R ++EG L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSSEGLLQLG 513


>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
          Length = 737

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 38/134 (28%), Positives = 66/134 (49%), Gaps = 15/134 (11%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR    WL  +K+         M +D+G++SE+ ALR+KL C  F W++ ++       +
Sbjct: 496 LRLADVWLDDYKRFYHARIGFEM-VDIGNVSERKALREKLKCHSFDWYLTNI-------Y 547

Query: 114 PEL---PPNLFWGEAKNLGTQKCLD----SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
           PEL      L  G+ ++     CLD    S   ++  +I T  CH  G +Q + L++E +
Sbjct: 548 PELFVPSKALASGDIESAAGPHCLDAPLPSENDSSSVIIKTRPCHKQGGNQFWLLSSENE 607

Query: 167 LGHGERCVDADKQE 180
           +   + C D+  Q+
Sbjct: 608 IRRDDYCFDSGIQK 621


>gi|358332242|dbj|GAA50924.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
          Length = 403

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 89/192 (46%), Gaps = 37/192 (19%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDS- 136
           +D GD+SE+ A+ ++L C  F W++K    +VY +F     ++  G+ ++     C+DS 
Sbjct: 239 IDFGDVSERKAILERLKCHSFDWYLK----NVYPEFFVPTDSVAHGDIESEAGPHCIDSP 294

Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
           +      ++G   CH  G +Q + ++  G++    +C D                    A
Sbjct: 295 LKGDGKVIVGMWPCHREGGNQYWLMSKLGEIRRDNKCWD--------------------A 334

Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLR 256
           G++     +G V  LF+C    G        + S  +++H   +RC+++  +   L+M  
Sbjct: 335 GIE-----VGRVA-LFDCHGVRGNQHF---VYTSNNEIKHG--ERCVTLSEDNKSLSMEY 383

Query: 257 CDENNSYQQWRF 268
           C  N++ Q+W+F
Sbjct: 384 CS-NSARQRWKF 394


>gi|158300338|ref|XP_001238288.2| AGAP012253-PA [Anopheles gambiae str. PEST]
 gi|157013111|gb|EAU75785.2| AGAP012253-PA [Anopheles gambiae str. PEST]
          Length = 593

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 55/232 (23%), Positives = 92/232 (39%), Gaps = 48/232 (20%)

Query: 54  LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +R    W+  +K+I     G P  +  + G ++ + A+R+   CK FS+++++       
Sbjct: 396 IRLAEVWMDEYKRIIFDIYGIPHYLEEEFGSVATRKAIRESAKCKPFSYYLENA------ 449

Query: 112 KFPE----LPPNLFWGEAKN--LGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
            FPE    L P  F GE  N  LG   CL    RT    +G + C      Q +  N   
Sbjct: 450 -FPEMHNPLVPGAFRGEVHNMALGNGSCLTY--RTRDRFLGMAPCDHLEKDQYWTHNYYQ 506

Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
           +L     C+DA                            +GTV+ ++ C  S G  +  W
Sbjct: 507 ELNSYRNCIDA----------------------------VGTVVEVYQCHRSRG--NQAW 536

Query: 226 RFHNSTRQLEHAILKRCLSVHPETNQLAML-RCDENNSYQQWRFKEVRPDWS 276
           +    ++Q+       CL+++ +T    +L +CD     QQW    +  D S
Sbjct: 537 KVLVESQQILSVARNLCLALNLQTKTTLLLEKCDATKPSQQWNVSYIELDVS 588


>gi|301614636|ref|XP_002936794.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Xenopus (Silurana) tropicalis]
          Length = 625

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 55/95 (57%), Gaps = 8/95 (8%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           + GD+S+++ LR +L CK+F+W++ ++  ++Y   PE  P L  G+ KN+G   CLD  G
Sbjct: 458 EYGDLSKRLDLRHRLQCKNFTWYLNNIYPEMYV--PERHP-LIHGDLKNVGRDLCLDVGG 514

Query: 139 RT---APAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
                 P ++ +  CHG G +Q F   ++ ++ H 
Sbjct: 515 ENHGDKPLIMYS--CHGLGGNQYFEYTSKHEIRHN 547


>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Cavia porcellus]
          Length = 531

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR +L C+ F W++++V       +P
Sbjct: 343 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCRPFQWYLENV-------YP 395

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 396 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 453

Query: 171 ERCV 174
           + C+
Sbjct: 454 DLCL 457


>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
 gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
          Length = 415

 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 49/83 (59%), Gaps = 11/83 (13%)

Query: 82  DISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW--GEAKNLGTQKCLDSM 137
           D+S+++ALR+KL CK+F W++ +V       +PEL  P   F   G+ +N  +  CLD++
Sbjct: 339 DVSQRLALREKLKCKNFKWYLDNV-------YPELEVPDTNFAASGQVRNPSSDMCLDTL 391

Query: 138 GRTAPAVIGTSYCHGSGSSQLFR 160
           G+     +G   CHG G +Q+ R
Sbjct: 392 GKKDDTPLGLYQCHGQGGNQVKR 414


>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Ornithorhynchus anatinus]
          Length = 628

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR +L CK F W++++V       +P
Sbjct: 440 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLSCKPFKWYLENV-------YP 492

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 493 ELRVPDHQDIAFG-ALQQGT-NCLDTLGHFADGVVGVYECHNAGGNQEWALTKDRSVKHM 550

Query: 171 ERCV 174
           + C+
Sbjct: 551 DLCL 554


>gi|397507787|ref|XP_003824367.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
           paniscus]
          Length = 633

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 106/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQSLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KGCTYKGHKTVV------------- 583

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               T +  W      + L +   K CLS + E    +++ C+ ++  Q+W F +
Sbjct: 584 ----TGEQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWVFSQ 631


>gi|390347275|ref|XP_003726736.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
           [Strongylocentrotus purpuratus]
          Length = 507

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 60/123 (48%), Gaps = 14/123 (11%)

Query: 72  SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LPPNL-FWGEAKNL 128
           +P    +D GDIS ++ +RD+L CK+F W++++V       +PE  LP N    G   N 
Sbjct: 319 TPEYQAVDAGDISSRIRIRDELKCKNFEWYLENV-------YPENILPLNFQALGRFMNE 371

Query: 129 GTQKCLDSM----GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVE 184
           G   C+D +    GR   A +  + C     +Q F  N   QL H   C+ A + +  V 
Sbjct: 372 GVNLCIDVLHATDGRRMGAHLAVNACREGALAQTFSWNDLSQLRHDRFCITAVEGDNHVM 431

Query: 185 MLK 187
           +L+
Sbjct: 432 LLE 434


>gi|157114758|ref|XP_001652407.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883560|gb|EAT47785.1| AAEL001146-PA [Aedes aegypti]
          Length = 552

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 84/218 (38%), Gaps = 38/218 (17%)

Query: 54  LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +R    W+  + Q+     G P     + GD+S++  LR+ L+CK F W++++       
Sbjct: 364 VRVAEVWMDQYAQVVYDMFGGP-EFRGNFGDVSDRKKLRESLNCKSFKWYLENA------ 416

Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
            FPEL   + +G              G+     +G ++C         R    G     E
Sbjct: 417 -FPELEDPVSYGVGH-----------GKFTNLGVGKNFCP--------RYRKAGYTFRME 456

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
            C D D Q  +  ML    T  V           G  L +F C    G  +  WR+  ST
Sbjct: 457 PCTDDDYQHWVHNMLGEISTSNVCLDYD------GITLYMFECHKGQG--NQKWRYSKST 508

Query: 232 RQLEHAILKRCLSVHPETN-QLAMLRCDENNSYQQWRF 268
           +Q  +   K CL V P    +L   +C+     Q+W F
Sbjct: 509 KQFTNVKNKVCLDVGPAPEMKLVAEKCNATKESQKWEF 546


>gi|348534088|ref|XP_003454535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
           [Oreochromis niloticus]
          Length = 559

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ ++ +L+CK F W++++V       +P
Sbjct: 371 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLEMKKRLNCKPFKWYLENV-------YP 423

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  +  G   CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 424 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 481

Query: 171 ERCV 174
           + C+
Sbjct: 482 DLCL 485


>gi|432098371|gb|ELK28171.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Myotis davidii]
          Length = 633

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 52/228 (22%), Positives = 100/228 (43%), Gaps = 42/228 (18%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
           VY   P+L P +  G  K+ G   CLD   +   + P ++ T  CHG G +Q F  +A+ 
Sbjct: 496 VYV--PDLNP-VISGYIKSFGQPLCLDVGENNQGSKPLILYT--CHGLGGNQYFEYSAQH 550

Query: 166 QLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
           ++ H    E C+ A           + +  C   G K +                  T +
Sbjct: 551 EIRHNIQKELCLHAAPG-------PVQLKTCTYKGHKTV-----------------ATGE 586

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
             W        L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 587 QIWEIQKDLL-LYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|440897124|gb|ELR48890.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Bos
           grunniens mutus]
          Length = 499

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR +L CKDF WF+  V  +++   
Sbjct: 299 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERRQLRARLGCKDFKWFLNTVYPELHV-- 356

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F G  +N G +  C D        + G       CHG G +Q F   ++ ++
Sbjct: 357 PEDRPGFF-GMLQNKGLRDYCFDYNPPNEHEITGHQVILYRCHGMGQNQFFEYTSQNEI 414


>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
           glaber]
          Length = 567

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR +L C+ F W++++V       +P
Sbjct: 379 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCQPFQWYLENV-------YP 431

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  E  + H 
Sbjct: 432 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 489

Query: 171 ERCV 174
           + C+
Sbjct: 490 DLCL 493


>gi|426222263|ref|XP_004005316.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
           partial [Ovis aries]
          Length = 467

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 8/122 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR +L CKDF WF+  V  +++   
Sbjct: 267 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERRQLRARLGCKDFKWFLNTVYPELHV-- 324

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
           PE  P  F G  +N G +  C D        + G       CHG G +Q F   ++ ++ 
Sbjct: 325 PEDRPGFF-GMLQNKGLKDYCFDYNPPNEHEITGHQVILYRCHGMGQNQFFEYTSQNEIR 383

Query: 169 HG 170
           + 
Sbjct: 384 YN 385


>gi|313234048|emb|CBY19624.1| unnamed protein product [Oikopleura dioica]
          Length = 827

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 10/112 (8%)

Query: 54  LRKRSSWLKVFKQIR-SLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDV-YT 111
           +R    WL  FK++    G+P+   +D G++S ++ LR+ L C+ FSW+M++V  ++ YT
Sbjct: 697 IRVADVWLDEFKEVYFRRGAPILKNIDPGNMSSRIQLRENLQCQPFSWYMENVLPELDYT 756

Query: 112 KFPELPPNLFWGE----AKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
              +L   +F GE    A+N   ++C DS G+   A I    CHG   +Q +
Sbjct: 757 MNDDL---IFAGEIISQARNRTNRQCFDSTGKDN-AQIQIFPCHGLLGNQYY 804


>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Ailuropoda melanoleuca]
          Length = 578

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 84/179 (46%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L C+ F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAVRSMGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+K V M   P   F + A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKKYVGMQNCPKDGFPIPANIIWHFKEDGTIFHPHSGLCLSAFRTPEG 556


>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
          Length = 491

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 77/164 (46%), Gaps = 11/164 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L C+ F W++K+V  +++   P
Sbjct: 320 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHV--P 377

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 378 EDRPG-WHGAVRSMGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 436

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLN 210
              E C +  +Q+K V M   P   F + A +   F   GT+ +
Sbjct: 437 SVTELCAEVPEQKKYVGMQNCPKDGFPIPANIIWHFKEDGTIFH 480


>gi|114581503|ref|XP_515871.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
           troglodytes]
 gi|410331347|gb|JAA34620.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Pan
           troglodytes]
          Length = 633

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 106/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KGCTYKGHKTVV------------- 583

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               T +  W      + L +   K CLS + E    +++ C+ ++  Q+W F +
Sbjct: 584 ----TGEQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWVFSQ 631


>gi|326436017|gb|EGD81587.1| polypeptide N-acetylgalactosaminyltransferase 14 [Salpingoeca sp.
           ATCC 50818]
          Length = 645

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 94/229 (41%), Gaps = 34/229 (14%)

Query: 53  ELRKRSSWLKVFKQI-RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +LR    W+  +K I + + SPL     +G +     +R + +CK F W++++V  +++ 
Sbjct: 442 KLRAAEVWMDEYKDIVKRVMSPLPRGKTLGPLDMMFDIRKRHNCKPFKWYLENVYPEMFV 501

Query: 112 KFPELPPNLFW-GEAKNLGTQKCLDSMG-RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
             P  P ++   GE +N  T  C D++G R   A IG   CH S  +Q F L+  G    
Sbjct: 502 --PYDPDHVVASGEVRNPHTDACFDTLGARHQGARIGAYPCHHSHGTQEFVLSTGGD--- 556

Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
                        + +  +    C+  G           + ++ C  + G     W +  
Sbjct: 557 -------------IRVAAMDFDTCLDRG------NGDGSIGIWPCHQTGGNQK--WNYDP 595

Query: 230 STRQLE----HAILKRCLSVHPETN-QLAMLRCDENNSYQQWRFKEVRP 273
            + +L     H   +   +  P +  +L +  CD  N  QQW FKE  P
Sbjct: 596 KSGRLSDGDGHICAEAQRTATPSSPFKLVLSPCDLTNPAQQWIFKEAEP 644


>gi|281343220|gb|EFB18804.1| hypothetical protein PANDA_020124 [Ailuropoda melanoleuca]
          Length = 275

 Score = 58.2 bits (139), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 38/73 (52%), Gaps = 1/73 (1%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R   +W+  F +      P    L  GDIS Q  LR +L CKDF WFM  VA+DV   +P
Sbjct: 203 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 262

Query: 115 EL-PPNLFWGEAK 126
            + PP   WGE +
Sbjct: 263 PVEPPPAAWGEVR 275


>gi|301775621|ref|XP_002923231.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Ailuropoda melanoleuca]
          Length = 601

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDSFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W++     +VY +       L +GE +N   +  CLD         I
Sbjct: 435 LALRQRLKCRSFKWYLD----NVYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSAEGLLQLG 513


>gi|301608341|ref|XP_002933751.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Xenopus (Silurana) tropicalis]
          Length = 586

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/231 (23%), Positives = 98/231 (42%), Gaps = 49/231 (21%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K +        + +A     GDIS+++ L+  L CK+F+W+++++  +
Sbjct: 389 QVRLAEVWMDDYKVLYYRRNEQAAKIAKEKSFGDISKRLKLKADLQCKNFTWYLENIYPE 448

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD-----SMGRTAPAVIGTSYCHGSGSSQLFRLNA 163
           ++   P+  P  ++G  KN GTQ C+D     + G   P +     CHG G +Q F  + 
Sbjct: 449 MFV--PDRDPT-YYGAIKNEGTQNCIDVGENNNYGSQLPIMYP---CHGMGGNQYFEYST 502

Query: 164 EGQLGHGERCVDADKQEKLVEMLKLPVTFCV---LAGVKLIFCRLGTVLNLFNCFNSSGT 220
             +L H                LK  +  C       VKL+ C+       +   N++  
Sbjct: 503 HKELRHN---------------LKTQLCLCSKYEPGPVKLVDCQ-------YKGQNTTVP 540

Query: 221 VDGPWRFHNSTRQLEHAILKRCLSV---HPETNQLAMLRCDENNSYQQWRF 268
            +  W F    R + +     CL+    HP T +     C+ ++  Q W F
Sbjct: 541 GNEEWEF-TKDRLIRNVGSNMCLTAQGDHPSTAE-----CNPDDIDQLWSF 585


>gi|281340802|gb|EFB16386.1| hypothetical protein PANDA_012334 [Ailuropoda melanoleuca]
          Length = 600

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDSFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W++     +VY +       L +GE +N   +  CLD         I
Sbjct: 435 LALRQRLKCRSFKWYLD----NVYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSAEGLLQLG 513


>gi|154152027|ref|NP_001093795.1| polypeptide N-acetylgalactosaminyltransferase 12 [Bos taurus]
 gi|151553796|gb|AAI48122.1| GALNT12 protein [Bos taurus]
          Length = 579

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 8/119 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR +L CKDF WF+  V  +++   
Sbjct: 379 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERRQLRARLGCKDFKWFLNTVYPELHV-- 436

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
           PE  P  F+G  +N G +  C D        + G       CHG G +Q F   ++ ++
Sbjct: 437 PEDRPG-FFGMLQNKGLRDYCFDYNPPNEHEITGHQVILYRCHGMGQNQFFEYTSQNEI 494


>gi|153266878|ref|NP_004473.2| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
 gi|209572629|sp|Q14435.2|GALT3_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=Polypeptide GalNAc transferase 3;
           Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 3;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3
 gi|62822129|gb|AAY14678.1| unknown [Homo sapiens]
 gi|109731077|gb|AAI13568.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
           sapiens]
 gi|109731742|gb|AAI13566.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
           sapiens]
 gi|119631729|gb|EAX11324.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3), isoform
           CRA_b [Homo sapiens]
 gi|313883200|gb|ADR83086.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3)
           [synthetic construct]
          Length = 633

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 103/225 (45%), Gaps = 44/225 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +  W      + L +  LK CLS + E    +++ C+ ++  Q+W
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKW 627


>gi|74004468|ref|XP_535940.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
           1 [Canis lupus familiaris]
          Length = 632

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 34/232 (14%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 429 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 488

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  + Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 489 NTIYPEAYV--PDLNP-VISGYIKSIGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 542

Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
           F  +A+ ++ H       + Q++L          C+ A   L+  R  T    +    + 
Sbjct: 543 FEYSAQHEIRH-------NIQKEL----------CLHAAQGLVQLRACT----YKGHRTV 581

Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
            + +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 582 ASGEQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 630


>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
 gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
          Length = 442

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/126 (29%), Positives = 63/126 (50%), Gaps = 15/126 (11%)

Query: 39  DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDF 98
           DA K+++   +   E+     W+  +K+I     P     DMGD++++  LR  LHC+DF
Sbjct: 325 DAWKKLVHNNRRMAEV-----WMDDYKEIYYRKHPEYRKYDMGDVTQRKLLRKGLHCRDF 379

Query: 99  SWFMKHVAYDVYTKFPELPPNLFWGEAKNLG------TQKCLDSMGRTAPAVIGTSYCHG 152
           SW++ HV   +Y   P++ P +  G+  ++       T  CLD + +      G   CHG
Sbjct: 380 SWYLSHVFPTLYV--PDIRP-IAHGQVSHVTSISSEQTGLCLDVI-KAGKEPAGVFPCHG 435

Query: 153 SGSSQL 158
            G +Q+
Sbjct: 436 KGGTQV 441


>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
           laevis]
 gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
          Length = 622

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 4/128 (3%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVIS  +  L       +  ++ +     + +      GD+S+++ L+  LHCK+F+W++
Sbjct: 420 QVISRNQVRLAEVWMDDYKIIYYRRNDQAAKMVKEKSFGDVSKRLKLKADLHCKNFTWYL 479

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
           +++  +++   P+  P  + G  KN G QKCLD             Y CHG G +Q F  
Sbjct: 480 ENIYPELFV--PDRDPT-YSGAVKNEGAQKCLDVGENNHGGKPLIMYPCHGMGGNQYFEY 536

Query: 162 NAEGQLGH 169
           +   +L H
Sbjct: 537 STHKELRH 544


>gi|1617312|emb|CAA63371.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
           (GalNAc-T3) [Homo sapiens]
          Length = 633

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 103/225 (45%), Gaps = 44/225 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLRCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +  W      + L +  LK CLS + E    +++ C+ ++  Q+W
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKW 627


>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
 gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
          Length = 721

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 54/242 (22%), Positives = 97/242 (40%), Gaps = 60/242 (24%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
           D GD+S++  LR  L CK F W++ ++       +PEL     ++  GE +NLG   + C
Sbjct: 482 DWGDVSDRRKLRTDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 534

Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFR------LNAEGQL------------GHGERC 173
           LD+    +     +GT  CH  G +Q+        L+A+ +             G G   
Sbjct: 535 LDAPAGKKHQKKAVGTYPCHRQGGNQIANVPKGMCLDAKEKSEEETPVSVYECHGQGGNQ 594

Query: 174 VDAD-------------KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTV------------ 208
           V A                E L+    + + +  +     +  + G +            
Sbjct: 595 VSASMSTSSELRKAGGGDSESLIPGFSISLLYGFIFQSYWMLSKAGEIRRDDSCLDYAGK 654

Query: 209 -LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWR 267
            + LF C    G  +  W +  +T+QL H    +CL++    ++L M  C+++ + QQW 
Sbjct: 655 DVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLLMEECNQSLNRQQWT 712

Query: 268 FK 269
            +
Sbjct: 713 LE 714


>gi|315139010|ref|NP_001186711.1| POC1B-GALNT4 protein isoform 2 [Homo sapiens]
 gi|194387348|dbj|BAG60038.1| unnamed protein product [Homo sapiens]
          Length = 406

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 81/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 209 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 266

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P    G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 267 EDRPGWH-GAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 325

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 326 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 384


>gi|405967231|gb|EKC32417.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 570

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 42/198 (21%), Positives = 80/198 (40%), Gaps = 33/198 (16%)

Query: 76  MFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLD 135
           +F D GD++++  LR++L C  F WF+K+V  D++     +       +AK +     +D
Sbjct: 399 LFGDYGDVTDRKKLRERLQCHSFDWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVD 458

Query: 136 SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
           +     P  +    CH  G +Q + L+  G++   + C+D    E ++            
Sbjct: 459 NHNYHKP--VNMWPCHNQGGNQYWMLSKNGEIRRDDGCLDYSGGESVI------------ 504

Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAML 255
                          ++ C    G  +  +R  NS   + HA  ++C+    +  +L M 
Sbjct: 505 ---------------VYPCHGQKGNQEWQYREDNS---IYHANTQKCMETSVDGQKLTMK 546

Query: 256 RCDENNSYQQWRFKEVRP 273
            C   +  Q W +K   P
Sbjct: 547 TCTGIDR-QIWTWKRKSP 563


>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
           [Anolis carolinensis]
          Length = 592

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 58/242 (23%), Positives = 94/242 (38%), Gaps = 62/242 (25%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K++    +P A     GD++E+  LR+KL CKDF W++ ++       +
Sbjct: 390 VRAAEVWMDSYKELYYHRNPHARMEPYGDVTERRLLREKLKCKDFKWYLDNI-------Y 442

Query: 114 PEL--PPNL--FWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAE 164
           PEL  P +   ++G  KN G    C D        + G       CHG G +Q F   + 
Sbjct: 443 PELHVPEDRLGYFGMLKNKGMANFCFDYNPPNEHDITGHVVILYPCHGMGQNQFFEYTSY 502

Query: 165 GQL----GHGERCVDADKQEKLVEML-------KLPVTFCVLAGVKLIFCRLGTVLNL-- 211
            ++     H E C   D     ++M        K+P         K +    GT++++  
Sbjct: 503 HEIRYNTRHPEACAAVDPGTDYLKMYLCQDDIQKIP------ENQKFMLREDGTLVHMQT 556

Query: 212 FNCFNS-SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
             C  + S   +G                          N   +LR   N+ YQ+W FKE
Sbjct: 557 LKCLQAESNAYNG--------------------------NPAPVLRPCSNSEYQKWFFKE 590

Query: 271 VR 272
            R
Sbjct: 591 KR 592


>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
          Length = 626

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 11/135 (8%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K++  +       +D+GD S +  LR  L CK F W++++V  D   KF     +
Sbjct: 434 WMDDYKELFYMHRGDLKTIDIGDTSARKKLRKDLKCKSFKWYLENVLPD---KFIMTEHS 490

Query: 120 LFWGEAKN--LGTQKCLDSMGRTA--PAVIGTSYCHGS-GSSQLFRLNAEGQLGHGERCV 174
           L +G   N   G Q CLD++ R    P  +G   CH     SQ+F L+  GQL   E C 
Sbjct: 491 LGYGRVMNDAFGKQLCLDNLQRNEDQPYNLGQYPCHAQMAMSQVFALSKLGQLRREESCA 550

Query: 175 DADKQ---EKLVEML 186
           +       E  V+ML
Sbjct: 551 EVQDNVSAEAPVKML 565


>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
           laevis]
 gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
          Length = 582

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 57/228 (25%), Positives = 96/228 (42%), Gaps = 44/228 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K++    +P A   + GDISE+  LR++L CK F W++K V  +++   P
Sbjct: 385 RAAEVWMDGYKELFYNRNPPAQKENYGDISERKLLRERLQCKSFDWYLKKVFPELH--IP 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP      A +    CHG G +Q F    + ++
Sbjct: 443 EDRPG-WHGAVRSMGISSECLDY---NAPEHNPTGAHLSLFGCHGQGGNQFFEYTTKREI 498

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
                 E C +   Q+  + M+      C   G                   +S   +  
Sbjct: 499 RFNSVTELCAEVPDQQTYIGMMH-----CTKDG-------------------NSVPKNIA 534

Query: 225 WRFHNSTRQLEHAILKRCLSVH--PETN-QLAMLRCDENNSYQQWRFK 269
           W F N    + H     C+S +  PE    + M  CD ++  Q W+F+
Sbjct: 535 WEFRNDG-TIYHPSSGMCISSYRTPEGRADVQMKGCDPSDKNQLWKFE 581


>gi|440909764|gb|ELR59641.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Bos grunniens
           mutus]
          Length = 604

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDGFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W++ +    VY +       L +GE +N   +  CLD         I
Sbjct: 435 LALRQRLKCRSFKWYLDN----VYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSAEGLLQLG 513


>gi|410968769|ref|XP_003990872.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3 [Felis catus]
          Length = 633

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  + Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTIYPEAYV--PDLNP-VISGYIKSIGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 544 FEYSAQREIRHNIQKELCLHA--AQGLVQLRA-----CAYKGHKTV-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
            +SG  +  W      + L +  LK CLS + E    +++ C+ ++  Q+W F +
Sbjct: 583 -ASG--EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631


>gi|291190646|ref|NP_001167159.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
 gi|223648406|gb|ACN10961.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
          Length = 560

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  FK       P A  +  G+I  ++ ++ +L C+ F W++++V       +P
Sbjct: 372 RAAEVWMDEFKNFYYAAVPSARNVPYGNIQSRMEMKKRLGCQPFKWYLENV-------YP 424

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  +  G   CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 425 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 482

Query: 171 ERCV 174
           + C+
Sbjct: 483 DLCL 486


>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
 gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
          Length = 621

 Score = 57.4 bits (137), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 65/233 (27%), Positives = 98/233 (42%), Gaps = 40/233 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVIS  +  L       + ++F +     + +A     GDI+E+  LR++LHCK+F+W++
Sbjct: 421 QVISRNQVRLAEVWMDDYKEIFYRRNQQAAQMAREKTYGDITERRRLRERLHCKNFTWYL 480

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
           ++V  +++   P+L P  + G  KN GT  CLD             Y CHG G +Q F  
Sbjct: 481 QNVYPEMFV--PDLNPTSY-GAIKNEGTNSCLDVGENNHGGKPLIMYPCHGMGGNQYFEY 537

Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAG---VKLIFCRLGTVLNLFNCFNSS 218
             +  L H     +  KQ             C+ AG   V+L  CR       F    + 
Sbjct: 538 TTQRDLRH-----NVGKQ------------LCLRAGAGPVQLGECR-------FRGQPTR 573

Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSV---HPETNQLAMLRCDENNSYQQWRF 268
              D  W          HA  + CL+    HP     AM  CD ++ +Q W F
Sbjct: 574 VPPDEEWELTQDHLIKNHAS-RMCLTARGKHP-----AMAPCDPSDPHQLWSF 620


>gi|169790926|ref|NP_001116108.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Homo
           sapiens]
          Length = 603

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 433

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 489

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512


>gi|143811393|sp|Q9HCQ5.3|GALT9_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 9;
           AltName: Full=Polypeptide GalNAc transferase 9;
           Short=GalNAc-T9; Short=pp-GaNTase 9; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 9;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9
          Length = 603

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 433

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 489

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512


>gi|193786287|dbj|BAG51570.1| unnamed protein product [Homo sapiens]
          Length = 460

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 231 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 290

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 291 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 346

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 347 LYP-CHGM-SSQLVRYSADGLLQLG 369


>gi|38327556|ref|NP_068580.2| polypeptide N-acetylgalactosaminyltransferase 9 isoform B [Homo
           sapiens]
 gi|21360816|gb|AAM49722.1|AF458594_1 hypothetical protein [Homo sapiens]
 gi|127802610|gb|AAH93819.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9 (GalNAc-T9) [Homo
           sapiens]
 gi|127802793|gb|AAH93817.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9 (GalNAc-T9) [Homo
           sapiens]
 gi|313882776|gb|ADR82874.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9 (GalNAc-T9)
           (GALNT9), transcript variant B [synthetic construct]
          Length = 237

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 8   RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 67

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 68  LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 123

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 124 LYP-CHGM-SSQLVRYSADGLLQLG 146


>gi|432891064|ref|XP_004075530.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 3-like
           [Oryzias latipes]
          Length = 529

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 65/146 (44%), Gaps = 20/146 (13%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
           LR    W+  +K    L   + M    +D GDIS++V LR  L CK F W++     +VY
Sbjct: 330 LRVAEVWMDEYKSNVYLAWNIPMENHGIDYGDISQRVVLRKNLQCKSFQWYLD----NVY 385

Query: 111 TKFPELPPNLFWGEAKNLG-TQKCLDS-MGRTAPAVIGTSYCHGSG--------SSQLFR 160
            +       LF+GE +N   +  C+D  M     A++    CHG G         SQLF 
Sbjct: 386 PEMRRYNNTLFYGEIRNSKVSHLCMDQGMKENHTAILHP--CHGWGPQLGRYTKDSQLF- 442

Query: 161 LNAEGQLGHGERCVDADKQEKLVEML 186
           L   G  G   RCV  D+     ++L
Sbjct: 443 LGPLGSTGEDTRCVVDDQVSSFPQLL 468


>gi|10336504|dbj|BAB13699.2| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 603

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 433

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 489

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512


>gi|410912128|ref|XP_003969542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Takifugu rubripes]
          Length = 558

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ L+ ++ CK F W++++V       +P
Sbjct: 370 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENV-------YP 422

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  +  G   CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 423 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 480

Query: 171 ERCV 174
           + C+
Sbjct: 481 DLCL 484


>gi|189066640|dbj|BAG36187.1| unnamed protein product [Homo sapiens]
          Length = 633

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 102/225 (45%), Gaps = 44/225 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           VY   P+L P +  G  K+ G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSAGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A   + LV++       C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
           +  W      + L +  LK CLS + E    +++ C+ ++  Q+W
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKW 627


>gi|334326859|ref|XP_001378345.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
           [Monodelphis domestica]
          Length = 603

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 374 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 433

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       + +GE +N   +  CLD         I
Sbjct: 434 LALRQRLQCRSFKWYLE----NVYPEMRIYNNTITYGEVRNSKASGYCLDQGAEDDDRAI 489

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 490 LYP-CHGM-SSQLVRYSAEGLLQLG 512


>gi|350592396|ref|XP_003483457.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Sus scrofa]
          Length = 217

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 8   RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDGFKSHVYMAWNIPMTNPGVDFGDVSER 67

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W++ +    VY +       L +GE +N   +  CLD         I
Sbjct: 68  LALRQRLKCRSFKWYLDN----VYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 123

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 124 LYP-CHGM-SSQLVRYSAEGLLQLG 146


>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
           catus]
          Length = 582

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/164 (28%), Positives = 78/164 (47%), Gaps = 11/164 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 385 RAAEVWMDQYKEHFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHV--P 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLD-SMGRTAPAVIGTSY--CHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD +   + P     S   CHG G +Q F   +  ++   
Sbjct: 443 EDRPG-WHGAIRSMGISSECLDYNSPDSNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 501

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLN 210
              E C +  +Q+  V M   P   F + A +   F   GT+ +
Sbjct: 502 SVTELCAEVPEQKNYVGMQNCPKDGFPIPANIIWHFEEDGTIFH 545


>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           2 [Sus scrofa]
          Length = 578

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 59/226 (26%), Positives = 96/226 (42%), Gaps = 38/226 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSIGISSECLDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
              E C +  +Q+  V M   P     +              N+   F   GT+     F
Sbjct: 498 SVTELCAEVPEQKNHVGMQNCPKDGSPIP------------TNIIWHFKEDGTI-----F 540

Query: 228 HNSTRQLEHAILKRCLSVH--PETNQLAMLR-CDENNSYQQWRFKE 270
           H       H+ L  CLS +  PE      +R CD  +  Q W+F++
Sbjct: 541 H------PHSGL--CLSAYRTPEGRPDVQMRTCDAADKNQIWKFEK 578


>gi|119618953|gb|EAW98547.1| hCG31690 [Homo sapiens]
          Length = 477

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 231 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 290

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 291 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 346

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 347 LYP-CHGM-SSQLVRYSADGLLQLG 369


>gi|397487071|ref|XP_003814635.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Pan
           paniscus]
          Length = 603

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 433

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 489

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512


>gi|157114760|ref|XP_001652408.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
 gi|108883561|gb|EAT47786.1| AAEL001151-PA [Aedes aegypti]
          Length = 592

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 93/235 (39%), Gaps = 51/235 (21%)

Query: 44  VISTQKSSLE---LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDF 98
           +I T+K  +    LR    W+  +KQI     G P     ++GD+S +  +R+K  CK F
Sbjct: 381 LIQTKKDVVRANSLRLAEVWMDEYKQIIFDIYGLPRYPVEEIGDVSHRKQIREKAKCKTF 440

Query: 99  SWFMKHVAYDVYTKFPELPPNL----FWGEAKN--LGTQKCLDSMGRTAPAVIGTSYCHG 152
            ++       V   FPE+   +    F GE KN  LG   CL+    T    + T  C  
Sbjct: 441 KYY-------VQAAFPEMNNPMVEGAFHGEVKNMALGNDTCLEYQLDTNTVRMAT--CDH 491

Query: 153 SGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLF 212
             + Q +  N   +L   + C+D                              G  + ++
Sbjct: 492 QETGQFWAHNYYQELNSHKHCLDY----------------------------TGDTMGVY 523

Query: 213 NCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLR-CDENNSYQQW 266
            C  S G  +  W++   T+Q++     +CL +  ET +  ++  CDE    Q+W
Sbjct: 524 GCHRSRG--NQAWQYVKKTKQIKSVKHGKCLGLSLETMKTLLVEDCDEKKESQKW 576


>gi|169790950|ref|NP_001116116.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Rattus
           norvegicus]
 gi|149063717|gb|EDM14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9 (predicted), isoform
           CRA_b [Rattus norvegicus]
          Length = 604

 Score = 57.0 bits (136), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 16/148 (10%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
           +ALR KL C+ F W++++V       +PE+      L +GE +N   +  CLD       
Sbjct: 435 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 487

Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
             I    CHG  SSQL R +A+G L  G
Sbjct: 488 RAILYP-CHGM-SSQLVRYSADGLLQLG 513


>gi|449279337|gb|EMC86972.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Columba livia]
          Length = 519

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 290 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 349

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       + +GE +N   +  CLD         I
Sbjct: 350 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 405

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R ++EG L  G
Sbjct: 406 LYP-CHGM-SSQLVRYSSEGVLQLG 428


>gi|301608339|ref|XP_002933739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
           [Xenopus (Silurana) tropicalis]
          Length = 622

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 56/93 (60%), Gaps = 8/93 (8%)

Query: 80  MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGR 139
            GDIS+++ L+  L CK+F+W+++++  +++   P+  P  ++G+ KN G+Q CLD+  +
Sbjct: 457 FGDISKRLKLKADLQCKNFTWYLENIYPEMFV--PDRDPT-YYGKVKNEGSQNCLDAGEK 513

Query: 140 T---APAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
                P ++  + C+G G +Q F  +   +L H
Sbjct: 514 NHGGKPLIM--NLCNGMGGTQYFEYSTHKELRH 544


>gi|449476915|ref|XP_004176604.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 9 [Taeniopygia
           guttata]
          Length = 604

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       + +GE +N   +  CLD         I
Sbjct: 435 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R ++EG L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSSEGVLQLG 513


>gi|354479146|ref|XP_003501774.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Cricetulus griseus]
          Length = 610

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 16/148 (10%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 381 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 440

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
           +ALR KL C+ F W++++V       +PE+      L +GE +N   +  CLD       
Sbjct: 441 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 493

Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
             I    CHG  SSQL R +A+G L  G
Sbjct: 494 RAILYP-CHGM-SSQLVRYSADGLLQLG 519


>gi|405977048|gb|EKC41520.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
          Length = 635

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 65/128 (50%), Gaps = 14/128 (10%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K++        +  D GDISE+V LR +L+CK F WF+ +V       +
Sbjct: 439 MRLAEVWMDEYKRLFYTHRKDLLGQDYGDISERVELRKRLNCKSFKWFLDNV-------Y 491

Query: 114 PE--LPPNLF--WGEAKNLGTQKCLDSMGRTAPAV--IGTSYC-HGSGSSQLFRLNAEGQ 166
           PE  +P      WG  +N  +  CLD++ +    V  +G   C +G+ ++++F L+   +
Sbjct: 492 PEKFIPDENVHAWGMVRNPPSNLCLDTLQKDEKTVFDMGIYSCQNGASANEVFSLSINDE 551

Query: 167 LGHGERCV 174
           L   E C+
Sbjct: 552 LRREEACL 559


>gi|360043880|emb|CCD81426.1| putative n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 526

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 51/225 (22%), Positives = 92/225 (40%), Gaps = 36/225 (16%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+      P    +D GD+S+Q  LR +L CK F WFM  VA+D+   +P     
Sbjct: 318 WMDEYKEYIYRSLPECRKVDPGDLSQQHNLRKRLQCKSFKWFMTEVAFDLTKAYPPPEEV 377

Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER- 172
           LF  GE +++     C+D+   T  + +  S+C     G    Q F  + +  +   +R 
Sbjct: 378 LFATGEIRSVAFPYLCIDAARATKRSPVKLSFCSAISKGHNYMQSFEYSFKQDIKAVKRS 437

Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGT----VDGPWRF 227
            C+D    +        PV F                   + C   +G     +   +R 
Sbjct: 438 LCLDVPDAKP-----NKPVLF-------------------YPCHGQNGNQHWHLQSAYRN 473

Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
           + +   + H+    CL ++     + + RC  N   Q+W ++++R
Sbjct: 474 NENPVIIAHSTSGLCLDLNLRGLSVFVRRCSSNLRTQRWNWRDLR 518


>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
          Length = 575

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 378 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 435

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 436 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 494

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 495 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 553


>gi|157820525|ref|NP_001100621.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform B [Rattus
           norvegicus]
 gi|149063716|gb|EDM14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9 (predicted), isoform
           CRA_a [Rattus norvegicus]
          Length = 237

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 16/148 (10%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 8   RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
           +ALR KL C+ F W++++V       +PE+      L +GE +N   +  CLD       
Sbjct: 68  LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 120

Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
             I    CHG  SSQL R +A+G L  G
Sbjct: 121 RAILYP-CHGM-SSQLVRYSADGLLQLG 146


>gi|355753170|gb|EHH57216.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
           fascicularis]
          Length = 542

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 53/228 (23%), Positives = 89/228 (39%), Gaps = 49/228 (21%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  FK++    +P A     GD++E+  LR KL CKDF WF++ V  +++   
Sbjct: 350 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 407

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLGH 169
           PE  P  F           C D        ++G       CHG G +Q F   ++ ++ +
Sbjct: 408 PEDRPGFF--------GMYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIRY 459

Query: 170 G----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTVD 222
                E C+                   V AG+ ++    C      N        G++ 
Sbjct: 460 NTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL- 500

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
               FH  +++   A  K         + + +LR   N+ +Q+W FKE
Sbjct: 501 ----FHEQSKKCVQAARKES-----SDSFVPLLRDCTNSDHQKWFFKE 539


>gi|401407036|ref|XP_003882967.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1 [Neospora caninum
           Liverpool]
 gi|325117383|emb|CBZ52935.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1 [Neospora caninum
           Liverpool]
          Length = 762

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 13/112 (11%)

Query: 53  ELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
           +LR    W+  + +I  L  P L     +GD++E  ALRD+L CKDF+W++++V      
Sbjct: 554 KLRAAHVWMGEYARIVELVVPRLPQGQSLGDLTELKALRDRLQCKDFNWYLENV------ 607

Query: 112 KFPEL-PPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
            +PEL PPN+     G  +N     C+D++  T    IG   CH    +Q F
Sbjct: 608 -YPELEPPNVAHALTGAMRNPKFNCCVDTL-TTKNQEIGVYPCHFEHGTQAF 657


>gi|344246331|gb|EGW02435.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Cricetulus
           griseus]
          Length = 557

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 328 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 387

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR KL C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 388 LALRQKLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 443

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 444 LYP-CHGM-SSQLVRYSADGLLQLG 466


>gi|301603772|ref|XP_002931524.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 2-like [Xenopus
           (Silurana) tropicalis]
          Length = 569

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ L+  L CK F W++++V       +P
Sbjct: 381 RAAEVWMDDYKNFYYAAVPSARNVPYGNIQSRLELKKNLGCKPFKWYLENV-------YP 433

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 434 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 491

Query: 171 ERCV 174
           + C+
Sbjct: 492 DLCL 495


>gi|345791109|ref|XP_543349.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Canis
           lupus familiaris]
          Length = 600

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 371 RVAHIERSKKPYNNDIDYYAKRNALRAAEVWMDSFKSHVYMAWNIPMTNPGVDFGDVSER 430

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 431 LALRQRLKCRSFKWYLE----NVYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDHAI 486

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R + EG L  G
Sbjct: 487 LYP-CHGM-SSQLVRYSTEGLLQLG 509


>gi|47217176|emb|CAG11012.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 598

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ L+ ++ CK F W++++V       +P
Sbjct: 409 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENV-------YP 461

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  +  G   CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 462 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 519

Query: 171 ERCV 174
           + C+
Sbjct: 520 DLCL 523


>gi|432852860|ref|XP_004067421.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
           [Oryzias latipes]
          Length = 556

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ ++ +L CK F W++ +V       +P
Sbjct: 368 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLDNV-------YP 420

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +G  +  G   CLD++G  A  V+G   CH +G +Q + L  +  + H 
Sbjct: 421 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGIYECHNAGGNQEWALTKDKSVKHM 478

Query: 171 ERCV 174
           + C+
Sbjct: 479 DLCL 482


>gi|402911950|ref|XP_003918561.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Papio anubis]
          Length = 237

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 8   RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 68  LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 123

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 124 LYP-CHGM-SSQLVRYSADGLLQLG 146


>gi|380813218|gb|AFE78483.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Macaca
           mulatta]
          Length = 603

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 433

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 489

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512


>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
           norvegicus]
          Length = 633

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 104/235 (44%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGDKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A +    V  LK     CV  G + I              
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGRRTI-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                 +  W      + L + + + CLS + E    +++ CD  +  Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFRMCLSSNGE--HPSLVPCDTTDLLQKWIFSQ 631


>gi|410922253|ref|XP_003974597.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Takifugu rubripes]
          Length = 604

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 64/146 (43%), Gaps = 10/146 (6%)

Query: 29  GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
            ++ H ER        I        LR    W+  +K    +   + M    +D GD+SE
Sbjct: 374 ARVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEYKSHVYMAWNIPMNNPGVDFGDVSE 433

Query: 86  QVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAV 144
           ++ALR +L C+ F W+++H    VY +       + +GE +N   +  CLD         
Sbjct: 434 RLALRKRLQCRSFRWYLEH----VYPEMRVYNNTITYGEVRNSKASGYCLDQGAEDDDKA 489

Query: 145 IGTSYCHGSGSSQLFRLNAEGQLGHG 170
           I    CHG  SSQL R + EG L  G
Sbjct: 490 ILYP-CHGM-SSQLARYSTEGLLQLG 513


>gi|334330196|ref|XP_003341314.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 3-like [Monodelphis
           domestica]
          Length = 631

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 104/227 (45%), Gaps = 44/227 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  FK+I     +  + +      GDIS+++ +R +L CK+F+W++ ++  +
Sbjct: 434 QVRLAEVWMDEFKEIFYRRNTEAAKIVKQKAYGDISKRLDIRHRLQCKNFTWYLNNIYPE 493

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
           +Y   P+L P +  G  +N+G   CLD +G       P ++ T  CH  G +Q F  + +
Sbjct: 494 IYV--PDLNP-VISGYIQNIGRHLCLD-VGENNQGGKPLIMYT--CHFLGGNQYFEXSEQ 547

Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
            ++ H      + ++E  +  L+ PV    C   G K                  + TVD
Sbjct: 548 HEIRH------SIQKELCLHALQGPVQMKACSYKGQK------------------TFTVD 583

Query: 223 -GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
              W        L +  L RCL+ + E    +++ C+ ++ +Q+W F
Sbjct: 584 EQKWEIRKD-HLLYNPSLHRCLTGNGE--HPSLVPCNPSDVFQKWDF 627


>gi|193786146|dbj|BAG51429.1| unnamed protein product [Homo sapiens]
          Length = 354

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 66/249 (26%), Positives = 98/249 (39%), Gaps = 36/249 (14%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 125 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 184

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 185 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 240

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL 205
               CHG  SSQL R +A+G L  G           L     LP + C      L+    
Sbjct: 241 -LYPCHGM-SSQLVRYSADGLLQLG----------PLGSTAFLPDSKC------LVDDGT 282

Query: 206 GTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN-----QLAMLRCDEN 260
           G +  L  C + +      W F  S   +  A   RCL V    +     +L + RC   
Sbjct: 283 GRMPTLKKCEDVARPTQRLWDFTQSGPIVSRAT-GRCLEVEMSKDANFGLRLVVQRC--- 338

Query: 261 NSYQQWRFK 269
            S Q+W  +
Sbjct: 339 -SGQKWMIR 346


>gi|426374733|ref|XP_004054218.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Gorilla gorilla gorilla]
          Length = 237

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 8   RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 68  LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 123

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 124 LYP-CHGM-SSQLVRYSADGLLQLG 146


>gi|355564838|gb|EHH21338.1| hypothetical protein EGK_04374, partial [Macaca mulatta]
          Length = 524

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 295 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 354

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 355 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 410

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 411 LYP-CHGM-SSQLVRYSADGLLQLG 433


>gi|355786668|gb|EHH66851.1| hypothetical protein EGM_03916, partial [Macaca fascicularis]
          Length = 524

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 295 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 354

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 355 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 410

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 411 LYP-CHGM-SSQLVRYSADGLLQLG 433


>gi|354487360|ref|XP_003505841.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
           [Cricetulus griseus]
          Length = 633

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/235 (24%), Positives = 103/235 (43%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             V  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTVYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A +    V  LK     CV  G + +              
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTV-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                 +  W      + L + + K CLS   E   L  + CD ++  Q+W F +
Sbjct: 583 ---APGEQIWDIQKG-QLLYNPLFKMCLSATGEHPNL--VPCDTSDLLQKWIFSQ 631


>gi|51490862|emb|CAD44536.1| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
          Length = 168

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 91/194 (46%), Gaps = 40/194 (20%)

Query: 80  MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGR 139
            GD+S++  ++ +L CK+F+W++ ++  +VY   P+L P +  G  K++G   CLD +G 
Sbjct: 2   FGDLSKRFEIKHRLRCKNFTWYLNNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGE 57

Query: 140 T----APAVIGTSYCHGSGSSQLFRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF 192
                 P ++ T  CHG G +Q F  +A+ ++ H    E C+ A   + LV++       
Sbjct: 58  NNQGGKPLIMYT--CHGLGGNQYFEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KA 108

Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
           C   G K +                  T +  W      + L +  LK CLS + E    
Sbjct: 109 CTYKGHKTVV-----------------TGEQIWEIQKD-QLLYNPFLKMCLSANGE--HP 148

Query: 253 AMLRCDENNSYQQW 266
           +++ C+ ++  Q+W
Sbjct: 149 SLVSCNPSDPLQKW 162


>gi|395513519|ref|XP_003760971.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
           [Sarcophilus harrisii]
          Length = 570

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 341 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 400

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       + +GE +N   +  CLD         I
Sbjct: 401 LALRQRLQCRSFKWYLE----NVYPEMRIYNNTITYGEVRNSKASGYCLDQGAEDDDRAI 456

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 457 LYP-CHGM-SSQLVRYSAEGLLQLG 479


>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Meleagris gallopavo]
          Length = 579

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 69/142 (48%), Gaps = 10/142 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A   + GDISE+  LR++L CK F+W++++V  +++   P
Sbjct: 382 RAAEVWMDEYKEHFYNRNPPARKENYGDISERKLLRERLKCKSFNWYLRNVFSELHV--P 439

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSM---GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD +        A +    CHG G +Q F   +  +    
Sbjct: 440 EDRPG-WHGAVRSVGISSECLDYVLPEHNPTGAHLSLFGCHGQGGNQFFEYTSNKEFRFN 498

Query: 171 ---ERCVDADKQEKLVEMLKLP 189
              E C +  +QE  + M   P
Sbjct: 499 SVTELCAEVPEQEDFIGMRSCP 520


>gi|405967230|gb|EKC32416.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
          Length = 347

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 41/195 (21%), Positives = 78/195 (40%), Gaps = 33/195 (16%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           D GD++++  LR++L C  F WF+K+V  D++     +       +AK +     +D+  
Sbjct: 179 DYGDVTDRKKLRERLQCHSFDWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHN 238

Query: 139 RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGV 198
              P  +    CH  G +Q + L+  G++   + C+D    E ++               
Sbjct: 239 YHKP--VNMWPCHNQGGNQYWMLSKNGEIRRDDGCLDYSGGESVI--------------- 281

Query: 199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCD 258
                       ++ C    G  +  +R  NS   + HA  ++C+    +  +L M  C 
Sbjct: 282 ------------VYPCHGQKGNQEWQYREDNS---IYHANTQKCMETSVDGQKLTMKTCT 326

Query: 259 ENNSYQQWRFKEVRP 273
             +  Q W +K   P
Sbjct: 327 GIDR-QIWTWKRKSP 340


>gi|348550781|ref|XP_003461209.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Cavia porcellus]
          Length = 604

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 435 LALRQRLKCRSFRWYLE----NVYPEMRVYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSADGLLQLG 513


>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           1 [Sus scrofa]
 gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Sus scrofa]
          Length = 582

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 95/226 (42%), Gaps = 38/226 (16%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 385 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHV--P 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 443 EDRPG-WHGAIRSIGISSECLDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIRFN 501

Query: 171 ---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
              E C +  +Q+  V M   P     +              N+   F   GT+  P   
Sbjct: 502 SVTELCAEVPEQKNHVGMQNCPKDGSPIP------------TNIIWHFKEDGTIFHP--- 546

Query: 228 HNSTRQLEHAILKRCLSVH--PETNQLAMLR-CDENNSYQQWRFKE 270
                   H+ L  CLS +  PE      +R CD  +  Q W+F++
Sbjct: 547 --------HSGL--CLSAYRTPEGRPDVQMRTCDAADKNQIWKFEK 582


>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
 gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
          Length = 494

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 13/124 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P+A     G I  +  LR +L CK F W++++V       +P
Sbjct: 305 RTAEVWMDEYKRFYYAARPMARSALYGSIKSRKELRKRLQCKPFKWYLQNV-------YP 357

Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           EL      ++ +GE K    + CLD++G  A   +G   CHG   +Q + L  +  + H 
Sbjct: 358 ELQIPDSQDVSFGELKQ--GKSCLDTLGSQAGGSVGMFDCHGQAGNQEWALTKKSTVRHL 415

Query: 171 ERCV 174
           + C+
Sbjct: 416 DLCL 419


>gi|324522446|gb|ADY48063.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Ascaris suum]
          Length = 210

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 73/153 (47%), Gaps = 26/153 (16%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR  + WL  + +      P    ++ GDISE  A+R++LHCK F W+++++  ++    
Sbjct: 6   LRAANVWLDEYLEKFYEAKPYLRGVNFGDISEMKAIRERLHCKPFKWYLENIYPELLPN- 64

Query: 114 PELP-------PNLFW---GEAKNL---GTQKCLDS------MGRTAPAVIGTSYCHGSG 154
             LP       PN +W   G+  ++   GT  CL        + R A A++    CH S 
Sbjct: 65  -NLPSRESLQRPNPYWKSTGDKYHIKLHGTTLCLAGESNNGRISRGARAIL--ERCHKSA 121

Query: 155 SSQLFRLNAEGQL---GHGERCVDADKQEKLVE 184
             Q++R    G+L   G    C+D+ K  +L++
Sbjct: 122 RQQMWRWTKLGELRPMGSSSLCLDSLKGPRLLK 154


>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
           N-acetylgalactosaminyltransferase 4 [Equus caballus]
          Length = 703

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 19/182 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR +L CK F W++K+V  +++   P
Sbjct: 506 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRKRLKCKSFDWYLKNVFSNLHV--P 563

Query: 115 ELPPNLFWGEAKNLGT-QKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G   +CLD     AP      A +    CHG G +Q F   ++ ++
Sbjct: 564 EDRPG-WHGAIRSMGIPSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEI 619

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
                 E C +  +Q+  V M   P   F + A +   F   GT+ +  +  C ++  T 
Sbjct: 620 RFNSVTELCAEVPEQKNHVGMQNCPKDGFSIPANIIWHFKEDGTIFHPHSGQCLSAYWTP 679

Query: 222 DG 223
           +G
Sbjct: 680 EG 681


>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
           abelii]
          Length = 578

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
           troglodytes]
 gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
           paniscus]
 gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
 gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
           troglodytes]
          Length = 578

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
           gorilla gorilla]
          Length = 578

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|326923175|ref|XP_003207815.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Meleagris gallopavo]
          Length = 709

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 78/200 (39%), Gaps = 51/200 (25%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLF--WGEAKNLGTQKCLDS 136
           ++GD+S+Q ALR++L C+ F W++++V       +P+L   L    G   N+ T +C+ +
Sbjct: 545 ELGDLSQQKALRERLRCRSFRWYLENV-------YPDLAAPLVKAGGLLVNVATARCIAA 597

Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
              +    +    C  +  +Q F       L HGE CV                      
Sbjct: 598 QNTS----LALQACDVNNKAQHFNYTWLRLLRHGELCV---------------------- 631

Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ---LEHAILKR-----CLSVHPE 248
                   +G    L  C       D  WR  +   +    EH +++      CL VHP 
Sbjct: 632 ------VPMGAAPGLRPC--GGWESDLAWRHSSMAARPELAEHLVVQHQHPPTCLLVHPV 683

Query: 249 TNQLAMLRCDENNSYQQWRF 268
              L +  C   N YQQWRF
Sbjct: 684 LLALRLGACSATNRYQQWRF 703


>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
 gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
           AltName: Full=Polypeptide GalNAc transferase 4;
           Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 4;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 4
 gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
           sapiens]
          Length = 578

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
           sapiens]
 gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
 gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
           [synthetic construct]
          Length = 578

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
 gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
           norvegicus]
 gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
          Length = 633

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/235 (23%), Positives = 104/235 (44%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  +VY   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGDKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A +    V  LK     CV  G + I              
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                 +  W      + L + + + CLS + E    +++ CD  +  Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFRMCLSSNGE--HPSLVPCDTTDLLQKWIFSQ 631


>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Oryctolagus cuniculus]
          Length = 579

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A   D GDISE+  LR++L CK F W++K+V   ++   P
Sbjct: 382 RAAEVWMDDYKEHFYNRNPPARKEDYGDISERKLLRERLKCKSFDWYLKNVFSSLHV--P 439

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 440 EDRPG-WHGAIRSKGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 498

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +++  V M   P   F + A +   F   GT+ +  +  C ++  T +G
Sbjct: 499 SVTELCAEVPEKKNHVGMQNCPKDGFPIPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 557


>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
           anubis]
          Length = 578

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 46/164 (28%), Positives = 75/164 (45%), Gaps = 11/164 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSKGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLN 210
              E C +  +Q+  V M   P   F V A +   F   GT+ +
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFLVPANIIWHFKEDGTIFH 541


>gi|348516166|ref|XP_003445610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
           [Oreochromis niloticus]
          Length = 604

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 16/149 (10%)

Query: 29  GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
            ++ H ER        I        LR    W+  +K    +   + M    +D GD+SE
Sbjct: 374 ARVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEYKSHVYMAWNIPMNNPGVDFGDVSE 433

Query: 86  QVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTA 141
           ++ALR +L C+ F W+++HV       +PE+      + +GE +N   +  CLD      
Sbjct: 434 RLALRKRLQCRSFRWYLEHV-------YPEMRIYNNTITYGEVRNSKASGYCLDQGSEDD 486

Query: 142 PAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
              I    CHG  SSQL R + EG L  G
Sbjct: 487 DKAILYP-CHGM-SSQLARYSTEGLLQLG 513


>gi|351698595|gb|EHB01514.1| Polypeptide N-acetylgalactosaminyltransferase 9, partial
           [Heterocephalus glaber]
          Length = 519

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 290 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMTSPGVDFGDVSER 349

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 350 LALRQRLKCRSFRWYLE----NVYPEMRVYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 405

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 406 LYP-CHGM-SSQLVRYSADGLLQLG 428


>gi|296213304|ref|XP_002753213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
           [Callithrix jacchus]
          Length = 311

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/252 (26%), Positives = 100/252 (39%), Gaps = 42/252 (16%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 82  RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 141

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
           +ALR KL C+ F W++++V       +PE+      L +GE +N   +  CLD       
Sbjct: 142 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 194

Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIF 202
             I    CHG  SSQL R +A+G L  G           L     LP + C++   +   
Sbjct: 195 RAI-LYPCHGM-SSQLVRYSADGLLQLG----------PLGSTAFLPDSKCLVDDGR--- 239

Query: 203 CRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN-----QLAMLRC 257
              G    L  C + +      W F  S   +  A   RCL V    +     +L + RC
Sbjct: 240 ---GRTPTLKKCEDVARPAQRLWDFTQSGPIVSRAT-GRCLEVEMAKDANFGLRLVVQRC 295

Query: 258 DENNSYQQWRFK 269
               S Q+W  +
Sbjct: 296 ----SGQKWMIR 303


>gi|51316092|sp|Q9GM01.1|GALT9_MACFA RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 9;
           AltName: Full=Polypeptide GalNAc transferase 9;
           Short=GalNAc-T9; Short=pp-GaNTase 9; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 9;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9
 gi|11041469|dbj|BAB17277.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Macaca
           fascicularis]
          Length = 606

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 377 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 436

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 437 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 492

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 493 -LYPCHGM-SSQLVRYSADGLLQLG 515


>gi|67613609|ref|XP_667313.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3 [Cryptosporidium
           hominis TU502]
 gi|54658437|gb|EAL37084.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3 [Cryptosporidium
           hominis]
          Length = 732

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 96/228 (42%), Gaps = 40/228 (17%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR    W+  F QI  + +P    + +G   + + L+ KL CK F WF+ +VA + Y   
Sbjct: 526 LRTARVWMDEFYQITEMLAPNPN-IKLGSFDDMLHLKKKLKCKPFRWFLDNVAPETYIT- 583

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSM-GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
            +L   LF GE +N      CLDSM G+T    IG  +CHG   +Q F ++      H +
Sbjct: 584 -QLDHLLFVGEIRNKKLNNICLDSMGGQTDGDKIGVFHCHGKKGTQAFMMS-----NHTQ 637

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
           +     K+           ++C+ + +K   C    + N+             WR  N  
Sbjct: 638 QIRIVSKE-----------SYCIGSNLKYAACSNSEITNI-------------WRLENEM 673

Query: 232 RQLEHAILKR-CLSVHPETN-----QLAMLRCDENNSYQQWRFKEVRP 273
            +      K  CLS+  + +     +  +L CD N+  Q W   + +P
Sbjct: 674 IKANVEPNKYVCLSLTEDNDSSTKHKAELLDCDPNDPSQHWNVNKFKP 721


>gi|395840154|ref|XP_003792930.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
           [Otolemur garnettii]
          Length = 604

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 435 LALRQRLKCRSFRWYLE----NVYPEMRIYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +A+G L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSADGLLQLG 513


>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
           sapiens]
          Length = 578

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|66363212|ref|XP_628572.1| extracellular protein with a signal peptide followed by family 2
           glycosyltransferase and ricin domains [Cryptosporidium
           parvum Iowa II]
 gi|46229580|gb|EAK90398.1| extracellular protein with a signal peptide followed by family 2
           glycosyltransferase and ricin domains [Cryptosporidium
           parvum Iowa II]
 gi|448826184|gb|AAQ95174.2| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T1, partial
           [Cryptosporidium parvum]
          Length = 637

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 91/221 (41%), Gaps = 41/221 (18%)

Query: 53  ELRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
           +LR    W+  F  +  R +G P    +D G + E++ LR++L C  F WF+++V  +  
Sbjct: 426 KLRTLYLWMDEFGDLAWRVMGRPR---VDTGPLDERIKLRERLRCNSFKWFLENVNPEAE 482

Query: 111 TKFPELPPNLFWGEAKNLGTQKCLDSMGRTAP-AVIGTSYCHGSGSSQLFRLNAEGQLGH 169
            K  +  P  + G  KN+G+  C+D+ G   P   +    CH +G +Q F      +  H
Sbjct: 483 VKSIDDVP--YIGNIKNIGSNLCIDTDGFNNPGGKVKLWSCH-TGETQNFMYFKTSK--H 537

Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
               ++ +               C+    KL +C   +                 W    
Sbjct: 538 WMVTINDES--------------CITEKFKLDWCNEHSYH---------------WDVEI 568

Query: 230 STRQLEHAILKRCLS-VHPETNQLAMLRCDENNSYQQWRFK 269
            ++    A++KR  S +    NQL +  C+ NN +QQW  +
Sbjct: 569 ISKSPLTALIKRSNSCITAVDNQLKLTPCETNNKFQQWEIE 609


>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
          Length = 1150

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 66/146 (45%), Gaps = 16/146 (10%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLDS 136
           D GD+++  ALR+KL CK F WF+K VA ++   +P + P  F  G  +++     C+D+
Sbjct: 555 DAGDLTKMKALREKLQCKPFEWFLKEVAPEILQLYPPVEPEPFASGAIQSIAEPTLCIDT 614

Query: 137 MGRTAPAVIGTSYC-----HGSGSSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLP 189
           M R     IG   C     H    +Q F L  + + Q    E+C D  +        + P
Sbjct: 615 MQRPRGNPIGMHPCDSDLIHPKNMNQYFVLSWHRDIQQKSDEQCFDVPESAP-----RSP 669

Query: 190 VTFCVLAGVKLI--FCRLGTVLNLFN 213
           VT      +K +    R   ++  ++
Sbjct: 670 VTIYTCHNIKYLQHLPRTSVIIPFYD 695



 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/67 (31%), Positives = 37/67 (55%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P+   +D GD+++   LR +L+CK F WF+++VA D+   +P
Sbjct: 933 RVAEVWMDEYKKYVYRKDPMRFSIDAGDLTKMKELRRRLNCKPFRWFIENVAPDLIDWYP 992

Query: 115 ELPPNLF 121
            + P  F
Sbjct: 993 PIEPEPF 999


>gi|354484387|ref|XP_003504370.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
           [Cricetulus griseus]
          Length = 532

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 95/229 (41%), Gaps = 44/229 (19%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K      +P A     GDISE+  LR++L CK F W++++V  +++   P
Sbjct: 335 RAAEVWMDEYKDHFYNRNPPARKEVYGDISERKLLRERLKCKSFHWYLENVFSNLHV--P 392

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP      A +    CHG G +Q F   +  ++
Sbjct: 393 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 448

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
                 E C +  +Q+  V M   P               L   +N+   F   GT+   
Sbjct: 449 RFNSVTELCAEVPEQKHHVGMQNCPKD------------GLPAPVNIIWLFKEDGTI--- 493

Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
             FH  T          CLS    +     + M  CD ++  Q WRF++
Sbjct: 494 --FHPHT--------GLCLSAFRTSEGRPSVQMKTCDAHDKNQIWRFEK 532


>gi|410057869|ref|XP_001172345.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9, partial
           [Pan troglodytes]
          Length = 288

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/171 (29%), Positives = 76/171 (44%), Gaps = 15/171 (8%)

Query: 6   PAV--IGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKV 63
           PAV  +G +     GS ++   +   ++ H ER        I        LR    W+  
Sbjct: 84  PAVGGVGAAVWQCGGSMEVLPCS---RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDD 140

Query: 64  FKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNL 120
           FK    +   + M    +D GD+SE++ALR +L C+ F W+++    +VY +       L
Sbjct: 141 FKSHVYMAWNIPMTNPGVDFGDVSERLALRQRLKCRSFKWYLE----NVYPEMRVYNNTL 196

Query: 121 FWGEAKN-LGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
            +GE +N   +  CLD         I    CHG  SSQL R +A+G L  G
Sbjct: 197 TYGEVRNSKASGYCLDQGAEDGDRAI-LYPCHGM-SSQLVRYSADGLLQLG 245


>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
           aries]
          Length = 582

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/182 (28%), Positives = 84/182 (46%), Gaps = 19/182 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V   ++   P
Sbjct: 385 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHV--P 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY------CHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP    TS       CHG G +Q F   +  ++
Sbjct: 443 EDRPG-WHGAIRSIGISSECLDY---NAPDNNPTSANLSLFGCHGQGGNQFFEYTSNKEI 498

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
                 E C +  + +K V M   P   F + A +   F   GT+ +  +  C ++  T 
Sbjct: 499 RFNSVTELCAEVPELKKHVGMQNCPKDGFPIPANIIWHFKEDGTIFHPHSGLCLSAYRTP 558

Query: 222 DG 223
           +G
Sbjct: 559 EG 560


>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
          Length = 578

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEIPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
 gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
          Length = 587

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 91/223 (40%), Gaps = 37/223 (16%)

Query: 60  WLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPP 118
           W+  +KQ I           D GD+S+   +R+KL CK F WF++ VA ++   +P + P
Sbjct: 374 WMDEYKQYIYDRDPERYAKTDAGDMSKMKTIREKLMCKPFKWFLQEVAPEIIELYPPVEP 433

Query: 119 NLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS-----GSSQLF--RLNAEGQLGH 169
             +  G  +++     C+D+M R     IG   C  S       +Q F    + + Q  +
Sbjct: 434 EPYASGSIQSVADSSLCIDTMQRGRGEPIGLYPCSNSLIEPTNHNQYFVHSWHRDIQHKY 493

Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
           GE C D  +                         + G+ + +F C    G  +  +++ +
Sbjct: 494 GEGCFDVPQ------------------------SKPGSPVTIFTCHMHQG--NQFFQYDH 527

Query: 230 STRQLEHAILKR-CLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
            TRQ+     K  C       ++L + RC+     Q+WR  ++
Sbjct: 528 DTRQIYAGKDKSFCFDADGMNDKLILNRCNRKQESQRWRMGQM 570


>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
           grunniens mutus]
          Length = 566

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 84/182 (46%), Gaps = 19/182 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V   ++   P
Sbjct: 369 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHV--P 426

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY------CHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP    TS       CHG G +Q F   +  ++
Sbjct: 427 EDRPG-WHGAIRSIGISSECLDY---NAPDNNPTSANLSLFGCHGQGGNQFFEYTSNKEI 482

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
                 E C +  + +K V M   P   F + A +   F   GTV +  +  C ++  T 
Sbjct: 483 RFNSVTELCAEVPELKKHVGMQNCPKDGFPIPANIIWHFKEDGTVFHPHSGLCLSAFRTQ 542

Query: 222 DG 223
           +G
Sbjct: 543 EG 544


>gi|281344573|gb|EFB20157.1| hypothetical protein PANDA_017448 [Ailuropoda melanoleuca]
          Length = 539

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 55/110 (50%), Gaps = 13/110 (11%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 343 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 395

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFR 160
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q +R
Sbjct: 396 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQGWR 443


>gi|118098416|ref|XP_415088.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Gallus gallus]
          Length = 604

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 434

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       + +GE +N   +  CLD         I
Sbjct: 435 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 490

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +++G L  G
Sbjct: 491 LYP-CHGM-SSQLVRYSSDGVLQLG 513


>gi|260809642|ref|XP_002599614.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
 gi|229284894|gb|EEN55626.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
          Length = 432

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 60/113 (53%), Gaps = 10/113 (8%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K +     P    ++ G++  +  LR  LHC DFSW++++V  ++Y   P++ P 
Sbjct: 320 WMDDYKNVIYRKHPDYKTVETGNVMPRKVLRKALHCHDFSWYVQNVYPNLYV--PDVRP- 376

Query: 120 LFWGEAKNLGTQKCLDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           + +G+ +  G  KCLD++   +  P + G   CHG G +Q +     G++  G
Sbjct: 377 VAYGQVRMTG--KCLDAVSPEKEQPKLFG---CHGLGGNQYWEFTRAGEVRFG 424


>gi|195379310|ref|XP_002048423.1| GJ11361 [Drosophila virilis]
 gi|194155581|gb|EDW70765.1| GJ11361 [Drosophila virilis]
          Length = 540

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/221 (23%), Positives = 89/221 (40%), Gaps = 33/221 (14%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    WL  +K       P    + +G + +Q ALR +L+CK F WF+ H+  D   ++P
Sbjct: 329 RVAEVWLDEYKDKLYANMPHLTHIKVGSLKKQKALRQRLNCKPFKWFLDHLGRDFLEQYP 388

Query: 115 -ELPPNLFWGEAKNLGTQK-CLD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHG 170
            + P +  +G  ++L     CL+ +     P ++            Q + L+    L   
Sbjct: 389 MDEPLDFAFGAVQSLAAPTLCLERTESAEHPQLLPCDEDLMYPKLEQKWSLSHFRDLHSS 448

Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
            RC++  +Q    E+                         L+ C + +G  +  W F  +
Sbjct: 449 SRCLEVHQQHPNAEIW------------------------LWQCHHHAG--NQFWSFDLN 482

Query: 231 TRQLEHA---ILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           ++Q+ H      +RCL    E  ++    CD NN  QQW+F
Sbjct: 483 SKQIIHGQNRAEQRCLEALVEQRKVVANICDANNLRQQWKF 523


>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
 gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
 gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
          Length = 578

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 84/182 (46%), Gaps = 19/182 (10%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V   ++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY------CHGSGSSQLFRLNAEGQL 167
           E  P  + G  +++G + +CLD     AP    TS       CHG G +Q F   +  ++
Sbjct: 439 EDRPG-WHGAIRSIGISSECLDY---NAPDNNPTSANLSLFGCHGQGGNQFFEYTSNKEI 494

Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
                 E C +  + +K V M   P   F + A +   F   GTV +  +  C ++  T 
Sbjct: 495 RFNSVTELCAEVPELKKHVGMQNCPKDGFPIPANIIWHFKEDGTVFHPHSGLCLSAFRTQ 554

Query: 222 DG 223
           +G
Sbjct: 555 EG 556


>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 7/124 (5%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +K+           +D GD+S+++ LR KL C +F+W++K++  ++    
Sbjct: 335 MRVAEVWMDHYKKFFYATQTKLHMVDYGDVSDRLELRRKLGCHNFTWYLKNIIPEMI--L 392

Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
           P    N F GE +N  T  CLDS    +   +    C  + S Q+F L ++ QL  G+ C
Sbjct: 393 PVDDANYF-GEIRNDATGLCLDS---ASGKPLRVDIC-AATSDQIFTLTSDHQLRIGKEC 447

Query: 174 VDAD 177
           +  D
Sbjct: 448 LMKD 451


>gi|170038571|ref|XP_001847122.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
 gi|167882321|gb|EDS45704.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
          Length = 560

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 58/244 (23%), Positives = 96/244 (39%), Gaps = 52/244 (21%)

Query: 40  ADKQVISTQKSSLELRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKD 97
           A+K V+        LR    W+  +KQ+     G P     ++GD+S +  +R K +CK 
Sbjct: 351 ANKDVVRANS----LRLAEVWMDEYKQVIFDIYGLPRYPVEEVGDVSSRKEVRRKANCKT 406

Query: 98  FSWFMKHVAYDVYTKFPELPPNL----FWGEAKN--LGTQKCLDSMGRTAPAVIGTSYCH 151
           F ++++       T +PE+   L    F GE KN  LG   CL      A   +G + C 
Sbjct: 407 FRYYIE-------TAYPEMKNPLIEGAFRGEVKNAALGNDTCLTY--HAATNTVGMASCD 457

Query: 152 GSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNL 211
            +  SQ +  N   +L   + C+D                              G+ L +
Sbjct: 458 HAEKSQFWVHNYYQELNSYKHCLDY----------------------------TGSELGV 489

Query: 212 FNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKE 270
           F C    G  +  W++   + Q++     +CL++  E+   L M  CD +   Q+W    
Sbjct: 490 FGCHRGRG--NQAWQYKIDSGQIKSVKHGKCLALDMESRTFLKMEDCDGSKEVQKWLVTP 547

Query: 271 VRPD 274
           V  D
Sbjct: 548 VSLD 551


>gi|444727591|gb|ELW68073.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Tupaia chinensis]
          Length = 554

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 13/107 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL+CK F W++++V       +P
Sbjct: 300 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 352

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q
Sbjct: 353 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQ 397


>gi|71994070|ref|NP_001022877.1| Protein GLY-9, isoform b [Caenorhabditis elegans]
 gi|58081905|emb|CAI46621.1| Protein GLY-9, isoform b [Caenorhabditis elegans]
          Length = 325

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 39/215 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K++  +        D+GD++ +  LR +L+CK F WF+ ++A     KF      
Sbjct: 138 WMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIA---KGKFIMDEDV 194

Query: 120 LFWGEAKNL--GTQKCLDSMGRTA--PAVIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
           + +G    +  GT+ C D++ R      ++G  +C G GSS QL  L+ EG L     C 
Sbjct: 195 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCA 254

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS-TRQ 233
             +                    +++  C      N              W + N   R 
Sbjct: 255 SEEN-----------------GNIRMKTCSKKAQFN------------ERWAYENKMIRN 285

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           L+        ++ P  N + ++ CDE + +Q+W F
Sbjct: 286 LKSGKCMSTANLKPGDNAI-VVECDEKDEHQKWNF 319


>gi|221504299|gb|EEE29974.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3, putative
           [Toxoplasma gondii VEG]
          Length = 635

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/187 (28%), Positives = 78/187 (41%), Gaps = 41/187 (21%)

Query: 90  RDKLHCKDFSWFMKHV--AYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPA-VIG 146
           R +  CK F WFM++V    DV T   ++P   + G  +N     CLD+MG  +P   +G
Sbjct: 471 RKRKGCKSFRWFMENVFPEGDVVT-LDDVP---YLGPLRNDKIGMCLDNMGWASPGHAVG 526

Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLG 206
             YCHG G +Q F      ++GH     D +               C+    +L +CR  
Sbjct: 527 LEYCHG-GDTQTFMFFR--KVGHVMPVNDDEA--------------CLQPSGRLDWCR-- 567

Query: 207 TVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
                       GT    W F  S+ QL     K+CLS      +L M+ CD+ + YQ W
Sbjct: 568 ------------GTAQFWWDF-TSSGQLMFRETKQCLSAF--GRKLRMVECDDTDPYQIW 612

Query: 267 RFKEVRP 273
            +    P
Sbjct: 613 SWTAYNP 619


>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Anolis carolinensis]
          Length = 583

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/228 (24%), Positives = 94/228 (41%), Gaps = 42/228 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A   + GD+SE+  LR KL C +F W++K++  +++   P
Sbjct: 386 RAAEVWMDDYKEHFYNRNPPARKENFGDLSERKLLRKKLQCNNFDWYLKNIFPNLHV--P 443

Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMG---RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  +++G + +CLD          A +    CHG G +Q F      ++   
Sbjct: 444 EDRPG-WHGAIRSMGISSECLDYNSPEHNPTGAHVSLFGCHGQGGNQFFEYTVNQEIRFN 502

Query: 171 ---ERCVDADKQEKLVEMLKLPV--TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
              E C +  +Q+  V M   P   TF +   +K  F   GT+                 
Sbjct: 503 SVTELCAEVPEQKDFVGMRTCPKDGTF-IPETIKWQFREDGTIY---------------- 545

Query: 226 RFHNSTRQLEHAILKRCLSVH--PETN-QLAMLRCDENNSYQQWRFKE 270
                     H    +CLS +  PE    + M  C+  +  Q W+F++
Sbjct: 546 ----------HPHSAKCLSAYRTPEGRADIQMTACNAGDKNQAWKFEK 583


>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
 gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
          Length = 578

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 48/179 (26%), Positives = 82/179 (45%), Gaps = 13/179 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  +R++L CK F W++K+V  +++   P
Sbjct: 381 RVAEVWMDEYKEHFYNRNPPARKEAYGDISERKLIRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497

Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
              E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFLVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
 gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
           AltName: Full=Protein-UDP
           acetylgalactosaminyltransferase 9; AltName:
           Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
 gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
          Length = 579

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 89/215 (41%), Gaps = 39/215 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K++  +        D+GD++ +  LR +L+CK F WF+ ++A   +    ++   
Sbjct: 392 WMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDV--- 448

Query: 120 LFWGEAKNL--GTQKCLDSMGRTAP--AVIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
           + +G    +  GT+ C D++ R      ++G  +C G GSS QL  L+ EG L     C 
Sbjct: 449 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCA 508

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS-TRQ 233
             +                    +++  C            +     +  W + N   R 
Sbjct: 509 SEEN-----------------GNIRMKTC------------SKKAQFNERWAYENKMIRN 539

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           L+        ++ P  N + ++ CDE + +Q+W F
Sbjct: 540 LKSGKCMSTANLKPGDNAI-VVECDEKDEHQKWNF 573


>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
          Length = 518

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 2/99 (2%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           D GD+S + ALR++LHCK F W++ ++  D++     L      G   +     CLDS  
Sbjct: 338 DYGDVSARKALRERLHCKSFKWYLDNIYPDLFIPGESLASGEVNGVFNSQSQPACLDSAA 397

Query: 139 --RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
             +     I    CH  G +Q + L+  G++   E C D
Sbjct: 398 DKKAYNKAIKLWPCHNMGGNQYWMLSKSGEIRRDEGCFD 436


>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
           1 [Nomascus leucogenys]
          Length = 578

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 77/178 (43%), Gaps = 11/178 (6%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR++L CK F W++K+V  +++   P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438

Query: 115 ELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG- 170
           E  P           + +CLD          A +    CHG G +Q F   +  ++    
Sbjct: 439 EDRPGWHGAIHSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNS 498

Query: 171 --ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
             E C +  +Q+  V M   P   F V A +   F   GT+ +  +  C ++  T +G
Sbjct: 499 VTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556


>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
 gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
          Length = 597

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 21/129 (16%)

Query: 39  DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKD 97
           D D   I+T + +L       W+  F  I  L  P L    D+GD++ +V LR KL CK 
Sbjct: 395 DRDTHGINTARMAL------VWMDEFINIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKS 448

Query: 98  FSWFMKHVAYDVYTKFPE-LPPN---LFWGEAKNLGTQKCLDSM--GRTAPAVIGTSYCH 151
           F+W++K++       +PE   PN   + WG+ K++ +  CLD +      P  +G   C 
Sbjct: 449 FAWYLKNI-------YPEKFVPNADVVGWGKVKSVSSNLCLDDLLQNNEKPYNVGLYPCG 501

Query: 152 GS-GSSQLF 159
                SQLF
Sbjct: 502 KVLQKSQLF 510


>gi|237841027|ref|XP_002369811.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3, putative
           [Toxoplasma gondii ME49]
 gi|37622165|gb|AAQ95173.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3 [Toxoplasma gondii]
 gi|37724831|gb|AAO20278.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3 [Toxoplasma gondii]
 gi|211967475|gb|EEB02671.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3, putative
           [Toxoplasma gondii ME49]
 gi|221483678|gb|EEE21990.1| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3, putative
           [Toxoplasma gondii GT1]
          Length = 635

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/187 (28%), Positives = 78/187 (41%), Gaps = 41/187 (21%)

Query: 90  RDKLHCKDFSWFMKHV--AYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPA-VIG 146
           R +  CK F WFM++V    DV T   ++P   + G  +N     CLD+MG  +P   +G
Sbjct: 471 RKRKGCKSFRWFMENVFPEGDVVT-LDDVP---YLGPLRNDKIGMCLDNMGWASPGHAVG 526

Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLG 206
             YCHG G +Q F      ++GH     D +               C+    +L +CR  
Sbjct: 527 LEYCHG-GDTQTFMFFR--KVGHVMPVNDDEA--------------CLQPSGRLDWCR-- 567

Query: 207 TVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
                       GT    W F  S+ QL     K+CLS      +L M+ CD+ + YQ W
Sbjct: 568 ------------GTAQFWWDF-TSSGQLMFRETKQCLSAF--GRKLRMVECDDTDPYQIW 612

Query: 267 RFKEVRP 273
            +    P
Sbjct: 613 SWTAYNP 619


>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
 gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
          Length = 597

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 21/129 (16%)

Query: 39  DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKD 97
           D D   I+T + +L       W+  F  I  L  P L    D+GD++ +V LR KL CK 
Sbjct: 395 DRDTHGINTARMAL------VWMDEFINIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKS 448

Query: 98  FSWFMKHVAYDVYTKFPE-LPPN---LFWGEAKNLGTQKCLDSM--GRTAPAVIGTSYCH 151
           F+W++K++       +PE   PN   + WG+ K++ +  CLD +      P  +G   C 
Sbjct: 449 FAWYLKNI-------YPEKFVPNADVVGWGKVKSVSSNLCLDDLLQNNEKPYNVGLYPCG 501

Query: 152 GS-GSSQLF 159
                SQLF
Sbjct: 502 KVLQKSQLF 510


>gi|198414821|ref|XP_002124597.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
           4, partial [Ciona intestinalis]
          Length = 230

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 64/128 (50%), Gaps = 8/128 (6%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLA-MFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
           +R    WL  +K+   +  PL+    D+GD++++  LR +L+C DF W++ +V  + +  
Sbjct: 36  MRLAEVWLDDYKRHFYVRVPLSERNQDVGDVTKRKELRQRLNCHDFKWYLDNVFPEQFV- 94

Query: 113 FPELPPNLFWGEAKNLGTQKCLD-SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG- 170
            PE  P  +      L    CLD S   T  AV+    CHG  S+Q F   ++ ++ +  
Sbjct: 95  -PEDRPGFYGPIESVLLKDNCLDVSFDATNQAVVQMYQCHGM-SNQFFEYTSKNEIRYNQ 152

Query: 171 --ERCVDA 176
             E+C+D 
Sbjct: 153 ISEKCLDT 160


>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
          Length = 579

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 46/215 (21%), Positives = 89/215 (41%), Gaps = 39/215 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K++  +        D+GD++ +  LR +L+CK F WF+ ++A   +    ++   
Sbjct: 392 WMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDV--- 448

Query: 120 LFWGEAKNL--GTQKCLDSMGRTAP--AVIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
           + +G    +  GT+ C D++ R      ++G  +C G GSS QL  L+ EG L     C 
Sbjct: 449 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCA 508

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS-TRQ 233
             +                    +++  C            +     +  W + N   R 
Sbjct: 509 SEEN-----------------GNIRMKTC------------SKKAQFNERWAYENKMIRN 539

Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
           L+        ++ P  N + ++ CDE + +Q+W F
Sbjct: 540 LKSGKCMSTANLKPGDNAI-VVECDEKDEHQKWNF 573


>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
 gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
           AltName: Full=Polypeptide GalNAc transferase 3;
           Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
           Full=Protein-UDP acetylgalactosaminyltransferase 3;
           AltName: Full=UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 3
 gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
 gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mus musculus]
          Length = 633

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 100/229 (43%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++  +  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
            Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  +A+
Sbjct: 496 AYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A +    V  LK     CV  G + I                    
Sbjct: 550 REIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-----------------APG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L + + K CLS + E   L  + CD  +  Q+W F +
Sbjct: 586 EQIWEIRKD-QLLYNPLFKMCLSSNGEHPNL--VPCDATDLLQKWIFSQ 631


>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
          Length = 507

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/196 (26%), Positives = 87/196 (44%), Gaps = 37/196 (18%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
           D GD S+++ LR  L CK F W++++V  +   KF      + +G  +N  +  CLD++ 
Sbjct: 338 DAGDFSDRLQLRRDLKCKSFKWYLENVYPE---KFIPDENVIAYGMVRNPSSNLCLDTLS 394

Query: 139 RTAPAV--IGTSYCHGSGSS-QLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
           +    V  +G   C G  SS QLF L+   +L   E C+D+   E               
Sbjct: 395 KDEKMVFNLGLYGCQGGVSSNQLFSLSQSNELRREESCMDSVGGE--------------- 439

Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN--QLA 253
                     G+ + L  C  S G  +  W ++  T+ + H+  K CL    E N  ++ 
Sbjct: 440 ----------GSPVKLMPCHGSRGHQE--WTYNLETKVIVHSSGK-CLDKVEEKNGGEVK 486

Query: 254 MLRCDENNSYQQWRFK 269
           + +C+  +S Q W F+
Sbjct: 487 VEKCNGKDS-QIWEFE 501


>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mus musculus]
          Length = 633

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 102/235 (43%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  + Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A +    V  LK     CV  G + I              
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                 +  W      + L + + K CLS + E   L  + CD  +  Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFKMCLSSNGEHPNL--VPCDATDLLQKWIFSQ 631


>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
           musculus]
          Length = 633

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 57/235 (24%), Positives = 102/235 (43%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  + Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A +    V  LK     CV  G + I              
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                 +  W      + L + + K CLS + E   L  + CD  +  Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFKMCLSSNGEHPNL--VPCDATDLLQKWIFSQ 631


>gi|326929546|ref|XP_003210923.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Meleagris gallopavo]
          Length = 481

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 252 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 311

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       + +GE +N   +  CLD         I
Sbjct: 312 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 367

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +++G L  G
Sbjct: 368 LYP-CHGM-SSQLVRYSSDGVLQLG 390


>gi|256083753|ref|XP_002578103.1| peptidase [Schistosoma mansoni]
          Length = 1860

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 2/98 (2%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+      P    +D GD+S+Q  LR +L CK F WFM  VA+D+   +P
Sbjct: 313 RVAEVWMDEYKEYIYRSLPECRKVDPGDLSQQHNLRKRLQCKSFKWFMTEVAFDLTKAYP 372

Query: 115 ELPPNLF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYC 150
                LF  GE +++     C+D+   T  + +  S+C
Sbjct: 373 PPEEVLFATGEIRSVAFPYLCIDAARATKRSPVKLSFC 410


>gi|432851710|ref|XP_004067046.1| PREDICTED: putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 4-like
           [Oryzias latipes]
          Length = 609

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/177 (27%), Positives = 77/177 (43%), Gaps = 11/177 (6%)

Query: 29  GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
            ++ H ER      + +++      LR    W+  FK    +   + M    +D+GDISE
Sbjct: 380 ARIAHIERAHKPYTEDLTSNVRRNALRVAEVWMDEFKSHVYMAWNIPMEDSGIDIGDISE 439

Query: 86  QVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAV 144
           + ALR +L CK F W++     ++Y +       + +G  KN L +  CLD  G     V
Sbjct: 440 RKALRKRLQCKTFRWYL----VNIYPEMRMYSDTIAYGVLKNSLKSDLCLDQ-GPDGDNV 494

Query: 145 IGTSYCHG-SGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLP-VTFCVLAGVK 199
                CHG +  S  +    E  +G     +D D  + LV++   P +T C  A  K
Sbjct: 495 PILYLCHGMTPQSVYYTQTQELHIGLLSPTIDDDDNKCLVDVNSKPRLTECSYAAAK 551


>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
           [Saccoglossus kowalevskii]
          Length = 621

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 20/135 (14%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           WL  +K +     P  M +D GDIS++V LR  L CK F WF++++  D +  F  +   
Sbjct: 417 WLDEYKYLFYANQPETMKIDPGDISQRVQLRKNLQCKSFRWFLQNIYPDSHYNFAFVGV- 475

Query: 120 LFWGEAKNLGTQKCLD-----------SMGRTAPAVIGTSY----CHGSGSSQLFRLNAE 164
              G+ KN+ +  CLD             G+ A  V   +     CH  G  QLF    +
Sbjct: 476 ---GQLKNVASGACLDFGKAAGHGGKEFKGKDATNVTSNTVELWPCH-DGKIQLFIRTDK 531

Query: 165 GQLGHGERCVDADKQ 179
            +  +   C+D + Q
Sbjct: 532 KEFRYIHMCLDYNVQ 546


>gi|444726436|gb|ELW66970.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Tupaia chinensis]
          Length = 651

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 423 RVAHIERTRKPYSGDIDYYARRNALRAAEVWMDGFKSHVYMAWNIPMTNPGVDFGDVSER 482

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W++     +VY +       L +GE +N   +  CLD         I
Sbjct: 483 LALRRRLKCRSFKWYLD----NVYPEMRVYNDTLTYGEVRNSKASGYCLDQGAEDDDQAI 538

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R +AEG L  G
Sbjct: 539 -LYPCHGM-SSQLVRYSAEGLLQLG 561


>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
 gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
          Length = 604

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 21/129 (16%)

Query: 39  DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKD 97
           D D   I+T + +L       W+  +  +  L  P L    D+GD++ +V LR KL CK 
Sbjct: 401 DRDTHGINTARMAL------VWMDEYINVFFLNRPDLKFHPDIGDVTHRVVLRKKLRCKS 454

Query: 98  FSWFMKHVAYDVYTKFPE-LPPNL---FWGEAKNLGTQKCLDSM--GRTAPAVIGTSYCH 151
           F W++K+V       +PE   PN+    WG+ K + +  CLD +      P  +G   C 
Sbjct: 455 FEWYLKNV-------YPEKFVPNMNVKAWGKVKAVNSNLCLDDLLNNNEKPYNLGLYACG 507

Query: 152 GS-GSSQLF 159
            +   SQLF
Sbjct: 508 KALQKSQLF 516


>gi|431894865|gb|ELK04658.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Pteropus alecto]
          Length = 633

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/235 (22%), Positives = 103/235 (43%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDDYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
            ++  +VY   P+L P +  G  K+ G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSFGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  + + ++ H    E C+ A   + LV++       C   G K +              
Sbjct: 544 FEYSVQHEIRHNIQKELCLHA--AQGLVQL-----KACAYKGHKTV-------------- 582

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
                 +  W        L + + K CLS + E    +++ C+ ++  Q+W F +
Sbjct: 583 ---AAGEQIWEIQKD-HLLYNPLFKMCLSANGE--HPSLVSCNPSDLLQKWIFSQ 631


>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
           [Rhipicephalus pulchellus]
          Length = 626

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 12/123 (9%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K       P    +  GDISE+ ALR+KL CK F W++ +V      KF      
Sbjct: 420 WMDNYKYYFYQNRPELRKISFGDISERKALRNKLQCKSFKWYLDNV---YPNKFVPSEKV 476

Query: 120 LFWGEAKNLGTQKCLDSMGR----TAPAVIGTSYCH---GSGSSQLFRLNAEGQLGHGER 172
             +G A+N  T  CLDSM      T P  +G   CH    SG +QL       ++   + 
Sbjct: 477 FAFGNARNPNTGMCLDSMSHNYDNTEP--LGIYPCHKDTNSGGNQLVSYTWRHEIRKEDS 534

Query: 173 CVD 175
           C +
Sbjct: 535 CAE 537


>gi|355689613|gb|AER98891.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 3 [Mustela putorius
           furo]
          Length = 302

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 106/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 100 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 159

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  + Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 160 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 213

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G + +              
Sbjct: 214 FEYSAQHEIRHNIQRELCLHA--AQGLVQLRA-----CAYKGHRTV-------------- 252

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
            +SG  +  W      + L +   K CLS + E    +++ C+ ++  Q+W F +
Sbjct: 253 -ASG--EQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 301


>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
           [Takifugu rubripes]
          Length = 583

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 57/229 (24%), Positives = 96/229 (41%), Gaps = 46/229 (20%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           +R    W+  +KQ     +P A     GDIS ++ LRDKL C+ F+W++K++  D++   
Sbjct: 386 VRAAEVWMDSYKQHFYNRNPPARKETYGDISGRLLLRDKLKCQSFNWYLKNIYPDLH--I 443

Query: 114 PELPPNLFW-GEAKNLG-TQKCLDSMGRTAP--AVIGTSY----CHGSGSSQLFRLNAEG 165
           PE      W G  ++LG   +CLD     AP  +V G       CHG G +Q F   ++ 
Sbjct: 444 PE--DRAGWHGAVRHLGINSECLDY---NAPEHSVTGAHLSLFGCHGQGGNQYFEYTSQK 498

Query: 166 QLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
           ++      E C +  +    + M       C   G      R+ ++L             
Sbjct: 499 EIRFNTVTELCAEVVEGRASIGM-----KHCPGDGE----TRIPSIL------------- 536

Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWRF 268
             W F      + H     C++ +  +     + M+RC+  +  QQW+F
Sbjct: 537 --WEFRQDG-SIYHPHTDMCITAYRTSAGRTDVKMMRCNPGDKNQQWKF 582


>gi|301784471|ref|XP_002927652.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like,
           partial [Ailuropoda melanoleuca]
          Length = 463

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 13/107 (12%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K       P A  +  G+I  ++ LR KL CK F W++++V       +P
Sbjct: 315 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 367

Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
           EL      ++ +G A   GT  CLD++G  A  V+G   CH +G +Q
Sbjct: 368 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQ 412


>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
          Length = 520

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 8/124 (6%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K+   +        D+GD+SE+ ALR KL CK F W++++VA + +     L  N
Sbjct: 330 WMDDYKKFYYIHRLDLKEKDVGDLSERKALRQKLKCKSFKWYLENVAKNKFV----LDEN 385

Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC---HGSGSSQLFRLNAEGQLGHGERCVD 175
           +  +G  +N  ++ CLD++ R     I  S     +G   +Q+F L  +G L     C  
Sbjct: 386 VAAFGSLRNPSSELCLDTLQRDEGEAIPLSVFPCQNGKSEAQIFSLTNDGILRRELTCAK 445

Query: 176 ADKQ 179
            D+ 
Sbjct: 446 IDRD 449


>gi|66357692|ref|XP_626024.1| extracellular protein with a ricin domain and a family 2
           glycosyltransferase domain [Cryptosporidium parvum Iowa
           II]
 gi|46227210|gb|EAK88160.1| extracellular protein with a ricin domain and a family 2
           glycosyltransferase domain [Cryptosporidium parvum Iowa
           II]
 gi|448826186|gb|AAQ95175.2| UDP-N-acetyl-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase T3, partial
           [Cryptosporidium parvum]
          Length = 732

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 96/228 (42%), Gaps = 40/228 (17%)

Query: 54  LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
           LR    W+  F QI  + +P    + +G   + + L+ KL CK F WF+ +VA + Y   
Sbjct: 526 LRTARVWMDEFYQITEMLAPNPN-IKLGSFDDMLHLKKKLKCKPFRWFLDNVAPETYIT- 583

Query: 114 PELPPNLFWGEAKNLGTQK-CLDSM-GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
            +L   LF GE +N      CLDSM G+T    IG  +CHG   +Q F ++      H +
Sbjct: 584 -QLDHLLFVGEIRNKKLDNICLDSMGGQTDGDKIGVFHCHGKKGTQAFMMS-----NHTQ 637

Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
           +     K+           ++C+ + +K   C    + N+             WR  N  
Sbjct: 638 QIRIVSKE-----------SYCIGSNLKYAACSNSEMTNV-------------WRLENEM 673

Query: 232 RQLEHAILKR-CLSVHPETN-----QLAMLRCDENNSYQQWRFKEVRP 273
            +      K  CLS+  + +     +  +L CD N+  Q W   + +P
Sbjct: 674 IKANVEPNKYVCLSLTEDNDSSTKHKAELLDCDPNDPSQHWNVNKFKP 721


>gi|301783121|ref|XP_002926975.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
           [Ailuropoda melanoleuca]
 gi|281344477|gb|EFB20061.1| hypothetical protein PANDA_016676 [Ailuropoda melanoleuca]
          Length = 632

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 106/235 (45%), Gaps = 40/235 (17%)

Query: 43  QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
           QVI+  +  L       + ++F +  +  + +      GD+S++  ++ +L CK+F+W++
Sbjct: 429 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 488

Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
             +  + Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q 
Sbjct: 489 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 542

Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
           F  +A+ ++ H    E C+ A   + LV++       C   G + +              
Sbjct: 543 FEYSAQHEIRHNIQRELCLHA--AQGLVQLRA-----CAYKGHRTV-------------- 581

Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
            +SG  +  W      + L +   K CLS + E    +++ C+ ++  Q+W F +
Sbjct: 582 -ASG--EQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 630


>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
          Length = 522

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/142 (26%), Positives = 69/142 (48%), Gaps = 10/142 (7%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A   + GD+SE+  LR++L CK F+W++K++  +++   P
Sbjct: 325 RAAEVWMDEYKEHFYNRNPSARKENYGDLSERKILRERLKCKSFNWYLKNIFAELHV--P 382

Query: 115 ELPPNLFWGEAKNLG-TQKCLD-SMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++ G   +CLD ++    P  A +    CHG G +Q F   +  ++   
Sbjct: 383 EDRPG-WHGAIRSAGIASECLDYALPENHPTGAHLSLFGCHGQGGNQFFEYTSNKEIRFN 441

Query: 171 ---ERCVDADKQEKLVEMLKLP 189
              E C +  + E  + M   P
Sbjct: 442 SVTELCAEVPEHEDFITMRNCP 463


>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
           [Oryzias latipes]
          Length = 758

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 84/199 (42%), Gaps = 49/199 (24%)

Query: 79  DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAK--NLGTQKCLDS 136
           D+G+++EQ+ LR KL CK+F W++++V       +PEL   L   E    N G ++CL +
Sbjct: 593 DIGNLTEQIELRKKLKCKNFKWYLENV-------YPELDAPLVKAEGLVFNQGLRRCL-A 644

Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
           M + +   +    C  +  SQ F       +  G+ C+    Q+ L              
Sbjct: 645 MFKDS---LSFETCDLNKQSQHFSYTWLRHVRQGDLCIAP--QQSL-------------- 685

Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL-------EHAILKRCLSVHPET 249
                        ++ +C N+   +   W FH S++         E A    CL      
Sbjct: 686 ----------NSFSVVSCDNAKAELR--W-FHKSSKSTLADHFIAEFASQHMCLEAESRD 732

Query: 250 NQLAMLRCDENNSYQQWRF 268
           + L +  CD NNS+Q+W+F
Sbjct: 733 DSLHLSPCDSNNSFQKWQF 751


>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
          Length = 579

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/216 (23%), Positives = 87/216 (40%), Gaps = 41/216 (18%)

Query: 60  WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
           W+  +K++  +        D+GD++ +  LR +L+CK F WF+ ++A     KF      
Sbjct: 392 WMDDYKRLYYMHREDLRTKDVGDLTSRHELRKRLNCKSFKWFLDNIA---KGKFIMDEDV 448

Query: 120 LFWGEAKNL--GTQKCLDSMGRTAPA--VIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
           + +G    +  GT+ C D++ R      ++G  +C G GSS QL  L+ EG L     C 
Sbjct: 449 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSREGNLRRENTCA 508

Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
             +                    V++  C      N               R+    + L
Sbjct: 509 SEEN-----------------GNVRMKTCSKKAQFNE--------------RWAYEKQML 537

Query: 235 EHAILKRCLSVH--PETNQLAMLRCDENNSYQQWRF 268
            +    +CLS       +   ++ CDEN+ +Q+W F
Sbjct: 538 RNLKSGKCLSTANLKAGDDAVVIDCDENDPHQKWNF 573


>gi|403292275|ref|XP_003937178.1| PREDICTED: uncharacterized protein LOC101029503 [Saimiri boliviensis
            boliviensis]
          Length = 1450

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 46/145 (31%), Positives = 66/145 (45%), Gaps = 12/145 (8%)

Query: 30   QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
            ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 1221 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 1280

Query: 87   VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
            +ALR KL C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 1281 LALRQKLKCRSFKWYLE----NVYPEMRTYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 1336

Query: 146  GTSYCHGSGSSQLFRLNAEG--QLG 168
                CHG  SSQL R +A+G  QLG
Sbjct: 1337 -LYPCHGM-SSQLVRYSADGLLQLG 1359


>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
          Length = 697

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 14/106 (13%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCL 134
           +D GD+SE+ ALR+KL C+ F W++ ++       +PEL      L  G+ ++     CL
Sbjct: 524 IDFGDVSERKALREKLKCRSFDWYLTNI-------YPELFIPSKALASGDIESAAGPHCL 576

Query: 135 DSM----GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
           DS     G     VI    CH  G +Q + L+   ++   E C D+
Sbjct: 577 DSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDS 622


>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
          Length = 654

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 14/106 (13%)

Query: 78  LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCL 134
           +D GD+SE+ ALR+KL C+ F W++ ++       +PEL      L  G+ ++     CL
Sbjct: 524 IDFGDVSERKALREKLKCRSFDWYLTNI-------YPELFIPSKALASGDIESAAGPHCL 576

Query: 135 DSM----GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
           DS     G     VI    CH  G +Q + L+   ++   E C D+
Sbjct: 577 DSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDS 622


>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
           glaber]
          Length = 582

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 95/228 (41%), Gaps = 42/228 (18%)

Query: 55  RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
           R    W+  +K+     +P A     GDISE+  LR +L CK F W++K+V  +++   P
Sbjct: 385 RAAEVWMDDYKEHFYNRNPPARKEAYGDISERKLLRKQLRCKSFDWYLKNVFSNLHV--P 442

Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
           E  P  + G  ++LG + +CLD          A +    CHG G +Q F   +  ++   
Sbjct: 443 EDRPG-WHGAIRSLGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 501

Query: 171 ---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGT--VLNLFNCFNSSGTVDGPW 225
              E C +  +Q+  V M   P              R G+   +N+   F   GT+  P 
Sbjct: 502 SVTELCAEVPEQKTYVGMQTCP--------------RDGSPAPVNIIWHFKEDGTIFHP- 546

Query: 226 RFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
                     H+ L  CLS +        + M  C+  +  Q WRF++
Sbjct: 547 ----------HSGL--CLSAYRTAEGRPDVQMRTCEALDKNQIWRFEK 582


>gi|291412932|ref|XP_002722732.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
           [Oryctolagus cuniculus]
          Length = 535

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 64/145 (44%), Gaps = 10/145 (6%)

Query: 30  QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
           ++ H ER        I        LR    W+  FK    +   + M    +D GD+SE+
Sbjct: 306 RVAHIERSRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 365

Query: 87  VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
           +ALR +L C+ F W+++    +VY +       L +GE +N   +  CLD         I
Sbjct: 366 LALRRRLKCRSFQWYLE----NVYPEMRVYSDTLAYGEVRNSKASGYCLDQGSEDDDQAI 421

Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
               CHG  SSQL R + +G L  G
Sbjct: 422 LYP-CHGM-SSQLVRYSTDGLLQLG 444


>gi|350593559|ref|XP_003133495.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Sus
           scrofa]
          Length = 633

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 53/229 (23%), Positives = 100/229 (43%), Gaps = 44/229 (19%)

Query: 53  ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
           ++R    W+  +K+I     +  + +      GD+S++  ++ +L CK+F+W++ ++  +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKRRLQCKNFTWYLNNIYPE 495

Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
            Y   P+L P +  G  K++G   CLD +G       P ++ T  CHG G +Q F  + +
Sbjct: 496 AYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSVQ 549

Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
            ++ H    E C+ A +    V  LK     C   G K +                  T 
Sbjct: 550 HEIRHNIQKELCLHAAQG---VVQLKT----CAYKGHKTV-----------------ATG 585

Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
           +  W      + L +   K CLS   E    +++ C+ ++  Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDPLQKWIFSQ 631


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.135    0.428 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,392,518,095
Number of Sequences: 23463169
Number of extensions: 172500921
Number of successful extensions: 344855
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1338
Number of HSP's successfully gapped in prelim test: 389
Number of HSP's that attempted gapping in prelim test: 340991
Number of HSP's gapped (non-prelim): 3150
length of query: 277
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 137
effective length of database: 9,074,351,707
effective search space: 1243186183859
effective search space used: 1243186183859
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)