BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy11641
(277 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|195567537|ref|XP_002107316.1| GD15630 [Drosophila simulans]
gi|194204722|gb|EDX18298.1| GD15630 [Drosophila simulans]
Length = 411
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 106/203 (52%), Positives = 137/203 (67%), Gaps = 30/203 (14%)
Query: 73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 239 PLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASDG 298
Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 299 CLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------- 345
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T QL
Sbjct: 346 ----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQL 388
Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
++ CD N+SYQQW FKE+RP W
Sbjct: 389 SLGHCDVNDSYQQWWFKEIRPRW 411
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 303 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 347
>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
pisum]
Length = 588
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 104/205 (50%), Positives = 140/205 (68%), Gaps = 30/205 (14%)
Query: 73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
PLA +LDMGDIS+Q+ L+DKL CKDFSWFM++VAYDVYTKFPELPPNL+WGE +N+G
Sbjct: 414 PLARYLDMGDISKQLELKDKLQCKDFSWFMENVAYDVYTKFPELPPNLYWGELRNIGKTT 473
Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
CLD+ G P+++G CHG G++QLFRLN +GQL GERC+ AD+Q
Sbjct: 474 CLDTRGHQPPSLVGLELCHGQGNNQLFRLNTKGQLSVGERCIFADRQ------------- 520
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
VKL+ C L GTVDGPW++ ++ + L H+I ++CLSVHP ++Q+
Sbjct: 521 ----NVKLVVCPL-------------GTVDGPWQYESTNKLLLHSITRKCLSVHPSSDQV 563
Query: 253 AMLRCDENNSYQQWRFKEVRPDWSS 277
++ CD NN+ QQW FK++ P W+S
Sbjct: 564 ILMPCDSNNANQQWIFKQIVPSWTS 588
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%)
Query: 2 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
G P+++G CHG G++QLFRLN +GQL GERC+ AD+Q +
Sbjct: 479 GHQPPSLVGLELCHGQGNNQLFRLNTKGQLSVGERCIFADRQNV 522
>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
Length = 591
Score = 228 bits (582), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 107/204 (52%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGLQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPVTQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
LA+ CD N+SYQQW FKE+RP W
Sbjct: 568 LALGHCDVNDSYQQWWFKEIRPRW 591
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGLQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
Length = 591
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
N-acetylgalactosaminyltransferase 2 [Drosophila
melanogaster]
Length = 591
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 7;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
AltName: Full=dGalNAc-T2
gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
Length = 591
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
Length = 591
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+RP W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIRPRW 591
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
Length = 486
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 313 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 372
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 373 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 420
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 421 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 462
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+RP W
Sbjct: 463 LSLGHCDVNDSYQQWWFKEIRPRW 486
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 378 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 422
>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
Length = 587
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDI+EQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 414 EPLARYLDMGDITEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 473
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERC++AD+Q
Sbjct: 474 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCIEADRQ------------ 521
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 522 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPTTQQ 563
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ RCD N+SYQQW FKE+ P W
Sbjct: 564 LSLARCDVNDSYQQWWFKEIHPRW 587
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERC++AD+Q I
Sbjct: 479 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCIEADRQGI 523
>gi|16769336|gb|AAL28887.1| LD27021p [Drosophila melanogaster]
Length = 214
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 106/204 (51%), Positives = 137/204 (67%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 41 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 100
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 101 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 148
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 149 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 190
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+RP W
Sbjct: 191 LSLGHCDVNDSYQQWWFKEIRPRW 214
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 106 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 150
>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
Length = 567
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 136/204 (66%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDI+EQ+AL+ +L CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 394 EPLARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 453
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 454 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 501
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW++++ T+ L H + K+C+++HP T Q
Sbjct: 502 -----GIKLAVCRL-------------GTVDGPWQYNDHTKHLMHRVHKKCMALHPTTQQ 543
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L + CD N+SYQQW FKE+RP W
Sbjct: 544 LTLGHCDVNDSYQQWWFKEIRPRW 567
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 459 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 503
>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
Length = 591
Score = 226 bits (577), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 136/204 (66%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDI+EQ+AL+ +L CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDITEQLALKKRLGCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW++++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNDHTKHLMHRVHKKCMALHPTTQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L + CD N+SYQQW FKE+RP W
Sbjct: 568 LTLGHCDVNDSYQQWWFKEIRPRW 591
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
Length = 591
Score = 226 bits (576), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 136/204 (66%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM H+AYDVY KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDHIAYDVYDKFPGLPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW+++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWQYNEHTKHLMHRVHKKCMALHPATQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ CD N+SYQQW FKE+ P W
Sbjct: 568 LSLGHCDVNDSYQQWWFKEIHPRW 591
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G +YCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLTYCHGGGNNQLVRLNAAGQLGVGERCVEADRQGI 527
>gi|383860243|ref|XP_003705600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Megachile
rotundata]
Length = 581
Score = 223 bits (568), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 139/212 (65%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA LD GDISEQ+A + + CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 400 YKEFFYTREPLAQLLDHGDISEQLAFKKRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 459
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N+ T CLD+M + P+++ TS+CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 460 ELRNVATGTCLDTMSHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCISADSQ---- 515
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
GVK +FCRL GTVDGPW++ T+ L H + K+C+
Sbjct: 516 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 549
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
++HP+T QL+++ CD NN+YQQW F ++RP+W
Sbjct: 550 ALHPQTQQLSLMPCDINNTYQQWSFHQIRPEW 581
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 25/43 (58%), Positives = 34/43 (79%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
M + P+++ TS+CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCISADSQ 515
>gi|170060398|ref|XP_001865784.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
gi|167878898|gb|EDS42281.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
Length = 356
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 104/212 (49%), Positives = 134/212 (63%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA FLDMGDISEQ+AL+++L CK F W+M +VAYDV K+P LP NLFWG
Sbjct: 175 YKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPALPANLFWG 234
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E KN GT+KC+D++GR PA+IG +CHG G +QL RLNA GQLG GERC++AD Q
Sbjct: 235 ELKNSGTEKCVDALGRQPPAIIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNQ---- 290
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
G+KL FCRL GTVDGPW++ +T L H K C+
Sbjct: 291 -------------GIKLAFCRL-------------GTVDGPWQYDETTSTLLHRTYKMCM 324
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
HP+T LA++ CD +N+YQ W+F + P W
Sbjct: 325 GYHPQTRALALMPCDVHNAYQSWKFHTITPRW 356
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 34/45 (75%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
+GR PA+IG +CHG G +QL RLNA GQLG GERC++AD Q I
Sbjct: 248 LGRQPPAIIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNQGI 292
>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
Length = 591
Score = 219 bits (559), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 135/204 (66%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDI+EQ+AL+ +L+CK F WFM ++AYDV KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDITEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW ++ T+ L H + K+C+++HP T Q
Sbjct: 526 -----GIKLAICRL-------------GTVDGPWSYNEHTKHLLHRVHKKCMALHPTTQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ +CD N+ YQ+W FKE+RP W
Sbjct: 568 LSLGQCDVNDGYQKWLFKEIRPHW 591
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQGI 527
>gi|91081797|ref|XP_973938.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270006291|gb|EFA02739.1| hypothetical protein TcasGA2_TC008465 [Tribolium castaneum]
Length = 583
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 141/212 (66%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ P+A FLDMGDISEQ+AL+ +L+CK F WFM++VAYDV K+PELPPN+ +G
Sbjct: 402 YKEFFYTREPMARFLDMGDISEQLALKKRLNCKSFEWFMENVAYDVLDKYPELPPNIHFG 461
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +++ KCLD++G P+++ +CHG G++QL RLNA+GQLG GERC++AD Q
Sbjct: 462 ELRSVAVSKCLDTLGHGPPSLMAIQHCHGYGNNQLIRLNAKGQLGVGERCIEADAQ---- 517
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
G+KL FCR+ GTVDGPW + +T + H + K+C+
Sbjct: 518 -------------GIKLAFCRM-------------GTVDGPWLYDENTHTILHRVHKKCI 551
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
++HP+T+ L+++ CD NN+YQQW FKE+RP W
Sbjct: 552 ALHPQTSHLSLMPCDINNAYQQWIFKEIRPKW 583
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 34/45 (75%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
+G P+++ +CHG G++QL RLNA+GQLG GERC++AD Q I
Sbjct: 475 LGHGPPSLMAIQHCHGYGNNQLIRLNAKGQLGVGERCIEADAQGI 519
>gi|307212076|gb|EFN87959.1| N-acetylgalactosaminyltransferase 7 [Harpegnathos saltator]
Length = 563
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 134/204 (65%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA FLD GDISEQ+ +++ CK F W+M +VAYDV KFPELPPN+ WGE +N+ T+
Sbjct: 390 EPLARFLDHGDISEQLMFKERRKCKSFQWYMDNVAYDVLDKFPELPPNIHWGELQNVATK 449
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLD+MG P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q
Sbjct: 450 SCLDTMGHAPPSLMATSHCHGFGNNQLMRLNAKGQLGVGERCVEADGQ------------ 497
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
GVK FCRL GT DGPW++ T+ L H + K+C+++HP+T Q
Sbjct: 498 -----GVKYAFCRL-------------GTADGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 539
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L+++ CD NNSYQQW F++V P W
Sbjct: 540 LSLMPCDINNSYQQWIFRQVHPQW 563
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q +
Sbjct: 455 MGHAPPSLMATSHCHGFGNNQLMRLNAKGQLGVGERCVEADGQGV 499
>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
Length = 596
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 102/212 (48%), Positives = 133/212 (62%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA FLDMGDISEQ+AL+++L CK F W+M +VAYDV K+P LP NLFWG
Sbjct: 415 YKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPALPANLFWG 474
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E KN GT+KC+D++GR PA+IG +CHG G +QL RLNA GQLG GERC++AD
Sbjct: 475 ELKNSGTEKCVDALGRQPPAMIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNM---- 530
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
G+KL FCR+ GTVDGPW++ +T L H K C+
Sbjct: 531 -------------GIKLAFCRM-------------GTVDGPWQYDENTSTLLHRTYKMCM 564
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
HP+T LA++ CD +N+YQ W+F + P W
Sbjct: 565 GYHPQTRALALMPCDVHNAYQSWKFHTITPRW 596
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 33/45 (73%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
+GR PA+IG +CHG G +QL RLNA GQLG GERC++AD I
Sbjct: 488 LGRQPPAMIGLQHCHGQGHNQLIRLNAAGQLGVGERCIEADNMGI 532
>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
Length = 591
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 134/204 (65%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA +LDMGDISEQ+AL+ +L+CK F WFM ++AYDV KFP LP NL WGE +++ +
Sbjct: 418 EPLARYLDMGDISEQLALKKRLNCKSFQWFMDNIAYDVVDKFPALPANLHWGELRSVASD 477
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLDSMG PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q
Sbjct: 478 GCLDSMGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQ------------ 525
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+KL CRL GTVDGPW ++ T+ L H + K+C+++HP Q
Sbjct: 526 -----GIKLAVCRL-------------GTVDGPWSYNEHTKHLLHRVHKKCMALHPAGQQ 567
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L++ +CD N+ YQ+W FKE+RP W
Sbjct: 568 LSLGQCDVNDGYQKWLFKEIRPHW 591
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 30/45 (66%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG PA++G SYCHG G++QL RLNA GQLG GERCV+AD+Q I
Sbjct: 483 MGHQPPAIMGLSYCHGGGNNQLVRLNAVGQLGVGERCVEADRQGI 527
>gi|307169192|gb|EFN62008.1| N-acetylgalactosaminyltransferase 7 [Camponotus floridanus]
Length = 580
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 133/204 (65%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA LD GDISEQ+A +++ CK+F W+M +VAYDV KFPELPPN+ WGE +N+ T
Sbjct: 407 EPLARLLDHGDISEQLAFKERKKCKNFQWYMDNVAYDVLDKFPELPPNIHWGELRNVATG 466
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLD+MG P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q
Sbjct: 467 SCLDTMGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGIGERCVEADGQ------------ 514
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+K FCRL GT DGPW++ T+ L H + K+C+++HP+T Q
Sbjct: 515 -----GIKYAFCRL-------------GTADGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 556
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L+++ CD NN+YQQW F ++ P W
Sbjct: 557 LSLMLCDINNTYQQWNFHQIHPRW 580
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG P+++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q I
Sbjct: 472 MGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGIGERCVEADGQGI 516
>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
vitripennis]
Length = 583
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 99/204 (48%), Positives = 133/204 (65%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA LD GDI+EQ+ + + CK F WFM ++AYDV KFPELPPN+ WGE KN+ TQ
Sbjct: 410 EPLARLLDHGDITEQLEFKRRKGCKSFQWFMDNIAYDVLDKFPELPPNIHWGEMKNVATQ 469
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLD+MG P ++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q
Sbjct: 470 MCLDTMGHAPPNLMATSHCHGFGNNQLIRLNAKGQLGVGERCVEADGQ------------ 517
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
GVKL FCRL GTVDGPW++ ++ L H + K+C+++HP+T+
Sbjct: 518 -----GVKLAFCRL-------------GTVDGPWQYDEKSKTLLHRVHKKCMALHPQTHH 559
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L+++ CD NN+YQQW F ++ P+W
Sbjct: 560 LSLMPCDMNNAYQQWTFVQISPNW 583
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 35/45 (77%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG P ++ TS+CHG G++QL RLNA+GQLG GERCV+AD Q +
Sbjct: 475 MGHAPPNLMATSHCHGFGNNQLIRLNAKGQLGVGERCVEADGQGV 519
>gi|380013105|ref|XP_003690610.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
7-like [Apis florea]
Length = 581
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 135/212 (63%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA LD GDISEQ+ + + CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 400 YKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 459
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N+ T CLD+M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 460 ELRNVATGTCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ---- 515
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
GVK +FCRL GTVDGPW++ T+ L H + K+C+
Sbjct: 516 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 549
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
++HP+T QL+++ CD NN+YQQW F +V P W
Sbjct: 550 ALHPQTQQLSLMPCDINNTYQQWSFHQVHPQW 581
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 33/43 (76%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ 515
>gi|242016390|ref|XP_002428804.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212513501|gb|EEB16066.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 579
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 138/212 (65%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA +L+MGDIS+Q+A++++L CK F WFM+ VA D+ TKFPELPPN+ WG
Sbjct: 398 YKEFFYTREPLARYLNMGDISKQLAMKERLQCKSFQWFMEEVAPDLLTKFPELPPNVHWG 457
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +NL T KCLD++G PA + S CHG G++QL RLN++GQLG GERC+D KQ
Sbjct: 458 ELRNLATLKCLDTLGHGPPARMSVSECHGFGNNQLVRLNSQGQLGIGERCIDGGKQ---- 513
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
G++L+FCRL G+VDGPW++ R L H ++ CL
Sbjct: 514 -------------GIRLVFCRL-------------GSVDGPWQYDEKGRTLLHVKMQMCL 547
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
++HP+T+QL+++ CD NNS+ QW FKE+ P W
Sbjct: 548 ALHPQTSQLSLMPCDYNNSFHQWIFKEIHPKW 579
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 25/45 (55%), Positives = 32/45 (71%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
+G PA + S CHG G++QL RLN++GQLG GERC+D KQ I
Sbjct: 471 LGHGPPARMSVSECHGFGNNQLVRLNSQGQLGIGERCIDGGKQGI 515
>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
impatiens]
Length = 582
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA LD GDISEQ+ + + CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 401 YKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 460
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N+ T CLD+M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 461 ELRNIATGMCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADGQ---- 516
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
GVK +FCRL GTVDGPW++ T+ L H + K+C+
Sbjct: 517 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 550
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
++HP+T QL+++ CD NN+YQQW F ++ P W
Sbjct: 551 ALHPQTQQLSLMPCDINNTYQQWSFHQIHPKW 582
Score = 57.8 bits (138), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 33/43 (76%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 474 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADGQ 516
>gi|340718182|ref|XP_003397550.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Bombus
terrestris]
Length = 581
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 96/204 (47%), Positives = 132/204 (64%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA LD GDISEQ+ + + CK F W+M++VAYDV+ KFPELPPN+ WGE +N+ T
Sbjct: 408 EPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWGELRNIATG 467
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLD+M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 468 MCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADSQ------------ 515
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
GVK +FCRL GTVDGPW++ T+ L H + K+C+++HP+T Q
Sbjct: 516 -----GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 557
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L+++ CD NN+YQQW F ++ P W
Sbjct: 558 LSLMPCDVNNTYQQWSFHQIHPKW 581
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 33/43 (76%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADSQ 515
>gi|332023194|gb|EGI63450.1| N-acetylgalactosaminyltransferase 7 [Acromyrmex echinatior]
Length = 614
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 131/203 (64%), Gaps = 30/203 (14%)
Query: 73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
PLA LD GDISEQ+A + + CK F WF+ +VAYDV KFPELPPN+ WGE +N +
Sbjct: 442 PLARLLDHGDISEQLAFKKRKKCKSFQWFVDNVAYDVLDKFPELPPNIHWGELRNTASGS 501
Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
CLD+MG + P+++ TS+CHG G++QL RLNA+GQLG GERC+DAD Q
Sbjct: 502 CLDTMGHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIDADGQR------------ 549
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
+K FCRL GTVDGPW++ T+ L H + K+C+++HP+T QL
Sbjct: 550 -----IKYAFCRL-------------GTVDGPWQYDEKTKTLLHRVYKKCIALHPQTQQL 591
Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
+++ CD NN+YQQW F +V P W
Sbjct: 592 SLMPCDINNAYQQWSFHQVHPRW 614
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 37/45 (82%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG + P+++ TS+CHG G++QL RLNA+GQLG GERC+DAD Q I
Sbjct: 506 MGHSPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIDADGQRI 550
>gi|322798640|gb|EFZ20244.1| hypothetical protein SINV_10970 [Solenopsis invicta]
Length = 580
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 98/204 (48%), Positives = 132/204 (64%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
PLA LD GDISEQ+A +++ CK F W++ +VAYDV KFPELPPN+ WGE +N+ T
Sbjct: 407 EPLARLLDHGDISEQLAFKERKKCKSFQWYIDNVAYDVLDKFPELPPNIHWGELRNVATG 466
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLD+MG P+++ TS+CHG G++QL RLNA+GQLG GERC++AD Q
Sbjct: 467 SCLDTMGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIEADGQ------------ 514
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
GVK FCRL GT DGPW++ T+ L H + K+C+++HP+T Q
Sbjct: 515 -----GVKYAFCRL-------------GTADGPWQYDEKTKTLLHRVHKKCMALHPQTQQ 556
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L+++ CD NN+YQQW F +V P W
Sbjct: 557 LSLMPCDINNTYQQWSFHQVHPRW 580
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 26/45 (57%), Positives = 36/45 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG P+++ TS+CHG G++QL RLNA+GQLG GERC++AD Q +
Sbjct: 472 MGHAPPSLMATSHCHGFGNNQLIRLNAKGQLGVGERCIEADGQGV 516
>gi|328781461|ref|XP_395266.4| PREDICTED: n-acetylgalactosaminyltransferase 7 [Apis mellifera]
Length = 581
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 98/212 (46%), Positives = 134/212 (63%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA LD GDISEQ+ + + CK F W+M++VAYDV+ KFPELPPN+ WG
Sbjct: 400 YKEFFYTREPLAQLLDHGDISEQLEFKRRKRCKSFQWYMENVAYDVFDKFPELPPNIHWG 459
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N T CLD+M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 460 ELRNGATGTCLDTMSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ---- 515
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
GVK +FCRL GTVDGPW++ T+ L H + K+C+
Sbjct: 516 -------------GVKFVFCRL-------------GTVDGPWQYDEKTKTLLHRVHKKCM 549
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
++HP+T QL+++ CD NN+YQQW F +V P W
Sbjct: 550 ALHPQTQQLSLMPCDINNTYQQWSFHQVHPQW 581
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 33/43 (76%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
M + P+++ T+ CHG G++QL RLNA+GQLG GERC+ AD Q
Sbjct: 473 MSHSPPSLMATTDCHGFGNNQLIRLNAKGQLGVGERCISADNQ 515
>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
Length = 598
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 100/212 (47%), Positives = 132/212 (62%), Gaps = 30/212 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA FLDMGDISEQ+AL+++L CK F W+M +VAYDV K+P LP N+ WG
Sbjct: 417 YKEYFYTREPLARFLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPMLPANVKWG 476
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N+G +KC+D++GR PAVIG CHG G +QL RLN GQLG GERC++A E
Sbjct: 477 ELQNVGKEKCVDALGRQPPAVIGLQQCHGQGHNQLIRLNGAGQLGVGERCIEAYNSE--- 533
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
+KL FCRL GTVDGPW++ + L H K+C+
Sbjct: 534 --------------IKLAFCRL-------------GTVDGPWQYDEQSGTLLHRTHKKCM 566
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
P+T QLA++ CD NN+YQ W+FKE++P W
Sbjct: 567 GYQPQTRQLALMPCDINNAYQSWKFKEIQPHW 598
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 29/40 (72%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
+GR PAVIG CHG G +QL RLN GQLG GERC++A
Sbjct: 490 LGRQPPAVIGLQQCHGQGHNQLIRLNGAGQLGVGERCIEA 529
>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
Length = 521
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 96/210 (45%), Positives = 136/210 (64%), Gaps = 30/210 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ P A FLDMG+I++Q+ ++ +L+CK F+WFM+ VAYDV K+PELP NL WG
Sbjct: 341 YKEFFYTREPTARFLDMGNITQQLEMKKRLNCKSFAWFMEEVAYDVLDKYPELPANLHWG 400
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N ++CLD+MG P+++G S+CHG G++QLFRLN GQLG GERCV+AD Q
Sbjct: 401 ELRNTAARQCLDTMGHQPPSLMGISHCHGFGNNQLFRLNKAGQLGVGERCVNADSQ---- 456
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
GVKL+ CRL G+V+GPW + +++ L H +LKRCL
Sbjct: 457 -------------GVKLVVCRL-------------GSVEGPWEYDETSKTLLHKVLKRCL 490
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
+VHP+ N L +L CD N+Y +W F+E++P
Sbjct: 491 AVHPQNNHLTLLPCDTANNYHRWEFREIQP 520
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/45 (60%), Positives = 35/45 (77%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVI 45
MG P+++G S+CHG G++QLFRLN GQLG GERCV+AD Q +
Sbjct: 414 MGHQPPSLMGISHCHGFGNNQLFRLNKAGQLGVGERCVNADSQGV 458
>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
Length = 499
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 125/204 (61%), Gaps = 30/204 (14%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQ 131
P+A FLDMGDISEQVAL+++L CK F WFM++VAYDVY KFP+LP N+ WG KN T
Sbjct: 326 EPMARFLDMGDISEQVALKERLKCKSFGWFMENVAYDVYDKFPKLPKNVHWGMVKNKATG 385
Query: 132 KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
CLD+MG+ APA IGTS CHG G+SQLFRLN GQLG GERCV+ D
Sbjct: 386 ACLDTMGKAAPAYIGTSSCHGMGNSQLFRLNEAGQLGVGERCVETDGD------------ 433
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
VK CRL GTVDGPW ++ QL H + CL++ P + Q
Sbjct: 434 -----NVKQAICRL-------------GTVDGPWSYNEERHQLVHRLHGHCLTLQPHSGQ 475
Query: 252 LAMLRCDENNSYQQWRFKEVRPDW 275
L + CD NN+YQQW K+ P+W
Sbjct: 476 LGLAPCDPNNTYQQWTVKQKTPNW 499
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 30/41 (73%), Positives = 33/41 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
MG+ APA IGTS CHG G+SQLFRLN GQLG GERCV+ D
Sbjct: 391 MGKAAPAYIGTSSCHGMGNSQLFRLNEAGQLGVGERCVETD 431
>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
Length = 874
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 98/210 (46%), Positives = 131/210 (62%), Gaps = 30/210 (14%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA +LDMGDISEQ+AL+++L CK F W+M +VAYDV K+P LP NLFWG
Sbjct: 415 YKEYFYTREPLAQYLDMGDISEQLALKERLQCKSFQWYMDNVAYDVLDKYPMLPANLFWG 474
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N G +KC+D++GR PA+IG CHG G +QL RLNA GQLG GERC++A
Sbjct: 475 ELQNTGMEKCVDALGRQPPAIIGLQVCHGQGHNQLIRLNAAGQLGVGERCIEAYN----- 529
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
A +KL FCRL GTVDGPW++ T L H K+C+
Sbjct: 530 ------------ADIKLAFCRL-------------GTVDGPWQYDEHTGTLLHRTHKKCM 564
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
P+ QLA++ CD NN+YQ W+F++++P
Sbjct: 565 GYQPQNRQLALMPCDINNAYQSWKFQKIQP 594
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 30/40 (75%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
+GR PA+IG CHG G +QL RLNA GQLG GERC++A
Sbjct: 488 LGRQPPAIIGLQVCHGQGHNQLIRLNAAGQLGVGERCIEA 527
>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
Length = 499
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 122/203 (60%), Gaps = 30/203 (14%)
Query: 73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
P+A FLDMGDISEQVALRDKL+CK FSW+M++VAYDVY KFP+LP N+ WG KN
Sbjct: 327 PMARFLDMGDISEQVALRDKLNCKSFSWYMENVAYDVYDKFPKLPKNVHWGMVKNKAIGL 386
Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
CLD+MG+ AP+ IG CHG+G++QL+RLN GQLG GERC++AD
Sbjct: 387 CLDTMGKAAPSYIGIQSCHGAGNNQLYRLNEAGQLGVGERCLEADTD------------- 433
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
+K CRL GTVDGPWR+ L H + CL++ P + L
Sbjct: 434 ----SLKQTICRL-------------GTVDGPWRYDKERSHLIHRLHSYCLTLQPNSRTL 476
Query: 253 AMLRCDENNSYQQWRFKEVRPDW 275
+ CD NN+YQQW + P W
Sbjct: 477 GLAPCDPNNTYQQWTITQKNPKW 499
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 25/41 (60%), Positives = 33/41 (80%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
MG+ AP+ IG CHG+G++QL+RLN GQLG GERC++AD
Sbjct: 391 MGKAAPSYIGIQSCHGAGNNQLYRLNEAGQLGVGERCLEAD 431
>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 123/213 (57%), Gaps = 30/213 (14%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ PLA + D GD+ +Q+ALR+KL CK F WFMK+VAYDV FP LP N
Sbjct: 457 WMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLLPRN 516
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L+WGE ++ T +CLD+MG P+ + CHG+G +Q+FRLNAEGQLG GERC+DA
Sbjct: 517 LYWGEIRHDATDQCLDAMGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSH 576
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
+ +++C L GTVDGPW + T+ L H
Sbjct: 577 -----------------SMDVVYCSL-------------GTVDGPWEYSVDTKHLYHKTH 606
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
K+CL+ + ++ + +CD++N Q W F+E++
Sbjct: 607 KKCLTYNKRNYKVHLEKCDDSNLSQHWTFREIQ 639
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 33/52 (63%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSL 52
MG P+ + CHG+G +Q+FRLNAEGQLG GERC+DA + SL
Sbjct: 534 MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSHSMDVVYCSL 585
>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 85/213 (39%), Positives = 122/213 (57%), Gaps = 30/213 (14%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ PLA + D GD+ +Q+ALR+KL CK F WFMK+VAYDV FP LP N
Sbjct: 457 WMDEYKEYFYTREPLATYYDAGDLKQQLALREKLKCKSFRWFMKNVAYDVLKNFPLLPRN 516
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L+WGE ++ T +CLD+MG P+ + CHG+G +Q+FRLNAEGQLG GERC+DA
Sbjct: 517 LYWGEIRHDATDQCLDAMGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSH 576
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
+ +++C L GTVDGPW + T+ L H
Sbjct: 577 -----------------SMDVVYCSL-------------GTVDGPWEYSVDTKHLYHKTH 606
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
K+CL+ + ++ + +CD+ N Q W F+E++
Sbjct: 607 KKCLTYNKRNYKVHLEKCDDGNLSQHWTFREIQ 639
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 33/52 (63%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSL 52
MG P+ + CHG+G +Q+FRLNAEGQLG GERC+DA + SL
Sbjct: 534 MGAHPPSTAALTACHGTGGNQVFRLNAEGQLGLGERCMDASSHSMDVVYCSL 585
>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
magnipapillata]
Length = 599
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/212 (40%), Positives = 121/212 (57%), Gaps = 24/212 (11%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+ PLA F+D GD+SEQ+AL+ +++CK F+WFM+ +AYDV K+PE PPN WG
Sbjct: 403 YKEFFYTREPLARFIDTGDVSEQMALKKRMNCKSFTWFMEEIAYDVLKKYPEPPPNAHWG 462
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E +N+ T C+D++ R+ P IG S CH SG +QL+RLN GQL GE CV +K +
Sbjct: 463 EVRNIATNLCIDTLNRSPPYRIGLSGCHKSGGNQLWRLNTLGQLASGEWCVRYEKASNEI 522
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
E +L + +C SGT GPW+F S+ H L CL
Sbjct: 523 EN-ELSMEWC-----------------------PSGTPSGPWKFDASSGFFTHTGLNMCL 558
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEVRPDW 275
V+ T +L M RCD + + +W+ +E+RP W
Sbjct: 559 IVNSRTLKLNMARCDNSKEHHKWKIEEIRPHW 590
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 27/42 (64%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 42
+ R+ P IG S CH SG +QL+RLN GQL GE CV +K
Sbjct: 476 LNRSPPYRIGLSGCHKSGGNQLWRLNTLGQLASGEWCVRYEK 517
>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
Length = 640
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 124/235 (52%), Gaps = 36/235 (15%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
K VIST +R +W+ +++ + P A GDIS Q+ LR +LHCK F W+
Sbjct: 440 KPVISTNM----VRVIKTWMDEYEKYYYIREPSAKHRSPGDISAQLELRKRLHCKSFKWY 495
Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
M+ VAYDV +P LP N WGEAKNL T KC+D+MGR P ++G + CHG G +QL RL
Sbjct: 496 MEKVAYDVVYSYPFLPENHVWGEAKNLQTSKCIDTMGRAIPGIVGATPCHGYGGNQLIRL 555
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
N +GQL GE C+ LG L +C + GTV
Sbjct: 556 NKKGQLTQGEWCMTP----------------------------LGNQLQTGHC--AKGTV 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAML--RCDENNSYQQWRFKEVRPD 274
DGP+++ ++ Q++ + CL+ + + +CD +NS Q+W+++E+ D
Sbjct: 586 DGPFQYDEASEQIKMTLSDTCLTADSSMSSSTLSLKKCDPSNSLQKWKWREIFSD 640
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 38
MGR P ++G + CHG G +QL RLN +GQL GE C+
Sbjct: 531 MGRAIPGIVGATPCHGYGGNQLIRLNKKGQLTQGEWCM 568
>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
Length = 601
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 116/217 (53%), Gaps = 31/217 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R +W+ + + P A ++ GDIS Q+ALRDKL CK F W+M++VAYDV +
Sbjct: 411 MRVVKTWMDDYSKYYLTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLQSY 470
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P LPPN WGEA+N T KCLD MG P +G S CHG G +QL RLN +GQ+ GE C
Sbjct: 471 PLLPPNDVWGEARNPATGKCLDRMG-GIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWC 529
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ A+ +++ CV GTV+G W + +T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGTVNGNWIYDRNTQQ 559
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
++H + C++V + + + C E+N Q++ +KE
Sbjct: 560 IQHNAKRMCVTVSESGSDVILETCTEDNQRQKFVWKE 596
Score = 44.7 bits (104), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 6 PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
P +G S CHG G +QL RLN +GQ+ GE C+ A+
Sbjct: 498 PGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTAN 533
>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
Length = 601
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 117/217 (53%), Gaps = 31/217 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R +W+ + + P A ++ GDIS Q+ALRDKL CK F W+M++VAYDV +
Sbjct: 411 MRVVKTWMDDYSKYYLTREPQATNVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSY 470
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P LPPN WGEA+N T KCLD MG P +G + CHG G +QL RLN +GQ+ GE C
Sbjct: 471 PMLPPNDVWGEARNPATGKCLDRMG-GIPGPMGATGCHGYGGNQLIRLNVQGQMAQGEWC 529
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ A+ +++ CV GTV+G W + T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGTVNGFWSYDRKTKQ 559
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ H+ ++C++V +++ + C E+N Q++ +KE
Sbjct: 560 IIHSQKRQCITVSESGSEVTLQTCTEDNERQKFVWKE 596
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 17/36 (47%), Positives = 24/36 (66%)
Query: 6 PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
P +G + CHG G +QL RLN +GQ+ GE C+ A+
Sbjct: 498 PGPMGATGCHGYGGNQLIRLNVQGQMAQGEWCLTAN 533
>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
Length = 562
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/235 (35%), Positives = 119/235 (50%), Gaps = 38/235 (16%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
K VIST LR +W+ +++ + P A GDIS Q+ LR++L CK F W+
Sbjct: 364 KPVISTNM----LRVIKTWMDEYEKYYYIREPSAKHRLPGDISSQLKLRERLKCKSFEWY 419
Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
M+ VAYDV +P P N WGEAKN T KC+D++G+T P ++G CHG G +QL RL
Sbjct: 420 MEKVAYDVIVSYPLPPENHVWGEAKNHATGKCIDTIGQTIPGIVGAMPCHGYGGNQLIRL 479
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
N EGQL GE C+ + G NL + GTV
Sbjct: 480 NKEGQLTQGEWCITP------------------VGG------------NLVTKYCVKGTV 509
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
DGP+ + + Q+ I CL+ + L+M +CD N +Q+W+++E+ D
Sbjct: 510 DGPFAYDEKSEQI--MIDDLCLTAAKSMSSPTLSMEKCDSGNQWQKWKWQEIYID 562
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 38
+G+T P ++G CHG G +QL RLN EGQL GE C+
Sbjct: 455 IGQTIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCI 492
>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
Length = 560
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/235 (35%), Positives = 119/235 (50%), Gaps = 38/235 (16%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
K VIST LR +W+ +++ + P A GDIS Q+ LR++L CK F W+
Sbjct: 362 KPVISTNM----LRVIKTWMDEYEKYYYIREPSAKHRLPGDISSQLKLRERLKCKSFEWY 417
Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
M+ VAYDV +P P N WGEAKN T KC+D++G+T P ++G CHG G +QL RL
Sbjct: 418 MEKVAYDVIVSYPLPPENHVWGEAKNHATGKCIDTIGQTIPGIVGAMPCHGYGGNQLIRL 477
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
N EGQL GE C+ + G NL + GTV
Sbjct: 478 NKEGQLTQGEWCITP------------------VGG------------NLVTKYCVKGTV 507
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
DGP+ + + Q+ I CL+ + L+M +CD N +Q+W+++E+ D
Sbjct: 508 DGPFAYDEKSEQI--MIDDLCLTAAKSMSSPTLSMEKCDSGNQWQKWKWQEIYID 560
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 20/38 (52%), Positives = 26/38 (68%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 38
+G+T P ++G CHG G +QL RLN EGQL GE C+
Sbjct: 453 IGQTIPGIVGAMPCHGYGGNQLIRLNKEGQLTQGEWCI 490
>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
Length = 409
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/235 (35%), Positives = 118/235 (50%), Gaps = 38/235 (16%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
K VIST LR +W+ + + + P A G+IS Q+ LR+ L CK F W+
Sbjct: 211 KPVISTNM----LRVIKTWMDEYDKYYYIREPSAKHRLPGNISSQLKLRESLKCKSFKWY 266
Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
M+ VAYDV +P P N WGEAKN T KC+D+MGR P ++G + CHG G +QL RL
Sbjct: 267 MEKVAYDVIVSYPFPPENHVWGEAKNHATGKCIDTMGRPVPGIVGATPCHGYGGNQLIRL 326
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
N +GQL GE C+ A + G NL GTV
Sbjct: 327 NMKGQLAQGEWCITA------------------VHG------------NLITNHCIKGTV 356
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
DGP+ ++ + Q+ I CL+ + L+M +CD N +Q+W+++E+ D
Sbjct: 357 DGPFTYNEESEQI--IIRNLCLTAAKSISSSVLSMEKCDSENQWQKWKWREIYID 409
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR P ++G + CHG G +QL RLN +GQL GE C+ A
Sbjct: 302 MGRPVPGIVGATPCHGYGGNQLIRLNMKGQLAQGEWCITA 341
>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
Length = 601
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/217 (34%), Positives = 116/217 (53%), Gaps = 31/217 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R +W+ + + P A ++ GDIS Q+ALRDKL CK F W+M++VAYDV +
Sbjct: 411 MRVVKTWMDDYSKYYLTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSY 470
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P LPPN WGEA+N T KCLD MG P +G S CHG G +QL RLN +GQ+ GE C
Sbjct: 471 PLLPPNDVWGEARNPATGKCLDRMG-GIPGPLGASGCHGYGGNQLIRLNVQGQMAQGEWC 529
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ A+ +++ CV G+V G + + +T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGSVSGNFVYDRNTQQ 559
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
++H + C+++ + +++ C E+N Q++ +KE
Sbjct: 560 IQHNTKRMCITISEAGSDVSLETCTEDNQRQKFVWKE 596
Score = 44.3 bits (103), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 18/36 (50%), Positives = 24/36 (66%)
Query: 6 PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
P +G S CHG G +QL RLN +GQ+ GE C+ A+
Sbjct: 498 PGPLGASGCHGYGGNQLIRLNVQGQMAQGEWCLTAN 533
>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
Length = 601
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 115/217 (52%), Gaps = 31/217 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R +W+ +++ P A ++ GDIS Q+ALRDKL CK F W+M++VAYDV +
Sbjct: 411 MRVVKTWMDDYEKYYLTREPQAAHVNPGDISAQLALRDKLQCKSFKWYMENVAYDVLKSY 470
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P LPPN WG A+N T KCLD MG P +G S CHG G +QL RLN +GQL GE C
Sbjct: 471 PLLPPNDVWGGAQNPATGKCLDRMG-GIPGPLGASGCHGYGGNQLLRLNVQGQLAQGEWC 529
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ A+ +++ CV G+V+G W + +T+Q
Sbjct: 530 LTANG-------IRIQANHCV-----------------------KGSVNGNWVYDRNTQQ 559
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ H + C++V + + + C E+N Q++ +KE
Sbjct: 560 ILHNQKRMCITVAESGSDVILDTCTEDNQRQKFVWKE 596
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 24/36 (66%)
Query: 6 PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 41
P +G S CHG G +QL RLN +GQL GE C+ A+
Sbjct: 498 PGPLGASGCHGYGGNQLLRLNVQGQLAQGEWCLTAN 533
>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 645
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 83/235 (35%), Positives = 117/235 (49%), Gaps = 38/235 (16%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
K VIST LR +W+ + + + P A G+IS Q+ LR L CK F W+
Sbjct: 447 KPVISTNM----LRVIKTWMDEYDKYYYIREPSARHRLPGNISSQLKLRKSLKCKSFKWY 502
Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL 161
M+ VAYDV +P P N WGEAKN T KC+D+MGR P ++G + CHG G +QL RL
Sbjct: 503 MEKVAYDVVVSYPFPPENHVWGEAKNHATGKCIDTMGRPVPGIVGATPCHGYGGNQLIRL 562
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
N +GQL GE C+ A + G NL GTV
Sbjct: 563 NRKGQLAQGEWCITA------------------VHG------------NLITNHCIKGTV 592
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKEVRPD 274
DGP+ ++ + Q+ I CL+ + L+M +CD N +Q+W+++E+ D
Sbjct: 593 DGPFTYNEESEQI--IIRNLCLTAAKSISSSVLSMEKCDSENQWQKWKWREIYID 645
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 21/40 (52%), Positives = 27/40 (67%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR P ++G + CHG G +QL RLN +GQL GE C+ A
Sbjct: 538 MGRPVPGIVGATPCHGYGGNQLIRLNRKGQLAQGEWCITA 577
>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 478
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/168 (42%), Positives = 93/168 (55%), Gaps = 30/168 (17%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P+A D GD+S Q+ LR L CK F WFMK+V+YDV FP LP N
Sbjct: 340 WMDEYKEYFYTREPMARHYDPGDLSGQLELRQSLGCKGFDWFMKNVSYDVLKNFPLLPRN 399
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
+ WGE + + T +CLD+M P+ + S CHG+G +Q+FRLNAEGQLG GERCVDA
Sbjct: 400 IHWGEIRTMVTGQCLDTMNAHPPSTVSVSSCHGTGGNQIFRLNAEGQLGVGERCVDASSH 459
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
++L+FC L GTVDGPW +
Sbjct: 460 S-----------------MQLVFCSL-------------GTVDGPWEY 477
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 25/40 (62%), Positives = 30/40 (75%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
M P+ + S CHG+G +Q+FRLNAEGQLG GERCVDA
Sbjct: 417 MNAHPPSTVSVSSCHGTGGNQIFRLNAEGQLGVGERCVDA 456
>gi|391336074|ref|XP_003742408.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Metaseiulus
occidentalis]
Length = 593
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/214 (36%), Positives = 116/214 (54%), Gaps = 35/214 (16%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W +K+ P+A D G++++Q+ LR +L C+ FSWFMK +AYDV FP LP N
Sbjct: 412 WFDEYKEYFYTREPMARDYDAGNLTQQLELRKRLQCRPFSWFMKEIAYDVLKHFPYLPRN 471
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
WG K + CLDSM P +G S CHG G +QLFRLN EGQLG GERCVDA+K+
Sbjct: 472 NRWGHIKPVSEDGCLDSMRAHPPTNVGVSGCHGGGGNQLFRLNEEGQLGVGERCVDANKE 531
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI- 238
L +C+L GTV+GPW + +T+Q++H
Sbjct: 532 NMF-----------------LTYCKL-------------GTVNGPWTYDPNTKQMKHTRD 561
Query: 239 -LKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ RCL + + ++L + +C + Q++ ++E+
Sbjct: 562 GVIRCLQI--DRDRLFLAKCSAIDD-QKYDWEEI 592
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 19/24 (79%), Positives = 22/24 (91%)
Query: 20 SQLFRLNAEGQLGHGERCVDADKQ 43
+QLFRLN EGQLG GERCVDA+K+
Sbjct: 508 NQLFRLNEEGQLGVGERCVDANKE 531
>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 606
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/209 (30%), Positives = 92/209 (44%), Gaps = 32/209 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W +K+ P ++ + G+I +Q+A R+K HCK F WFMK +A D+ K+P N
Sbjct: 422 WFDDYKEYFYRSKPESLLVSTGNIEKQLAFREKFHCKSFGWFMKEIAPDIIEKYPLPHAN 481
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
+WGE + C+DSMG +G SYCHG+G +QLFR+ GQL ++C +
Sbjct: 482 KYWGEIRTKKGSLCVDSMGSKDGGRVGMSYCHGAGGNQLFRVTENGQLRIHDQCAYDHYK 541
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
E V+L C G G W F +T L HA
Sbjct: 542 E-----------------VRLRRC---------------GGSGGGWSFDEATGLLHHAGT 569
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+CL N++ + +C Q W
Sbjct: 570 SQCLEHKAGVNEIILTKCASIKPSQLWEI 598
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV-DADKQV 44
MG +G SYCHG+G +QLFR+ GQL ++C D K+V
Sbjct: 499 MGSKDGGRVGMSYCHGAGGNQLFRVTENGQLRIHDQCAYDHYKEV 543
>gi|395542397|ref|XP_003773119.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
[Sarcophilus harrisii]
Length = 797
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 95/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 606 IRVVEVWWDGYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 665
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG + CH G +QLFR+N QL ++C
Sbjct: 666 PLPPKNIEWGEIRGTETAHCIDSMGHANGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 725
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ + + K+ +T C L N FN W++ + +
Sbjct: 726 LTKEP-----DGTKIMITHCSL-----------------NEFNE-------WQYFKNLHR 756
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL P +Q+ + CD N S Q+W +
Sbjct: 757 FTHITSGKCLDRSPVLHQVFLSECDSNKSTQKWEMNNI 794
>gi|332030162|gb|EGI69956.1| N-acetylgalactosaminyltransferase 6 [Acromyrmex echinatior]
Length = 603
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 105/220 (47%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ + + P LD GD+SEQ ALR KLHCK F+WFMK++A+D+ +P + P+
Sbjct: 394 WMDEYAEYLYKRRPHLRTLDPGDLSEQKALRKKLHCKPFNWFMKNIAFDLVEVYPPIEPD 453
Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER- 172
F +GE +N+G T+ CLDS R ++ C Q FRL + +R
Sbjct: 454 DFAYGEIRNMGVTELCLDSKKRKRDELVVVDTCMKDDQKVSGEQEFRLTWHKDIRPKDRT 513
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+D K E+ K PV NL+ C G + WR+
Sbjct: 514 ECLDVSKGEE-----KAPV-------------------NLYPCHGKQG--NQLWRYDVEK 547
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ L+H RCL P + ++ + CD+++ Q+WR ++V
Sbjct: 548 QWLQHGYSSRCLDTDPSSKRVFVTICDKSSPTQKWRIEQV 587
>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
garnettii]
Length = 657
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 95/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNIDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L G K W++ + +
Sbjct: 586 L-----TKGPDGSKIMITHCSLNGFK------------------------EWQYFKNLYR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|260789880|ref|XP_002589972.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
gi|229275159|gb|EEN45983.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
Length = 522
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 97/218 (44%), Gaps = 30/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K+ P GDISE VA R++ HCK F WFM+ +AYD+ F
Sbjct: 332 MRVVEVWWDEYKEYFYASRPEIRNHPYGDISELVAFRERNHCKSFKWFMEEIAYDIVEHF 391
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N WGE + + + CLDSMG +G CHG G +Q+FRLN ++ ++C
Sbjct: 392 PLPPKNKIWGEIRGIDSGLCLDSMGHKDGGEMGIGGCHGMGGNQMFRLNEANEMAQYDQC 451
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ +GT + L +C G W F +
Sbjct: 452 IHV----------------------------VGTQVKLDHCDTEHTRYRG-WEFDEDKKM 482
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H I K CL +Q+ M CDE++ Q+W F +V
Sbjct: 483 LRHKIGK-CLDRSEILHQVFMSDCDESSKTQKWEFNQV 519
>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
garnettii]
Length = 657
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 95/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIPSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNIDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L G K W++ + +
Sbjct: 586 L-----TKGPDGSKIMITHCSLNGFK------------------------EWQYFKNLYR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|354484375|ref|XP_003504364.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cricetulus
griseus]
Length = 784
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 593 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 652
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 653 PLPPKNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHKMGGNQLFRINEVNQLMQYDQC 712
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 713 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 743
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD N Q+W +
Sbjct: 744 FTHIPSGKCLDRSDVLHQVFISNCDSNKMTQKWEMNNI 781
>gi|403295730|ref|XP_003938783.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Saimiri boliviensis
boliviensis]
Length = 659
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 95/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 468 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 527
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 528 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 587
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 588 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 618
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H + +CL +Q+ + CD + + Q+W +
Sbjct: 619 FTHILSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 656
>gi|344235654|gb|EGV91757.1| N-acetylgalactosaminyltransferase 7 [Cricetulus griseus]
Length = 607
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 416 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 475
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 476 PLPPKNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHKMGGNQLFRINEVNQLMQYDQC 535
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 536 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 566
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD N Q+W +
Sbjct: 567 FTHIPSGKCLDRSDVLHQVFISNCDSNKMTQKWEMNNI 604
>gi|345790686|ref|XP_543898.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Canis lupus
familiaris]
Length = 721
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 530 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 589
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 590 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 649
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 650 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKSLHR 680
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 681 FTHIPSGKCLDRSDVLHQVFISNCDSSKMTQKWEMNNI 718
>gi|32425405|gb|AAH35303.1| GALNT7 protein, partial [Homo sapiens]
Length = 495
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 304 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 363
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 364 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 423
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 424 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 454
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 455 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 492
>gi|13938655|gb|AAH07484.1| Galnt7 protein, partial [Mus musculus]
Length = 362
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 171 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 230
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 231 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 290
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 291 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 321
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 322 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 359
>gi|426346015|ref|XP_004040686.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gorilla gorilla
gorilla]
Length = 650
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 459 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 518
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 519 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 578
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 579 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 609
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 610 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 647
>gi|348566877|ref|XP_003469228.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cavia
porcellus]
Length = 637
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 446 VRVVEVWWDEYKDYFYASRPESKALLYGDISELKKFREDHNCKSFKWFMEEIAYDITLHY 505
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMGRT ++ CH G +QLFR+N QL ++C
Sbjct: 506 PLPPKNVDWGEIRGFETAYCIDSMGRTNGGIVELGPCHRMGGNQLFRINEANQLMQYDQC 565
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 566 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 596
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD N Q+W +
Sbjct: 597 FTHIPSGKCLDRSEVLHQVFISDCDSNKMTQKWEMNNI 634
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MGRT ++ CH G +QLFR+N QL ++C+ + L + W
Sbjct: 529 MGRTNGGIVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 588
Query: 61 LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
+ FK + R P LD ++ QV + D
Sbjct: 589 -QYFKNLHRFTHIPSGKCLDRSEVLHQVFISD 619
>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
Length = 540
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 105/220 (47%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ + + P LD GD+SEQ ALR KLHCK F+WFMK++A+D+ +P + P+
Sbjct: 333 WMDEYAEYIYKRRPHLRTLDPGDLSEQKALRTKLHCKSFNWFMKNIAFDLVEVYPPIEPD 392
Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER- 172
F +GE +N+G T+ CLDS R V+ C Q FRL + +R
Sbjct: 393 DFAFGEIRNMGATELCLDSKKRKRDEVVVMDICMKDDPKMSGEQEFRLTWHKDIRPKDRT 452
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+D + E+ K PV+ L+ C G + WR+
Sbjct: 453 DCLDVSRGEE-----KAPVS-------------------LYPCHGKQG--NQLWRYDVEK 486
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ L+H RCL P + ++ + CD+++ Q+WR ++V
Sbjct: 487 QWLQHGYSSRCLDTDPGSKRVFVTACDKSSLTQKWRIEQV 526
>gi|355687724|gb|EHH26308.1| hypothetical protein EGK_16238, partial [Macaca mulatta]
Length = 615
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 484 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 544 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 574
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 575 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 612
>gi|29612635|gb|AAH49907.1| Galnt7 protein, partial [Mus musculus]
Length = 365
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 174 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 233
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 234 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 293
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 294 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 324
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 325 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 362
>gi|355778494|gb|EHH63530.1| hypothetical protein EGM_16517, partial [Macaca fascicularis]
Length = 615
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 484 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 544 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 574
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 575 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 612
>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
[Saccoglossus kowalevskii]
Length = 634
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 68/125 (54%), Gaps = 2/125 (1%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAYDVYTK 112
LR W+ +K+ P GDI+EQV R + HC DF WFM VAYD+ +
Sbjct: 450 LRVAEVWMDEYKEYFYRMKPELRGKPYGDITEQVQFRQE-HCPHDFKWFMDEVAYDITER 508
Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
FP + N+ WGE + +G+ KC+DSMGR+ + CHG G SQL RLN G+ E
Sbjct: 509 FPLISKNIGWGEVRGVGSSKCVDSMGRSPSGKVALYGCHGYGGSQLLRLNEGGEFRVNEE 568
Query: 173 CVDAD 177
C+ D
Sbjct: 569 CLYTD 573
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 2/60 (3%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MGR+ + CHG G SQL RLN G+ E C+ D + ++ RK+ W
Sbjct: 533 MGRSPSGKVALYGCHGYGGSQLLRLNEGGEFRVNEECLYTDGSTVKLERCV--PRKQYGW 590
>gi|74139820|dbj|BAE31754.1| unnamed protein product [Mus musculus]
gi|74191634|dbj|BAE30388.1| unnamed protein product [Mus musculus]
gi|74198878|dbj|BAE30662.1| unnamed protein product [Mus musculus]
Length = 546
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 355 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 414
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 415 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 474
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 475 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 505
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 506 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 543
>gi|402870854|ref|XP_003899414.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Papio anubis]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHK 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|332820787|ref|XP_003310650.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan troglodytes]
gi|410227832|gb|JAA11135.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410262380|gb|JAA19156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410297750|gb|JAA27475.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410332293|gb|JAA35093.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|148696676|gb|EDL28623.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_a [Mus
musculus]
Length = 615
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 484 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 544 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 574
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 575 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 612
>gi|148696677|gb|EDL28624.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_b [Mus
musculus]
Length = 615
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 484 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 544 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 574
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 575 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 612
>gi|74212590|dbj|BAE31034.1| unnamed protein product [Mus musculus]
Length = 298
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 107 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 166
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 167 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 226
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 227 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 257
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 258 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 295
>gi|269784707|ref|NP_653332.3| N-acetylgalactosaminyltransferase 7 isoform 1 [Mus musculus]
gi|51315950|sp|Q80VA0.2|GALT7_MOUSE RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|13650041|gb|AAK37549.1|AF349573_1 UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 [Mus musculus]
gi|30851602|gb|AAH52461.1| Galnt7 protein [Mus musculus]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 617 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 654
>gi|157502212|ref|NP_059119.2| N-acetylgalactosaminyltransferase 7 [Homo sapiens]
gi|51315961|sp|Q86SF2.1|GALT7_HUMAN RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|28279289|gb|AAH46129.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|28704077|gb|AAH47468.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|119625166|gb|EAX04761.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|193786832|dbj|BAG52155.1| unnamed protein product [Homo sapiens]
gi|325464563|gb|ADZ16052.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7)
[synthetic construct]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|383412007|gb|AFH29217.1| N-acetylgalactosaminyltransferase 7 [Macaca mulatta]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|441620192|ref|XP_003258074.2| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
[Nomascus leucogenys]
Length = 636
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 445 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 504
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 505 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 564
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 565 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 595
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 596 FTHIPSGKCLDRSEVLHQVFISNCDSGKTTQKWEMNNI 633
>gi|269784709|ref|NP_001161453.1| N-acetylgalactosaminyltransferase 7 isoform 2 [Mus musculus]
gi|26331462|dbj|BAC29461.1| unnamed protein product [Mus musculus]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 617 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 654
>gi|26329091|dbj|BAC28284.1| unnamed protein product [Mus musculus]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 617 FTHITSGKCLDRSEVLHQVFISTCDSSKMTQKWEMNNI 654
>gi|197101721|ref|NP_001124628.1| N-acetylgalactosaminyltransferase 7 [Pongo abelii]
gi|75042656|sp|Q5RFJ6.1|GALT7_PONAB RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|55725190|emb|CAH89461.1| hypothetical protein [Pongo abelii]
Length = 657
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|301753757|ref|XP_002912714.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Ailuropoda
melanoleuca]
gi|281338294|gb|EFB13878.1| hypothetical protein PANDA_000463 [Ailuropoda melanoleuca]
Length = 657
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPFGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 617 FTHIPSGKCLDRSDVLHQVFISDCDSSKMTQKWEMNNI 654
Score = 38.1 bits (87), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG+T + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 549 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 608
Query: 61 LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
+ FK + R P LD D+ QV + D
Sbjct: 609 -QYFKSLHRFTHIPSGKCLDRSDVLHQVFISD 639
>gi|313220437|emb|CBY31290.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 103/223 (46%), Gaps = 38/223 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAYDVYTKF 113
R +W + + + P A LD GDI QV R K HC DF+WFMK + Y V ++
Sbjct: 425 RVIETWWDDWSKFFYVARPEAKTLDFGDIGPQVEFR-KNHCPYDFNWFMKEIGYGVALEY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N FWGE + +CLDSMG T + T YCH G +QLFRLN QL ++C
Sbjct: 484 PTPPLNKFWGEIRARQGNQCLDSMGHTNGGSVETYYCHKQGGNQLFRLNEGLQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ G+ L L +C S+G G W + T
Sbjct: 544 L----------------------------FMSGSYLKLSHC--SAGDKAG-WDYDPKTGM 572
Query: 234 LEHAILKR---CLSVHPET--NQLAMLRCDENNSYQQWRFKEV 271
+ + K+ C+S+ +T +++ + CD N+ Q++ F E+
Sbjct: 573 ISYGEKKKQRFCISLISKTSISKVKFVECDVNDDNQKFDFNEI 615
>gi|397505872|ref|XP_003823466.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan paniscus]
Length = 657
Score = 100 bits (250), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCDLNECK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|198419403|ref|XP_002128971.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
intestinalis]
Length = 631
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 95/220 (43%), Gaps = 32/220 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R +W +K P + + GDIS QV R + +CK F WFM+++AYD+ +
Sbjct: 441 IRVVETWWDEYKDYFYASRPETLNMPYGDISAQVKYRQEHNCKSFKWFMENIAYDIVEHY 500
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG-RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P P N WGE + + T +CLDSMG + + + YCH G +QLFRL+ QL ++
Sbjct: 501 PLPPKNKEWGEVRGVDTNQCLDSMGHKNGDGALESGYCHRMGGNQLFRLSEANQLAQYDQ 560
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
CV + V+ + +C T W +
Sbjct: 561 CVTIQR----------------------------GVVTVMHC---DTTQFREWDHDLINK 589
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
+ H +CL +++ + CD N Q+W +++
Sbjct: 590 SVRHTTSGKCLERGELVHKVFLALCDANKPTQRWEINQIQ 629
>gi|355689625|gb|AER98895.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 [Mustela putorius
furo]
Length = 230
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 39 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 98
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 99 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRLGGNQLFRINEANQLMQYDQC 158
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ S +
Sbjct: 159 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKSLYR 189
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 190 FTHIPSGKCLDRSDVLHQVFISDCDSSKMTQKWEMNNI 227
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 24/91 (26%), Positives = 35/91 (38%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG+T + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 122 MGKTNGGFVELGPCHRLGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 181
Query: 61 LKVFKQIRSLGSPLAMFLDMGDISEQVALRD 91
R P LD D+ QV + D
Sbjct: 182 QYFKSLYRFTHIPSGKCLDRSDVLHQVFISD 212
>gi|296195170|ref|XP_002745262.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Callithrix jacchus]
Length = 657
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKIMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKMTQKWEMNNI 654
>gi|344288243|ref|XP_003415860.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Loxodonta
africana]
Length = 657
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVEWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + +CD + Q+W +
Sbjct: 617 FTHTPSGKCLDRSEVLHQVFISKCDSSKMTQKWEMNNI 654
>gi|338722468|ref|XP_001915592.2| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Equus
caballus]
Length = 621
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 430 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 489
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMGRT + CH G +QLFR+N QL ++C
Sbjct: 490 PLPPKNVDWGEIRGFETVYCIDSMGRTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 549
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 550 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 580
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 581 FTHIPSGKCLDRSEVLHQVFISNCDSSKMTQKWEINNI 618
>gi|410956565|ref|XP_003984911.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Felis catus]
Length = 772
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 581 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 640
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 641 PLPPKNVEWGEIRGLETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 700
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 701 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKNLHR 731
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 732 FTHIPSGKCLDRSDILHQVFISNCDSSKMTQKWEMNNI 769
>gi|313226887|emb|CBY22032.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 70/223 (31%), Positives = 102/223 (45%), Gaps = 38/223 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAYDVYTKF 113
R +W + + + P A LD GDI QV R K HC DF+WFMK + Y V ++
Sbjct: 425 RVIETWWDDWSKFFYVARPEAKTLDFGDIGPQVEFR-KNHCPYDFNWFMKEIGYGVALEY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N FWGE + +CLDSMG T + T YCH G +QLFRLN QL ++C
Sbjct: 484 PTPPLNKFWGEIRARQGNQCLDSMGHTNGGSVETYYCHKQGGNQLFRLNEGLQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ G+ L L +C S+G G W + T
Sbjct: 544 L----------------------------FMSGSYLKLSHC--SAGDKAG-WDYDPKTGM 572
Query: 234 LEHAILKR---CLSVHPET--NQLAMLRCDENNSYQQWRFKEV 271
+ + K+ C+S+ +T +++ + CD N Q++ F E+
Sbjct: 573 ISYGEKKKQRFCISLISKTSISKVKFVECDVNGDNQKFDFNEI 615
>gi|6318186|emb|CAB60270.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7 [Homo
sapiens]
Length = 657
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +C+ F WFM+ +AYD+ + +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESQALPYGDISELKKFREDHNCQSFKWFMEEIAYDITSHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGADGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 617 FTHIPSGKCLDRSEVLHQVFISNCDSSKTTQKWEMNNI 654
>gi|326918604|ref|XP_003205578.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Meleagris
gallopavo]
Length = 665
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 474 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 533
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 534 PLPPKNVEWGEIRGFETAYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 593
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 594 L-----TKGPDGSKIMITHCNLNEYK------------------------DWQYFKNLHR 624
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 625 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 662
>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
saltator]
Length = 605
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 101/212 (47%), Gaps = 40/212 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ ++ +P A +D+GD+SE+V LR++L CK F W+++++ +PE P
Sbjct: 418 WMDEWRDFYYAMNPGARNVDVGDVSERVKLRERLKCKSFRWYLENI-------YPESPMP 470
Query: 120 L---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
L + G+ KN+ Q CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 471 LDYYYLGDVKNVEAQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
+ V++++ C G + W +++ T+ + H
Sbjct: 531 ASPQGPVKIVR--------------------------CHGMGG--NQAWVYNDETKMIRH 562
Query: 237 AILKRCLSV--HPETNQLAMLRCDENNSYQQW 266
CLS + +Q + +CD NN Q+W
Sbjct: 563 TNTGHCLSKPHSGDASQPVLTQCDPNNIGQKW 594
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530
>gi|363733313|ref|XP_420521.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gallus gallus]
Length = 636
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 445 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 504
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 505 PLPPKNVEWGEIRGFETAYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 564
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 565 L-----TKGPDGSKIMITHCNLNEYK------------------------DWQYFKNLHR 595
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 596 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 633
>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
echinatior]
Length = 605
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 103/218 (47%), Gaps = 42/218 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A +D+GD+SE++ LR++L CK F W+++++ +P
Sbjct: 413 RVAEVWMDEWRDFYYAMNPGARNVDVGDVSERIKLRERLKCKSFRWYLENI-------YP 465
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ KN+ TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 466 ESPMPLDYYYLGDVKNIETQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 525
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W + + T
Sbjct: 526 MCLDAANPQGPVKIVR--------------------------CHGMGG--NQAWVYSDET 557
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
+ ++H CLS P + Q + +CD NN Q+W
Sbjct: 558 KMIKHTNTGYCLS-KPHSGDPAQPVLAQCDVNNIGQRW 594
Score = 37.7 bits (86), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530
>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
floridanus]
Length = 608
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/217 (27%), Positives = 101/217 (46%), Gaps = 40/217 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A +D+GD+SE++ LR++L CK F W+++++ +P
Sbjct: 416 RVAEVWMDEWRDFYYAMNPGARNVDVGDVSERIKLRERLKCKSFRWYLENI-------YP 468
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ KN+ Q CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 469 ESPMPLDYYYLGDVKNVEMQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 528
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W ++N T
Sbjct: 529 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNNET 560
Query: 232 RQLEHAILKRCLSV--HPETNQLAMLRCDENNSYQQW 266
+ + H CLS + +Q + CD NN Q+W
Sbjct: 561 KMIRHTNTGHCLSKPHSGDASQPVLAPCDANNIGQRW 597
Score = 37.7 bits (86), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 494 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 533
>gi|12621080|ref|NP_075215.1| N-acetylgalactosaminyltransferase 7 [Rattus norvegicus]
gi|51315737|sp|Q9R0C5.1|GALT7_RAT RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|4092503|gb|AAC99426.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T6 [Rattus
norvegicus]
gi|149032267|gb|EDL87173.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_a
[Rattus norvegicus]
Length = 657
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ +
Sbjct: 466 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITAHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 526 PLPPRNVEWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 585
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 586 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 616
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD Q+W +
Sbjct: 617 FTHIASGKCLDRSEVLHQVFISSCDNGKMTQKWEMNNI 654
>gi|440908503|gb|ELR58512.1| N-acetylgalactosaminyltransferase 7, partial [Bos grunniens mutus]
Length = 615
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 484 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 544 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 574
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H +CL +Q+ + CD + Q+W +
Sbjct: 575 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 612
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG+T + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 507 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 566
Query: 61 LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
+ FK + L P LD ++ QV + D
Sbjct: 567 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 597
>gi|359067894|ref|XP_002689501.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
Length = 617
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 426 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 485
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 486 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 545
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 546 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 576
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H +CL +Q+ + CD + Q+W +
Sbjct: 577 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 614
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG+T + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 509 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 568
Query: 61 LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
+ FK + L P LD ++ QV + D
Sbjct: 569 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 599
>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7-like [Saccoglossus
kowalevskii]
Length = 594
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 91/207 (43%), Gaps = 31/207 (14%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P + L GDIS+Q+ + K +CK+F WFMK VA D+ +P N
Sbjct: 410 WMDDYKEFFYRSKPESQLLHFGDISKQLEFKTKHNCKNFDWFMKEVAPDLLAVYPVPAAN 469
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
WGE K+ + C+D+MG IG S CHG G +QLFR+ + + E C+
Sbjct: 470 QAWGEIKSNTNKVCVDTMGNREGGTIGISGCHGQGGNQLFRITEDHEFRIHELCL----- 524
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
+ + + VKL C G W F S +
Sbjct: 525 ------------YEIYSEVKLRRC--------------DGKSKYSWFFDESKGWISLKDK 558
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQW 266
CL ++ +L+M +CD +N Q+W
Sbjct: 559 NLCLELNSNLRRLSMKKCDISNGLQKW 585
>gi|119896052|ref|XP_602855.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
Length = 772
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 581 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 640
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 641 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 700
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 701 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 731
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H +CL +Q+ + CD + Q+W +
Sbjct: 732 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 769
Score = 37.4 bits (85), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG+T + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 664 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 723
Query: 61 LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
+ FK + L P LD ++ QV + D
Sbjct: 724 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 754
>gi|426222421|ref|XP_004005390.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Ovis aries]
Length = 865
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 674 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 733
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 734 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 793
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 794 LT-----KGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 824
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H +CL +Q+ + CD + Q+W +
Sbjct: 825 LTHISSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 862
>gi|296484976|tpg|DAA27091.1| TPA: N-acetylgalactosaminyltransferase 7-like [Bos taurus]
Length = 781
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 94/218 (43%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 590 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 649
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 650 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 709
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 710 L-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLHR 740
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H +CL +Q+ + CD + Q+W +
Sbjct: 741 LTHIPSGKCLDRSEVLHQVFISDCDSSKMTQKWEINNI 778
Score = 37.4 bits (85), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG+T + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 673 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKVMITHCNLNEFKEW 732
Query: 61 LKVFKQIRSLGS-PLAMFLDMGDISEQVALRD 91
+ FK + L P LD ++ QV + D
Sbjct: 733 -QYFKNLHRLTHIPSGKCLDRSEVLHQVFISD 763
>gi|109076193|ref|XP_001085532.1| PREDICTED: n-acetylgalactosaminyltransferase 7 [Macaca mulatta]
Length = 630
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/208 (27%), Positives = 91/208 (43%), Gaps = 29/208 (13%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K P + L GDISE R+ +CK F WFM+ +AYD+ + +P P N+ WG
Sbjct: 449 YKDYFYASRPESQALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHYPLPPKNVDWG 508
Query: 124 EAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLV 183
E + T C+DSMG+T + CH G +QLFR+N QL ++C+ K
Sbjct: 509 EIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCL-----TKGA 563
Query: 184 EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
+ K+ +T C L K W++ + + H +CL
Sbjct: 564 DGSKVMITHCNLNEFK------------------------EWQYFKNLHRFTHIPSGKCL 599
Query: 244 SVHPETNQLAMLRCDENNSYQQWRFKEV 271
+Q+ + CD + + Q+W +
Sbjct: 600 DRSEVLHQVFISNCDSSKTTQKWEMNNI 627
>gi|335301041|ref|XP_001926518.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Sus scrofa]
Length = 712
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 521 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 580
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 581 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 640
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 641 LT-----KGADGSKVMITHCNLNEFK------------------------EWQYLKNLHR 671
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H CL +Q+ + CD + Q+W +
Sbjct: 672 LTHIPSGMCLDRSEVLHQVFISNCDSSKMTQKWEINNI 709
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 43/93 (46%), Gaps = 4/93 (4%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD--ADKQVISTQKSSLELRKRS 58
MG+T + CH G +QLFR+N QL ++C+ AD + +L K
Sbjct: 604 MGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGADGSKVMITHCNLNEFKEW 663
Query: 59 SWLKVFKQIRSLGSPLAMFLDMGDISEQVALRD 91
+LK ++ + P M LD ++ QV + +
Sbjct: 664 QYLKNLHRLTHI--PSGMCLDRSEVLHQVFISN 694
>gi|327268630|ref|XP_003219099.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Anolis
carolinensis]
Length = 654
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 90/218 (41%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDI++ R+ CK F WFM+ +AYD+ + +
Sbjct: 463 VRVVEVWWDEYKDYFYASRPETKALAYGDITDLKKFREDHRCKSFKWFMEEIAYDITSHY 522
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 523 PLPPKNVEWGEIRGFETSYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 582
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 583 LT-----KGPDGTKVVITHCNLNEYK------------------------EWQYFKNLHR 613
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD N + Q W +
Sbjct: 614 FTHIPSGKCLDRSEVLHQVFLSECDSNKATQSWEMNNI 651
>gi|417411949|gb|JAA52393.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 615
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +C+ F WFM+ +AYD+ + +
Sbjct: 424 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCRSFKWFMEEIAYDITSHY 483
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMGRT + CH G +QLFR+N QL ++C
Sbjct: 484 PLPPKNVDWGEIRGFETVYCIDSMGRTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 543
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 544 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKNLHR 574
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 575 FTHIPSGKCLDRSDVLHQVFLSSCDSSKMAQKWEMNNI 612
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 38/90 (42%), Gaps = 2/90 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MGRT + CH G +QLFR+N QL ++C+ + L + W
Sbjct: 507 MGRTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQCLTKGPDGSKIMITHCNLNEFKEW 566
Query: 61 LKVFKQI-RSLGSPLAMFLDMGDISEQVAL 89
+ FK + R P LD D+ QV L
Sbjct: 567 -QYFKNLHRFTHIPSGKCLDRSDVLHQVFL 595
>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
Length = 615
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 100/220 (45%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ + P +D GD++EQ ALRDKL CK F WFM+++A+D+ +P + P+
Sbjct: 399 WMDEYADFIYRRRPHLRAMDPGDLTEQKALRDKLKCKSFKWFMENIAFDLVEVYPPIEPD 458
Query: 120 LF-WGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLGHGER- 172
F +GE +N+G CLD+ G+ I YC Q F+L + R
Sbjct: 459 DFAYGEMRNIGVPNLCLDAKGKGKDEEIAVDYCQKDTPKIKGEQEFQLTWHKDIRPNRRT 518
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+D + + K PVT L+ C G + WR++
Sbjct: 519 ECLDVSRGDD-----KSPVT-------------------LYPCHGKQG--NQLWRYNVEK 552
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ L+H RCL + P + ++ + CD +S Q+WR + V
Sbjct: 553 QWLQHGYGSRCLDMDPASKKVYVSNCDPTSSTQKWRIEHV 592
>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
Length = 602
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/220 (29%), Positives = 105/220 (47%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ + + P LD GD+SEQ ALR KLHCK F+WF++++A+D+ +P + P+
Sbjct: 393 WMDEYAEYIYKRRPHLRALDPGDLSEQKALRVKLHCKPFNWFIENIAFDLVEVYPPIEPD 452
Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER- 172
F +GE +N+G T+ CLDS R +I C Q FRL + +R
Sbjct: 453 DFAYGEIRNMGATELCLDSKKRKRDELIVVDTCVKDDPKVSGEQEFRLTWHKDIRPKDRT 512
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+D + E+ K PV+ L+ C G + WR+
Sbjct: 513 DCLDVSRGEE-----KAPVS-------------------LYPCHGKQG--NQLWRYDIEK 546
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ L+H RCL P + ++ + CD+++ Q+WR ++V
Sbjct: 547 QWLQHGYSSRCLDTDPGSKRVFVTTCDKSSPTQKWRIEQV 586
>gi|449270894|gb|EMC81540.1| N-acetylgalactosaminyltransferase 7, partial [Columba livia]
Length = 613
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 93/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 422 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 481
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 482 PLPPKNVDWGEIRGFETSYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 541
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C N + D W++ + +
Sbjct: 542 L-----TKGPDGSKIMITHC----------------------NQNEYKD--WQYFKNLHR 572
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 573 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 610
>gi|449500526|ref|XP_002187477.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Taeniopygia
guttata]
Length = 828
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 637 VRVVEVWWDEYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSYY 696
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 697 PLPPKNVDWGEIRGFETSYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 756
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C N F W+F + +
Sbjct: 757 L-----TKGPDGSKIMITHCNQ-----------------NEFKE-------WQFFKNLHR 787
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + + Q+W +
Sbjct: 788 FTHIPSGKCLDRSEVLHQVFISECDSSKATQKWEMNNI 825
>gi|47575716|ref|NP_001001200.1| polypeptide N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
tropicalis]
gi|45501097|gb|AAH67317.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
tropicalis]
Length = 653
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 88/218 (40%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDI R+ +CK F WFM+ +AYD+ F
Sbjct: 462 IRVVEVWWDEYKDYFYASRPETKALPYGDIGPLKKFREDHNCKSFKWFMEEIAYDITKYF 521
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T ++ CH G +QLFR+N QL ++C
Sbjct: 522 PLPPKNVEWGEIRGFETNYCIDSMGHTNGGIVELGGCHRMGGNQLFRINEANQLMQYDQC 581
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K P V+ L C L L W++ + +
Sbjct: 582 -----------LYKGPDGTQVM----LTHCNLKEFLE--------------WQYFKNMHR 612
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD N Q+W +
Sbjct: 613 FLHISTGKCLDRSEVLHQVFLSDCDANKPTQKWEMNNI 650
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG T ++ CH G +QLFR+N QL ++C+ + L++ W
Sbjct: 545 MGHTNGGIVELGGCHRMGGNQLFRINEANQLMQYDQCLYKGPDGTQVMLTHCNLKEFLEW 604
Query: 61 LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
+ FK + R L LD ++ QV L D
Sbjct: 605 -QYFKNMHRFLHISTGKCLDRSEVLHQVFLSD 635
>gi|431918328|gb|ELK17555.1| N-acetylgalactosaminyltransferase 7 [Pteropus alecto]
Length = 717
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 92/218 (42%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 526 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 585
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T CH G +QLFR+N QL ++C
Sbjct: 586 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFAELGPCHRMGGNQLFRINEANQLMQYDQC 645
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 646 L-----TKGPDGSKIMITHCNLNEFK------------------------EWQYFKNLNR 676
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD + Q+W +
Sbjct: 677 FTHIPSGKCLDRSEVLHQVFLSNCDSSKMTQKWEMNNI 714
>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis mellifera]
Length = 606
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 104/221 (47%), Gaps = 42/221 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+SE++ LR++L CK F W+++++ +P
Sbjct: 414 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 466
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ +N+ TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 467 ESPMPLDYYYLGDVQNVDTQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 526
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W ++ T
Sbjct: 527 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 558
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFK 269
+ ++H + CLS P +N Q + CD +N Q+W +
Sbjct: 559 KMIKHTNIGHCLS-KPRSNDAMQPVLAPCDPHNIGQKWTMR 598
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531
>gi|51490866|emb|CAD44538.1| polypeptide N-acetylgalactosaminyltransferase 7 [Homo sapiens]
Length = 166
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/192 (27%), Positives = 86/192 (44%), Gaps = 29/192 (15%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT 140
GDISE R+ +C+ F WFM+ +AYD+ + +P P N+ WGE + T C+DSMG+T
Sbjct: 2 GDISELKKFREDHNCQSFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKT 61
Query: 141 APAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKL 200
+ CH G +QLFR+N QL ++C+ K + K+ +T C L K
Sbjct: 62 NGGFVELGPCHRMGGNQLFRINEANQLMQYDQCL-----TKGADGSKVMITHCNLNEFK- 115
Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
W++ + + H +CL +Q+ + CD +
Sbjct: 116 -----------------------EWQYFKNLHRFTHIPSGKCLDRSEVLHQVFISNCDSS 152
Query: 261 NSYQQWRFKEVR 272
+ Q+W +
Sbjct: 153 KTTQKWEMNNIH 164
>gi|387019377|gb|AFJ51806.1| n-acetylgalactosaminyltransferase 7-like [Crotalus adamanteus]
Length = 658
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 90/218 (41%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W FK P L GDIS+ R+ CK F WFM+ +AYD+ + +
Sbjct: 467 VRVVEVWWDEFKDYFYASRPETKALAYGDISDLKNFREDHKCKSFKWFMEEIAYDITSHY 526
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 527 PLPPRNVEWGEIRGFEAAYCIDSMGHTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 586
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 587 LI-----KGPDGSKVMITHCNLNEYK------------------------EWQYFKNLYR 617
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +Q+ + CD N + Q+W +
Sbjct: 618 FTHISSGKCLDRSEVLHQVFISDCDSNKASQKWEMNNI 655
>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Megachile rotundata]
Length = 605
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 102/218 (46%), Gaps = 42/218 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+SE++ LR++L CK F W+++++ +P
Sbjct: 413 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 465
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ +N+ TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 466 ESPMPLDYYYLGDVQNIDTQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 525
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W ++ T
Sbjct: 526 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 557
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
+ ++H CLS P +N Q + CD +N Q+W
Sbjct: 558 KMIKHTNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKW 594
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530
>gi|351701091|gb|EHB04010.1| N-acetylgalactosaminyltransferase 7, partial [Heterocephalus
glaber]
Length = 616
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 92/219 (42%), Gaps = 31/219 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM +AYD+ T
Sbjct: 425 VRVVEVWWDEYKDYFYASRPESKALLYGDISELKKFREDHNCKSFKWFMDEIAYDI-TSH 483
Query: 114 PELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P LPP N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++
Sbjct: 484 PPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQ 543
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C+ K + K+ +T C L K W++ +
Sbjct: 544 CL-----TKGPDGSKVMITHCNLNEFK------------------------EWQYFKNLH 574
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ H +CL +Q+ + CD Q+W +
Sbjct: 575 RFTHIPSGKCLDRSEVLHQVFISNCDSGKMTQKWEMNNI 613
>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus impatiens]
Length = 606
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/213 (28%), Positives = 101/213 (47%), Gaps = 42/213 (19%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
WL +K +P A + +GD+SE++ LR++L CK F W+++++ +PE P
Sbjct: 419 WLDQWKYFYYNINPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YPESPMP 471
Query: 120 L---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
L + G+ +N+ TQ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 472 LDYYYLGDVQNVETQSCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
+ V++++ C G + W ++ T+ ++H
Sbjct: 532 ASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEETKMIKH 563
Query: 237 AILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
CLS P +N Q + CD +N Q+W
Sbjct: 564 TNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKW 595
Score = 37.7 bits (86), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531
>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus impatiens]
Length = 606
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 102/218 (46%), Gaps = 42/218 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+SE++ LR++L CK F W+++++ +P
Sbjct: 414 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 466
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ +N+ TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 467 ESPMPLDYYYLGDVQNVETQSCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 526
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W ++ T
Sbjct: 527 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 558
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
+ ++H CLS P +N Q + CD +N Q+W
Sbjct: 559 KMIKHTNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKW 595
Score = 37.7 bits (86), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531
>gi|345307492|ref|XP_001507110.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Ornithorhynchus anatinus]
Length = 873
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 63/121 (52%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 439 VRVVEVWWDDYKDYFYASRPETKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 498
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T V+ CH G +QLFR+N QL ++C
Sbjct: 499 PLPPKNVEWGEIRGFDTAYCIDSMGHTNGGVVELGPCHRMGGNQLFRINEANQLMQYDQC 558
Query: 174 V 174
+
Sbjct: 559 L 559
>gi|444727227|gb|ELW67729.1| N-acetylgalactosaminyltransferase 7 [Tupaia chinensis]
Length = 606
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 65/121 (53%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 274 IRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 333
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + L T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 334 PLPPRNIDWGEIRGLETAYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEVNQLMQYDQC 393
Query: 174 V 174
+
Sbjct: 394 L 394
>gi|47191417|emb|CAG14579.1| unnamed protein product [Tetraodon nigroviridis]
Length = 174
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 84/199 (42%), Gaps = 29/199 (14%)
Query: 73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
P ++ L GDISE R++ CK F WFM+ +AYD+ +P P N+ WGE + L T
Sbjct: 2 PESLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPLHYPMPPKNVDWGEIRGLDTSY 61
Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
C+DSMG T + CH G +QLFR+N QL ++C+
Sbjct: 62 CIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQCLTRGTDN------------ 109
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
+GV + C W++ + H +CL +++
Sbjct: 110 ---SGVIITHC--------------DQNQHTEWKYFKDLHRFTHVTTGKCLDRSDLLHKV 152
Query: 253 AMLRCDENNSYQQWRFKEV 271
+ CD + + Q+W +
Sbjct: 153 FISDCDTSKTTQKWEMNNI 171
>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis florea]
Length = 605
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 102/221 (46%), Gaps = 42/221 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+SE++ LR +L CK F W+++++ +P
Sbjct: 413 RVAEVWMDEWRDFYYAMNPGARNVAVGDVSERIKLRQRLKCKSFRWYLENI-------YP 465
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ +N+ TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 466 ESPMPLDYYYLGDVQNVDTQTCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 525
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W ++ T
Sbjct: 526 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 557
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFK 269
+ ++H CLS P +N Q + CD +N Q+W +
Sbjct: 558 KMIKHTNTGHCLS-KPRSNDAMQPVLAPCDPHNIGQKWTMR 597
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 491 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 530
>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus terrestris]
gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus terrestris]
Length = 606
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 102/218 (46%), Gaps = 42/218 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+SE++ LR++L CK F W+++++ +P
Sbjct: 414 RVAEVWMDEWRDFYYAMNPGARSVAVGDVSERIKLRERLKCKSFRWYLENI-------YP 466
Query: 115 ELPPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P L + G+ +N+ TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 467 ESPMPLDYFYLGDVQNVETQSCLDTMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD 526
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + V++++ C G + W ++ T
Sbjct: 527 MCLDAASPQGPVKIVR--------------------------CHGMGG--NQAWVYNEET 558
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
+ ++H CLS P +N Q + CD +N Q+W
Sbjct: 559 KMIKHTNTGHCLS-KPRSNDAMQPVLASCDLHNIGQKW 595
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 492 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 531
>gi|410927898|ref|XP_003977377.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
7-like [Takifugu rubripes]
Length = 664
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 93/239 (38%), Gaps = 29/239 (12%)
Query: 33 HGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDK 92
HG + V S+ +R W +K P + L GDISE R++
Sbjct: 452 HGWQGNPPPAHVGSSPTLKNYVRVVEVWWDEYKDYFYASRPETLTLAYGDISELKKFREE 511
Query: 93 LHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHG 152
CK F WFM+ +AYD+ +P P N+ WGE + L + C+DSMG T + CH
Sbjct: 512 HRCKSFKWFMEEIAYDIPLHYPLPPKNVEWGEIRGLDSSYCIDSMGHTNGGSVEIGPCHR 571
Query: 153 SGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLF 212
G +QLFR+N QL ++C+ V +T C
Sbjct: 572 MGGNQLFRINDANQLMQYDQCLTRGADNSAVI-----ITHC------------------- 607
Query: 213 NCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
W++ + H +CL +++ + CD + + Q+W +
Sbjct: 608 -----DQNQHNEWKYFKDLHRFTHVTTGKCLDRSDLLHKVFISDCDTSKTTQKWEMNNI 661
>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Acyrthosiphon pisum]
Length = 615
Score = 94.0 bits (232), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 103/215 (47%), Gaps = 41/215 (19%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
W+ +K +P A +++GD+SE++ALR+KL CK F W+++++ +PE +P
Sbjct: 427 WMDEWKHFYYAINPGASNVEVGDVSERLALREKLKCKSFRWYLENI-------YPESQMP 479
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE KN+ +Q+CLD+M R + +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 480 LDYYYLGEIKNVDSQQCLDTMSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDA 539
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
++ V LI C G + W + + ++H
Sbjct: 540 SN---------------IVGPVSLI-----------RCHGLEG--NQAWVYDSKEMTIKH 571
Query: 237 AILKRCLSVHPETNQLAML--RCDENNSYQQWRFK 269
+CL +Q A + CD + S QQW K
Sbjct: 572 KTTDQCLEHSMSADQYAAILNECDGSRS-QQWTMK 605
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 23/44 (52%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQV 44
M R + +G SYCHG G +Q+F Q+ + C+DA V
Sbjct: 500 MSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIV 543
>gi|390348396|ref|XP_787966.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 403
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 54/127 (42%), Positives = 64/127 (50%), Gaps = 4/127 (3%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHC-KDFSWFMKHVAYDVYTK 112
+R WL FK+ P GDI EQ+ R K HC K F WFM+ VA+D K
Sbjct: 203 MRVAEVWLDEFKEFFYTSRPDLRGKPYGDIGEQIRFR-KHHCPKSFKWFMEEVAFDSLEK 261
Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGR--TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
FP PN WGE K+ T C+DSMG TA G YCHG G +Q FRL GQ+
Sbjct: 262 FPPPQPNQAWGEIKSDHTGMCVDSMGHQATAGGEAGVYYCHGMGGNQRFRLTGPGQIMFN 321
Query: 171 ERCVDAD 177
+ C D
Sbjct: 322 DYCFYVD 328
>gi|66472462|ref|NP_001018477.1| N-acetylgalactosaminyltransferase 7 [Danio rerio]
gi|63100869|gb|AAH95642.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7 [Danio rerio]
Length = 652
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 89/218 (40%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDIS R++ CK F WFM+ +AYD+ +
Sbjct: 461 VRVVEVWWDDYKDYFYASRPETLTLAYGDISTLKKFREEHRCKSFKWFMEEIAYDIPLHY 520
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 521 PLPPKNVEWGEIRGYETSYCIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQC 580
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ G+ + + +C + T W++ +
Sbjct: 581 LTKGGD--------------------------GSAVIITHCNLNEHT---EWKYFKDLHR 611
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +CL +++ + CD + S Q+W +
Sbjct: 612 FTHVPTGKCLDRSDVLHKVYIADCDNSKSSQKWEMNNI 649
Score = 38.9 bits (89), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 40/92 (43%), Gaps = 2/92 (2%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSW 60
MG T + CH G +QLFR+N QL ++C+ + + L + + W
Sbjct: 544 MGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQCLTKGGDGSAVIITHCNLNEHTEW 603
Query: 61 LKVFKQI-RSLGSPLAMFLDMGDISEQVALRD 91
K FK + R P LD D+ +V + D
Sbjct: 604 -KYFKDLHRFTHVPTGKCLDRSDVLHKVYIAD 634
>gi|432847870|ref|XP_004066191.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oryzias
latipes]
Length = 653
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 86/218 (39%), Gaps = 30/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R++ CK F WFM+ +AYD+ +
Sbjct: 463 VRVVEVWWDEYKDFFYASRPETLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPVHY 522
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 523 PLPPKNVEWGEIRGFETSYCIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQC 582
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ A V + C W++ +
Sbjct: 583 LTRGDN----------------AAVIITHCDRNQYTE--------------WKYFKDLHR 612
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
H +C+ +++ + CD + + Q+W +
Sbjct: 613 FTHMPTGKCMDRSDLLHKVFISDCDTSKTTQKWEMNNI 650
>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 605
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 101/215 (46%), Gaps = 38/215 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A +++GDI+ ++ LR+ L CK F W++ ++ +P
Sbjct: 410 RVAEVWMDEWRDFYYAMNPGAKKIEVGDITSRLKLREDLKCKSFRWYLTNI-------YP 462
Query: 115 E--LPPNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + F G+ KN+ T++CLD+MGR + +G SYCHG G +Q+F Q+ +
Sbjct: 463 ESTMPLDYYFLGDIKNVETEQCLDTMGRKSGENVGMSYCHGYGGNQVFSYTKRHQITADD 522
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA V VKL+ C G + W+++N T
Sbjct: 523 NCLDAAS---------------VRGPVKLV-----------RCHGMGGNQE--WKYYNDT 554
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+L+H +CL + + LR + + QQW
Sbjct: 555 FELKHVHTGQCLDKPSDRDSTPQLRSCDGSRGQQW 589
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 23/40 (57%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR + +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 488 MGRKSGENVGMSYCHGYGGNQVFSYTKRHQITADDNCLDA 527
>gi|350597048|ref|XP_003361960.2| PREDICTED: hypothetical protein LOC100155807 [Sus scrofa]
Length = 1191
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 73/146 (50%), Gaps = 5/146 (3%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 1019 VRVVEVWWDEYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 1078
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG+T + CH G +QLFR+N QL ++C
Sbjct: 1079 PLPPKNVDWGEIRGFETVYCIDSMGKTNGGFVELGPCHRMGGNQLFRINEANQLMQYDQC 1138
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVK 199
+ K + K+ +T C L K
Sbjct: 1139 LT-----KGADGSKVMITHCNLNEFK 1159
>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Acyrthosiphon pisum]
Length = 615
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 61/220 (27%), Positives = 104/220 (47%), Gaps = 41/220 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A +++GD+SE++ALR+KL CK F W+++++ +P
Sbjct: 422 RVAEVWMDEWRDFYYAMNPGASNVEVGDVSERLALREKLKCKSFRWYLENI-------YP 474
Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + ++ GE KN+ +Q+CLD+M R + +G SYCHG G +Q+F Q+ +
Sbjct: 475 ESQMPLDYYYLGEIKNVDSQQCLDTMSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDD 534
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA ++ V LI C G + W + +
Sbjct: 535 NCLDASN---------------IVGPVSLI-----------RCHGLEG--NQAWVYDSKE 566
Query: 232 RQLEHAILKRCLSVHPETNQLAML--RCDENNSYQQWRFK 269
++H +CL +Q A + CD + S QQW K
Sbjct: 567 MTIKHKTTDQCLEHSMSADQYAAILNECDGSRS-QQWTMK 605
Score = 37.4 bits (85), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 16/44 (36%), Positives = 23/44 (52%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQV 44
M R + +G SYCHG G +Q+F Q+ + C+DA V
Sbjct: 500 MSRKSGEKVGMSYCHGLGGNQVFAYTKRSQIMSDDNCLDASNIV 543
>gi|348538240|ref|XP_003456600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oreochromis
niloticus]
Length = 649
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 89/220 (40%), Gaps = 33/220 (15%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDIS+ R++ CK F WFM+ +AYD+ +
Sbjct: 458 VRVVEVWWDDYKDYFYASRPETLTLAYGDISDLKRFREEHRCKSFKWFMEEIAYDIPLHY 517
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + T C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 518 PLPPKNVEWGEIRGFDTSYCIDSMGHTNGGNVELGPCHRMGGNQLFRINEANQLMQYDQC 577
Query: 174 --VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
+ DK + + + C + W++
Sbjct: 578 LTIGGDK-----------------SAIIITHCDTNQHME--------------WKYFKDL 606
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ H +CL +++ + CD + + Q+W +
Sbjct: 607 HRFTHVPTGKCLDRSDLLHKVFVSDCDTSKTTQKWEMNNI 646
>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
Length = 613
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 60/225 (26%), Positives = 101/225 (44%), Gaps = 35/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P + G+I+EQ+A+R KL+CK F WFM+ +A+D+ K+P
Sbjct: 399 RVAEVWMDEYAEFLYKRRPQYRSIQPGNITEQLAIRKKLNCKPFKWFMEEIAFDLPKKYP 458
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGH 169
+ PP + GE +N+G C+D+ + G C G G Q ++ +
Sbjct: 459 PIEPPAVAEGEMRNVGANLCVDTRFKGQGETFGLEKCAKDEPGIGGEQRLQITWHKDIRP 518
Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW-R 226
G+R C D + K PV L+NC G W +
Sbjct: 519 GKRSFCFDVS-----TSVEKAPVI-------------------LYNCHGMKGNQ---WFK 551
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ T Q+ H I + CL E ++ M CD++++ Q+W+F++V
Sbjct: 552 YDTETSQVFHPISRHCLDCDAERKEIFMSSCDKDSNTQKWKFEKV 596
>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
laevis]
gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
Length = 653
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/218 (26%), Positives = 91/218 (41%), Gaps = 29/218 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W ++ P L GDIS R+ +CK F WFM+ +AYD+ +
Sbjct: 462 VRVVEVWWDEYRDYFYASRPETKALAYGDISALKKFREDHNCKSFKWFMEEIAYDIPNYY 521
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE + + C+DSMG T + CH G +QLFR+N QL ++C
Sbjct: 522 PLPPRNVDWGEIRGFESGYCIDSMGHTNGGLAELGGCHRMGGNQLFRINEANQLMQYDQC 581
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+ K + K+ +T C L K W++ + +
Sbjct: 582 LT-----KGTDGSKVILTHCNLNEYK------------------------EWQYFKNMHR 612
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L + +CL +Q+ + CD N + Q+W +
Sbjct: 613 LLYIPSGKCLDRSEVLHQVFLSDCDSNKTTQKWEMNNI 650
>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 3 [Bombus impatiens]
Length = 607
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 55/193 (28%), Positives = 93/193 (48%), Gaps = 42/193 (21%)
Query: 80 MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNL---FWGEAKNLGTQKCLDS 136
+GD+SE++ LR++L CK F W+++++ +PE P L + G+ +N+ TQ CLD+
Sbjct: 440 VGDVSERIKLRERLKCKSFRWYLENI-------YPESPMPLDYYYLGDVQNVETQSCLDT 492
Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
MGR +G SYCHG G +Q+F Q+ + C+DA + V++++
Sbjct: 493 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDAASPQGPVKIVR--------- 543
Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN---QLA 253
C G + W ++ T+ ++H CLS P +N Q
Sbjct: 544 -----------------CHGMGG--NQAWVYNEETKMIKHTNTGHCLS-KPRSNDAMQPV 583
Query: 254 MLRCDENNSYQQW 266
+ CD +N Q+W
Sbjct: 584 LAPCDPHNIGQKW 596
Score = 37.7 bits (86), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 493 MGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDDMCLDA 532
>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
Length = 639
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/221 (28%), Positives = 102/221 (46%), Gaps = 43/221 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K +P A +++GD+S + +LR KL CK F W++++V +P
Sbjct: 442 RVAEVWMDRWKDFFYEMNPGARSVEVGDVSSRRSLRKKLQCKSFRWYLENV-------YP 494
Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + F+ GE +N TQ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 495 ESQMPLDYFFLGEIRNAETQTCLDTMGRKGGENVGISYCHGLGGNQVFAYTKRQQIMSDD 554
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA + +V++++ C G + W +
Sbjct: 555 NCLDATGTDGIVKLIR--------------------------CHGMGG--NQAWLYEAQD 586
Query: 232 RQLEHAILKRCLSVHPETNQL---AMLRCDENNSYQQWRFK 269
+ H RCLS PE+ + + RCD +S QQW K
Sbjct: 587 GLIRHVNSGRCLS-KPESRDVTLPVLRRCD-GSSGQQWIMK 625
Score = 38.1 bits (87), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 520 MGRKGGENVGISYCHGLGGNQVFAYTKRQQIMSDDNCLDA 559
>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
[Strongylocentrotus purpuratus]
gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
[Strongylocentrotus purpuratus]
Length = 670
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 73/228 (32%), Positives = 108/228 (47%), Gaps = 36/228 (15%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL +K PL G+ISEQV R + K F WFM+ +A+D ++
Sbjct: 470 MRVAEVWLDEYKDSFYATKPLLEGKPCGNISEQVEYRKQNCPKSFKWFMEEIAFDNLQEY 529
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG-RTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
P N WGE +++ ++ CLD+ G + P A +G CHG G +QLFR+N GQL
Sbjct: 530 PPPAKNKIWGEIRSVDSELCLDTNGAQNDPTHATMGMEGCHGMGGNQLFRINTRGQL--- 586
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLA---GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
+VE FC+ A KLIF + F S T+ W +
Sbjct: 587 -----------VVED-----GFCLYARPNTEKLIFAKCTAE----KSFTSETTM---WDY 623
Query: 228 HNSTRQLEH-AILKRCLSVHPETNQLAMLRCDENNSYQQWRF--KEVR 272
+S RQ+ H KRCL + ET+++ + C QQ++F +E+R
Sbjct: 624 -DSDRQIIHLERRKRCLDHNQETSKVFVTPCQHGEISQQFKFYKEEIR 670
>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
Length = 588
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 95/214 (44%), Gaps = 34/214 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ + +P M GD+S++ LR+ L CK F W++ +V D + P
Sbjct: 401 WMDDYKKFFYIVNPTVMKHKYGDVSDRKTLRNDLQCKSFQWYLDNVYPD--AQIPRRYKV 458
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE KN G CLD+MGR +G CHG G +Q+F + ++ + C+D
Sbjct: 459 L--GEIKNTGANICLDTMGRKENKKVGCYSCHGQGGNQVFSFTMDNEIRIDDLCLDVANS 516
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
+ V M+K C + G + W ++ T QL H
Sbjct: 517 KGPVMMVK--------------------------CHHQKG--NQYWEYNIKTNQLIHTNS 548
Query: 240 KRCLS--VHPETNQLAMLRCDENNSYQQWRFKEV 271
K+CL+ + N+ M RCD + Q+W F+ +
Sbjct: 549 KQCLTKPIARSANEPRMERCDGSIDTQKWSFRNM 582
>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
Length = 626
Score = 90.9 bits (224), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 96/199 (48%), Gaps = 44/199 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
D GDIS + LR+ L CK F W++ ++ +PEL ++ GE +N Q +C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEVRNSAVQPARC 502
Query: 134 LDSM-GR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD M GR +GT CHG G +Q + L+ +G++ E CVD
Sbjct: 503 LDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDESCVDY--------------- 547
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+ + +F C G + WR+++ T +L+HA+ ++CL + + +
Sbjct: 548 -------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAK 592
Query: 252 LAMLRCDENNSYQQWRFKE 270
L M+ C ++ YQ W+FKE
Sbjct: 593 LEMVACQYDDPYQHWKFKE 611
>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
Length = 626
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 96/199 (48%), Gaps = 44/199 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
D GDIS + LR+ L CK F W++ ++ +PEL ++ GE +N Q +C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEVRNSAVQPARC 502
Query: 134 LDSM-GR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD M GR +GT CHG G +Q + L+ +G++ E CVD
Sbjct: 503 LDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDESCVDY--------------- 547
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+ + +F C G + WR+++ T +L+HA+ ++CL + + +
Sbjct: 548 -------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAK 592
Query: 252 LAMLRCDENNSYQQWRFKE 270
L M+ C ++ YQ W+FKE
Sbjct: 593 LEMVACQYDDPYQHWKFKE 611
>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
Length = 630
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 96/199 (48%), Gaps = 44/199 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
D GD+S + LR L CK F W++ ++ FPEL ++ GE +N Q +C
Sbjct: 454 DFGDVSARKKLRSDLGCKSFKWYLDNI-------FPELFVPGESVAKGEVRNSAVQPARC 506
Query: 134 LDSM-GRTAPA-VIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD M GR +GT CHG G +Q + L+ +G++ E CVD
Sbjct: 507 LDCMVGRHEKNRPVGTYQCHGQGGNQYWMLSKDGEIRRDESCVDY--------------- 551
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G+ + +F C G + WR+++ T +L+HA+ ++CL + + +
Sbjct: 552 -------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTRDGAK 596
Query: 252 LAMLRCDENNSYQQWRFKE 270
L M++C ++ YQ W+FKE
Sbjct: 597 LEMVQCQYDDPYQHWKFKE 615
>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 97/211 (45%), Gaps = 39/211 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ SP A GD+SE+ ALR++L CK F W+++++ +PE +P
Sbjct: 400 WLDEWKEFYYQMSPGARKASAGDVSERRALRERLKCKSFRWYLENI-------YPESQMP 452
Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ F GE +N+ T CLD+MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 453 LDYYFLGEIRNVKTHNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 512
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
LG V NL C G + W + + + ++H
Sbjct: 513 SNA-------------------------LGPV-NLVRCHGMGGNQE--WIYDDEEKTIKH 544
Query: 237 AILKRCLSVHPETN-QLAMLRCDENNSYQQW 266
CL+ E + +LR + QQW
Sbjct: 545 VNSGNCLTRASEDDPSTPLLRPCNYSEGQQW 575
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 473 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 512
>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
[Crassostrea gigas]
Length = 526
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 100/232 (43%), Gaps = 33/232 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P +D GD+SEQ A+RDKLHCK F WFM+ VA+D+ +P
Sbjct: 318 RVAEVWMDEYAEYLYKRRPHYRNIDPGDVSEQKAIRDKLHCKPFKWFMEEVAFDLPKFYP 377
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS---SQLFRLNAEGQLGHG 170
+ PP GE +N CLD+ + C G Q F + G
Sbjct: 378 PVEPPPFASGEVRNKAANMCLDTRYKGQNERFDLQPCLKDGKGGGEQQFEFTWHKDIRPG 437
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+R V D + + K PV LFNC G + ++++
Sbjct: 438 KRTVCFDVSQSIK---KAPVI-------------------LFNCHGMGG--NQRFKYNID 473
Query: 231 TRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK-----EVRPDWSS 277
T+Q+ H I +CL + +++ M CD+ + Q W+ + EVR +W +
Sbjct: 474 TQQIYHVISNQCLDCDRDRHEIFMNPCDKTSETQMWKVEHVHEDEVRKEWDA 525
>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
Length = 632
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 74/133 (55%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP M +D GD+S + LRD LHCK FSW+++++ D ++ P
Sbjct: 390 RLAEVWMDDFKDFFYIISPGVMRVDYGDVSSRKGLRDALHCKPFSWYLENIYPD--SQIP 447
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 448 RRYYSL--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCL 505
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 506 DVSRLNGPVLMLK 518
>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Monodelphis domestica]
Length = 559
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Sarcophilus harrisii]
Length = 559
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISTRVGLRHKLQCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
Length = 499
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D ++P +
Sbjct: 313 WMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRH 367
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 368 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 427
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 428 LNGPVTMLK 436
>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pongo abelii]
Length = 612
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D ++P +
Sbjct: 373 WMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRH 427
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 428 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 487
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 488 LNGPVTMLK 496
>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Callithrix jacchus]
gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
boliviensis boliviensis]
Length = 559
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Nomascus leucogenys]
gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pan troglodytes]
gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
paniscus]
gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
gorilla gorilla]
gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [synthetic
construct]
gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
fascicularis]
gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
Length = 559
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
anubis]
Length = 559
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|90078941|dbj|BAE89150.1| unnamed protein product [Macaca fascicularis]
Length = 311
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D ++P +
Sbjct: 125 WMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRH 179
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 180 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 239
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 240 LNGPVTMLK 248
>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 565
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/216 (25%), Positives = 94/216 (43%), Gaps = 38/216 (17%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE---- 115
W+ +K SP + + GD++ + LRDKL CK F W+++++ +PE
Sbjct: 379 WMDDYKTFFYKISPGSKKSEYGDVTNRKQLRDKLQCKSFKWYLENI-------YPESQFM 431
Query: 116 LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
+ N+ GE +N+ T++CLD+MGR +G CHG G +Q+F + +L H + C+D
Sbjct: 432 MDYNMI-GEIRNMETKQCLDNMGRKENNKVGIYACHGQGGNQIFAWTKKKELKHDDLCLD 490
Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
A +Q ++++L C N G + W F+ +
Sbjct: 491 ASRQSGFNDIMQL------------------------RCHNQGGNQE--WSFNRELNVIT 524
Query: 236 HAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
HA CL LR + QQW ++
Sbjct: 525 HATSGLCLDKADTGKDTPTLRVCDKRLSQQWELVDI 560
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 43
MGR +G CHG G +Q+F + +L H + C+DA +Q
Sbjct: 452 MGRKENNKVGIYACHGQGGNQIFAWTKKKELKHDDLCLDASRQ 494
>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Anolis carolinensis]
Length = 498
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 73/134 (54%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD++ + ALRD L CK FSW++++V D
Sbjct: 294 RLAEVWMDEFKDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPD-----S 348
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P F GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C
Sbjct: 349 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 408
Query: 174 VDADKQEKLVEMLK 187
+D + V MLK
Sbjct: 409 LDVSRLNGPVIMLK 422
>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Cavia porcellus]
Length = 559
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRIGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|340386578|ref|XP_003391785.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like,
partial [Amphimedon queenslandica]
Length = 237
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/217 (28%), Positives = 95/217 (43%), Gaps = 33/217 (15%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+ P GDISE++ LR KLHCK F W+M+ +VY++
Sbjct: 41 MRLAEVWMDEYKEHYYAKRPSIRSRSYGDISERLELRKKLHCKSFRWYME----NVYSEM 96
Query: 114 PELPPNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P NL + G N ++ C D+MG + CHG G +Q F + G+L +
Sbjct: 97 PLPLENLRYGGYVMNPQSKLCFDTMGNREYGKVSLFTCHGIGGNQEFGFSDSGELIFDDD 156
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+D +E+ G + + NC G + W ++N T
Sbjct: 157 LCLDISSKER------------------------GGKVGILNCHGLGG--NQKWEYNNVT 190
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
R L+H + K C+ P N L M C E+ Q+W F
Sbjct: 191 RHLQHVVTKLCVEKGP-GNILVMNNCMESFPSQEWIF 226
>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Anolis carolinensis]
Length = 556
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 98/220 (44%), Gaps = 37/220 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD++ + ALRD L CK FSW++++V D
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPD-----S 421
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P F GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C
Sbjct: 422 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 481
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+D + V MLK C + G + W +
Sbjct: 482 LDVSRLNGPVIMLK--------------------------CHHMRG--NQLWEYDAERLT 513
Query: 234 LEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
L H +CL E +++ M C E+ S QQW + +
Sbjct: 514 LRHLNSNQCLDEPAEDDKMVPTMKDCSESRS-QQWLLRNM 552
>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 600
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 48/131 (36%), Positives = 74/131 (56%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ SP A GD+SE+ ALR+KL CK F W+++++ +PE +P
Sbjct: 402 WLDEWKEFYYQMSPGARKASAGDVSERRALREKLKCKSFRWYLENI-------YPESQMP 454
Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ F GE +N +Q CLD+MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 455 LDYYFLGEIRNEESQNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 514
Query: 177 DKQEKLVEMLK 187
V +++
Sbjct: 515 SNALGPVNLVR 525
Score = 40.0 bits (92), Expect = 1.00, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 475 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 514
>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
Length = 624
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 94/197 (47%), Gaps = 42/197 (21%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
D GDIS + LR+ L CK F W++ ++ +PEL ++ GE +N T +CLD
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGELRNAQTSQCLD 502
Query: 136 S-MGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
S +G T Y CH G +Q + L+ +G++ E CVD
Sbjct: 503 SAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESCVDY----------------- 545
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA 253
G+ + +F C G + WR+++ T +L+HA+ ++CL + + +L
Sbjct: 546 -----------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAKLE 592
Query: 254 MLRCDENNSYQQWRFKE 270
M+ C ++ YQ W+FKE
Sbjct: 593 MVACQYDDPYQHWKFKE 609
>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Anolis carolinensis]
Length = 557
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 98/220 (44%), Gaps = 37/220 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD++ + ALRD L CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVTVRKALRDNLKCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P F GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C
Sbjct: 423 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
+D + V MLK C + G + W +
Sbjct: 483 LDVSRLNGPVIMLK--------------------------CHHMRG--NQLWEYDAERLT 514
Query: 234 LEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
L H +CL E +++ M C E+ S QQW + +
Sbjct: 515 LRHLNSNQCLDEPAEDDKMVPTMKDCSESRS-QQWLLRNM 553
>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
Length = 624
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/197 (28%), Positives = 94/197 (47%), Gaps = 42/197 (21%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
D GDIS + LR+ L CK F W++ ++ +PEL ++ GE +N T +CLD
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGELRNAQTSQCLD 502
Query: 136 S-MGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
S +G T Y CH G +Q + L+ +G++ E CVD
Sbjct: 503 SAVGEEVENKAITPYPCHEQGGNQYWMLSKDGEIRRDESCVDY----------------- 545
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA 253
G+ + +F C G + WR+++ T +L+HA+ ++CL + + +L
Sbjct: 546 -----------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGAKLE 592
Query: 254 MLRCDENNSYQQWRFKE 270
M+ C ++ YQ W+FKE
Sbjct: 593 MVACQYDDPYQHWKFKE 609
>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase 10-like [Metaseiulus
occidentalis]
Length = 590
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 64/221 (28%), Positives = 95/221 (42%), Gaps = 33/221 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P L GD+S+Q ALR KL CK F WFM+ VA+D ++P
Sbjct: 385 RVAEVWMDEYKEFLYNRRPHYRTLGYGDVSKQKALRKKLKCKPFKWFMETVAFDQPLRYP 444
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLGH 169
+ PP+ WG +N+G KCLD+ + C S Q F L L
Sbjct: 445 PVEPPDFAWGAIRNVGADKCLDTKFKEQGKRFSLETCISSNGDVSGEQNFVLTWHKDLRP 504
Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
+R C D EK K PV L+ C G + +++
Sbjct: 505 AKRNVCFDVSSGEK-----KAPVV-------------------LWTCHGMHG--NQLFKY 538
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ +T+QL H I +C+ P ++ M CD ++ Q+W F
Sbjct: 539 NVNTKQLLHPITAQCIDCDPGNLEIFMNPCDVESNTQKWLF 579
>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Ornithorhynchus anatinus]
Length = 558
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL CK FSW++++V D ++P +
Sbjct: 372 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPD-----SQIPRH 426
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 427 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 486
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 487 LNGPVTMLK 495
>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Anolis carolinensis]
Length = 559
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Ornithorhynchus anatinus]
Length = 559
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/129 (36%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL CK FSW++++V D ++P +
Sbjct: 373 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENVYPD-----SQIPRH 427
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 428 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 487
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 488 LNGPVTMLK 496
>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
Acetylgalactosaminyltransferase-T1
Length = 472
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D ++P +
Sbjct: 286 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 340
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 341 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 400
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 401 LNGPVTMLK 409
>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Taeniopygia guttata]
Length = 559
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Meleagris gallopavo]
gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
Length = 559
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
(Silurana) tropicalis]
gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus (Silurana) tropicalis]
Length = 498
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+SE+ ALR+ L C FSW+++ V D
Sbjct: 294 RLAEVWMDDFKDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPD-----S 348
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P F GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C
Sbjct: 349 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 408
Query: 174 VDADKQEKLVEMLK 187
+D + V MLK
Sbjct: 409 LDVSRLNGPVIMLK 422
>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 632
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 56/202 (27%), Positives = 96/202 (47%), Gaps = 47/202 (23%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ---- 131
D GD+S++ ALR++L CK F W++ ++ +PEL ++ GE +N G +
Sbjct: 456 DYGDVSDRKALRERLKCKSFKWYLDNI-------YPELFVPGDSVAKGEVRNYGYKEGGG 508
Query: 132 --KCLDS-MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKL 188
+CLDS +G + CHG G +Q + L+ +G++ E C+D
Sbjct: 509 APQCLDSVVGEDVHKDVTPYPCHGQGGNQYWMLSKDGEIRRDESCID------------- 555
Query: 189 PVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPE 248
AG ++ +F C G + WR+++ QL H + +CL + +
Sbjct: 556 ------YAGANVM---------IFPCHGMKGNQE--WRYNHKLHQLMHVVSDKCLEMSRD 598
Query: 249 TNQLAMLRCDENNSYQQWRFKE 270
++L + CD +N YQQW F+E
Sbjct: 599 GSKLLVNTCDSSNPYQQWAFQE 620
>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 559
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
adamanteus]
Length = 556
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 98/214 (45%), Gaps = 35/214 (16%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + ALR+ L CK FSW++++V D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLEYVYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCMDVSRL 487
Query: 180 EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
V MLK C + G + W + L H
Sbjct: 488 NGPVIMLK--------------------------CHHMRG--NQLWEYDAERLTLRHVNS 519
Query: 240 KRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
+CL E +++ M C E+ S QQW + +
Sbjct: 520 NQCLDEPSEDDKMVPTMKDCSESRS-QQWLLRNM 552
>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
Length = 400
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D ++P +
Sbjct: 214 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 268
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 269 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 328
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 329 LNGPVTMLK 337
>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Canis lupus familiaris]
gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Ailuropoda melanoleuca]
gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
Length = 559
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Meleagris gallopavo]
Length = 556
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + ALR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVDTNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVTMLK 495
>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
gallus]
Length = 556
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + ALR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSARKALREALKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVDTNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVTMLK 495
>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
Length = 591
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D ++P +
Sbjct: 405 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----SQIPRH 459
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 460 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 519
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 520 LNGPVTMLK 528
>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
Length = 483
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 71/129 (55%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+SE+ ALR+ L C FSW+++ V D ++P
Sbjct: 299 WMDDFKDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPD-----SQIPRR 353
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 354 YFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSR 413
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 414 LNGPVIMLK 422
>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus laevis]
gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
Length = 556
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 72/134 (53%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+SE+ ALR+ L C FSW+++ V D
Sbjct: 367 RLAEVWMDDFKDFFYIISPGVVKVDYGDVSERKALRENLKCNPFSWYLETVYPD-----S 421
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P F GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C
Sbjct: 422 QIPRRYFSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLC 481
Query: 174 VDADKQEKLVEMLK 187
+D + V MLK
Sbjct: 482 LDVSRLNGPVIMLK 495
>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
Length = 666
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 106/224 (47%), Gaps = 36/224 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + S G L +D GD++EQ A+R KL+CK F WFMK VA+D+ +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCKSFKWFMKEVAFDLMKTY 509
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
P + PP+ G +N+G Q CLD++GR +G C + +Q + L+ +
Sbjct: 510 PPVDPPSYAMGALQNVGNQNLCLDTLGRKKHNKMGMYACADNIKTPQRTQFWELSWKRDL 569
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L + C+D + PV L++C + G + W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603
Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ +QL+H +RCL + P + ++ +CD +N +QQW F
Sbjct: 604 YYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNF 647
>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Danio rerio]
Length = 556
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 74/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + ALR+ L CK FSW++++V D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVRVDYGDVSSRKALRESLKCKPFSWYLENVYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+DA +
Sbjct: 430 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDASRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVVMLK 495
>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
Length = 630
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 72/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ P A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 444 WLDDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 496
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 557 SSSNGPVNMVR 567
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
Length = 541
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 72/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ P A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 355 WLDDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 407
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 408 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 467
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 468 SSSNGPVNMVR 478
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 428 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 467
>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
caballus]
Length = 453
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D ++P +
Sbjct: 267 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----SQIPRH 321
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 322 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 381
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 382 LNGPVTMLK 390
>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
Length = 675
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 72/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ P A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 489 WLDDWKEFYYSFYPGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 541
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 542 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 602 SSSNGPVNMVR 612
Score = 37.4 bits (85), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 562 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601
>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oreochromis niloticus]
Length = 557
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDI+ + ALR KL CK FSW+++++ D ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDITSRTALRQKLQCKPFSWYLENIYPD-----SQIPRH 425
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 486 LNGPVMMLK 494
>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
Length = 553
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRRKLQCRPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 559
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GD++ ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDVASRIGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Takifugu rubripes]
Length = 556
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP M +D GD+S + LRD L CK FSW+++++ D ++ P +
Sbjct: 372 WMDDFKDFFYIISPGVMRVDYGDVSSRKGLRDALRCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVLMLK 495
>gi|395823173|ref|XP_003804166.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1 [Otolemur garnettii]
Length = 539
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 348 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 402
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 403 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 462
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 463 LDVSKLNGPVTMLK 476
>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Equus caballus]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Monodelphis domestica]
Length = 556
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/128 (35%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + ALR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKALRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F AE ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKDNEKVGMFNCHGMGGNQVFSYTAEKEIRTDDFCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 SGPVIMLK 495
>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
mutus]
Length = 564
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 373 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 427
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 428 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 487
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 488 LDVSKLNGPVTMLK 501
>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
catus]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
[Bos taurus]
Length = 519
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D ++P +
Sbjct: 333 WMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----SQIPRH 387
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 388 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 447
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 448 LNGPVTMLK 456
>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDI+ ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDIASRLGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Loxodonta africana]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oryzias latipes]
Length = 557
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS + +LR KL CK FSW+++++ D ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDISTRTSLRQKLQCKPFSWYLENIYPD-----SQIPRH 425
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 486 LNGPVMMLK 494
>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Sus scrofa]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
taurus]
gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
taurus]
gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
aries]
Length = 559
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 514
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/202 (30%), Positives = 96/202 (47%), Gaps = 43/202 (21%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLGT-QKCL 134
D GDIS+++ALRDKL C F W++ V +PEL P + GE +NLG+ CL
Sbjct: 346 DYGDISDRIALRDKLKCHSFKWYIDEV-------YPELFVPGDAIGSGEMRNLGSGGMCL 398
Query: 135 DS-MGRTA-PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
DS G+++ +G CHG G +Q + + G++ E C+D
Sbjct: 399 DSPAGKSSLHKAVGLYPCHGQGGNQYWLYSKNGEIRRDEACLD----------------- 441
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
AG +I L+ C S G + W + T+ L H ++CL + +L
Sbjct: 442 --YAGTDVI---------LYPCHGSKG--NQYWIYDQQTKLLRHGSSRKCLEMSLSKTEL 488
Query: 253 AMLRCDENNSYQQWRFKEVRPD 274
+ CDE+N+ Q+W+F + R D
Sbjct: 489 LVRGCDEDNNRQKWKFGKYRED 510
>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
laevis]
gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
Length = 559
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDI+ +V LR KL CK FSW++++V D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDIATRVGLRHKLQCKPFSWYLENVYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + + GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTASKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVIMLK 496
>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Length = 591
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 96/224 (42%), Gaps = 33/224 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P +D GD+++Q ALR+KLHCK F WFM+ VA+D+ K+P
Sbjct: 379 RVAEVWMDEYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYP 438
Query: 115 ELPPNLF-WGEAKNLGTQK-CLDSMGRTAPAVIGTSYC---HGSGSSQLFRLNAEGQLGH 169
+ P F GE +NL + C+DS + VIG + C Q F L L
Sbjct: 439 PIEPGDFGVGEIRNLAAPELCVDSGHKDRDQVIGLAECVKGTNKNGEQNFALTWHKDLRV 498
Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
+ C+D +++ L+ C S G + WR+
Sbjct: 499 KGKTLCLDVSDPNDKADIV------------------------LYPCHGSQG--NQYWRY 532
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ H RCL P +L + +CD+ + Q+WRF+ V
Sbjct: 533 DVEKQWFLHGGNPRCLDCDPGQRRLYVTKCDDQSKTQKWRFENV 576
>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
Length = 613
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 96/224 (42%), Gaps = 33/224 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P +D GD+++Q ALR+KLHCK F WFM+ VA+D+ K+P
Sbjct: 401 RVAEVWMDEYAEYLYKRRPHYRDIDPGDLTKQKALREKLHCKPFKWFMEKVAFDLPLKYP 460
Query: 115 ELPPNLF-WGEAKNLGTQK-CLDSMGRTAPAVIGTSYC---HGSGSSQLFRLNAEGQLGH 169
+ P F GE +NL + C+DS + VIG + C Q F L L
Sbjct: 461 PIEPGDFGVGEIRNLAAPELCVDSGHKDRDQVIGLAECVKGTNKNGEQNFALTWHKDLRV 520
Query: 170 GER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
+ C+D +++ L+ C S G + WR+
Sbjct: 521 KGKTLCLDVSDPNDKADIV------------------------LYPCHGSQG--NQYWRY 554
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ H RCL P +L + +CD+ + Q+WRF+ V
Sbjct: 555 DVEKQWFLHGGNPRCLDCDPGQRRLYVTKCDDQSKTQKWRFENV 598
>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
Length = 623
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 96/200 (48%), Gaps = 49/200 (24%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLG--TQKC 133
D GDIS + LR+ L CK F W++ ++ +PEL ++ GE +N G ++C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEMRNAGGKNRQC 502
Query: 134 LD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+D S G+T +G CH G +Q + L+ +G++ E CVD
Sbjct: 503 IDYKPSGGKT----VGMYQCHNQGGNQYWMLSKDGEIRRDESCVDY-------------- 544
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
G+ + +F C G + WR+++ T +L+HA+ ++CL + +
Sbjct: 545 --------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGA 588
Query: 251 QLAMLRCDENNSYQQWRFKE 270
+L M+ C ++ YQ W+FKE
Sbjct: 589 KLEMVACQYDDPYQHWKFKE 608
>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
Length = 623
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 96/200 (48%), Gaps = 49/200 (24%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLG--TQKC 133
D GDIS + LR+ L CK F W++ ++ +PEL ++ GE +N G ++C
Sbjct: 450 DFGDISSRKKLREDLGCKSFKWYLDNI-------YPELFVPGESVAKGEMRNAGGKNRQC 502
Query: 134 LD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+D S G+T +G CH G +Q + L+ +G++ E CVD
Sbjct: 503 IDYKPSGGKT----VGMYQCHNQGGNQYWMLSKDGEIRRDESCVDY-------------- 544
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
G+ + +F C G + WR+++ T +L+HA+ ++CL + +
Sbjct: 545 --------------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTKDGA 588
Query: 251 QLAMLRCDENNSYQQWRFKE 270
+L M+ C ++ YQ W+FKE
Sbjct: 589 KLEMVACQYDDPYQHWKFKE 608
>gi|74215848|dbj|BAE28617.1| unnamed protein product [Mus musculus]
Length = 330
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D G+IS ++ LR KL CK FSW+++++ D ++P +
Sbjct: 144 WMDEFKNFFYIISPGVTKVDYGNISSRLGLRRKLQCKPFSWYLENIYPD-----SQIPRH 198
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 199 YFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 258
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 259 LNGPVTMLK 267
>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Takifugu rubripes]
Length = 557
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 69/129 (53%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDI+ + ALR KL CK FSW+++ + D ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDIATRTALRQKLQCKPFSWYLESIYPD-----SQIPRH 425
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 486 LNGPVMMLK 494
>gi|383863685|ref|XP_003707310.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Megachile
rotundata]
Length = 610
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 99/220 (45%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ + + P LD G++++Q LR +LHCK F WFM++VA+D+ +P + P+
Sbjct: 401 WMDEYAEYIYRRRPHLRSLDPGNLTKQKNLRARLHCKPFKWFMENVAFDLVDVYPPIEPD 460
Query: 120 LFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLGHGER- 172
F GE +N+G + CLD+ R ++ C S Q F+L + +R
Sbjct: 461 DFASGEIRNMGVPELCLDAKKRKKDELVVVDICMKDNSKIIGEQEFKLTWHKDIRPKDRT 520
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+D + + K PV+ L+ C G + WR++
Sbjct: 521 ECLDVSRGDA-----KAPVS-------------------LYPCHGGQG--NQLWRYNVEK 554
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ L H RCL P + ++ + CD + Q+WR ++V
Sbjct: 555 QWLMHGYSSRCLDTDPASKKVFVTNCDSTSPTQKWRIEKV 594
>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Equus
caballus]
Length = 601
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKG-- 538
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ W H R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWMFEHI 587
>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Oryzias latipes]
Length = 557
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP M +D GD+S + ALR+ L CK F+W+++++ D ++ P +
Sbjct: 373 WMDEFKDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPD--SQIPRRYYS 430
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 431 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 488
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 489 NGPVVMLK 496
>gi|24656262|ref|NP_647749.2| polypeptide GalNAc transferase 6, isoform A [Drosophila
melanogaster]
gi|24656265|ref|NP_728779.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
melanogaster]
gi|442629817|ref|NP_001261342.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
melanogaster]
gi|51315873|sp|Q6WV16.2|GALT6_DROME RecName: Full=N-acetylgalactosaminyltransferase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|7292281|gb|AAF47689.1| polypeptide GalNAc transferase 6, isoform A [Drosophila
melanogaster]
gi|7292282|gb|AAF47690.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
melanogaster]
gi|440215219|gb|AGB94037.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
melanogaster]
Length = 666
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 106/224 (47%), Gaps = 36/224 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + S G L +D GD++EQ A+R KL+CK F WFM+ VA+D+ +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCKSFKWFMEEVAFDLMKTY 509
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
P + PP+ G +N+G Q CLD++GR +G C + +Q + L+ +
Sbjct: 510 PPVDPPSYAMGALQNVGNQNLCLDTLGRKKHNKMGMYACADNIKTPQRTQFWELSWKRDL 569
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L + C+D + PV L++C + G + W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603
Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ +QL+H +RCL + P + ++ +CD +N +QQW F
Sbjct: 604 YYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNF 647
>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Oryzias latipes]
Length = 556
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP M +D GD+S + ALR+ L CK F+W+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVVMLK 495
>gi|34042986|gb|AAQ56703.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 666
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 106/224 (47%), Gaps = 36/224 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + S G L +D GD++EQ A+R KL+CK F WFM+ VA+D+ +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCKSFKWFMEEVAFDLMKTY 509
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
P + PP+ G +N+G Q CLD++GR +G C + +Q + L+ +
Sbjct: 510 PPVDPPSYAMGALQNVGNQNLCLDTLGRKKHNKMGMYACADNIKTPQRTQFWELSWKRDL 569
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L + C+D + PV L++C + G + W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603
Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ +QL+H +RCL + P + ++ +CD +N +QQW F
Sbjct: 604 YYDYRHKQLKHGTEGRRCLELLPFSQEVVANKCDTDNRFQQWNF 647
>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Danio rerio]
Length = 557
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/129 (34%), Positives = 70/129 (54%), Gaps = 6/129 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP +D GDIS + +LR +L CK FSW++++V D ++P +
Sbjct: 371 WMDEFKNFFYIISPGVTKVDYGDISTRTSLRQRLQCKPFSWYLENVYPD-----SQIPRH 425
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+D K
Sbjct: 426 YYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSK 485
Query: 179 QEKLVEMLK 187
V MLK
Sbjct: 486 LNGPVMMLK 494
>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Oryzias latipes]
Length = 558
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP M +D GD+S + ALR+ L CK F+W+++++ D ++ P +
Sbjct: 374 WMDEFKDFFYIISPGVMRVDYGDVSSRKALREALKCKPFAWYLENIYPD--SQIPRRYYS 431
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 432 L--GEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 489
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 490 NGPVVMLK 497
>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
Length = 670
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 94/215 (43%), Gaps = 35/215 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P AMF +G+IS+++ALR +L CK F W++K++ +P
Sbjct: 383 RTAEVWMDEYKEFYYKMTPSAMFAPLGNISDRIALRKRLGCKSFKWYLKNI-------YP 435
Query: 115 E--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +PP + G KN CLD+MGR A CH SG +Q+ + G
Sbjct: 436 ESNIPPTYYSIGYIKNEKNDLCLDTMGRKASGSPALLTCHNSGGNQVLFMKVWSYTGTLN 495
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
+ AD+ C+ A K + + L C N + W + +T
Sbjct: 496 --IRADE-------------LCLQASRK-----ADSPIFLQQCNNDESQI---WDYDPNT 532
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
Q H + CL+ P N + M C + N QQW
Sbjct: 533 YQFRHRQSRACLT--PLNNMIVMAECYKENLLQQW 565
>gi|157113705|ref|XP_001652065.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108877647|gb|EAT41872.1| AAEL006558-PA [Aedes aegypti]
Length = 368
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 74/136 (54%), Gaps = 10/136 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R WL +K+ SP A GD+SE+ LR++L CK F W+++++ +P
Sbjct: 168 RVAEVWLDEWKEFYYQMSPGARKASAGDVSERKELRERLKCKSFRWYLENI-------YP 220
Query: 115 E--LPPNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + F GE +N+ T CLD+MGR + IG+SYCHG G +Q+F Q+ +
Sbjct: 221 ESQMPLDYYFLGEIRNVETGNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDD 280
Query: 172 RCVDADKQEKLVEMLK 187
C+DA V +++
Sbjct: 281 NCLDASNALGPVNLVR 296
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 246 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDA 285
>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Canis lupus
familiaris]
Length = 601
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKG-- 538
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ W H R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
Length = 632
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 72/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 446 WLDEWRDFYYAMSTGARKASAGDVSDRKALRDRLQCKSFRWYLENV-------YPESLMP 498
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G+SYCHG G +Q+F Q+ + C+DA
Sbjct: 499 LDYYYLGEIRNAETETCLDTMGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 558
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 559 ASSSGPVNMVR 569
Score = 38.1 bits (87), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 23/40 (57%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G+SYCHG G +Q+F Q+ + C+DA
Sbjct: 519 MGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 558
>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Ornithorhynchus anatinus]
Length = 556
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + ALR L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKALRQNLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
Length = 592
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 95/211 (45%), Gaps = 39/211 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ SP A GD+SE+ LR++L CK F W+++++ +PE +P
Sbjct: 397 WLDEWKEFYYQMSPGARKASAGDVSERKELRERLKCKSFRWYLENI-------YPESQMP 449
Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ F GE +N+ T CLD+MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 450 LDYYFLGEIRNVETGNCLDTMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDA 509
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
LG V NL C G + W + + ++H
Sbjct: 510 SNA-------------------------LGPV-NLVRCHGMGGNQE--WVYDEEDKTIKH 541
Query: 237 AILKRCLSVHPETN-QLAMLRCDENNSYQQW 266
CL+ + + +LR + QQW
Sbjct: 542 VNSGNCLTRPTQDDPSTPLLRPCKYTKGQQW 572
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/40 (42%), Positives = 24/40 (60%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR + IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 470 MGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDNCLDA 509
>gi|380024969|ref|XP_003696257.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
florea]
Length = 598
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P LD G++ Q LR KLHCK F WFM+++A+D+ +P
Sbjct: 384 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 443
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C + Q F+L +
Sbjct: 444 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 503
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D K G K ++L+ C G + WR
Sbjct: 504 PKDRTECLDVSK-----------------GGAK-------ASVSLYPCHGGQG--NQLWR 537
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ + +CD ++ Q+WR ++V
Sbjct: 538 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 582
>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Callithrix jacchus]
Length = 601
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNIAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H++ C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHSVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Saimiri boliviensis boliviensis]
Length = 601
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNIAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H++ C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 FWGYR---KDRTLFHSVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|328781649|ref|XP_003250010.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
mellifera]
Length = 598
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P LD G++ Q LR KLHCK F WFM+++A+D+ +P
Sbjct: 384 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 443
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C + Q F+L +
Sbjct: 444 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 503
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D K G K ++L+ C G + WR
Sbjct: 504 PKDRTECLDVSK-----------------GGTK-------ASVSLYPCHGGQG--NQLWR 537
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ + +CD ++ Q+WR ++V
Sbjct: 538 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 582
>gi|380024971|ref|XP_003696258.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 3 [Apis
florea]
Length = 590
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P LD G++ Q LR KLHCK F WFM+++A+D+ +P
Sbjct: 376 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 435
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C + Q F+L +
Sbjct: 436 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 495
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D K G K ++L+ C G + WR
Sbjct: 496 PKDRTECLDVSK-----------------GGAK-------ASVSLYPCHGGQG--NQLWR 529
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ + +CD ++ Q+WR ++V
Sbjct: 530 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 574
>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Oreochromis niloticus]
Length = 556
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP M ++ GD+S + ALR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDDFKDFFYIISPGVMRVEYGDVSSRKALREALKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +C+D+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCMDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVVMLK 495
>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
Length = 559
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 71/133 (53%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL C+ FSW+++++ D ++ P
Sbjct: 368 RLAEVWMDEFKTFFYIISPGVTKVDYGDISSRLGLRHKLQCRPFSWYLENIYPD--SQIP 425
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C+
Sbjct: 426 RHYSSL--GEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCL 483
Query: 175 DADKQEKLVEMLK 187
D K V MLK
Sbjct: 484 DVSKLNGPVTMLK 496
>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 559
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 69/134 (51%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE + T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRKEETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|328781647|ref|XP_003250009.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
mellifera]
Length = 611
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P LD G++ Q LR KLHCK F WFM+++A+D+ +P
Sbjct: 397 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 456
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C + Q F+L +
Sbjct: 457 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 516
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D K G K ++L+ C G + WR
Sbjct: 517 PKDRTECLDVSK-----------------GGTK-------ASVSLYPCHGGQG--NQLWR 550
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ + +CD ++ Q+WR ++V
Sbjct: 551 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 595
>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
Length = 561
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 424
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 482
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 483 DVSRLNGPVIMLK 495
>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
caballus]
Length = 539
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 73/128 (57%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + +LR+ L CK FSW+++++ D ++ P +
Sbjct: 355 WMDEFKDFFYIISPGVVKVDYGDVSVRKSLRENLKCKPFSWYLENIYPD--SQIPRRYYS 412
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 413 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 470
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 471 NGPVIMLK 478
>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Saimiri boliviensis boliviensis]
Length = 556
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLQCKPFSWYLENIYPD--SQIP 424
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 482
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 483 DVSRLNGPVIMLK 495
>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
Length = 675
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 72/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ P A GD+S++ ALR++L CK F W++++V +PE +P
Sbjct: 489 WLDDWKEFYYSFYPGARKASAGDVSDRKALRERLKCKSFRWYLENV-------YPESLMP 541
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 542 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 602 SSSNGPVNMVR 612
Score = 37.4 bits (85), Expect = 8.0, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 562 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 601
>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
Length = 559
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 70/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS +V LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRVGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T + LD+M R +G CHG G +Q+F A ++ + C
Sbjct: 423 QIPRHYFSLGEIRNVETNQFLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 482
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 483 LDVSKLNGPVTMLK 496
>gi|380024967|ref|XP_003696256.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
florea]
Length = 611
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P LD G++ Q LR KLHCK F WFM+++A+D+ +P
Sbjct: 397 RVAEVWMDEYAEYIYRRRPHLRSLDPGNLKSQRDLRAKLHCKPFKWFMENIAFDLVDVYP 456
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSS----QLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C + Q F+L +
Sbjct: 457 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDICIKDNPNIVGEQEFKLTWHKDIR 516
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D K G K ++L+ C G + WR
Sbjct: 517 PKDRTECLDVSK-----------------GGAK-------ASVSLYPCHGGQG--NQLWR 550
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ + +CD ++ Q+WR ++V
Sbjct: 551 YNVEKQWLMHGYSSRCLDTDPASKKVFVTKCDPSSPTQKWRIEKV 595
>gi|340713833|ref|XP_003395440.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like [Bombus
terrestris]
Length = 610
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L+ G++ EQ LR +LHCK F WFM+++A+D+ +P
Sbjct: 396 RVAEVWMDEYAEYIYTRRPHLRSLNPGNLKEQRDLRARLHCKPFKWFMENIAFDLVDVYP 455
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C Q F+L +
Sbjct: 456 PIEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDTCIKDNPKIIGEQEFKLTWHKDIR 515
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D + K PVT L+ C G + WR
Sbjct: 516 PKDRTECLDVSRGGA-----KAPVT-------------------LYPCHGGQG--NQLWR 549
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ +CD +++ Q+WR ++V
Sbjct: 550 YNVEKQWLMHGYTSRCLDTDPASKKVFAAKCDSSSATQKWRIEKV 594
>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
partial [Papio anubis]
Length = 444
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 306 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 363
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 364 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 421
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 422 NGPVIMLK 429
>gi|350409603|ref|XP_003488790.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Bombus
impatiens]
Length = 610
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 99/225 (44%), Gaps = 34/225 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L+ G++ EQ LR +LHCK F WFM+++A+D+ +P
Sbjct: 396 RVAEVWMDEYAEYIYTRRPHLRSLNPGNLKEQRDLRARLHCKPFKWFMENIAFDLVDVYP 455
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLG 168
+ P+ F GE +N+G + CLDS R ++ C Q F+L +
Sbjct: 456 PVEPDDFASGEIRNMGVPELCLDSKKRKKDELVVVDTCIKDNPKIIGEQEFKLTWHKDIR 515
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C+D + K PVT L+ C G + WR
Sbjct: 516 PKDRTECLDVSRGGA-----KAPVT-------------------LYPCHGGQG--NQLWR 549
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + L H RCL P + ++ +CD +++ Q+WR ++V
Sbjct: 550 YNVEKQWLMHGYTSRCLDTDPASKKVFAAKCDSSSATQKWRIEKV 594
>gi|344237432|gb|EGV93535.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Cricetulus
griseus]
Length = 413
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/134 (34%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D G+IS ++ LR KL CK FSW+++++ D
Sbjct: 222 RLAEVWMDEFKNFFYIISPGFTKVDYGEISSRLGLRHKLQCKPFSWYLENIYPD-----S 276
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ T +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 277 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLC 336
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 337 LDVSKLNGPVTMLK 350
>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
Length = 516
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 327 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 384
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 385 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 442
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 443 SGPVIMLK 450
>gi|426337430|ref|XP_004032709.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
partial [Gorilla gorilla gorilla]
Length = 231
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 47 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 104
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 105 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 162
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 163 NGPVIMLK 170
>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Felis catus]
Length = 601
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF L +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTLGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCVDCNPAEKKIFMARCDPLSETQQWVFEHL 587
>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Cavia porcellus]
Length = 601
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCIDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 FWGYR---KDRTLFHPVSNSCMDSNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cricetulus griseus]
Length = 497
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 313 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 370
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 371 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 428
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 429 NGPVIMLK 436
>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
Length = 519
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 97/219 (44%), Gaps = 35/219 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 330 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 387
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 388 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 445
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
D + V MLK C + G + W + L
Sbjct: 446 DVSRLNGPVIMLK--------------------------CHHMRG--NQLWEYDAERLTL 477
Query: 235 EHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
H +CL E +++ M C N S QQW + +
Sbjct: 478 RHVNSNQCLDEPSEEDKMVPTMQDCSGNRS-QQWLLRNM 515
>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Pan paniscus]
Length = 518
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 304 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 363
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 364 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 423
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 424 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 457
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 458 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 504
>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pan troglodytes]
gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Pongo abelii]
gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pongo abelii]
gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
paniscus]
gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
Length = 556
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Callithrix jacchus]
Length = 561
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPD--SQIP 424
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 482
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 483 DVSRLNGPVIMLK 495
>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Nomascus leucogenys]
Length = 557
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 425
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 426 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 483
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 484 DVSRLNGPVIMLK 496
>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
taurus]
gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
Length = 556
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
Length = 488
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/217 (26%), Positives = 94/217 (43%), Gaps = 40/217 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD--VYTKFPELP 117
W+ FK + SP +D GD++ + LRDKL+CK F W+++++ D + T + L
Sbjct: 299 WMDNFKDFFYIISPGVTKVDYGDVTGRKELRDKLNCKPFKWYLENIYPDSQIPTSYHSL- 357
Query: 118 PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 177
GE +N+ + +C+D+M R +G CHG G +Q+F E +L + C+D
Sbjct: 358 -----GEIRNVDSNQCIDNMARKENEKVGIFSCHGMGGNQVFSYTKEKELRTDDLCLDVS 412
Query: 178 KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
K G + LF C + G + W + + L+H
Sbjct: 413 KP--------------------------GGPVMLFKCHHLGG--NQLWEYDEKIQLLKHV 444
Query: 238 ILKRCLSVHPET---NQLAMLRCDENNSYQQWRFKEV 271
+CL P + + +M CD S Q W+ V
Sbjct: 445 NSNQCLD-KPHSGSPQEPSMQPCDPGKSSQHWQLTNV 480
>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
aries]
Length = 556
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Papio anubis]
Length = 584
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 370 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 429
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 430 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 489
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 490 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 523
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 524 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 570
>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Otolemur garnettii]
Length = 556
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Gorilla gorilla gorilla]
Length = 555
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 341 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 400
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 401 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 460
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 461 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 494
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 495 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 541
>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
Length = 556
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Oryctolagus cuniculus]
Length = 557
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 425
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 426 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 483
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 484 DVSRLNGPVIMLK 496
>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 2
[Macaca mulatta]
Length = 584
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 370 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 429
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 430 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 489
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 490 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 523
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 524 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 570
>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
norvegicus]
gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
Length = 556
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 SGPVIMLK 495
>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
Length = 516
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 332 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 389
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 390 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 447
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 448 NGPVIMLK 455
>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Ailuropoda melanoleuca]
Length = 557
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 73/133 (54%), Gaps = 4/133 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 368 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 425
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+
Sbjct: 426 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCL 483
Query: 175 DADKQEKLVEMLK 187
D + V MLK
Sbjct: 484 DVSRLSGPVIMLK 496
>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
catus]
Length = 546
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 362 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 419
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 420 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 477
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 478 SGPVIMLK 485
>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Ailuropoda melanoleuca]
Length = 556
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 SGPVIMLK 495
>gi|351699506|gb|EHB02425.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 246
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/134 (33%), Positives = 70/134 (52%), Gaps = 6/134 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GD+S ++ LR KL CK FSW+++++ D
Sbjct: 56 RLAEVWMDEFKNFFYIISPGVTKVDYGDMSSRLGLRHKLQCKPFSWYLENIYPD-----S 110
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
++P + F GE +N+ +CLD+M R +G CHG G +Q+F A ++ + C
Sbjct: 111 QIPCHYFSLGEIRNVEANQCLDNMARKENEKVGIFNCHGMGRNQVFSYTANKEIRTDDLC 170
Query: 174 VDADKQEKLVEMLK 187
+D K V MLK
Sbjct: 171 LDVSKLNGPVTMLK 184
>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
Length = 556
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 SGPVIMLK 495
>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Papio anubis]
Length = 601
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Nomascus leucogenys]
Length = 601
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Otolemur garnettii]
Length = 558
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 374 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 431
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 432 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 489
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 490 NGPVIMLK 497
>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
Length = 584
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 370 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 429
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 430 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 489
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 490 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 523
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 524 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 570
>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
AltName: Full=Polypeptide GalNAc transferase 17;
Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 17;
AltName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 17; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 17
gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Homo sapiens]
Length = 601
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
Length = 556
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFKCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 SGPVIMLK 495
>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
3 [Canis lupus familiaris]
Length = 556
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 SGPVIMLK 495
>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1
[Macaca mulatta]
gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
Length = 601
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|195020976|ref|XP_001985304.1| GH16989 [Drosophila grimshawi]
gi|193898786|gb|EDV97652.1| GH16989 [Drosophila grimshawi]
Length = 682
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/219 (28%), Positives = 104/219 (47%), Gaps = 36/219 (16%)
Query: 60 WLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-P 117
W+ +K + + G + +D GD++ Q A+R KL CK F WFM++VA+D+ +P P
Sbjct: 470 WMDEYKNYLYANGEGIYERVDAGDLTAQKAIRTKLKCKSFKWFMENVAFDLMKSYPPFDP 529
Query: 118 PNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHG 170
P+ G +NLG Q KC+D++GR+ +G +C SQ + L+ + +L
Sbjct: 530 PDYASGAIQNLGDQTKCVDTLGRSRHNRLGIYHCADDLVRPQRSQYWSLSWKRDLRLRRK 589
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+ C+D I+ V L++C G + W +
Sbjct: 590 KDCMDVQ-----------------------IWGENAPVW-LWDCHGQQG--NQYWYYDYQ 623
Query: 231 TRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
T+ + H KRC+ + P T +L + +CD +NSY +W F
Sbjct: 624 TKLIMHGKEGKRCMELLPSTEELVVNKCDVSNSYMRWSF 662
>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
troglodytes]
Length = 601
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
Length = 594
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 73/131 (55%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL + + +P A GD+SE+ LR++L CK F W+++++ +PE +P
Sbjct: 452 WLDGWSEFYYNINPGARKASAGDVSERRELRERLKCKSFRWYLENI-------YPESQMP 504
Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ F GE +N+ +Q CLD+MGR A IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 505 LDYYFLGEIRNVESQNCLDTMGRKANEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 564
Query: 177 DKQEKLVEMLK 187
V +++
Sbjct: 565 SNALGPVNLVR 575
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR A IG+SYCHG G +Q+F Q+ + C+DA
Sbjct: 525 MGRKANEKIGSSYCHGLGGNQVFAYTKRHQIMSDDNCLDA 564
>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Macaca mulatta]
Length = 375
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 191 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 248
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 249 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 306
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 307 NGPVIMLK 314
>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Oryctolagus cuniculus]
Length = 601
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCVDCNPAEKKIFMARCDPFSETQQWIFEHI 587
>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Callithrix jacchus]
Length = 556
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 72/128 (56%), Gaps = 4/128 (3%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 372 WMDEFKDFFYIISPGVVKVDYGDVSVRKILRENLKCKPFSWYLENIYPD--SQIPRRYYS 429
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQ 179
L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C+D +
Sbjct: 430 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRL 487
Query: 180 EKLVEMLK 187
V MLK
Sbjct: 488 NGPVIMLK 495
>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
Length = 630
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 71/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 444 WLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 496
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 557 SSSNGPVNMVR 567
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
>gi|440908502|gb|ELR58511.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial [Bos
grunniens mutus]
Length = 254
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 89/218 (40%), Gaps = 18/218 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GD+S Q LR +L CKDF WFM VA+DV +P
Sbjct: 40 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFKWFMAAVAWDVPKYYP 99
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ PP WGE +N+ C+DS + C GS + + G E
Sbjct: 100 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWRE-- 157
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D E L FC A + + L++C G + W H R
Sbjct: 158 -DIRPGEPLHTR-----KFCFDA------VSHSSPVTLYDCHGMKG--NQLWG-HRKDRT 202
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 203 LFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWVFEHI 240
>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 617
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 72/136 (52%), Gaps = 10/136 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R WL ++ S A GD+S++ ALRD+L CK F W++++V +P
Sbjct: 426 RVAEVWLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YP 478
Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ +
Sbjct: 479 ESLMPLDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDD 538
Query: 172 RCVDADKQEKLVEMLK 187
C+DA V M++
Sbjct: 539 LCLDASSSNGPVNMVR 554
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 504 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 543
>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
Length = 630
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 71/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 444 WLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 496
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 557 SSSNGPVNMVR 567
Score = 37.4 bits (85), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
Length = 536
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 71/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ ALRD+L CK F W++++V +PE +P
Sbjct: 350 WLDEWRDFYYSMSTGARKASAGDVSDRKALRDRLKCKSFRWYLENV-------YPESLMP 402
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 403 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 462
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 463 SSSNGPVNMVR 473
Score = 37.4 bits (85), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 423 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 462
>gi|431918329|gb|ELK17556.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Pteropus
alecto]
Length = 371
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 157 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKFYP 216
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 217 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 276
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 277 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 310
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 311 LWGYR---KDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 357
>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Otolemur garnettii]
Length = 601
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHLVSNSCVDCNPAEKKIFMARCDPLSETQQWVFEHI 587
>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Otolemur garnettii]
Length = 600
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 386 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 445
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 446 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 505
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 506 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 539
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 540 LWGYR---KDRTLFHLVSNSCVDCNPAEKKIFMARCDPLSETQQWVFEHI 586
>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 601
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GD+S Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAVSHSS-------PVT-------------------LYDCHGMKG-- 538
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ W H R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWVFEHI 587
>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Ovis aries]
Length = 601
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 89/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GD+S Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFRWFMAAVAWDVPRHYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAVSHSS-------PVT-------------------LYDCHGMKG-- 538
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ W H R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 539 NQLWG-HRKDRTLFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWVFEHI 587
>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
Length = 633
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 72/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ ALR++L CK F W++++V +PE +P
Sbjct: 447 WLDEWRDFYYAMSTGARKASAGDVSDRKALRERLQCKSFRWYLENV-------YPESLMP 499
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G+SYCHG G +Q+F Q+ + C+DA
Sbjct: 500 LDYYYLGEIRNAETETCLDTMGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 559
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 560 ASSNGPVNMVR 570
Score = 38.1 bits (87), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 23/40 (57%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G+SYCHG G +Q+F Q+ + C+DA
Sbjct: 520 MGRKYNEKVGSSYCHGLGGNQVFAYTKRQQIMSDDLCLDA 559
>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Heterocephalus glaber]
Length = 522
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 88/218 (40%), Gaps = 15/218 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 305 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 364
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ PP WGE +N+ C+DS + C GS + + +L
Sbjct: 365 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQACELFTFGWR 424
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D E L FC A + + L++C G +R R
Sbjct: 425 EDIRPGEPLHTR-----KFCFDA------ISHNSPVTLYDCHGMKGNQLWGYR---KDRT 470
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 471 LFHPVSNSCMDSNPAEKKIFMARCDPLSETQQWIFEHI 508
>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Loxodonta africana]
Length = 601
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 87/218 (39%), Gaps = 18/218 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ PP WGE +N+ C+DS + C GS + + G E
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWRE-- 504
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D E L FC A + L++C G +R R
Sbjct: 505 -DIRPGEPLHTR-----KFCFDA------ISHNSPATLYDCHGMKGNQLWGYR---KDRT 549
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 550 LFHPVSNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 587
>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Saccoglossus kowalevskii]
Length = 1082
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 90/221 (40%), Gaps = 38/221 (17%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+ P A GD+SE++ L+ +L CKDF W++ +V +
Sbjct: 892 IRVAEVWMDDYKRFFYATRPDAQGKTYGDLSERLKLKSRLKCKDFKWYLDNV-------Y 944
Query: 114 PELP-PN---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
PEL PN WGE +N + CLD++ R A +G CHG G +Q+F G++ H
Sbjct: 945 PELSVPNENAYAWGECQNAASNVCLDTLMREAGQPVGLYICHGGGGNQVFSYTKLGEVRH 1004
Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
E C+D + ++G C G R H
Sbjct: 1005 EELCLDVSTK------------------------KVGETPVFEQCHALGGNQMWEHRKHG 1040
Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
R + CL + + L M+ C+ + Q W F +
Sbjct: 1041 FIRHKSSGL---CLDRSGDNDGLKMMSCNSKKTTQIWEFSQ 1078
>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Cricetulus griseus]
Length = 555
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 87/230 (37%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 341 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 400
Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ P WGE +NL C+DS + C GS QLF +
Sbjct: 401 PVEPLPAAWGEIRNLAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 460
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 461 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 494
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 495 LWGYR---KDRTLFHPVTNSCMDCNPAEKKIFMARCDPLSETQQWIFEHI 541
>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Monodelphis domestica]
Length = 573
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 87/218 (39%), Gaps = 18/218 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 359 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 418
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ PP WGE +N+ C+DS + C GS + + G E
Sbjct: 419 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWRE-- 476
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D E L FC A + + L++C G +R +
Sbjct: 477 -DVRPGEPLHTR-----KFCFDA------ISHSSPVTLYDCHGMKGNQHWGYR---KDKT 521
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 522 LFHPVSNSCIDCNPSEKKIFMNRCDPLSETQQWIFEHI 559
>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Nasonia vitripennis]
Length = 635
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 97/224 (43%), Gaps = 47/224 (20%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+SE+V LR++L CK F W+++++ +P
Sbjct: 436 RVAEVWMDEWRDFYYAMNPGARNVPVGDVSERVKLREQLKCKSFRWYLENI-------YP 488
Query: 115 ELPPNL---FWGEAKNLGTQK-------CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAE 164
E P L + G+ KN CLD+MGR +G SYCHG G +Q+F
Sbjct: 489 ESPMPLDYYYLGDIKNADPNNPEKVQNYCLDTMGRRTGENVGMSYCHGLGGNQIFAYTKR 548
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
Q+ + C+DA + V++++ C G +
Sbjct: 549 QQIMSDDMCLDAASPQGPVKIVR--------------------------CHGMGG--NQA 580
Query: 225 WRFHNSTRQLEHAILKRCLSV--HPETNQLAMLRCDENNSYQQW 266
W ++ T+ ++H K CLS + Q + CD + Q+W
Sbjct: 581 WIYNEETKMIKHVNTKHCLSKPRPSDPTQPVLAPCDSKSVGQKW 624
Score = 38.1 bits (87), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 521 MGRRTGENVGMSYCHGLGGNQIFAYTKRQQIMSDDMCLDA 560
>gi|395542399|ref|XP_003773120.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Sarcophilus harrisii]
Length = 436
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/218 (27%), Positives = 87/218 (39%), Gaps = 18/218 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 222 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 281
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ PP WGE +N+ C+DS + C GS + + G E
Sbjct: 282 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWRE-- 339
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D E L FC A + + L++C G +R +
Sbjct: 340 -DVRPGEPLHTR-----KFCFDA------ISHSSPVTLYDCHGMKGNQHWGYR---KDKT 384
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 385 LFHPVSNSCIDCNPSEKKIFMNRCDPLSETQQWIFEHI 422
>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
Length = 682
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 100/220 (45%), Gaps = 39/220 (17%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL +K+I +D GD+SE+ LR+ L C F W++ +V D++
Sbjct: 481 IRLAEVWLDDYKEI-YYNRINHKLVDFGDVSERKKLREHLKCHSFKWYLDNVFPDLF--- 536
Query: 114 PELPPN-LFWGEAKNLGTQK-CLD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGH 169
LP + GE +NLG QK C+D +GR A Y CH G +Q + L+ G++
Sbjct: 537 --LPSEAIASGEIRNLGNQKYCVDHDVGRNAVNDSVIPYPCHLQGGNQFWMLSKSGEIRR 594
Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
E C+D + PVT + C S G + W +++
Sbjct: 595 DEYCIDYTGRGS-------PVT--------------------YECHGSKG--NQLWDYNH 625
Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
T +L H + + CL++ + L M CD+ N Q+WRF+
Sbjct: 626 ETGRLYHPVSRYCLTLSGDDTMLVMRICDDKNERQRWRFQ 665
>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
Length = 621
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ +LRD+L CK F W++++V +PE +P
Sbjct: 435 WLDEWRDFYYAMSTGARKASAGDVSDRKSLRDRLQCKSFRWYLENV-------YPESLMP 487
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 488 LDYYYLGEIRNSETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 547
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 548 ASSNGPVNMVR 558
Score = 37.4 bits (85), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 508 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 547
>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
Length = 662
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 102/222 (45%), Gaps = 32/222 (14%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + S G + +D GD++ Q A+R KL CK F WFM+ VA+D+ +
Sbjct: 446 RVAEVWMDEYKNYLYSHGDGIYERVDAGDLTAQKAIRTKLKCKSFRWFMEEVAFDLMKNY 505
Query: 114 PEL-PPNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQL 167
P + PPN G +++G Q CLD+MGR +G C SQ + L+ + L
Sbjct: 506 PPVDPPNYAMGAIQSVGNPQLCLDTMGRKKHNRMGMFACADDLKVPQKSQFWELSWKRDL 565
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
+Q + E L + + + L++C G + +
Sbjct: 566 ----------RQRRKKECLDVQI------------WEANAPVWLWDCHGQGGNQYWYYDY 603
Query: 228 HNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
HN + L+H + +RCL + P ++++ + CD N Y QW F
Sbjct: 604 HN--KLLKHGMEGRRCLELLPFSHEVVVNNCDPKNHYMQWNF 643
>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
Length = 677
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 71/131 (54%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ P A GD+S++ LR++L CK F W++++V +PE +P
Sbjct: 491 WLDDWKEFYYSFYPGARKASAGDVSDRKNLRERLKCKSFRWYLENV-------YPESLMP 543
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 544 LDYYYLGEIRNSETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 603
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 604 SSSNGPVNMVR 614
Score = 37.4 bits (85), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 564 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 603
>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
Length = 626
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 99/217 (45%), Gaps = 41/217 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + +P A+ + +GD+SE+ ALR++L CK F W+++++ +P
Sbjct: 432 RVAEVWMDEWGEFYYAMNPGALNVPVGDVSERKALRERLKCKSFRWYLENI-------YP 484
Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + ++ GE +N T CLD++G A +G YCHG G +Q+F Q+ +
Sbjct: 485 ESQMPLDYYYLGEIRNAETSNCLDTLGGKAGQPLGMGYCHGMGGNQVFAYTKRKQIMSDD 544
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C+DA +++++ C G + W + +
Sbjct: 545 NCLDAAHPRGPIKLIR--------------------------CHGMRGNQE--WTYDTKS 576
Query: 232 RQLEHAILKRCLSVHPETNQL--AMLRCDENNSYQQW 266
R ++H CL PE+ + +LR + + QQW
Sbjct: 577 RTIKHTNTGMCLD-KPESTDVWKPVLRSCDRSRGQQW 612
>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 2 [Xenopus (Silurana) tropicalis]
Length = 601
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 88/226 (38%), Gaps = 34/226 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ + + P L GDIS Q LR L CKDF W+M VA+DV +P
Sbjct: 387 RVAETWMDEYAEYIYQRRPEYRHLSTGDISSQKELRKHLKCKDFKWYMSEVAWDVPKFYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ + C+DS + C GS QLF +
Sbjct: 447 PVEPPPASWGEIRNVASNLCIDSKHGATGTELRLDTCVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL--GTVLNLFNCFNSSGTVDGPW 225
GE L K F + + + L++C G +
Sbjct: 507 RPGE----------------------PLHTRKFCFDSISHSSPVTLYDCHGMKGNQQWSY 544
Query: 226 RFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
R + L H + C+ +P +L M RCD + QQW F+ V
Sbjct: 545 R---KDKTLFHPVTNSCIDCNPAEKKLFMNRCDPLSETQQWIFEHV 587
>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 1 [Xenopus (Silurana) tropicalis]
Length = 600
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 66/251 (26%), Positives = 96/251 (38%), Gaps = 34/251 (13%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVAL 89
++GH R K T + R +W+ + + P L GDIS Q L
Sbjct: 361 RVGHIYRKYVPYKVPTGTSLARNLKRVAETWMDEYAEYIYQRRPEYRHLSTGDISSQKEL 420
Query: 90 RDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTS 148
R L CKDF W+M VA+DV +P + PP WGE +N+ + C+DS +
Sbjct: 421 RKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVASNLCIDSKHGATGTELRLD 480
Query: 149 YCHGSGS------SQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIF 202
C GS QLF + GE L K F
Sbjct: 481 TCVKDGSERTWSHEQLFTFGWREDIRPGE----------------------PLHTRKFCF 518
Query: 203 CRL--GTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
+ + + L++C G +R + L H + C+ +P +L M RCD
Sbjct: 519 DSISHSSPVTLYDCHGMKGNQQWSYR---KDKTLFHPVTNSCIDCNPAEKKLFMNRCDPL 575
Query: 261 NSYQQWRFKEV 271
+ QQW F+ V
Sbjct: 576 SETQQWIFEHV 586
>gi|118090108|ref|XP_420520.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Gallus gallus]
Length = 601
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 87/230 (37%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R + L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWSYR---KDKTLFHPVSSSCIDCNPSEKKIFMNRCDPLSETQQWVFEHI 587
>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 580
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 71/133 (53%), Gaps = 12/133 (9%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ F+ SP + GD+S++ LRD+L C F W+++++ +PE
Sbjct: 393 WMDDFRHFYYRISPGVRKTEFGDVSQRKKLRDRLKCHTFEWYLENI-------YPESQFR 445
Query: 120 LFW---GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD- 175
L + GE +N+ T KCLD+MGR +G CHG G +Q+F L + ++ H + C+D
Sbjct: 446 LDFKTIGEIRNIETHKCLDNMGRKENEKVGIFSCHGQGGNQIFALTKQNEIKHDDLCLDA 505
Query: 176 -ADKQEKLVEMLK 187
A+ K V M+K
Sbjct: 506 SANSHYKDVVMIK 518
Score = 38.1 bits (87), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G CHG G +Q+F L + ++ H + C+DA
Sbjct: 466 MGRKENEKVGIFSCHGQGGNQIFALTKQNEIKHDDLCLDA 505
>gi|449685123|ref|XP_002167708.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
partial [Hydra magnipapillata]
Length = 411
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 107/232 (46%), Gaps = 35/232 (15%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
K V T +L R W+ +K+ PL + G+I++++ LR KL CK F W+
Sbjct: 196 KGVTETLSKNLN-RLAEVWMDEYKEYYYRSRPLFRGKEYGNITQRLELRQKLQCKSFKWY 254
Query: 102 MKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAV---IGTSYCHGSGSSQL 158
M+++ D+ + P+L P GE +N + C+DSMG V +G CHG G +Q
Sbjct: 255 MENIYSDM--EIPDLYPPAE-GEIRNGASNLCIDSMGVVKENVKHQVGLYPCHGEGGAQH 311
Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
F+L+ +G++ + DK FC+ V G + F C
Sbjct: 312 FQLSLKGEI------IFQDK-------------FCLDVAV----ASPGAFIEFFKCHKQR 348
Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLS--VHPETNQLAMLRCDENNSYQQWRF 268
G + W+ + + ++ H + ++CL V ++ M RCD S Q+W+F
Sbjct: 349 G--NQLWQHNIDSGEIVHFVTQQCLDLGVSDHKDKAVMNRCDGRAS-QKWKF 397
>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 612
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 101/220 (45%), Gaps = 35/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ +K+ + P L+ GD++ Q LR +L+CK F WFM++VA+D +K+P + PP
Sbjct: 412 WMDEYKEYLYMRRPHYRNLEPGDLTAQKELRKRLNCKSFKWFMENVAFDQPSKYPAIEPP 471
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ WGE ++ + C+D+ + G ++ F L E+C+ +
Sbjct: 472 DYAWGEIRHEKSSLCIDTQFK--------------GQNERFSL---------EKCIRDHR 508
Query: 179 QEKLVEMLKL-------PVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
+ + L L P V V R V L++C G + +++ +T
Sbjct: 509 DQSGEQHLVLTWHKDIRPQKRTVCFDVSSSEPRAPVV--LWSCHGMHG--NQLFKYDTTT 564
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+Q+ H I CL E ++ M CD N+ Q+W F+ V
Sbjct: 565 KQISHPITASCLDCDAERREIFMNSCDPNSVSQRWLFEHV 604
>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 70/131 (53%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ LRD+L CK F W++++V +PE +P
Sbjct: 444 WLDEWRDFYYAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENV-------YPESLMP 496
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 557 ASSNGPVNMVR 567
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
norvegicus]
Length = 601
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ P WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPLPAAWGEIRNVAANLCVDSKHGAMGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C+DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCLDAISHSS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---EDRTLFHPVSNSCMDCNPAEKKVFMARCDPLSETQQWMFEHI 587
>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
Length = 630
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 70/131 (53%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL ++ S A GD+S++ LRD+L CK F W++++V +PE +P
Sbjct: 444 WLDEWRDFYYAMSTGARKASAGDVSDRKDLRDRLKCKSFRWYLENV-------YPESLMP 496
Query: 118 PNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ ++ GE +N T+ CLD+MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 497 LDYYYLGEIRNAETETCLDTMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
Query: 177 DKQEKLVEMLK 187
V M++
Sbjct: 557 ASSNGPVNMVR 567
Score = 37.4 bits (85), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 16/40 (40%), Positives = 22/40 (55%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 40
MGR +G SYCHG G +Q+F Q+ + C+DA
Sbjct: 517 MGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDDLCLDA 556
>gi|324520154|gb|ADY47570.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Ascaris suum]
Length = 286
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 80/149 (53%), Gaps = 10/149 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A +D+GD++++ ALR+ L C+ F W+++++ +P
Sbjct: 94 RTAEVWMDEYKEFFYKMVPAARSVDVGDLADRKALRENLQCRSFRWYLENI-------YP 146
Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P P F G+ KN T KC+D++GR+A G + CHG G +Q + L ++G++ E
Sbjct: 147 EAPIPRGFKSIGQIKNPSTTKCVDTLGRSAGEAAGVTVCHGIGGNQAWSLTSDGEVRSDE 206
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKL 200
C+ AD+ + + + + C V +
Sbjct: 207 TCLAADRAADKAKKIDVKLEKCSTTSVNV 235
>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
1 [Taeniopygia guttata]
gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
2 [Taeniopygia guttata]
Length = 601
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/228 (28%), Positives = 86/228 (37%), Gaps = 42/228 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+R + L H + C+ +P ++ M RCD + QQW F+
Sbjct: 541 HWSYR---KDKTLFHPVSSSCIDCNPAEKKIFMNRCDPLSETQQWIFE 585
>gi|158300139|ref|XP_320141.4| AGAP012414-PA [Anopheles gambiae str. PEST]
gi|157013013|gb|EAA00190.4| AGAP012414-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 91/198 (45%), Gaps = 34/198 (17%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
D+GDIS Q+A+R+KL CK F WFM VA+D+ K+P + PP+ G +++ C+D+
Sbjct: 418 DVGDISRQLAIREKLQCKPFKWFMTQVAFDLIEKYPPIEPPDFANGAIQSVANAALCVDT 477
Query: 137 MGRTAPAVIGTSYCHGSGS----SQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+ IG C +Q F+L + + ++ GE C D + ++L
Sbjct: 478 LNHGEKQTIGLYSCAEDKKQPQPNQFFQLSWHRDLRIKFGELCWDVSESVPNAKIL---- 533
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
L++C G + WR+ T+ L+ RCL ++P
Sbjct: 534 --------------------LYHCHGGQG--NQLWRYEPETQMLKQGKNNRCLDMNPNNR 571
Query: 251 QLAMLRCDENNSYQQWRF 268
++ + C+ N Q+WR+
Sbjct: 572 EVFVNPCEPTNPRQKWRW 589
>gi|449270895|gb|EMC81541.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Columba livia]
Length = 440
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/228 (28%), Positives = 86/228 (37%), Gaps = 42/228 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 226 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 285
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 286 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 345
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 346 RPGEPLHTRKFCFDAISHSS-------PVT-------------------LYDCHGMKGNQ 379
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+R + L H + C+ +P ++ M RCD + QQW F+
Sbjct: 380 HWSYR---KDKTLFHPVSGSCIDCNPAEKKIFMNRCDPLSETQQWIFE 424
>gi|324507788|gb|ADY43296.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Ascaris suum]
Length = 580
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 111/223 (49%), Gaps = 40/223 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R WL +K + P A F+D GDISE++ L+++LHCK+F+W++K V ++ K P
Sbjct: 388 RAAEVWLDDYKMLYLKQVPSARFVDFGDISERLKLKEQLHCKNFTWYLKEVYPEL--KIP 445
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
E L+ K G C+DS+G+ TA + +G CHG+G +Q +
Sbjct: 446 EREDGLYL-TFKQAGL--CIDSLGKQTAHSPVGVYSCHGTGGNQEWVF------------ 490
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
DKQ+ LK P T ++ + V++L C T DG W + S +
Sbjct: 491 ---DKQKG---TLKNPFTKLCMSDSDI------GVVSLQKC----ETADGKWTMNESEGR 534
Query: 234 LEH-----AILKRC-LSVHPETNQLAMLRCDENNSYQQWRFKE 270
L++ A+++R +++ + L ++ CD ++ Q+W F++
Sbjct: 535 LKYKDSCVALIRRTPVNMQKDETVLMLMPCDNSDHRQRWVFEK 577
>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
Length = 622
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 57/199 (28%), Positives = 90/199 (45%), Gaps = 44/199 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNL--GTQKC 133
D GD+S + ALR+KL CK F W++ +V +PEL P + GE +N G++ C
Sbjct: 449 DFGDVSSRKALREKLQCKSFKWYLDNV-------YPELFVPGDAIGKGEIRNRGGGSKNC 501
Query: 134 LD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD S GR G +CH G +Q + L+ +G++ E C+D
Sbjct: 502 LDWASHGRQRSVNAGLYWCHKKGGNQYWMLSKDGEIRRDESCID---------------- 545
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AGV ++ ++ C G + W++ Q H + +CL + + +
Sbjct: 546 ---YAGVDVM---------VYPCHGMKGNQE--WKYLLYQSQFLHVVTNKCLEMSRDGTR 591
Query: 252 LAMLRCDENNSYQQWRFKE 270
L + CD N YQQW +E
Sbjct: 592 LMVNVCDRTNPYQQWLIQE 610
>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
Length = 666
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 103/224 (45%), Gaps = 36/224 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + S G + +D GD++EQ A+R KL CK F WFM+ VA+D+ +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGVYESVDPGDLTEQKAIRTKLKCKSFKWFMEAVAFDLMKTY 509
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
P + PP G +N+G Q CLD+MG+ +G C SQ + L+ +
Sbjct: 510 PPVDPPAYAMGALQNVGNQNLCLDTMGKKKHNRMGMYSCASDIKVPQRSQFWELSWKRDL 569
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L + C+D + PV L++C + G + W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHSQGG--NQYW 603
Query: 226 RFHNSTRQLEHAI-LKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ + L+H +RCL + P + ++ + +CD +N +QQW F
Sbjct: 604 YYDYHKKMLKHGEGSRRCLELLPFSQEVVVNKCDPDNRFQQWNF 647
>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 641
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 33/213 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
WL +K++ P + D GDIS+++ALR+KL+CK F W+++++ DV + P+L P
Sbjct: 444 WLDEYKELYYQKRPQSKGKDYGDISQRLALRNKLNCKSFKWYIENIYPDV--QLPDLYPP 501
Query: 120 LFWGEAKNLGTQKCLDSMGR---TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER-CVD 175
GE KN + CLDSMG +G CHG G +Q F + +G++ E C+D
Sbjct: 502 AR-GEIKNPASSYCLDSMGDMKGNNVKKLGIFPCHGQGGNQNFVFSRKGEIVFDEEYCLD 560
Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
+ G ++++ C N G + W +T ++
Sbjct: 561 VSS------------------------SKPGVLIDIMKCHNFGG--NQQWIHKINTGEIM 594
Query: 236 HAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
H ++C+ TN+ ++ + Q+W F
Sbjct: 595 HYPTRQCIDRGSNTNKSPQMQPCNGQTSQKWVF 627
>gi|195492881|ref|XP_002094181.1| GE20340 [Drosophila yakuba]
gi|194180282|gb|EDW93893.1| GE20340 [Drosophila yakuba]
Length = 666
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 101/224 (45%), Gaps = 36/224 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + S G L +D GD++EQ A+R KL CK F WFM+ VA+D+ +
Sbjct: 450 RVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLKCKSFKWFMEEVAFDLMKTY 509
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEG 165
P + PP+ G +N+G Q CLD+MGR +G C +Q + L+ +
Sbjct: 510 PPVDPPSYARGALQNVGNQNLCLDTMGRKKHNKMGMYACANDIKIPQRTQFWELSWKRDL 569
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L + C+D + PV L++C G + W
Sbjct: 570 RLRRKKECLDVQ-----IWDANAPVW-------------------LWDCHMQGG--NQYW 603
Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ + L H +RCL + P + ++ +CD +N +QQW F
Sbjct: 604 YYDYRKKLLLHGKEGRRCLELLPFSQEVVANKCDPDNRFQQWNF 647
>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
Length = 601
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 87/230 (37%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAISHNS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ + ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---KDRTLFHPVSNSCMDCNHAEKKIFMARCDPLSETQQWIFEHI 587
>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 88/215 (40%), Gaps = 31/215 (14%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K + P A D GDIS+++ LR +L CK F W++ ++ DV + PEL P
Sbjct: 325 WMDEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKCKSFKWYINNIYPDV--QMPELDPP 382
Query: 120 LFWGEAKNLGTQKCLDSMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD 177
GE +N + +CLDS+G A +G CHG G +Q+ + + E C D
Sbjct: 383 AR-GEVRNPSSNQCLDSLGAKPEHNARVGIYTCHGQGGNQVSKYMPRELIFEEENCFDVS 441
Query: 178 KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
K G + L C G + W+ L H
Sbjct: 442 KTHP------------------------GAPVELMKCHGMRGNQE--WKHDREKGTLMHF 475
Query: 238 ILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
++CL ++Q A++ + Q+W F R
Sbjct: 476 TTQQCLDRGSPSDQYAVMNPCDGRESQRWVFSHYR 510
>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
Length = 504
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 66/124 (53%), Gaps = 15/124 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ PLA + GDI +++ LR +L CK F W+++HV
Sbjct: 317 RAAEVWMDEYKKYYYAAVPLATNIPFGDIDDRLQLRKELQCKPFRWYLEHVY-------- 368
Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
P L E +N G+ + CLDS+G A AV+G CHG+G +Q + LN +G+L H
Sbjct: 369 ---PQLGIPERRNNGSIRQGVYCLDSLGNVAGAVVGLYSCHGNGGNQNWILNRKGELKHH 425
Query: 171 ERCV 174
+ C+
Sbjct: 426 DLCL 429
Score = 41.6 bits (96), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 20/55 (36%), Positives = 34/55 (61%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELR 55
+G A AV+G CHG+G +Q + LN +G+L H + C+ K IS + +S+ ++
Sbjct: 392 LGNVAGAVVGLYSCHGNGGNQNWILNRKGELKHHDLCLTLVKFTISARYNSVLMK 446
>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 641
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 65/231 (28%), Positives = 101/231 (43%), Gaps = 45/231 (19%)
Query: 42 KQVISTQKSSLELRKRSSWLKVFKQ--IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
K V +K+S+ L K W+ + Q +G L D GDISE+V LR L+CK F
Sbjct: 441 KGVNVLKKNSVRLAK--VWMDEYAQYYFDRIGPDLG---DYGDISERVELRRSLNCKSFD 495
Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK--CLDSMGRTAPAVIGTSYCHGSGSSQ 157
W++K++ D++ GE +N G ++ CLDS T A + CHG G +Q
Sbjct: 496 WYVKNIYPDLFIPGDAAAS----GEVRNSGFERKWCLDSAA-TVHATVSVYPCHGQGGNQ 550
Query: 158 LFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNS 217
+ + G++ E C+D G + L++C S
Sbjct: 551 YWLFSKTGEIRRDELCLDYS----------------------------GGDVVLYSCHGS 582
Query: 218 SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
G + WR+ T+QL H ++CL+++ + M C S QQW F
Sbjct: 583 KG--NQYWRYDPMTKQLRHGSSEKCLAINEYKTGVTMETCGP-TSRQQWNF 630
>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Mus musculus]
Length = 601
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 88/230 (38%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 387 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 446
Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ P WGE +++ C+DS + C GS QLF +
Sbjct: 447 PVEPLPAAWGEIRSVAANLCVDSKHGATGTELRLDVCVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C+DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCLDAISHSS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R R L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 LWGYR---EDRTLFHPVSNSCMDCNPSEKKIFMARCDPLSETQQWIFEHI 587
>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
1 [Ciona intestinalis]
Length = 573
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 96/216 (44%), Gaps = 38/216 (17%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + +P + + GDISE+V LR+KL CK F W+++++ D ++P
Sbjct: 386 WMDSFKNFFYIITPGVLKQEYGDISERVRLREKLQCKSFKWYLENIYPD-----SQIPGE 440
Query: 120 LF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ GE +N CLD+MGR +G CH G +Q++ +L + C+DA K
Sbjct: 441 YYSLGEIRNEEGGLCLDTMGRKENDKVGIFNCHEMGGNQVWAYTGNQELRCDDICLDASK 500
Query: 179 QEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI 238
+G + + C + G + W + TR L+H
Sbjct: 501 --------------------------VGGPIMMVKCHHMRG--NQLWEYDEDTRLLKHTN 532
Query: 239 LKRCLSVHP--ETNQLAMLR-CDENNSYQQWRFKEV 271
+CL P +++QL +L CD Q+W K +
Sbjct: 533 THQCLD-RPVDQSSQLPLLTDCDVTKRSQRWLLKNM 567
>gi|170056941|ref|XP_001864259.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876546|gb|EDS39929.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 606
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 97/198 (48%), Gaps = 34/198 (17%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
D GD+S+Q+A+R+KL CK F WF++HVA+D+ K+P + PP+ G +++ + C+D+
Sbjct: 415 DAGDLSKQLAIREKLQCKPFKWFIEHVAFDLIEKYPPVEPPDFANGAIQSVSSPNLCVDT 474
Query: 137 MGRTAPAVIGTSYCHGSG----SSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+ IG C ++Q F+L + + ++ +GE C D +
Sbjct: 475 LSHGEKETIGLFGCAADKAQPQANQFFQLSWHRDLRIKYGELCWDVSES----------- 523
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
+A K++ L++C G + WR+ ++Q+ RCL + +
Sbjct: 524 ----IANAKIL---------LYHCHGGQG--NQLWRYDLDSQQIRQGKNNRCLDMDADKR 568
Query: 251 QLAMLRCDENNSYQQWRF 268
+ + C E+N Q+W++
Sbjct: 569 AVFVNPCSESNPNQRWKW 586
>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
Length = 545
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 69/131 (52%), Gaps = 12/131 (9%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR WL +K + +P GD+SE+ +R++L CK F WF++++ F
Sbjct: 354 LRVVEVWLDSYKDLFYALNPHMQGEPYGDVSERKRIRERLQCKSFDWFLENI-------F 406
Query: 114 PELP-PNLFW---GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
PELP P+ GE KNLG KC+D+MG AP G CHG G +Q+F + + +
Sbjct: 407 PELPIPDKNVQGRGELKNLGGNKCMDTMGEHAPYT-GLYSCHGMGGNQVFSYTWKNVISY 465
Query: 170 GERCVDADKQE 180
ERC+ + +
Sbjct: 466 QERCLAVSRNK 476
>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
Length = 561
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 97/225 (43%), Gaps = 37/225 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P DMGDIS Q+ LR++L CK F WFMK VA D+ +P
Sbjct: 347 RVAEVWMDEYKEHLYKRRPHLRKTDMGDISGQLQLRERLKCKPFKWFMKVVAPDIILHYP 406
Query: 115 ELPPNL-----FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLN---AEGQ 166
+ P W +A NL C+DS HG G +++ RL+ G
Sbjct: 407 PVEPEPAASGEIWNKASNL----CIDSK-------------HGGGQAEV-RLDQCVKGGG 448
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ +GE+ + + TFC A +K G L LF C G W
Sbjct: 449 IMNGEQNFHMSWHNDIRPKGR---TFCFDAQMK------GGTLILFACHQMLGNQH--WL 497
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ +QL H + CL + ++ L + C+ N QQW +++
Sbjct: 498 YYEEKQQLLHKVTGSCLEGNSDSKSLFLSECNHKNPMQQWSWEKT 542
>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
[Tribolium castaneum]
gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
Length = 602
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 76/136 (55%), Gaps = 10/136 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ ++ +P A + +GD+S + LR++L CK F W++++V +P
Sbjct: 405 RVAEVWMDEWRDFYYAMNPGARSVPVGDVSARRELRERLKCKSFRWYLENV-------YP 457
Query: 115 E--LPPNLFW-GEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P ++ G+ +N+ T+ CLD+MGR + +G +YCH G +Q+F Q+ +
Sbjct: 458 ESQMPLEYYYLGDIRNVETKNCLDTMGRKSGENLGMTYCHNLGGNQVFAYTKRQQIMSDD 517
Query: 172 RCVDADKQEKLVEMLK 187
C+DA ++ V++++
Sbjct: 518 NCLDASNKKGPVKLVR 533
>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 96/200 (48%), Gaps = 35/200 (17%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLD 135
+D GD++EQ+A+R KL CK F WFM+ VA+D+ +P + PP G +N+G ++ C+D
Sbjct: 483 IDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFALGAIQNVGDKRLCID 542
Query: 136 SMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLP 189
+MGR +G C +Q + L+ + +L + C+D + + P
Sbjct: 543 TMGRRKHKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKECLDVQ-----IWTVNAP 597
Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPE 248
V L++C G + W + T+ ++H +RCL + P
Sbjct: 598 VW-------------------LWDCHLQGG--NQYWSYDYHTQTIKHGRDGRRCLELLPF 636
Query: 249 TNQLAMLRCDENNSYQQWRF 268
+L + +C+ +N++ +W F
Sbjct: 637 AQELVVNKCNTSNTFMKWNF 656
>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
Length = 676
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 96/200 (48%), Gaps = 35/200 (17%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLD 135
+D GD++EQ+A+R KL CK F WFM+ VA+D+ +P + PP G +N+G ++ C+D
Sbjct: 483 IDAGDLTEQMAIRKKLKCKSFKWFMEEVAFDLINSYPPVDPPTFALGAIQNVGDKRLCID 542
Query: 136 SMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLP 189
+MGR +G C +Q + L+ + +L + C+D + + P
Sbjct: 543 TMGRRKHKRMGVYACAEDLKVPQKTQFWELSWKRDLRLRRKKECLDVQ-----IWTVNAP 597
Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPE 248
V L++C G + W + T+ ++H +RCL + P
Sbjct: 598 VW-------------------LWDCHLQGG--NQYWSYDYHTQTIKHGRDGRRCLELLPF 636
Query: 249 TNQLAMLRCDENNSYQQWRF 268
+L + +C+ +N++ +W F
Sbjct: 637 AQELVVNKCNTSNTFMKWNF 656
>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Loxodonta africana]
Length = 560
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 9/138 (6%)
Query: 55 RKRSSWLKV-----FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDV 109
+++ W +V K + SP + +D GD+S + LR+ L CK FSW+++++ D
Sbjct: 366 KQQEDWQEVWDGMNLKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD- 424
Query: 110 YTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
++ P +L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++
Sbjct: 425 -SQIPRRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRT 481
Query: 170 GERCVDADKQEKLVEMLK 187
+ C+D + V MLK
Sbjct: 482 DDLCLDVSRLNGPVIMLK 499
>gi|51490860|emb|CAD44535.1| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
Length = 170
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 6/108 (5%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQKCLDSMGR 139
GDIS +V LR KL CK FSW+++++ D ++P + F GE +N+ T +CLD+M R
Sbjct: 5 GDISSRVGLRHKLQCKPFSWYLENIYPD-----SQIPRHYFSLGEIRNVETNQCLDNMAR 59
Query: 140 TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
+G CHG G +Q+F A ++ + C+D K V MLK
Sbjct: 60 KENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLK 107
>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
Length = 534
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 40/111 (36%), Positives = 62/111 (55%), Gaps = 6/111 (5%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQKCLDS 136
+D GDIS + LR KL CK FSW+++++ D ++P + + GE +N+ T +CLD+
Sbjct: 366 VDYGDISSRTTLRQKLQCKPFSWYLENIYPD-----SQIPRHYYSLGEIRNVETNQCLDN 420
Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
M R +G CHG G +Q+F A ++ + C+D K V MLK
Sbjct: 421 MARKENEKVGIFNCHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVMMLK 471
>gi|170039452|ref|XP_001847548.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863025|gb|EDS26408.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 606
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 96/198 (48%), Gaps = 34/198 (17%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
D GD+S+Q+A+R+KL CK F WF++HVA+D+ K+P + PP+ G ++ + C+D+
Sbjct: 415 DAGDLSKQLAIREKLQCKPFKWFIEHVAFDLIEKYPPVEPPDFANGAIQSASSPNLCVDT 474
Query: 137 MGRTAPAVIGTSYCHGSG----SSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+ IG C ++Q F+L + + ++ +GE C D +
Sbjct: 475 LSHGEKETIGLFGCAADKAQPQANQFFQLSWHRDLRIKYGELCWDVSES----------- 523
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
+A K++ L++C G + WR+ ++Q+ RCL + +
Sbjct: 524 ----IANAKIL---------LYHCHGGQG--NQLWRYDLDSQQIRQGKNNRCLDMDADKR 568
Query: 251 QLAMLRCDENNSYQQWRF 268
+ + C E+N Q+W++
Sbjct: 569 AVFVNPCSESNPNQRWKW 586
>gi|389610347|dbj|BAM18785.1| polypeptide GalNAc transferase 2 [Papilio xuthus]
Length = 256
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 72/135 (53%), Gaps = 11/135 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K++ PLA +D GDISE+V LR+KLHCK F W++K+V +P
Sbjct: 79 RAAEVWMDDYKELYYRSQPLAKQVDFGDISERVRLREKLHCKPFRWYLKNV-------YP 131
Query: 115 ELP-PNLFWGEAKNLGTQ--KCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
EL P L ++ Q +CLD+MG +G CH +G +Q +R + G + H
Sbjct: 132 ELQVPTLENSGDGHVIRQGPRCLDTMGHLVDGTLGMYPCHNTGGNQEWRFD-NGLIRHHS 190
Query: 172 RCVDADKQEKLVEML 186
C+ ++++ +L
Sbjct: 191 LCISLSPEDRVTAVL 205
>gi|327278031|ref|XP_003223766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Anolis carolinensis]
Length = 602
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 86/230 (37%), Gaps = 42/230 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GD+S Q LR L CKDF WFM VA+DV FP
Sbjct: 387 RVAETWMDEFAEYVYQRRPEYRHLSTGDLSAQKELRKHLKCKDFKWFMAAVAWDVPKYFP 446
Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ P WGE +N+ C+DS + C GS QLF +
Sbjct: 447 PVEPLPASWGEIRNVAANLCVDSKHGGTGTELRLDICVKDGSERTWSHEQLFTFGWREDI 506
Query: 168 GHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
GE C DA PVT L++C G
Sbjct: 507 RPGEPLHTRKFCFDAVSHSS-------PVT-------------------LYDCHGMKGNQ 540
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R + L H + C+ +P ++ M RCD + QQW F+ +
Sbjct: 541 HWSYR---KDKTLFHPVSSSCIDCNPAEKKIFMNRCDPLSETQQWIFEHI 587
>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
Length = 604
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 102/229 (44%), Gaps = 41/229 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K++ P + GDIS+++ LR KL+CK F W++ +V D+ P
Sbjct: 405 RVAEVWMDEYKELYYGVRPHHRAIGTGDISDRLELRKKLNCKPFKWYLDNVFPDMMVPLP 464
Query: 115 ELPPNLFW-GEAKNLGTQKCLDSM-GRTAPAVIGTSYC-HGSGSSQLFRLNAE-GQL--- 167
E NL G KN T CLDS+ R G C +G +Q+F + G++
Sbjct: 465 E---NLLGKGAVKNAATNMCLDSLSSREVDMKAGLYPCANGKSENQMFYFTTKYGEIRRE 521
Query: 168 -GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
G RC+D AG K G+ L+++ C G + W+
Sbjct: 522 GTFGARCLD-------------------FAGGK-----PGSTLSMYGCHLMKGNQE--WK 555
Query: 227 FHNSTRQLEHAILKRCL--SVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
S +Q+ HA K CL +V+ +L + CD NN+ Q+W F E P
Sbjct: 556 --RSGKQIVHAASKLCLEAAVNGNDRKLIVNTCDSNNANQEWTFTEYDP 602
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus impatiens]
Length = 602
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 86/199 (43%), Gaps = 42/199 (21%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
GD+SE+ ALR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492
Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
R A PA G CH G +Q + L+ G++ E C+D
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQYWMLSKTGEIRRDESCLDYS----------------- 533
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
GT + L+ C S G W +++ T+Q+ H +CL++ +L M
Sbjct: 534 -----------GTDVILYPCHGSKGNQQ--WIYNHQTKQIRHGSSDKCLAITESKQRLIM 580
Query: 255 LRCDENNSYQQWRFKEVRP 273
C + Q+W F+ P
Sbjct: 581 EECSAAAARQRWSFENYDP 599
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus terrestris]
gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus terrestris]
Length = 602
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 86/199 (43%), Gaps = 42/199 (21%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
GD+SE+ ALR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492
Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
R A PA G CH G +Q + L+ G++ E C+D
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQYWMLSKTGEIRRDESCLDYS----------------- 533
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
GT + L+ C S G W +++ T+Q+ H +CL++ +L M
Sbjct: 534 -----------GTDVILYPCHGSKGNQQ--WIYNHQTKQIRHGSSDKCLAITESKQRLIM 580
Query: 255 LRCDENNSYQQWRFKEVRP 273
C + Q+W F+ P
Sbjct: 581 EECSAAAARQRWSFENYDP 599
>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
Length = 685
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 102/211 (48%), Gaps = 33/211 (15%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQ--KC 133
D GD+S + LR L CK F W++ ++ +PEL ++ GE +N Q +C
Sbjct: 474 DFGDVSARKKLRSDLGCKSFKWYLDNI-------YPELFVPGESVAKGEVRNSAVQPARC 526
Query: 134 LDSM-GR-TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD M GR +GT CHG G +QL NA+ +C+D+ +++ P
Sbjct: 527 LDCMVGRHEKNRPVGTYQCHGQGGNQL--RNAQTS-----QCLDSAVGDEVENKAITPYP 579
Query: 192 FCVLAGVKL-IFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAIL 239
G + + + G + +C + +GT + WR+++ T +L+HA+
Sbjct: 580 CHEQGGNQYWMLSKDGEIRRDESCVDYAGTDVMVFPCHGMKGNQEWRYNHDTGRLQHAVS 639
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
++CL + + +L M+ C ++ YQ W+FKE
Sbjct: 640 QKCLGMTKDGAKLEMVACHYDDPYQHWKFKE 670
>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
Length = 481
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/117 (39%), Positives = 62/117 (52%), Gaps = 3/117 (2%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K I P GDIS+++ LR KL CK F W++K+V DV ++P++
Sbjct: 332 WMDNYKDIFYNRFPNLRQHSYGDISKRIELRKKLKCKSFDWYLKNVFTDV--QYPDMI-F 388
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
L GE +N T CLDSMG A IG CHG G +QL +L E C+DA
Sbjct: 389 LAKGELRNPSTGYCLDSMGNKEYADIGIYPCHGQGGNQLLTYTIRKELEMDEVCLDA 445
>gi|355689619|gb|AER98893.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mustela putorius
furo]
Length = 256
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 103/224 (45%), Gaps = 38/224 (16%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC +FSWF+ ++ +
Sbjct: 63 QVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCHNFSWFLNNIYPE 122
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G +NLGT +CLD +G P ++ T CHG G +Q F +
Sbjct: 123 MFV--PDLKPT-FYGAIRNLGTNQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 176
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
L H + C+ AG LG F NS D
Sbjct: 177 RDLRH-----------------NIAKQLCLHAGAST----LGLRSCHFTGKNSQVPKDEE 215
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W R + + CL+ E + AM C+ ++ +QQW F
Sbjct: 216 WEL-TQDRLIRNLGSGTCLT--SEDKKPAMAPCNPSDPHQQWLF 256
>gi|358341053|dbj|GAA48824.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 424
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 97/218 (44%), Gaps = 34/218 (15%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ K SP A+ D GDISE+ ALR +L CK F W++++V DV + F
Sbjct: 222 MRTALVWMDDHKLFYLKSSPDAVHTDYGDISERQALRKRLRCKSFRWYLENV--DVESVF 279
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P + GE ++ + CLD++G+ +G S CHG G +QLF +G++ C
Sbjct: 280 PVDFHGI--GEIRHESSGLCLDTLGQKQHGPVGLSSCHGQGGNQLFVWTTKGEIQAEVGC 337
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
V + K C+ RL T GP F R+
Sbjct: 338 VSPTDDGDTPLLFKP----CL---------RLDT---------------GPQLFDYQDRK 369
Query: 234 LEHAILKRCLSV-HPETNQLAMLRCDENNSYQQWRFKE 270
L H +CL+V + ++ QL + C++ S+ W E
Sbjct: 370 LIHRATAKCLTVSNLKSAQLTLRPCEDTPSF-HWILPE 406
>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Metaseiulus occidentalis]
Length = 583
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 58/213 (27%), Positives = 92/213 (43%), Gaps = 42/213 (19%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K +P+A +D GD S + LR L CK F W+++++ +PE +P
Sbjct: 393 WLDEWKDFYFAINPVAKKVDRGDTSGRHKLRQDLQCKSFRWYLENI-------YPESHMP 445
Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ + GE KN CLD+ G+ + V+ CHG G +Q+F Q+ + C+DA
Sbjct: 446 LDYYHLGEIKNADGNLCLDTYGKKSGDVLYMGKCHGLGGNQVFAYTKRQQIMADDSCLDA 505
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
V+ LF C N G + W + +T+Q+ H
Sbjct: 506 SSPSGPVK--------------------------LFRCHNMGG--NQMWTYDGNTKQIRH 537
Query: 237 AILKRCLSVHPETNQ--LAMLR-CDENNSYQQW 266
RCL P N+ +A L+ C+ Q+W
Sbjct: 538 INTNRCLD-KPNANRSNVATLKECNSGKESQKW 569
>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
Length = 586
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 87/195 (44%), Gaps = 38/195 (19%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
D GDI+ + ALR+KL CK F W++ ++ +++ + GE +NLG + CLDS
Sbjct: 419 DFGDITSRKALREKLGCKSFKWYLDNIYPELFIPGEAVAS----GEIRNLGIGGKTCLDS 474
Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
R + +G CH G +Q + + G++ E C+D QE +
Sbjct: 475 PARRSDLHKPVGLYPCHRQGGNQFWMYSKSGEIRRDEACLDYSGQEVI------------ 522
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
L+ C S G + W ++ +++ L H +CL+++ N+L M
Sbjct: 523 ----------------LYPCHGSKG--NQFWDYNANSKLLRHGSSDKCLAINEAKNKLLM 564
Query: 255 LRCDENNSYQQWRFK 269
CDE + Q W +
Sbjct: 565 EPCDEEATRQHWSLE 579
>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
Length = 671
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 90/202 (44%), Gaps = 44/202 (21%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNL--GTQKC 133
D GD+S + LR++LHCK F W++ ++ +PEL P + + GE +N+ G + C
Sbjct: 504 DFGDVSSRKKLREELHCKPFRWYLDNI-------YPELFVPGDAVASGEVRNMGYGNRTC 556
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD+ R +G CH G +Q + L+ G++ E C+D
Sbjct: 557 LDAPAGKRNLRKAVGLYPCHNQGGNQYWMLSKTGEIRRDEACLD---------------- 600
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AG ++ L+ C S G + W + + T L H RCL+++ N+
Sbjct: 601 ---YAGDDVV---------LYPCHGSRG--NQYWNYSDDTHLLRHGSSDRCLAINEAKNK 646
Query: 252 LAMLRCDENNSYQQWRFKEVRP 273
L M C+ Q+W F+ P
Sbjct: 647 LIMQDCNPLVEAQRWSFQNYDP 668
>gi|357629476|gb|EHJ78219.1| hypothetical protein KGM_03405 [Danaus plexippus]
Length = 353
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 67/134 (50%), Gaps = 8/134 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K++ PLA +D GDISE+V++R +LHCK F W+++HV +P
Sbjct: 179 RAAEVWMDDYKELYYRSQPLAKQVDFGDISERVSIRQRLHCKPFRWYLEHV-------YP 231
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
EL F +CLD+MG + CH +G +Q + + G + H C+
Sbjct: 232 ELRVPTFGNSIAIKQGPRCLDTMGHQVDGTVAMYPCHNTGGNQEWSFD-NGLIRHQSLCL 290
Query: 175 DADKQEKLVEMLKL 188
+++ + +L +
Sbjct: 291 GLSQEDSVTVVLAV 304
>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
Length = 675
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 95/215 (44%), Gaps = 26/215 (12%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD + Q LR +L+CK F WFM VA+D+ +P + PP
Sbjct: 469 WMDEYAEYIYQRRPEYRHLSAGDTAAQKDLRSQLNCKSFKWFMTKVAWDLSKHYPPVEPP 528
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG--HGERCVDA 176
WGE +N+G+ CL++ + + + C L G++G HG+
Sbjct: 529 AAAWGEIRNVGSSMCLETKHFVSGSPVWMESC----------LKGRGEVGWNHGQVFTFG 578
Query: 177 DKQEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
+++ V M V F ++ + + L++C G + WR+ + L
Sbjct: 579 WREDIRVGDPMHTKKVCFDAVSN--------NSPVTLYDCHGMKG--NQLWRYRKD-KSL 627
Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
H + C+ P+ ++ M CD ++ QQW F+
Sbjct: 628 YHPVSNSCVDSSPKEGRVFMNTCDPSSPNQQWVFE 662
>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oreochromis niloticus]
Length = 587
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 95/215 (44%), Gaps = 22/215 (10%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR++L+CK+F WFM VA+D+ +P + PP
Sbjct: 379 WMDEYAEYIYQRRPEYRHLSAGDMTVQKELRNRLNCKNFKWFMSEVAWDLPKHYPPVEPP 438
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
WGE +N+G+ C++S + + I C + + HG+ +
Sbjct: 439 AAAWGEIRNVGSSMCMESKHFVSGSPIRLENC--------VKGRGDVSWSHGQVFTFGWR 490
Query: 179 QEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
++ V M V F ++ + + L++C G + WR+ + L H
Sbjct: 491 EDIRVGDPMHTKKVCFDAISH--------NSPVTLYDCHGMKG--NQLWRYRKD-KSLYH 539
Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ C+ P ++ M CD ++ QQW F++
Sbjct: 540 PVSNSCIDSSPNERRVFMNTCDPSSPTQQWLFEKT 574
>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 624
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 42/131 (32%), Positives = 67/131 (51%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
W+ +K +P GD+SE+ ALR+KLHCK F W++++V +PE +P
Sbjct: 432 WMDEYKDFFYKINPGVRSTSYGDVSERKALREKLHCKSFKWYLQNV-------YPESQMP 484
Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
GE +N T +C+DSMGR + +G CHG G +Q+F + L + C+D
Sbjct: 485 VEYHALGEIRNKATGQCIDSMGRKSGEKVGMVQCHGMGGNQIFSYTKKQALQTDDVCLDV 544
Query: 177 DKQEKLVEMLK 187
V++ +
Sbjct: 545 SSLHGPVKLFQ 555
>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
Length = 669
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 67/124 (54%), Gaps = 15/124 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ PLA + GDI +++ LR++L CK F W++++V
Sbjct: 480 RAAEVWMDEYKRYYYAAVPLATNIPFGDIEDRLRLREELQCKPFRWYLENVY-------- 531
Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
P L E +N G+ + CLDS+G A A++G CHG+G +Q + LN +G++ H
Sbjct: 532 ---PQLSVPERRNNGSIRQGAFCLDSLGNVAGAIVGLYSCHGNGGNQNWILNRKGEVKHH 588
Query: 171 ERCV 174
+ C+
Sbjct: 589 DLCL 592
>gi|405973911|gb|EKC38600.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Crassostrea gigas]
Length = 581
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 88/217 (40%), Gaps = 36/217 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A ++ GDISE++ LR +L CK F WF++HV +P
Sbjct: 392 RAAEVWMDNYKEFYYAAVPSAKMVNFGDISERMDLRKRLSCKPFKWFLEHV-------YP 444
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P + C+D++G A ++G CH +G +Q F L EG + H +
Sbjct: 445 ELKVPGHQDQAFGSIQQDNNCMDTLGNFADGILGIFPCHFAGGNQEFSLTKEGFIRHLDL 504
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C V L GTV+ LF C G W+ +
Sbjct: 505 C------------------------VTLTGSMPGTVVKLFQC--QEGNTLQMWQRTSRDT 538
Query: 233 QLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
L+ CL S + L + C E + Q+W F
Sbjct: 539 MLKSHNYDLCLDSQEVQMKGLIVNACQEGSPTQKWSF 575
>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
Length = 584
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 10/123 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A +D+GD++E+ LR+ L CK F W+++ T +P
Sbjct: 398 RTAEVWMDEYKDIFYSMVPAARNVDVGDLTERKILRENLQCKSFRWYLE-------TIYP 450
Query: 115 E--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + F G+ +N+G +CLD+ GR+A CHG G +QL+ +G++ E
Sbjct: 451 ESPIPIDFFSLGQVQNMGVMECLDTAGRSAGDSPAMLPCHGQGGNQLWTYTGKGEIRSDE 510
Query: 172 RCV 174
C+
Sbjct: 511 LCL 513
>gi|350593489|ref|XP_003133459.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Sus scrofa]
Length = 321
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 64/110 (58%), Gaps = 4/110 (3%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSM 137
+D GD+S + LR+ L CK FSW+++++ D ++ P +L GE +N+ T +CLD+M
Sbjct: 155 VDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYSL--GEIRNVETNQCLDNM 210
Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
GR +G CHG G +Q+F A+ ++ + C+D + V MLK
Sbjct: 211 GRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLK 260
>gi|312372346|gb|EFR20327.1| hypothetical protein AND_20267 [Anopheles darlingi]
Length = 616
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/201 (26%), Positives = 92/201 (45%), Gaps = 37/201 (18%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
D+GDIS+Q+A+R+KL CK F WFM HVA+D+ K+P + PP+ G ++ C+D+
Sbjct: 422 DVGDISKQIAIREKLQCKPFKWFMTHVAFDLMEKYPPIEPPDFANGAIQSAARPALCVDT 481
Query: 137 MGRTAPAVIGTSYC----HGSGSSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+ IG C +Q F+L + + ++ GE C D + ++L
Sbjct: 482 LNHGEKQTIGLYSCAEDKRQPQPNQFFQLSWHRDLRIKFGELCWDVSESVPNAKIL---- 537
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHP--- 247
L++C G + WR+ ++ L+ RCL + P
Sbjct: 538 --------------------LYHCHGGQG--NQLWRYDPDSQMLKQGKNNRCLDMDPSLG 575
Query: 248 ETNQLAMLRCDENNSYQQWRF 268
+ Q+ + C+ N +Q+W +
Sbjct: 576 DGRQVFVNPCNLTNPHQRWHW 596
>gi|209875919|ref|XP_002139402.1| glycosyl transferase [Cryptosporidium muris RN66]
gi|209555008|gb|EEA05053.1| glycosyl transferase, group 2 family protein [Cryptosporidium muris
RN66]
Length = 743
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 65/230 (28%), Positives = 105/230 (45%), Gaps = 41/230 (17%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR +WL F SP +D+GD +E + L+ +L CK F WF+ +VA + Y K
Sbjct: 540 LRTARAWLDEFYIFTKALSP-NKSIDLGDFNEVIQLKQRLQCKPFKWFLDNVAPETYLK- 597
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
++ + GE KN C+D+M ++ + + IG +CHG +Q F
Sbjct: 598 -DINEVQYLGEIKNPNLNMCIDTMSKSESGSKIGVFFCHGEKGTQGF------------- 643
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
+ EK+ M + +FC+ L + NC + W+++ +T
Sbjct: 644 -IFLKGSEKIRLMSR--ESFCI-----------SDRLTMENC----DEITTRWKYNENTL 685
Query: 233 QLEHAILK---RCLSVHPETNQLAMLRCDENNSY--QQWRF-KEVRPDWS 276
L H L +CL+V P +N+ L+ + N Y QQW F K VR ++S
Sbjct: 686 ALTHKGLDNSTKCLTVLPNSNKDLNLQLLDCNYYKEQQWIFEKFVRSEYS 735
>gi|339239855|ref|XP_003375853.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316975462|gb|EFV58902.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 625
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/200 (29%), Positives = 91/200 (45%), Gaps = 44/200 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGTQK--C 133
+ GD+S+++ALR +L+C F W++K+V +PEL P N + GE + +G K C
Sbjct: 459 EFGDVSDRIALRQRLNCSSFEWYIKNV-------YPELFVPGNSIAKGEIRCMGQNKRHC 511
Query: 134 LD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD + GR + Y CHG G +Q + L+ G++ E CVD
Sbjct: 512 LDFASGRKEHNKPISMYPCHGEGGNQYWMLSPTGEIRRDESCVD---------------- 555
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AG K+ L C G + W+++ + L H I CL ++ +
Sbjct: 556 ---YAGQKVF---------LSGCHGLKGNQE--WKYNFKDKTLTHVISGLCLEMNHDATG 601
Query: 252 LAMLRCDENNSYQQWRFKEV 271
+ M CD N QQW F E+
Sbjct: 602 VIMASCDSQNLNQQWTFGEI 621
>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
malayi]
Length = 584
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 66/123 (53%), Gaps = 10/123 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A +D+GD++E+ LR+ L CK F W+++ T +P
Sbjct: 398 RTAEVWMDEYKDIFYGMVPAAKNVDVGDLTERKILRENLQCKSFRWYLE-------TIYP 450
Query: 115 E--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E +P + F G+ +N+G +CLD+ GR+A CHG G +QL+ +G++ E
Sbjct: 451 ESPIPIDFFSLGQVQNMGVMECLDTAGRSAGDSPAMLPCHGKGGNQLWTYTGKGEIRSDE 510
Query: 172 RCV 174
C+
Sbjct: 511 LCL 513
>gi|326428492|gb|EGD74062.1| polypeptide N-acetylgalactosaminyltransferase 1 [Salpingoeca sp.
ATCC 50818]
Length = 531
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/125 (40%), Positives = 71/125 (56%), Gaps = 8/125 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMF--LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
LR WL +K + G + F +D GD+S Q+ALR +L CK FSW+++ V ++
Sbjct: 348 LRIVEVWLDEYKP-QFYGHRASHFTGVDFGDVSAQIALRSRLGCKPFSWYVQTVYPQLHV 406
Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
PE NL WGE + GT CLD+MG +GT CHG G +Q + L A+G+L HG
Sbjct: 407 --PEHH-NLAWGELR-CGT-VCLDTMGHGDGEEVGTFGCHGLGGNQYWVLTAQGELMHGV 461
Query: 172 RCVDA 176
C+ A
Sbjct: 462 LCLAA 466
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 20/53 (37%), Positives = 29/53 (54%)
Query: 1 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLE 53
MG +GT CHG G +Q + L A+G+L HG C+ A +S K +L+
Sbjct: 427 MGHGDGEEVGTFGCHGLGGNQYWVLTAQGELMHGVLCLAATPTGVSLAKCALD 479
>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
17 [Ciona intestinalis]
Length = 870
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 91/218 (41%), Gaps = 33/218 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P + GD+S+Q ALR+ L C+ F WFMK VA D+ +P
Sbjct: 659 RVAEVWMDEYAEYFYKKRPHVRGIHPGDLSKQKALRELLECRSFDWFMKEVAPDIIKHYP 718
Query: 115 E-LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSS-QLFRLNAEGQLGHGER 172
+P WG N G+++CLD + + A + C G++ Q F L + + G
Sbjct: 719 PVMPEPAAWGMLSNEGSKRCLDGLYKKEGAPLSLMPCREEGTADQSFILTWKEDIRPG-- 776
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLN----LFNCFNSSGTVDGPWRFH 228
DK K FC G LN L+ C G + W++
Sbjct: 777 -TSMDKARK--------------------FCLDGQGLNSPVVLWQCHGQYG--NQLWKYK 813
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ T L H I C + E ++ +L C N+ Q+W
Sbjct: 814 DDT--LIHGITGHCATSDAEAKKVILLPCKANDINQRW 849
>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
2, partial [Ciona intestinalis]
Length = 582
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 95/219 (43%), Gaps = 41/219 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A + G+I ++ +R + CK F W++++V +P
Sbjct: 393 RAAEVWMDDYKEYYYAAVPSAKLIPFGNIENRLQIRVRNQCKPFKWYLENV-------YP 445
Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G K G KC+D++G IG CH SG +Q F +N E Q+ H
Sbjct: 446 ELRVPSKESVAFGSIKQ-GVNKCIDTLGHVQEGSIGLYECHDSGGNQEFSMNKEMQIRHQ 504
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+ C A + R G+++ L +C + + T+ +F S
Sbjct: 505 DLCFTAGEG-----------------------AREGSIIKLRHC-DENNTMQ---KFEQS 537
Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
QL C+ S H L + +CD + S QQWRF
Sbjct: 538 KSQLRLYQSSYCIDSRHERDMGLILSKCDHSRS-QQWRF 575
>gi|195114158|ref|XP_002001634.1| GI15842 [Drosophila mojavensis]
gi|193912209|gb|EDW11076.1| GI15842 [Drosophila mojavensis]
Length = 628
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KL CK F W+++HV D+ T P
Sbjct: 444 RAAEVWMDEYKQHYYNAVPLAKNIPFGNIDDRLALKEKLQCKPFKWYLEHVYPDLQTPDP 503
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + + G++ H E C+
Sbjct: 504 QDV-----GQFRQDATE-CLDTMGHIVDGTVGLFPCHNTGGNQEWTFSKRGEIKHDELCL 557
>gi|167534921|ref|XP_001749135.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772288|gb|EDQ85941.1| predicted protein [Monosiga brevicollis MX1]
Length = 576
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/229 (27%), Positives = 103/229 (44%), Gaps = 48/229 (20%)
Query: 53 ELRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD-V 109
+LR + W+ + I +LG MGDIS+ + LRD+L CK+F W++K+V + +
Sbjct: 385 KLRTAAVWMDDYAFIVHEALGIKPEQLEQMGDISDMLELRDRLQCKNFDWYLKNVYPESI 444
Query: 110 YTKFPELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEGQL- 167
T ++ G KN + +CLD++ ++ A + +G CHG GS + L+ ++
Sbjct: 445 ITDRADIKA---LGMIKNPASNQCLDTLQKSWADSPVGVYGCHGQGSQRFLTLSKTHEIR 501
Query: 168 --GHGERCVDADKQEKLVEMLKLPVTFCVLA-GVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
+ E C+ + +L +T+C + K F G V N N
Sbjct: 502 PVANLELCLTS----------RLTITWCEASRDAKFEFTPAGLVQNKHN----------- 540
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
RCL V ET +LA CDEN++ Q+W F + P
Sbjct: 541 ---------------GRCLRVD-ETPKLAFADCDENDARQKWIFPDESP 573
>gi|6624102|gb|AAF19246.1|AC009227_1 similar to polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]; similar to JC4223 (PID:g2135942), partial
[Homo sapiens]
Length = 166
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/109 (35%), Positives = 63/109 (57%), Gaps = 4/109 (3%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
D GD+S + LR+ L CK FSW+++++ D ++ P +L GE +N+ T +CLD+MG
Sbjct: 1 DYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYSL--GEIRNVETNQCLDNMG 56
Query: 139 RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLK 187
R +G CHG G +Q+F A+ ++ + C+D + V MLK
Sbjct: 57 RKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLK 105
>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Callithrix jacchus]
Length = 622
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/124 (37%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ FK+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSFKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|345488662|ref|XP_003425959.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Nasonia vitripennis]
Length = 572
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 92/215 (42%), Gaps = 34/215 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ KLHCK FSW++KHV ++
Sbjct: 386 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKHVYPELIIPTS 445
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + + +G + H + C
Sbjct: 446 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGMTNDGLIKHHDLC- 499
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
L LPV GT L + C G+ + WR H +
Sbjct: 500 -----------LTLPVY------------AKGTSLLMQIC---DGSENQKWR-HLEGGLI 532
Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
HA + C+ S + + +CD N Q+W
Sbjct: 533 RHARIPVCIDSRYHAQRGVTAEKCDSNAESQRWHL 567
>gi|170589103|ref|XP_001899313.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158593526|gb|EDP32121.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 636
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 98/251 (39%), Gaps = 35/251 (13%)
Query: 30 QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
++GH RC + I S R W+ + + P + +D GD+SEQ
Sbjct: 399 RVGHIYRCKYIPFSNPGIGDFISRNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSEQK 458
Query: 88 ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
A+R +L CK F WFMK +A+D +P + P + GE +N+ KC+D+ + A G
Sbjct: 459 AIRKRLKCKSFDWFMKKIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTDFKNAEKKFG 518
Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKL 200
C + N H R C D V + K P+
Sbjct: 519 LRKCVSDDPTNHGEQNLRLTFWHDVRLQTRTMCFDVS-----VSVNKAPIV--------- 564
Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
LF+C G + +++ +Q+ H I C+ E ++ M CD
Sbjct: 565 ----------LFSCHGMQG--NQHFKYLPDKQQMYHPISDLCMDCDVEQGEIFMNPCDSE 612
Query: 261 NSYQQWRFKEV 271
Q+W + ++
Sbjct: 613 RRSQKWTWTKI 623
>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
Length = 584
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/220 (27%), Positives = 96/220 (43%), Gaps = 46/220 (20%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A +D+GD++E+ LR+ L CK F W+++ T +P
Sbjct: 398 RTAEVWMDEYKDIFYKMVPAAKNVDIGDLTERKVLRENLQCKSFRWYLE-------TIYP 450
Query: 115 ELPPN---LFWGEAKNLGTQKCLDSMGRTA---PAVIGTSYCHGSGSSQLFRLNAEGQLG 168
E P L G+ +N+G CLD+ GR+A PA++ CHG G +QL+ +G++
Sbjct: 451 ESPIPIDFLSLGQIQNMGVVGCLDTAGRSAGDSPAILP---CHGKGGNQLWAYTGKGEIR 507
Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
E C + F V G+ + C +L+ + +
Sbjct: 508 ADELC----------------LAFTV-KGISMEKCADWVLLSKVT-----------FEYV 539
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
T L+H CL + T QL+ C N+ Q+WR
Sbjct: 540 KKTLMLKHRETGLCLQANESTLQLST--CLHNDMNQKWRL 577
>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Saimiri boliviensis boliviensis]
gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Saimiri boliviensis boliviensis]
Length = 622
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/135 (35%), Positives = 75/135 (55%), Gaps = 13/135 (9%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLKLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG---ERCVDADK 178
L H + C+ A K
Sbjct: 544 LRHNIAKQLCLHASK 558
>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
Length = 670
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/195 (28%), Positives = 87/195 (44%), Gaps = 44/195 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLGT--QKC 133
D GD+SE+ LR+ L CK F W++ +V +PEL P + GE +NLG + C
Sbjct: 503 DFGDVSERKKLREDLQCKSFKWYLDNV-------YPELFIPGDAVANGEMRNLGYGGRTC 555
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LDS R +G CH G +Q + + G++ + C+D
Sbjct: 556 LDSPSGKRYLKKPVGLYPCHRQGGNQYWMFSKTGEIRRDQACLD---------------- 599
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AG +I LF C S G + W +H +T+ L H +CL+++ ++
Sbjct: 600 ---YAGKDVI---------LFGCHGSKG--NQFWTYHENTKLLHHGSSGKCLAINESKDK 645
Query: 252 LAMLRCDENNSYQQW 266
L M CD ++ Q W
Sbjct: 646 LIMEECDASHLRQHW 660
>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 621
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/212 (26%), Positives = 90/212 (42%), Gaps = 41/212 (19%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K +P A +D GD+S + LR KL C F W+++++ +PE +P
Sbjct: 435 WLDEWKDFYFAINPAAKNVDKGDLSYRKQLRTKLKCNTFRWYLENI-------YPESHMP 487
Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ + GE K+ T CLD+ GR + + S CHG G +Q+F Q+ + C+DA
Sbjct: 488 LDYYHLGEIKHADTSDCLDTFGRKSGENVAVSKCHGMGGNQVFAYTKRQQIMSDDNCLDA 547
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
V++L+ C G + W ++ + +H
Sbjct: 548 SSPRGPVKLLR--------------------------CHGMGG--NQLWIYNKEEQSFKH 579
Query: 237 AILKRCLSVHPETNQ--LAMLRCDENNSYQQW 266
RCL PE L +LR + S Q+W
Sbjct: 580 VNTARCLD-QPEAKDPSLPVLRECDGRSSQRW 610
>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
Length = 613
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 94/222 (42%), Gaps = 36/222 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ +P D GD+++Q ALR++L CK F WF++ VA D+ ++P
Sbjct: 400 RVAEVWMDEYKQFLYERNPQFDQTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLVVRYP 459
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
P F G ++ K CLDSM A IG C + + +Q F L + +
Sbjct: 460 LRDPKPFASGRVQSAANPKLCLDSMNHKAKEPIGVFSCAANRTYPQNNQFFTLTYYRDIR 519
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ ++C+DA G+ + LFNC S G + W+
Sbjct: 520 VSSVDKCLDASSD--------------------------GSEVILFNCHESQG--NQLWQ 551
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ T+ + H R + ++ + +CD Q+W +
Sbjct: 552 YDTETQMIRHGKPTRNQCLDLVERKVVVSKCDHRKKTQRWEW 593
>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 573
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 98/219 (44%), Gaps = 20/219 (9%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P +D GD++ Q A+R++L+CK F WF++++A+D+ K+P
Sbjct: 370 RVAEVWMDEYAEYLYKRRPHYRNIDPGDLTVQKAVRERLNCKPFKWFIENIAFDLPLKYP 429
Query: 115 EL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
+ PP+L GE +++ C+D+ + G C + S+ R L E
Sbjct: 430 PIEPPDLAEGEIRSIADPGLCVDTERKEPEDTFGLKPCEKNFKSKNTRTEQYFILTWHED 489
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
P V V I + +NL+ C G + W + ++
Sbjct: 490 I--------------RPKGRNVCWDVSSIDNKAS--VNLYKCHGMKG--NQYWHYDSNEN 531
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H RCL+ + E+ L + +CD+N Q+W+ +++
Sbjct: 532 WLVHGTKNRCLTANVESQTLYVTKCDKNIENQKWKIEKI 570
>gi|324506451|gb|ADY42754.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Ascaris suum]
Length = 618
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 92/194 (47%), Gaps = 31/194 (15%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSM 137
D GD+++Q A+R++L CK F WFMK +A+D +P + P + GE +N+ Q C+D+
Sbjct: 431 DEGDLTKQKAVRERLKCKSFDWFMKEIAFDQDKFYPAVEPEDSANGELRNVKAQTCVDTQ 490
Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA- 196
R G S+ F L +C+ D + ++L +F V
Sbjct: 491 FR--------------GGSERFGLR---------KCISDDPSGGGEQNMRLTFSFDVRPK 527
Query: 197 GVKLIFCRLGTV----LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
G L F +V + LF+C G + +++ +++Q+ H + +C+ E ++
Sbjct: 528 GRTLCFDVSSSVRKAPVVLFSCHGMQG--NQLFKYLPNSKQMYHPVSNQCMDCDAERGEI 585
Query: 253 AMLRCDENNSYQQW 266
M +CDE QQW
Sbjct: 586 FMDKCDEGKPTQQW 599
>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 561
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 95/217 (43%), Gaps = 39/217 (17%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE---L 116
W+ +K P A + GDI ++ LR+KL C F W+++++ FP+ L
Sbjct: 371 WMDEYKSFYYKTVPNARNAEYGDIEDRKQLREKLQCNSFRWYLENI-------FPDSQFL 423
Query: 117 PPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
N F + E +N+ T++CLD+MG+ + S CHG G Q++ + +L H + C+D
Sbjct: 424 LDNYFRFCEVRNMETKQCLDNMGQKEKSKAALSRCHGQGGHQIYAWSKLNELKHDDLCLD 483
Query: 176 ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLE 235
A P F + + C + GT + WR++ + +
Sbjct: 484 ASA----------PSGFKDVEQSR--------------CNSHGGTQE--WRYNEELKSIM 517
Query: 236 HAILKRCLS-VHPETNQLAMLRCDENNSYQQWRFKEV 271
H + CL ++ + +CD N Q+W E+
Sbjct: 518 HVVSGLCLDKADADSTTPTLHQCD-NRLSQKWELVEI 553
>gi|194761420|ref|XP_001962927.1| GF15680 [Drosophila ananassae]
gi|190616624|gb|EDV32148.1| GF15680 [Drosophila ananassae]
Length = 630
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 65/120 (54%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 446 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 505
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + GT+ CLD+MG +G CH +G +Q + + G++ H + C+
Sbjct: 506 QE-----IGQFRQDGTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFSKRGEIKHDDLCL 559
>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Nomascus leucogenys]
Length = 622
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
Length = 603
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 409 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 468
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 469 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 524
Query: 167 LGHG 170
L H
Sbjct: 525 LRHN 528
>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
gorilla gorilla]
Length = 622
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Pan paniscus]
gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pan paniscus]
Length = 622
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 6 [Homo sapiens]
Length = 622
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
Length = 622
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pongo abelii]
gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
3 [Pongo abelii]
Length = 622
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Nasonia vitripennis]
Length = 610
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 83/197 (42%), Gaps = 36/197 (18%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
+ GD+S++ ALR L CK F W++ ++ +++ + GE ++L ++ C+DS G
Sbjct: 445 NYGDVSDRKALRKNLGCKSFKWYLDNIYPELFIPGEAVAS----GEIRHLASRLCIDSPG 500
Query: 139 RTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
+G CH G +Q + L+ G++ E C+D
Sbjct: 501 NPEDLHQAVGFYECHNQGGNQYWMLSKTGEIRRDESCLDYS------------------- 541
Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLR 256
GT + L+ C S G W ++ T+Q+ H +CL++ +L M
Sbjct: 542 ---------GTDVILYPCHGSKGNQQ--WTYNTQTKQIRHESSGKCLAITESKQRLLMEE 590
Query: 257 CDENNSYQQWRFKEVRP 273
C Q+W F+ P
Sbjct: 591 CSPAAPRQRWAFENYDP 607
>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
Length = 673
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 100/219 (45%), Gaps = 36/219 (16%)
Query: 60 WLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-P 117
W+ +KQ G+ + +D GD++ Q A+R KL CK F WFM++VA+D+ +P + P
Sbjct: 459 WMDEYKQYLYNGADGVYERIDAGDLTAQKAIRTKLKCKSFKWFMENVAFDLIKNYPPIDP 518
Query: 118 PNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLN--AEGQLGHG 170
P+ G +N+G C+D++ + +G C + +Q + L+ + +L
Sbjct: 519 PDYASGAIQNVGDPTLCVDTLSKPRHNRVGIYSCARNLVKPQRTQYWSLSWKRDLRLHRK 578
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+ C+D + PV L++C G + W +
Sbjct: 579 KDCLDVQ-----IWDANAPVW-------------------LWDCHGQQG--NQYWYYDYR 612
Query: 231 TRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
T+ ++H KRC+ + P+T +L + CDE N QW F
Sbjct: 613 TKLIKHGKEGKRCMELLPDTEELVVNACDEKNRLMQWNF 651
>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
Length = 592
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 99/229 (43%), Gaps = 19/229 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 367 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 424
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG-QLGHG-ER 172
+L GE +N+ T +CLD+MGR +G CHG G +Q+ L LG G E
Sbjct: 425 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEE 482
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLI--------FCRLGTVLNLFNCFNSSGTVDGP 224
C LV + K I RL + + C + G +
Sbjct: 483 CCSNHPLYGLVYTPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRG--NQL 540
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
W + L HA +CL E +++ M C + S QQW + +
Sbjct: 541 WEYDAERLTLRHANSNQCLDEPSEEDKMVPTMQDCSGSRS-QQWLLRNM 588
>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
musculus]
Length = 594
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 64/229 (27%), Positives = 99/229 (43%), Gaps = 19/229 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P
Sbjct: 369 RLAEVWMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIP 426
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG-QLGHG-ER 172
+L GE +N+ T +CLD+MGR +G CHG G +Q+ L LG G E
Sbjct: 427 RRYYSL--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVHDLCLSAPSLGVGAEE 484
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLI--------FCRLGTVLNLFNCFNSSGTVDGP 224
C LV + K I RL + + C + G +
Sbjct: 485 CCSNHPLYGLVYTPTINEQVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRG--NQL 542
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLA--MLRCDENNSYQQWRFKEV 271
W + L HA +CL E +++ M C + S QQW + +
Sbjct: 543 WEYDAERLTLRHANSNQCLDEPSEEDKMVPTMQDCSGSRS-QQWLLRNM 590
>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Hydra magnipapillata]
Length = 548
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 69/136 (50%), Gaps = 8/136 (5%)
Query: 55 RKRSSWLKVFKQIR-SLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K++ P + GDISE+V LR KL CK F W++ +V D+
Sbjct: 398 RLAEVWMDEYKELYYRRKPPEDKLVKYGDISERVELRKKLGCKSFKWYIDNVIPDMIGAD 457
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAP-AVIGTSYCHGSGSSQLFRLNAEGQLGHG-E 171
P P + GE +N+ + CLDSMG A I CH G +Q F L+ G++ H E
Sbjct: 458 PNPPAH---GEVRNVASNMCLDSMGNKGNRAQIKVFPCHRLGGNQFFVLSKRGEIIHNDE 514
Query: 172 RCVD--ADKQEKLVEM 185
C+D + +E V+M
Sbjct: 515 SCLDYSLENEENKVDM 530
>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
Length = 495
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 88/211 (41%), Gaps = 39/211 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
W+ ++ +P D GD+S ++ LR+KL CK F W+++++ +PE +P
Sbjct: 314 WMDEWRSFYYKINPGVKQTDYGDLSPRIQLREKLECKSFRWYLQNI-------YPESQMP 366
Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ + GE +N T +CLDSMGR A +G CHG G +Q+F + + + C+D
Sbjct: 367 LDYYSLGEIRNKETNQCLDSMGRKAGEKVGIVGCHGMGGNQIFSYSKKKAFQTDDLCLD- 425
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
V L PV L+ C G + W T + H
Sbjct: 426 ------VSALTGPV-------------------KLYQCHGLGG--NQLWEHDQETLVIRH 458
Query: 237 AILKRCLS-VHPETNQLAMLRCDENNSYQQW 266
+CL P+ L +S QQW
Sbjct: 459 VSSNQCLDKPTPQDRDTPALGACSGSSSQQW 489
>gi|195435185|ref|XP_002065582.1| GK14594 [Drosophila willistoni]
gi|194161667|gb|EDW76568.1| GK14594 [Drosophila willistoni]
Length = 635
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 65/120 (54%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 451 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEP 510
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + GT+ CLD+MG +G CH +G +Q + + G++ H + C+
Sbjct: 511 QE-----IGQFRQDGTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCL 564
>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Cavia porcellus]
Length = 622
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSWF+ ++ +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKIAQEKSFGDISERLQLRERLHCHNFSWFLSNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLG +CLD R +I S CHG G +Q F + +
Sbjct: 488 MFV--PDLSPT-FYGAIKNLGINQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRE 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|198437817|ref|XP_002130165.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
intestinalis]
Length = 647
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 89/213 (41%), Gaps = 21/213 (9%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W + + P +D GD+S ++ + KL C+ F WFMK +AYD+ +P
Sbjct: 429 RVIETWWDDYSKYFYTRRPEVKSIDTGDLSVPLSYKKKLQCRSFDWFMKEIAYDIPKHYP 488
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ PP+ G N+GT+ CLD + + + S C G + L + G
Sbjct: 489 MVEPPSGASGHISNIGTKLCLDGKFGGSGSPLTLSNCEQHGGELNWWLTWLEDIRPGGNT 548
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D ++ V L +V++ + C + G + W++ +Q
Sbjct: 549 PDTARK------------------VCLDCVSRDSVVSFWECHHMRG--NQLWKYIFDNKQ 588
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
L H++ + CL P ++ C+ NN Q W
Sbjct: 589 LYHSVSRSCLEASPGEGKVYARECNHNNLNQLW 621
>gi|321455342|gb|EFX66478.1| hypothetical protein DAPPUDRAFT_302681 [Daphnia pulex]
Length = 613
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 92/223 (41%), Gaps = 34/223 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P L++GD+S Q +R++L CK F WFM ++A+D+ K+P
Sbjct: 399 RVAEVWMDEYKEYLYKRRPQYRNLEVGDLSSQTEIRERLQCKSFKWFMTNIAFDLPKKYP 458
Query: 115 EL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYC--HGSGSS--QLFRLNAEGQLG 168
+ PP+ GE ++ C+D+ + C GSG S Q F L +
Sbjct: 459 PIEPPDFASGEIRSKADSTLCVDTQFKKENERFNLEQCVKDGSGRSGEQQFALTWHKDIR 518
Query: 169 HGER--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+R C D + +L L+ C S G + WR
Sbjct: 519 PHKRTMCWDVSSLDAQAPVL------------------------LWGCHGSQG--NQLWR 552
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ T QL H RCL P +L + C+ + Q+W F+
Sbjct: 553 YDPETLQLIHGGNPRCLDCDPGRKELFVATCNSESPTQRWTFE 595
>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
Length = 675
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/224 (25%), Positives = 102/224 (45%), Gaps = 36/224 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K + + G + +D GD++ Q A+R KL CK F WFM++VA+D+ +
Sbjct: 459 RVAEVWMDEYKNYLYNHGDGIYDNVDPGDLTAQKAIRTKLKCKSFKWFMENVAFDLMKSY 518
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCH----GSGSSQLFRLN--AEG 165
P + PP+ G +N+G C+D++GR +G C +Q + L+ +
Sbjct: 519 PPVDPPDYASGAIQNVGDNTLCIDTLGRVRHNRVGMYRCAIDLVKPQRTQYWSLSWKRDL 578
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L + C+D + PV L++C G + W
Sbjct: 579 RLRRKKDCLDVQIWDA-----NAPVW-------------------LWDCHGQQG--NQYW 612
Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ T+ ++H KRC+ + P + +L + C+E+N Y +W F
Sbjct: 613 FYDYHTKMIKHGKEGKRCMELLPFSEELVVNNCNESNEYMRWNF 656
>gi|332020473|gb|EGI60888.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Acromyrmex
echinatior]
Length = 442
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 91/215 (42%), Gaps = 34/215 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ KLHCK FSW++K+V ++
Sbjct: 256 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTS 315
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + L +G + H + C
Sbjct: 316 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 369
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
L LPV GT L + C G+ + WR H +
Sbjct: 370 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HMEGGLI 402
Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
H + C+ S + + +CD N Q+W
Sbjct: 403 RHTRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 437
>gi|241622516|ref|XP_002407424.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
gi|215500988|gb|EEC10482.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
Length = 471
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 98/223 (43%), Gaps = 31/223 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P LD GD++ Q ALR +L+CK F WFM+ VA+D +K+P
Sbjct: 266 RVAEVWMDEYKEHLYHRRPHYRHLDPGDLTAQKALRKRLNCKSFKWFMEQVAFDQPSKYP 325
Query: 115 E-LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL-----NAEGQLG 168
L P W + +N + C+D+ + G ++ F L + G+ G
Sbjct: 326 ALLTPVAHWPQVRNEESGLCIDTQFK--------------GQNERFSLAPCLKDQRGRSG 371
Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
+ + K V K V F V + + + L++C G + W++
Sbjct: 372 EQQLVLTWHKD---VRPAKRSVCFDVSSS------DVHAPVMLWSCHGMHG--NQLWKYD 420
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
T+ L H I CL +++++ M CD + Q+W F+ V
Sbjct: 421 TVTKHLFHPITANCLDCDAKSHEVFMSICDSDVRTQRWLFEHV 463
>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 460
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 68/131 (51%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ +P A +D GD+S + LR KL C F W+++++ +PE +P
Sbjct: 274 WLDEWKEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENI-------YPESHMP 326
Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ + GE K+ + CLD+ GR + + S CHG G +Q+F Q+ + C+DA
Sbjct: 327 LDYYHLGEIKHADSPVCLDTFGRKSGENVAVSTCHGQGGNQVFAYTKRQQIMSDDNCLDA 386
Query: 177 DKQEKLVEMLK 187
V++L+
Sbjct: 387 SSPRGPVKLLR 397
>gi|10438640|dbj|BAB15297.1| unnamed protein product [Homo sapiens]
gi|193787354|dbj|BAG52560.1| unnamed protein product [Homo sapiens]
Length = 187
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/113 (37%), Positives = 64/113 (56%), Gaps = 6/113 (5%)
Query: 59 SWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPP 118
S+ K+F + + +A GDISE++ LR++LHC +FSW++ +V +++ P+L P
Sbjct: 3 SYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPEMFV--PDLTP 60
Query: 119 NLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
F+G KNLGT +CLD R +I S CHG G +Q F + L H
Sbjct: 61 T-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRDLRH 111
>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
Length = 622
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLMPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Takifugu rubripes]
Length = 600
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 95/215 (44%), Gaps = 26/215 (12%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR +L CK+F WFM +VA+D+ +P + PP
Sbjct: 392 WMDEYAEYVYQRRPEYRHLSAGDMTPQKELRSRLGCKNFKWFMSNVAWDLPKHYPPVEPP 451
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG--HGERCVDA 176
WGE +N+G+ C++ + + I C + + G++G HG+
Sbjct: 452 AAAWGEIQNVGSGLCMEIKHFVSGSPIRLENC----------VKSRGEVGWSHGQVLTFG 501
Query: 177 DKQEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
+++ V M V F ++ + + L++C G + WR+ R L
Sbjct: 502 WREDIRVGDPMHTRKVCFDAVSH--------NSPVTLYDCHGMKG--NQLWRYRKDKR-L 550
Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
H + C+ P ++ M CD + QQW F+
Sbjct: 551 YHPVSNSCIDSSPGERRVFMNTCDPASLSQQWLFE 585
>gi|351697576|gb|EHB00495.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Heterocephalus
glaber]
Length = 622
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 70/124 (56%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC++FSWF+ ++ +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKIAQEKSFGDISERLQLREQLHCRNFSWFLHNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLG +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGINQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|109096689|ref|XP_001083664.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
mulatta]
Length = 641
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|350583955|ref|XP_003126192.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Sus scrofa]
Length = 384
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSWF+ ++ +
Sbjct: 190 QVRLAEVWMDSYKEIFYRRNLQAAKMAQEKSFGDISERLQLRERLHCHNFSWFLDNIYPE 249
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
++ P+L P F+G KNLG CLD S P ++ T CHG G +Q F +
Sbjct: 250 MFV--PDLKPT-FYGAIKNLGMDHCLDVGESNNGGKPLIMYT--CHGLGGNQYFEYTTQR 304
Query: 166 QLGH 169
L H
Sbjct: 305 DLRH 308
>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
Length = 650
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 44/195 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL ++ GE +NLG + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD+ + +GT CH G +Q + L+ G++ + C+D
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 580
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G + LF C G + W + +T+QL H +CL++ ++
Sbjct: 581 -------------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDK 625
Query: 252 LAMLRCDENNSYQQW 266
L M C + S QQW
Sbjct: 626 LLMEECSASLSRQQW 640
>gi|355564239|gb|EHH20739.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
gi|355762987|gb|EHH62101.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
fascicularis]
gi|380809242|gb|AFE76496.1| polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
Length = 622
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
Length = 650
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 44/195 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL ++ GE +NLG + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD+ + +GT CH G +Q + L+ G++ + C+D
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 580
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G + LF C G + W + +T+QL H +CL++ ++
Sbjct: 581 -------------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDK 625
Query: 252 LAMLRCDENNSYQQW 266
L M C + S QQW
Sbjct: 626 LLMEECSASLSRQQW 640
>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
Length = 585
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 99/227 (43%), Gaps = 38/227 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + Q +P D GD+S Q LR++L CK F WF++ VA D+ ++P
Sbjct: 371 RVAEVWMDEYSQFLYERNPQFAKTDPGDLSAQRELRERLQCKPFKWFLEVVAPDLLVRYP 430
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
P F G +++ + CLDS+ A IG C + + +Q F L+ + +
Sbjct: 431 PRDPQPFASGRVQSVANPRLCLDSLNHQAKEPIGLYACAFNKTHPQNNQFFTLSYHRDIR 490
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ ++C+DA K L + LF+C S G + WR
Sbjct: 491 VRSNDKCLDAAK--------------------------LNDEIVLFSCHESQG--NQMWR 522
Query: 227 FHNSTRQLEHAILK--RCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ ++ + H RC+ TN+L + CD++ + Q+W + V
Sbjct: 523 YDYESKMIIHGKDHHGRCMEADLHTNKLFVRSCDKDKASQKWNWGYV 569
>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
Length = 650
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/195 (26%), Positives = 85/195 (43%), Gaps = 44/195 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL ++ GE +NLG + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD+ + +GT CH G +Q + L+ G++ + C+D
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 580
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G + LF C G + W + +T+QL H +CL++ ++
Sbjct: 581 -------------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDK 625
Query: 252 LAMLRCDENNSYQQW 266
L M C + S QQW
Sbjct: 626 LLMEECSASLSRQQW 640
>gi|402886019|ref|XP_003906439.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Papio anubis]
gi|402886021|ref|XP_003906440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Papio anubis]
Length = 622
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHSFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
Length = 647
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 84/195 (43%), Gaps = 41/195 (21%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
D GD+S++ LR+ L CK F W++ ++ +PEL ++ GE N+ CLD
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIANVPNGMCLD 535
Query: 136 SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
+ ++ + Y CHG G +Q + L+ G++ + C+D
Sbjct: 536 AKEKSEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDY------------------ 577
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
G + LF C G + W + +T+QL H +CL++ ++L M
Sbjct: 578 ----------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLLM 625
Query: 255 LRCDENNSYQQWRFK 269
C + S QQW +
Sbjct: 626 EECSASLSRQQWTLE 640
>gi|350409232|ref|XP_003488663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus impatiens]
Length = 571
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 94/215 (43%), Gaps = 34/215 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ KLHCK FSW++K+V ++
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTS 444
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + L +G + H + C
Sbjct: 445 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 498
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
L LPV + + GT L + C G+ + WR H +
Sbjct: 499 -----------LTLPV-----------YAK-GTTLLMQIC---DGSENQKWR-HLEGGLI 531
Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
H+ + C+ S + + +CD N Q+W
Sbjct: 532 RHSRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 566
>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Meleagris gallopavo]
Length = 562
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 95/222 (42%), Gaps = 39/222 (17%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR+ L+CK F WFM VA+D+ +P + PP
Sbjct: 352 WMDEYAEYIYQRRPEYRHLSAGDVTAQKELRNNLNCKSFKWFMSEVAWDLPKFYPPVEPP 411
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
WGE +N+GT C+D+ HGS S L RL E CV D+
Sbjct: 412 AAAWGEIRNVGTGLCVDTK-------------HGSLGSPL-RL---------ESCV-KDR 447
Query: 179 QEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDGPWRF 227
E +++ TF ++ FC + + L++C G + WR+
Sbjct: 448 GEAAWNNVQVTXTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQLWRY 505
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ ++ M C+ ++ QQW F+
Sbjct: 506 RKD-KTLYHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 546
>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 10/120 (8%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K SP D GDIS+++ LR L CK F W+++++ +PE P
Sbjct: 312 WLDEYKNFYYRISPSVAKTDPGDISDRLNLRKSLSCKSFKWYLENI-------YPESSWP 364
Query: 118 PNL-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
N F GE +N CLD+M + A +G CHG G +Q++ +L H + C+D
Sbjct: 365 VNYQFMGEVRNTEAHVCLDTMMKEAGNKVGLYGCHGQGGNQIWAFTKNNELRHDDLCLDV 424
>gi|195454523|ref|XP_002074278.1| GK18434 [Drosophila willistoni]
gi|194170363|gb|EDW85264.1| GK18434 [Drosophila willistoni]
Length = 646
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 99/229 (43%), Gaps = 38/229 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMF-LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K+ P +D GD+S Q+A+R++L CK F W+MK +A D K+
Sbjct: 432 RVAEVWMDKYKEYVYTRDPETYEKIDAGDLSRQLAIRERLQCKSFDWYMKEIAPDFLIKY 491
Query: 114 PEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNA--EG 165
P + PP G +++ CLDS+G+T+ +G C + ++Q F L + E
Sbjct: 492 PPIEPPKFASGAVQSVAYPLFCLDSLGQTSNNAVGLYSCAANLTFPQANQNFGLTSFREM 551
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+LG C+D + ++NC N G + W
Sbjct: 552 RLGEEGDCLDVQDNHP------------------------NATIWMWNCHNQGG--NQFW 585
Query: 226 RFHNSTRQLEHAIL-KRCLSVHPETNQLAMLR--CDENNSYQQWRFKEV 271
+ + + + H RC+ + ++ ++ C ENN Q+W+F V
Sbjct: 586 YYDRNHQWIVHGKNGHRCMEAFVDDSERSVYTNDCQENNPRQRWKFANV 634
>gi|161077160|ref|NP_001097343.1| CG30463, isoform E [Drosophila melanogaster]
gi|157400368|gb|ABV53824.1| CG30463, isoform E [Drosophila melanogaster]
Length = 264
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/192 (25%), Positives = 83/192 (43%), Gaps = 41/192 (21%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCLD 135
D GD+S++ LR+ L CK F W++ ++ +PEL ++ GE N+ CLD
Sbjct: 100 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIANVPNGMCLD 152
Query: 136 SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
+ ++ + Y CHG G +Q + L+ G++ + C+D
Sbjct: 153 AKEKSEEETPVSIYECHGQGGNQYWMLSKAGEIRRDDSCLDY------------------ 194
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
G + LF C G + W + +T+QL H +CL++ ++L M
Sbjct: 195 ----------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLLM 242
Query: 255 LRCDENNSYQQW 266
C + S QQW
Sbjct: 243 EECSASLSRQQW 254
>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
Length = 647
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/198 (26%), Positives = 86/198 (43%), Gaps = 44/198 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNL--GTQKC 133
D GD+SE+ LR+ L CK F W++ ++ FPEL + GE +N+ G + C
Sbjct: 480 DYGDVSERKQLRENLGCKPFRWYLDNI-------FPELFIPGEAVASGEVRNMGYGNRTC 532
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD+ G + +G CH G +Q + L+ G++ E C+D
Sbjct: 533 LDAPGGKKNLRKPVGLYPCHNQGGNQYWMLSKTGEIRRDEACLD---------------- 576
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AG +I L+ C S G + W + + L H +CL+++ +
Sbjct: 577 ---YAGQDVI---------LYPCHGSKG--NQYWNYSPDSHLLRHGSSDKCLAINESKQK 622
Query: 252 LAMLRCDENNSYQQWRFK 269
L M CD Q+W+F+
Sbjct: 623 LIMADCDSTVDAQKWQFQ 640
>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
Length = 600
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/218 (23%), Positives = 85/218 (38%), Gaps = 18/218 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ + + P L GD++ Q LR L CKDF W+M VA+D+ +P
Sbjct: 386 RVAETWMDEYTEYIYQRRPEYRHLSTGDLTAQKELRKHLKCKDFKWYMNTVAWDLPKYYP 445
Query: 115 ELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ P WGE +N + C+DS + + C G+ + + G E
Sbjct: 446 PVEPLPAAWGEIRNAASGLCVDSKHGSTGTELRLDNCLKEGAERTWAHEQIFTFGWREDI 505
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
D FC A + + L++C G +R +
Sbjct: 506 RPGDPLHTR--------KFCFDA------ISQNSPITLYDCHGMKGNQHWSYR---KDKS 548
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ P ++ M +CD + QQW F++V
Sbjct: 549 LYHLVSSGCMDCSPNDKRIFMNKCDPKSETQQWIFQKV 586
>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
Length = 622
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 69/124 (55%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE+ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERPQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|402585386|gb|EJW79326.1| polypeptide N-acetylgalactosaminyltransferase 10 [Wuchereria
bancrofti]
Length = 288
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/251 (23%), Positives = 98/251 (39%), Gaps = 35/251 (13%)
Query: 30 QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
++GH RC + I S R W+ + + P + +D GD+SEQ
Sbjct: 51 RVGHIYRCKYIPFSNPGIGDFISRNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSEQK 110
Query: 88 ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
A+R +L CK F WFMK +A+D +P + P + GE +N+ KC+D+ + A G
Sbjct: 111 AVRKRLKCKSFDWFMKEIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTDFKNAEKKFG 170
Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKL 200
C + N H R C D V + K P+
Sbjct: 171 LRKCVSDDPANHGEQNLRLTFWHDVRLQTRTMCFDVS-----VSVNKAPIV--------- 216
Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
LF+C G + +++ +Q+ H I C+ E ++ M C+
Sbjct: 217 ----------LFSCHGMQG--NQHFKYLPDKQQMYHPISDLCMDCDVEQGEIFMNPCNSE 264
Query: 261 NSYQQWRFKEV 271
Q+W + ++
Sbjct: 265 RRSQKWTWTKI 275
>gi|324520233|gb|ADY47590.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 267
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/221 (24%), Positives = 97/221 (43%), Gaps = 41/221 (18%)
Query: 54 LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+R WL +K+I + L + GD+SE+ LR++L C F W++ +V D++
Sbjct: 76 VRLAEVWLDEYKEIYYERINHKLGEY---GDVSERKRLRERLKCHSFKWYLDNVFPDLFI 132
Query: 112 KFPELPPNLFWGEAKNLGTQK-CLD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLG 168
+ GE +N G K C+D +GR Y CH G +Q + L+ EG++
Sbjct: 133 PSKAIGK----GEIRNRGNPKFCVDHEVGRNVVNDAVIPYPCHLMGGNQFWLLSKEGEIR 188
Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
E C+D + +V + C S G + W ++
Sbjct: 189 RDEYCIDYPGRGSVV---------------------------TYECHGSKG--NQLWEYN 219
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ + ++ H I ++CL++ + +L M CD+ N Q W F+
Sbjct: 220 HESGRIYHPISRKCLALSDDDKKLIMSACDKANDRQSWAFQ 260
>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
Length = 644
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/198 (25%), Positives = 89/198 (44%), Gaps = 44/198 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL P + + GE KNLG + C
Sbjct: 477 DWGDVSDRKKLREDLQCKSFRWYLDNI-------YPELFIPGDAVAHGEIKNLGYGGRTC 529
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
+D+ + +GT CH G +Q + L+ G++ + C+D
Sbjct: 530 MDAPAGKKHLKKSVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY--------------- 574
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
G + L+ C S G + W + +T+QL H +CL++ ++
Sbjct: 575 -------------AGKDVTLYACHGSKG--NQFWTYRENTKQLHHGTSGKCLAISEAKDK 619
Query: 252 LAMLRCDENNSYQQWRFK 269
L M C+ + + QQW +
Sbjct: 620 LLMEECNSSLARQQWMLE 637
>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
Length = 658
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 91/195 (46%), Gaps = 44/195 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL P + + GE +NLG + C
Sbjct: 491 DWGDVSDRKKLREDLQCKSFKWYLDNI-------YPELFIPGDAVAHGEIRNLGYGGRTC 543
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LDS + +G CH G +Q + L+ G++ + C+D
Sbjct: 544 LDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDDYCLD---------------- 587
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AG ++I L++C G + W + +T+QL H +CLS+ + +Q
Sbjct: 588 ---YAGKEVI---------LYSCHGGKG--NQFWTYRENTKQLHHGTSGKCLSISEKKDQ 633
Query: 252 LAMLRCDENNSYQQW 266
L M C ++ + Q+W
Sbjct: 634 LLMEECSQSLTRQEW 648
>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 573
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 69/126 (54%), Gaps = 11/126 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL + + + +P A+ +D GD++++ LR +L+CK F W+++H+ +
Sbjct: 378 VRTALVWLDQYSRFYFMLNPSALSVDYGDVTKRKKLRQQLNCKSFRWYLEHI-------Y 430
Query: 114 PE--LPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
PE +P ++ GE ++ Q CLDS+G +G ++CHG G +Q+F + G +
Sbjct: 431 PESSIPIDVIRLGEIRHKSGQ-CLDSLGHKLGETVGVTHCHGQGGNQVFAITESGTIRVH 489
Query: 171 ERCVDA 176
C+D
Sbjct: 490 AGCMDG 495
>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Canis lupus familiaris]
Length = 622
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 102/224 (45%), Gaps = 38/224 (16%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC +FSWF+ ++ +
Sbjct: 428 QVRLAEVWMDNYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCHNFSWFLHNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P L+ G +NLG +CLD +G P ++ T CHG G +Q F +
Sbjct: 488 MFV--PDLKPTLY-GAIRNLGINQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 541
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
L H + KQ C+ A LG F NS D
Sbjct: 542 RDLRH-----NISKQ------------LCLHASAGT----LGLRSCHFTGKNSQVPKDEE 580
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W F + + + CL+ E + AM C+ + +Q W F
Sbjct: 581 WEF-TQDQLIRNRASGTCLTS--EDKKPAMAACNPRDPHQHWVF 621
>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/124 (35%), Positives = 69/124 (55%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + GDISE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDSYKKIFYRRNLQAAKMTQEKSFGDISERLQLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLTPT-FYGAIKNLGTNQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQRD 543
Query: 167 LGHG 170
L H
Sbjct: 544 LRHN 547
>gi|326670821|ref|XP_003199296.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Danio rerio]
Length = 435
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 100/236 (42%), Gaps = 45/236 (19%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + +A GD+S +V LR++L CK FSW++
Sbjct: 234 QVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKEHSFGDVSRRVDLRERLQCKSFSWYL 293
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLF 159
K+V +V+ P+L P L +G +N+G + CLD S P ++ CHG G +Q F
Sbjct: 294 KNVYPEVF--MPDLNP-LQFGAIRNMGKEACLDVGESNEGGKPLIMYP--CHGMGGNQYF 348
Query: 160 RLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC-F 215
+ ++ H E C+D G ++ L C +
Sbjct: 349 EYSTHHEIRHNIQKELCLDGTD---------------------------GAMVRLEECQY 381
Query: 216 NSSGTVDGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
T+ P W A K+CLS PE L + C+ ++ YQ W F
Sbjct: 382 KGHNTITSPQQKWELREDQLFYNKAS-KQCLSARPENPSL--VPCNPDDKYQLWLF 434
>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
Length = 658
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/195 (27%), Positives = 91/195 (46%), Gaps = 44/195 (22%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL P + + GE +NLG + C
Sbjct: 491 DWGDVSDRKKLREDLQCKSFKWYLDNI-------YPELFIPGDAVAHGEIRNLGYGGRTC 543
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LDS + +G CH G +Q + L+ G++ + C+D
Sbjct: 544 LDSPTGKKHQKKAVGLYPCHRQGGNQYWMLSKVGEIRRDDYCLD---------------- 587
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ 251
AG ++I L++C G + W + +T+QL H +CLS+ + +Q
Sbjct: 588 ---YAGKEVI---------LYSCHGGKG--NQFWTYRENTKQLHHGTSGKCLSISEKKDQ 633
Query: 252 LAMLRCDENNSYQQW 266
L M C ++ + Q+W
Sbjct: 634 LLMEECSQSLTRQEW 648
>gi|307183874|gb|EFN70488.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Camponotus
floridanus]
Length = 451
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 91/215 (42%), Gaps = 34/215 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ +LHCK FSW++K+V ++
Sbjct: 265 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTS 324
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + L +G + H + C
Sbjct: 325 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHNTGGNQEWGLTKDGLIKHHDLC- 378
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
L LPV GT L + C G+ + WR H +
Sbjct: 379 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HLEGGLI 411
Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
H + C+ S + + +CD N Q+W
Sbjct: 412 RHTRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 446
>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Saccoglossus kowalevskii]
Length = 524
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 100/220 (45%), Gaps = 42/220 (19%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K I P D GD++E+ LRD+L C DF W+++++ +
Sbjct: 325 MRVAEVWMDEYKDIFYALKPQLKGEDYGDVTERKELRDRLQCHDFKWYLQNI-------Y 377
Query: 114 PELP-PNL---FWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG 168
PELP P+L GE +NLG C+D+MG A + CHG G +Q+F + + ++
Sbjct: 378 PELPIPDLKVQARGELRNLGKIGYCMDTMGANA---MCAHPCHGIGHNQMFSFSWQQKIW 434
Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
GE C+ + ++ A ++ IF N NS + W+ H
Sbjct: 435 FGELCLTSSPYQR--------------ASIQQIFA---------NQCNSPRVIH--WQ-H 468
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
++ +CL + + L + C EN++ Q W++
Sbjct: 469 TVNGKMMDTKTGKCLDIGTDQKFLILKTC-ENSATQNWKW 507
>gi|307214182|gb|EFN89299.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Harpegnathos
saltator]
Length = 442
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 92/215 (42%), Gaps = 34/215 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ +LHCK FSW++K+V ++
Sbjct: 256 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTS 315
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + L +G + H + C
Sbjct: 316 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 369
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
L LPV GT L + C G+ + WR H +
Sbjct: 370 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HLEGGLI 402
Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
H+ + C+ S + + +CD N Q+W
Sbjct: 403 RHSRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 437
>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
Length = 499
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/192 (25%), Positives = 87/192 (45%), Gaps = 33/192 (17%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN--LFWGEAKNLGTQKCLD 135
+D+GDIS++VALR +L CK F W++ ++ D+ K LPP L+ + +N + CLD
Sbjct: 338 VDIGDISDRVALRKRLGCKSFKWYLDNIYPDMTNK---LPPKSYLYSHQIRNKESSLCLD 394
Query: 136 SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ-LGHGERCVDADKQEKLVEMLKLPVTFCV 194
++G +G CHG G +Q F L + L + ++C+D+ +
Sbjct: 395 TLGEKNIKRVGLYTCHGMGGNQFFTLTKSNEILFNDDKCLDSPNGDP------------- 441
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAM 254
G+ + + C G + W+ + + H + CL + + + +
Sbjct: 442 -----------GSYVEMITCHGLKGNQE--WKHNKRMGTIVHVWTENCLDISQDGQFIIV 488
Query: 255 LRCDENNSYQQW 266
CD +S Q+W
Sbjct: 489 NPCD-GSSTQRW 499
>gi|167523942|ref|XP_001746307.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775069|gb|EDQ88694.1| predicted protein [Monosiga brevicollis MX1]
Length = 2376
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 10/124 (8%)
Query: 54 LRKRSSWLKVFKQI-RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
LR W+ + + +S A +D+GDISE+ LR+ LHCK F W++ +V
Sbjct: 1275 LRLAEVWMDEYAEFFKSRKGSAARKIDIGDISERQKLREDLHCKPFKWYLDNV------- 1327
Query: 113 FPEL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
+PEL P GE + CLDS G++ + CHG G +QL+ L+ G+L H
Sbjct: 1328 YPELRVPDPNPVGEGQVQSGGFCLDSAGKSVGHAVALYRCHGLGGNQLWTLSHNGELAHE 1387
Query: 171 ERCV 174
+ CV
Sbjct: 1388 DACV 1391
>gi|432112638|gb|ELK35354.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Myotis davidii]
Length = 416
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/122 (33%), Positives = 68/122 (55%), Gaps = 8/122 (6%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC++FSWF+ ++ +
Sbjct: 222 QVRLAEVWMDSYKEIFYRRNMEAAKMAQEKTFGDISERLQLREQLHCRNFSWFLHNIYPE 281
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQL 167
++ P+L P F+G KNLG +CLD + Y CHG G +Q F + L
Sbjct: 282 LF--IPDLKPT-FYGAIKNLGINQCLDVGEKNHGGKPLIMYACHGLGGNQYFEYTTQRDL 338
Query: 168 GH 169
H
Sbjct: 339 RH 340
>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
Length = 614
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/126 (35%), Positives = 70/126 (55%), Gaps = 14/126 (11%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +KQI R+L + +A GDISE++ LR+ LHC++FSWF+ +V +
Sbjct: 420 QVRLAEVWMDSYKQIFYRRNLQAAKMAQEKSFGDISERLKLRELLHCRNFSWFLHNVYPE 479
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G KNLG +CLD +G P ++ CHG G +Q F +
Sbjct: 480 MFV--PDLKPT-FYGAIKNLGINQCLD-VGENNHGGKPLIMYA--CHGLGGNQYFEYTTQ 533
Query: 165 GQLGHG 170
L H
Sbjct: 534 RDLRHN 539
>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
catus]
Length = 622
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 38/224 (16%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ L+++LHC++FSWF+ ++ +
Sbjct: 428 QVRLAEVWMDSYKEIFYRRNLQAAKMAQEKSFGDISERLQLKERLHCRNFSWFLHNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G +NLG +CLD +G P ++ T CHG G +Q F +
Sbjct: 488 MFV--PDLKPT-FYGAIRNLGVDQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 541
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
L H + KQ C+ A LG F NS D
Sbjct: 542 RDLRH-----NIAKQ------------LCLHASAGT----LGLRSCHFTGQNSQVPKDEE 580
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + ++ CL+ E + AM C+ ++ +Q W F
Sbjct: 581 WEL-TQDQLIRNSGSGTCLTS--ENKKPAMASCNPSDPHQHWLF 621
>gi|383847543|ref|XP_003699412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Megachile rotundata]
Length = 571
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 94/217 (43%), Gaps = 38/217 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ KLHCK FSW++K+V +P
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNV-------YP 437
Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P G +L CLDSMG +G CH +G +Q + L +G + H +
Sbjct: 438 ELVIPTSEGGPGGSLKQGPACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDL 497
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C L LPV + + GT L + C G+ + WR H
Sbjct: 498 C------------LTLPV-----------YAK-GTTLLMQIC---DGSENQKWR-HLEGG 529
Query: 233 QLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
+ H+ + C+ S + + +CD N Q+W
Sbjct: 530 LIRHSRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 566
>gi|322785490|gb|EFZ12159.1| hypothetical protein SINV_06585 [Solenopsis invicta]
Length = 466
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 91/215 (42%), Gaps = 34/215 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ +LHCK FSW++K+V ++
Sbjct: 280 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTS 339
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + L +G + H + C
Sbjct: 340 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLC- 393
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
L LPV GT L + C G+ + WR H +
Sbjct: 394 -----------LTLPVY------------AKGTTLLMQIC---DGSENQKWR-HLEGGLI 426
Query: 235 EHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
H + C+ S + + +CD N Q+W
Sbjct: 427 RHTRIPVCVDSRYHAQRGITAEKCDSNAETQRWHL 461
>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Hydra magnipapillata]
Length = 565
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 100/225 (44%), Gaps = 38/225 (16%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD-VYTKFPELPP 118
W+ FK+ G+P+ + G+ISE++ +R++ CK F W++ +VA D V T++ P
Sbjct: 358 WMDEFKEFLYRGNPMVRSQNAGNISERIKVRERNKCKSFKWYLLNVANDTVRTRY---EP 414
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL-GHGERCVDAD 177
+ GE +N T+ CLD+ G A I S C S+Q+FR +L + E C+DA
Sbjct: 415 DRASGEIENTHTKLCLDTYGANAGRKIKLSKCGQRNSNQIFRWTYIYELHQYPEECLDA- 473
Query: 178 KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
+ +M + + C G F + T Q++H+
Sbjct: 474 ---RYADMDNVYIEKCHEMGGNQKFL-----------------------YDKDTNQIKHS 507
Query: 238 ILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKEVR----PDWS 276
CLSV E + + CD + Q W+ + + P+W+
Sbjct: 508 GTGLCLSVPNEDSPTHPVIEACDLKSVRQLWKIQLSKDSTIPEWA 552
>gi|339249613|ref|XP_003373794.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
spiralis]
gi|316970007|gb|EFV54023.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
spiralis]
Length = 587
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/223 (23%), Positives = 93/223 (41%), Gaps = 30/223 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ + P LD GDIS+Q +R +L CK F WFM+ +A+D ++P
Sbjct: 386 RVAEVWMDEYKEYVYMRRPHYRKLDPGDISKQKEIRKRLGCKPFKWFMQVIAFDQSVRYP 445
Query: 115 ELP-PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCH-----GSGSSQLFRLNAEGQLG 168
+ ++ GE +++ + C+ V+G + C + Q F+ ++
Sbjct: 446 PVEMVDVREGEIRSIQSGMCVTVQYVVEHGVVGMADCKKGHDDAAVGEQFFKYTTRKEIK 505
Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
R + D E V K PV L+ C + G + WR++
Sbjct: 506 FRNRRLCFDVPENKV---KAPVI-------------------LYPCHDMQG--NQAWRYN 541
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ T Q+ H + CL ++ M C+ N Q+W F +
Sbjct: 542 SKTMQIVHPVTNMCLDADFSRREVFMQSCNLNLLSQRWSFGAI 584
>gi|195471053|ref|XP_002087820.1| GE14879 [Drosophila yakuba]
gi|194173921|gb|EDW87532.1| GE14879 [Drosophila yakuba]
Length = 634
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 450 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 509
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + G++ H + C+
Sbjct: 510 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 563
>gi|194855488|ref|XP_001968556.1| GG24441 [Drosophila erecta]
gi|190660423|gb|EDV57615.1| GG24441 [Drosophila erecta]
Length = 631
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 447 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 506
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + G++ H + C+
Sbjct: 507 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 560
>gi|62484229|ref|NP_608773.2| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|320594323|ref|NP_995625.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
gi|195576320|ref|XP_002078024.1| GD22759 [Drosophila simulans]
gi|51315875|sp|Q6WV19.2|GALT2_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
Short=pp-GaNTase 2; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 2; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2
gi|61678274|gb|AAF51113.3| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|194190033|gb|EDX03609.1| GD22759 [Drosophila simulans]
gi|318068299|gb|AAS64620.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
Length = 633
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 449 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 508
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + G++ H + C+
Sbjct: 509 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 562
>gi|34042922|gb|AAQ56700.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 615
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 431 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 490
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + G++ H + C+
Sbjct: 491 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 544
>gi|195342262|ref|XP_002037720.1| GM18147 [Drosophila sechellia]
gi|194132570|gb|EDW54138.1| GM18147 [Drosophila sechellia]
Length = 606
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 422 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 481
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + G++ H + C+
Sbjct: 482 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 535
>gi|33589464|gb|AAQ22499.1| RE02655p [Drosophila melanogaster]
Length = 633
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 63/120 (52%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 449 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 508
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + G++ H + C+
Sbjct: 509 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAFTKRGEIKHDDLCL 562
>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 749
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/114 (35%), Positives = 62/114 (54%), Gaps = 6/114 (5%)
Query: 73 PLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK 132
P M + GD+S+++ LR++L C DF W++ +V + K P+ G+ +N T
Sbjct: 579 PQLMGQEYGDVSDRIKLREELKCHDFQWYLDNVYPAL--KVPDTKVRAR-GDVRNAATSM 635
Query: 133 CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEML 186
CLDSMG+ V+G CHG G++Q F L + QL H +C+ K + ML
Sbjct: 636 CLDSMGK---GVLGMFPCHGEGNNQAFTLTWDDQLKHKNKCLPKIKTPGQLRML 686
>gi|405975887|gb|EKC40420.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 653
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 65/140 (46%), Gaps = 9/140 (6%)
Query: 54 LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+R W+ +K I L L D GD+SE+ LR++L CK F W++K + D+
Sbjct: 491 VRTAEVWMDEYKHIYYERLNYDLG---DYGDVSERKDLRNRLGCKSFGWYLKTMLPDM-- 545
Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
K PE L+ GE +N+ CLD+MG TA CH G +Q FR G +
Sbjct: 546 KLPETA--LYSGEVRNMEKGMCLDTMGTTAGNKFQAIPCHHQGGNQFFRFTVNGHIERDS 603
Query: 172 RCVDADKQEKLVEMLKLPVT 191
C+ L + K VT
Sbjct: 604 ACLSDQDGSLLYVLCKRNVT 623
>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Oryctolagus cuniculus]
Length = 622
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/123 (36%), Positives = 69/123 (56%), Gaps = 10/123 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++LHC +FSWF+ +V +
Sbjct: 428 QVRLAEVWMDNYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHCHNFSWFLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLG +CLD R +I S CHG G +Q F +
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGLGQCLDVGENNRGGKPLIMYS-CHGLGGNQYFEYTTQKD 543
Query: 167 LGH 169
L H
Sbjct: 544 LRH 546
>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
Length = 650
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 50/193 (25%), Positives = 84/193 (43%), Gaps = 44/193 (22%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKCLD 135
GD+S++ LR+ L CK F W++ ++ +PEL ++ GE +NLG + CLD
Sbjct: 485 GDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTCLD 537
Query: 136 SMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
+ + +GT CH G +Q + L+ G++ + C+D
Sbjct: 538 APAGKKHQKKAVGTYPCHRQGGNQYWMLSKAGEIRRDDSCLDY----------------- 580
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA 253
G + LF C G + W + +T+QL H +CL++ ++L
Sbjct: 581 -----------AGKDVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLL 627
Query: 254 MLRCDENNSYQQW 266
M C + S QQW
Sbjct: 628 MEECSASLSRQQW 640
>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
Length = 582
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 91/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ F + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 376 WMDEFAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 435
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ +++ C G G + Q+F + G
Sbjct: 436 AAAWGEIRNVGTGLCTDTKHGAVGSLLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 493
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ R
Sbjct: 494 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-R 532
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M CD ++ QQW F+
Sbjct: 533 TLYHPVSSSCMDCSESDHRIFMNACDPSSPTQQWLFE 569
>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oreochromis niloticus]
Length = 600
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 90/213 (42%), Gaps = 22/213 (10%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR +L+CK F WFM VA+D+ +P + PP
Sbjct: 392 WMDEYAEYVYQRRPEYRHLSAGDMTAQKELRTRLNCKSFKWFMNEVAWDLPKHYPPVEPP 451
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLG--HGERCVDA 176
WGE +N+G+ C++ + + I C + G++G HG+
Sbjct: 452 AAAWGEIQNVGSGMCMEVKHFVSGSPIRLENC----------VKGRGEVGWSHGQVLTFG 501
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
+++ V C A + + L++C G + WR+ + L H
Sbjct: 502 WREDIRVGDPMHTRKLCFDA------VSHSSPVTLYDCHGMKG--NQLWRYRQD-KSLYH 552
Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ C+ P ++ M CD + QQW F+
Sbjct: 553 PVSNSCIDSSPIERRVFMNTCDPASLSQQWLFE 585
>gi|195386226|ref|XP_002051805.1| GJ10330 [Drosophila virilis]
gi|194148262|gb|EDW63960.1| GJ10330 [Drosophila virilis]
Length = 631
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 447 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPEP 506
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T +CLD+MG +G CH +G +Q + + G++ H + C+
Sbjct: 507 QEV-----GQFRQ-DTTECLDTMGHVIDGTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCL 560
>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
Length = 684
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/213 (26%), Positives = 97/213 (45%), Gaps = 40/213 (18%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
D GD+S++ LR+ L CK F W++ ++ +PEL ++ GE +NLG + C
Sbjct: 483 DWGDVSDRRKLRNDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 535
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD+ + +GT CH G +Q+ + HG C+DA +EK E + +
Sbjct: 536 LDAPAGKKHQKKAVGTYPCHRQGGNQI------ANMQHG-MCLDA--KEKSEEETPVSIY 586
Query: 192 FCVLAGVK--LIFCRLGTV-------------LNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
C G + + G + + LF C G + W + +T+QL H
Sbjct: 587 ECHGQGGNQYWMLSKAGEIRRDDSCLDYAGKDVTLFGCHGGKG--NQFWTYRENTKQLHH 644
Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+CL++ ++L M C+++ S QQW +
Sbjct: 645 GTSGKCLAISESKDKLLMEECNQSLSRQQWTLE 677
>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
caballus]
Length = 622
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 101/224 (45%), Gaps = 38/224 (16%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC +FSWF++++ +
Sbjct: 428 QVRLAEVWMDGYKEIFYRRNMQAAKMAQEKSFGDISERLQLRERLHCHNFSWFLQNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G KNLG CLD +G P ++ T CHG G +Q F +
Sbjct: 488 MFV--PDLKPT-FYGAIKNLGIDHCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTQ 541
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
L H + C+ A LG F NS D
Sbjct: 542 RDLRHN-----------------IAKQLCLHASAGT----LGLRSCHFTGKNSQVPKDEE 580
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + ++ CL+ E + AM C+ ++ +Q W F
Sbjct: 581 WEL-TQDQLIRNSGSGTCLTS--EDKKPAMASCNPSDPHQHWLF 621
>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
Length = 693
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 92/207 (44%), Gaps = 35/207 (16%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLGT--QKC 133
D GD+SE+ LR+ L CK F W++ ++ +PEL P + GE +NLG + C
Sbjct: 493 DYGDVSERKKLREDLQCKSFKWYLDNI-------YPELFIPGDAVANGEIRNLGYGGRTC 545
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV- 190
LDS R +G CH G +Q+ +N + CVDA K + P
Sbjct: 546 LDSPTGKRNMKKPVGLYPCHKQGGNQIKSINT-------DMCVDAPKTGDESPVGVYPCH 598
Query: 191 -----TFCVLAGVKLI------FCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL 239
+ +L+ I G + LF C S G + W +H +T+ L H
Sbjct: 599 GQGGHQYWMLSKAGEIRRDQSCLDYAGKDVILFGCHGSKG--NQFWTYHENTKLLHHGSS 656
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQW 266
+CL+++ ++L M CD ++ Q W
Sbjct: 657 GKCLAINEGKDKLIMEECDASHLRQHW 683
>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
Length = 575
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 55/216 (25%), Positives = 90/216 (41%), Gaps = 31/216 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A GDI E+V LR KL C+ F W++++V ++ K P
Sbjct: 387 RAVEVWMDDYKRYYYAAVPYAKNTPYGDIEERVELRRKLRCRPFKWYVQNVYPEL--KLP 444
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+GE K G Q C+D++G IG CHG+G +Q++ L L H C+
Sbjct: 445 SDESTKSFGEIKQ-GNQ-CVDTLGHMRGQTIGLFECHGAGGNQMWSLTKSSLLKHETMCL 502
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
+ + V+L+ C + W + +T +L
Sbjct: 503 GVNDGK-------------ATEPVQLLDCDENNSMQH-------------WEYEKATSRL 536
Query: 235 EHAILKRCLSVHP-ETNQLAMLRCDENNSYQQWRFK 269
H CLS +T+ L + +C+ + Q W F+
Sbjct: 537 RHKPTSLCLSSDKHKTSGLTLEQCNGSAFSQHWAFE 572
>gi|198474621|ref|XP_001356764.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
gi|198138471|gb|EAL33829.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
Length = 639
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 455 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 514
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + + G++ H + C+
Sbjct: 515 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCL 568
>gi|195148230|ref|XP_002015077.1| GL19517 [Drosophila persimilis]
gi|194107030|gb|EDW29073.1| GL19517 [Drosophila persimilis]
Length = 638
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 454 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 513
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + + G++ H + C+
Sbjct: 514 QEV-----GQFRQDSTE-CLDTMGHLIDGTVGIFPCHNTGGNQEWAYSKRGEIKHDDLCL 567
>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Ailuropoda melanoleuca]
Length = 622
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 14/126 (11%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC++FSWF+ ++ +
Sbjct: 428 QVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFLTNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G +NLG +CLD +G P ++ T CHG G +Q F
Sbjct: 488 MFV--PDLKPT-FYGAIRNLGINQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTR 541
Query: 165 GQLGHG 170
L H
Sbjct: 542 RDLRHN 547
>gi|395834931|ref|XP_003790440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Otolemur garnettii]
gi|395834933|ref|XP_003790441.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Otolemur garnettii]
Length = 622
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 68/126 (53%), Gaps = 14/126 (11%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K I + +A GDISE++ LR++LHC++FSWF+ +V +
Sbjct: 428 QVRLAEVWMDSYKMIFYRRNQQAAKMAQEKSFGDISERLQLRERLHCRNFSWFLNNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G KNLG +CLD +G P ++ + CHG G +Q F +
Sbjct: 488 MFV--PDLMPT-FYGAIKNLGINQCLD-VGENNHGEKPLIMYS--CHGLGGNQYFEYTTQ 541
Query: 165 GQLGHG 170
L H
Sbjct: 542 RDLRHN 547
>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
Length = 621
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 14/126 (11%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC++FSWF+ ++ +
Sbjct: 428 QVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQLHCRNFSWFLTNIYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F+G +NLG +CLD +G P ++ T CHG G +Q F
Sbjct: 488 MFV--PDLKPT-FYGAIRNLGINQCLD-VGENNHGGKPLIMYT--CHGLGGNQYFEYTTR 541
Query: 165 GQLGHG 170
L H
Sbjct: 542 RDLRHN 547
>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Takifugu rubripes]
Length = 552
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 93/213 (43%), Gaps = 22/213 (10%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR +L+CK F WFM VA+D+ +P + PP
Sbjct: 344 WMDEYAEYIYQRRPEYRHLAAGDMAVQKDLRSQLNCKSFKWFMTKVAWDLPKHYPPVEPP 403
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
WGE +N+ + CL++ + + + C + EG HG+ +
Sbjct: 404 AAAWGEIRNVASGMCLETKHFASGSPVRMESC--------LKGRGEGGWSHGQVFTFGWR 455
Query: 179 QEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
++ V M V F ++ + + L++C G + W + + L H
Sbjct: 456 EDIRVGDPMHTKKVCFDAVSN--------NSPVTLYDCHGMKG--NQFWHYRED-KSLYH 504
Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ C+ +P+ ++ M C+ ++ QQW F+
Sbjct: 505 PVSNSCVDSNPKEGRVFMNTCNPSSPSQQWAFE 537
>gi|157107416|ref|XP_001649767.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884053|gb|EAT48278.1| AAEL000654-PA [Aedes aegypti]
Length = 607
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 93/201 (46%), Gaps = 34/201 (16%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDS 136
D GD+S+Q+A+R+KL CK F WFM++VA+D+ K+P + PP+ G +++ C+D+
Sbjct: 416 DAGDLSKQLAIREKLQCKPFKWFMENVAFDLVEKYPPIEPPDFANGAIQSVANPALCVDT 475
Query: 137 MGRTAPAVIGTSYCHGSG----SSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLPV 190
+ +G C ++Q F+L + + ++ GE C D + ++L
Sbjct: 476 LNHGEKQTVGLFSCASDKVQPQANQYFQLSWHRDLRIKFGEMCWDVSESVPNAKIL---- 531
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN 250
L++C G + W + ++ L+ RCL +
Sbjct: 532 --------------------LYHCHGGQG--NQLWSYDVESQMLKQGKNNRCLDMDSSRR 569
Query: 251 QLAMLRCDENNSYQQWRFKEV 271
++ + C ++N Q+W++ V
Sbjct: 570 EVFVNPCSDDNPNQKWKWGYV 590
>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
Length = 612
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 78/162 (48%), Gaps = 13/162 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A ++ GD+SE+ LR+ L CK F W+++++ +P
Sbjct: 422 RTAEVWMDEYKAFFYKMVPAARNVEAGDVSERKKLRETLQCKSFKWYLENI-------YP 474
Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P P F G N T+KC+D+ G+ G CHG+G +Q + L +G++ +
Sbjct: 475 EAPLPADFRSLGAIVNRFTEKCVDTNGKKDGQAPGIQACHGAGGNQAWSLTGKGEIRSDD 534
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFC---RLGTVLN 210
C+ + ++ LKL VK +F + GT+L+
Sbjct: 535 LCLSSGHVYQIGSELKLERCSVSKINVKHVFVFDDQAGTLLH 576
>gi|34528626|dbj|BAC85542.1| unnamed protein product [Homo sapiens]
Length = 213
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 57/98 (58%), Gaps = 4/98 (4%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ FK + SP + +D GD+S + LR+ L CK FSW+++++ D ++ P +
Sbjct: 99 WMDEFKDFFYIISPGVVKVDYGDVSVRKTLRENLKCKPFSWYLENIYPD--SQIPRRYYS 156
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
L GE +N+ T +CLD+MGR +G CHG G +Q
Sbjct: 157 L--GEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQ 192
>gi|224613432|gb|ACN60295.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Salmo salar]
Length = 185
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 95/212 (44%), Gaps = 36/212 (16%)
Query: 63 VFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW 122
++ + S + +A GDISE+V LR++L CK+F+W++ + +V+ P+L P F
Sbjct: 3 IYYRRNSNAAKMAKEKGFGDISERVNLRERLQCKNFTWYLNTIYPEVFV--PDLTPTKF- 59
Query: 123 GEAKNLGTQKCLDSMGRT---APAVIGTSYCHGSGSSQLFRLNAEGQLGHG---ERCVDA 176
G KN G Q CLD + P ++ T CH G +Q F + +L H + C+ A
Sbjct: 60 GAIKNSGAQSCLDVGEKNEGGKPLIMYT--CHNMGGNQYFEYTSHKELRHNIGKQLCLQA 117
Query: 177 DKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
+ Q V+ + C L G LGT S W F L++
Sbjct: 118 NPQPDQVK-----IELCTLKG-------LGT----------SVAPQQEWIF-TEENLLKN 154
Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ ++CL + + Q+ M C + Q W F
Sbjct: 155 PLSRKCLLL--KGGQIVMDDCKAADLNQHWAF 184
>gi|326434666|gb|EGD80236.1| polypeptide N-acetylgalactosaminyltransferase 13 [Salpingoeca sp.
ATCC 50818]
Length = 641
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 55/218 (25%), Positives = 91/218 (41%), Gaps = 37/218 (16%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +KQ P +D+GD++++ ALR++L CK F W++KHV D++
Sbjct: 453 MRLAEVWMDDYKQFFYDTKPKRENIDIGDLTKRKALRERLKCKPFKWYLKHVLPDLFVPD 512
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ--LFRLNAEGQLGHGE 171
E ++ A G CLD MG A G CHG G +Q ++ +N E
Sbjct: 513 SE---HVLHKGALRAGNGLCLDKMGHRAGGQAGVFSCHGEGGNQGWMYTVNDE------- 562
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
++ + C + + + ++L C G + W++ N+
Sbjct: 563 --------------IRTADSLC----LDVYSSKFPAPIHLQRCHQKQG--NQAWKYENNV 602
Query: 232 RQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQW 266
H CLS N +L + C E +S QQW
Sbjct: 603 --FRHVASNGCLSSVVSGNGQHELKLDTCKEGSSGQQW 638
>gi|312065523|ref|XP_003135832.1| glycosyl transferase [Loa loa]
gi|307769015|gb|EFO28249.1| glycosyl transferase [Loa loa]
Length = 614
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 101/251 (40%), Gaps = 35/251 (13%)
Query: 30 QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
++GH RC V I S R W+ + + P + +D GD+S+Q
Sbjct: 378 RVGHIYRCKYVPFPDPGIGDFISKNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSKQK 437
Query: 88 ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
A+R +L CK F WFMK +A+D +P + P + GE +N+ KC+D+ + A G
Sbjct: 438 AVRKRLKCKSFDWFMKEIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTHFKNAEEKFG 497
Query: 147 TSYCHGSGSS----QLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKL 200
C G + Q RL + +L C D V + P+
Sbjct: 498 LRKCVSDGPANDGEQNLRLTFWRDVRLHTRTMCFDVS-----VSGNRAPIV--------- 543
Query: 201 IFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDEN 260
LF+C G + +++ +Q+ H + C+ E ++ M CD
Sbjct: 544 ----------LFSCHGMQG--NQHFKYLPDKQQMYHPVSDLCMDCDVERGEIFMNPCDSK 591
Query: 261 NSYQQWRFKEV 271
Q+W + ++
Sbjct: 592 RWSQKWTWAKI 602
>gi|148696679|gb|EDL28626.1| mCG6366 [Mus musculus]
Length = 431
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/245 (24%), Positives = 89/245 (36%), Gaps = 57/245 (23%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 202 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKRLKCKDFKWFMAAVAWDVPKYYP 261
Query: 115 ELPP----------------NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS--- 155
+ P ++F + +++ C+DS + C GS
Sbjct: 262 PVEPLPAAWGLSGIVTQPVLDVFLIQIRSVAANLCVDSKHGATGTELRLDVCVKDGSERT 321
Query: 156 ---SQLFRLNAEGQLGHGER------CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLG 206
QLF + GE C+DA PVT
Sbjct: 322 WSHEQLFTFGWREDIRPGEPLHTRKFCLDAISHSS-------PVT--------------- 359
Query: 207 TVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
L++C G +R R L H + C+ +P ++ M RCD + QQW
Sbjct: 360 ----LYDCHGMKGNQLWGYR---EDRTLFHPVSNSCMDCNPSEKKIFMARCDPLSETQQW 412
Query: 267 RFKEV 271
F+ +
Sbjct: 413 IFEHI 417
>gi|312370886|gb|EFR19191.1| hypothetical protein AND_22918 [Anopheles darlingi]
Length = 1204
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 85/200 (42%), Gaps = 47/200 (23%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNL--------FWGEAKNLGT 130
D GD++E+ LR LHCK F W+++ T FPEL P L F EA ++
Sbjct: 1029 DFGDVTERKQLRRALHCKSFRWYLE-------TVFPELAPALDKRPGHGRFENEALSMEG 1081
Query: 131 QKCLDSMGRTAPAVIGTSYCH-GSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLP 189
Q +++ + C GS + Q + N G+L + RC+D D
Sbjct: 1082 QPKHCLTAQSSAGLPTMEPCQAGSDARQHWLHNLFGELSNENRCLDYD------------ 1129
Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILK-RCLSVHPE 248
G+ L ++ C + G + WR++ +T Q EH +CL+V P
Sbjct: 1130 ----------------GSALRVYACHKARGNQE--WRYNGTTYQFEHCKHSAKCLAVEPT 1171
Query: 249 TNQLAMLRCDENNSYQQWRF 268
T +L + CD Q+W +
Sbjct: 1172 TKRLRIDGCDAAQESQRWHY 1191
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 46/173 (26%), Positives = 71/173 (41%), Gaps = 53/173 (30%)
Query: 80 MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPPN--------LFWGEAKNLGT 130
G+++E+ LR +L CK F W+++ T FPE P+ F +AK T
Sbjct: 407 FGNLTERHELRQRLACKPFRWYLE-------TVFPEQFDPSKAVARGEIRFADDAK--AT 457
Query: 131 QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPV 190
CLD P+++ CHG G QL+ L A+G++ + C+D D
Sbjct: 458 PLCLD-----WPSLLSLVTCHGYGGHQLWYLTAKGEVTREDHCLDYD------------- 499
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCL 243
G +L++ C G + W + TRQL+ RCL
Sbjct: 500 ---------------GELLSVVRCHGLGG--NQRWVWLPDTRQLKKLTYDRCL 535
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 20/66 (30%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 6 PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQK-SSLELRKRSSWLKVF 64
P+++ CHG G QL+ L A+G++ + C+D D +++S + L +R WL
Sbjct: 464 PSLLSLVTCHGYGGHQLWYLTAKGEVTREDHCLDYDGELLSVVRCHGLGGNQRWVWLPDT 523
Query: 65 KQIRSL 70
+Q++ L
Sbjct: 524 RQLKKL 529
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus impatiens]
Length = 637
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 92/211 (43%), Gaps = 31/211 (14%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
GD+SE+ ALR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492
Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD-ADKQEKLVEMLKLPVTFC 193
R A PA G CH G +Q+ L + C+D A K E L + + L
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSAGKPEDLHQPVGLYPCHR 543
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
+ + G + +C + SGT + W +++ T+Q+ H +C
Sbjct: 544 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNHQTKQIRHGSSDKC 603
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
L++ +L M C + Q+W F+ P
Sbjct: 604 LAITESKQRLIMEECSAAAARQRWSFENYDP 634
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 3 [Bombus terrestris]
Length = 637
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 92/211 (43%), Gaps = 31/211 (14%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
GD+SE+ ALR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 437 GDVSERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 492
Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD-ADKQEKLVEMLKLPVTFC 193
R A PA G CH G +Q+ L + C+D A K E L + + L
Sbjct: 493 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSAGKPEDLHQPVGLYPCHR 543
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
+ + G + +C + SGT + W +++ T+Q+ H +C
Sbjct: 544 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNHQTKQIRHGSSDKC 603
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
L++ +L M C + Q+W F+ P
Sbjct: 604 LAITESKQRLIMEECSAAAARQRWSFENYDP 634
>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
ricinus]
Length = 582
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 67/131 (51%), Gaps = 10/131 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K+ +P A +D GD+S + LR KL C F W+++++ +PE +P
Sbjct: 396 WLDEWKEFYFAINPAAKNVDKGDLSHRRNLRKKLKCNSFRWYLENI-------YPESHMP 448
Query: 118 PNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ + GE K+ + CLD+ GR + + S CHG +Q+F Q+ + C+DA
Sbjct: 449 LDYYHLGEIKHADSPVCLDTFGRKSGENVAVSTCHGXXXNQVFAYTKRQQIMSDDNCLDA 508
Query: 177 DKQEKLVEMLK 187
V++L+
Sbjct: 509 SSPRGPVKLLR 519
>gi|242024227|ref|XP_002432530.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212517982|gb|EEB19792.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 603
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 93/217 (42%), Gaps = 37/217 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ PLA + G+I +++ L+ KLHCK F W++++V +P
Sbjct: 415 RAAEVWMDDYKKYYYAAVPLAKSIPFGNIDDRLELKRKLHCKSFKWYLENV-------YP 467
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P + Q CLD++G + +G CH +G +Q + + + + H +
Sbjct: 468 ELSIPHSTSPAFGSIRQRQLCLDTLGHSIEQTVGLYVCHDTGGNQEWGMEDDSYIKHHDL 527
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C L +P + G L+ RL C ++ + WR N+
Sbjct: 528 C------------LTIP---NYVPGA-LVLMRL--------CEDAD---NQKWRLVNNGG 560
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRF 268
++H CL T + L + +CD + QQW F
Sbjct: 561 LVKHETYSLCLDSRDATTRGLIVEKCDSRSPTQQWNF 597
>gi|66507571|ref|XP_394527.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis mellifera]
gi|380015445|ref|XP_003691712.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis florea]
Length = 571
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 93/217 (42%), Gaps = 38/217 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ KLHCK FSW++K+V +P
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNV-------YP 437
Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P G +L CLDSMG +G CH +G +Q + L +G + H
Sbjct: 438 ELVIPTSEGGPGGSLKQGSACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIRHHGL 497
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C L LPV + + GT L + C G+ + WR H
Sbjct: 498 C------------LTLPV-----------YAK-GTTLLMQIC---DGSENQKWR-HLEGG 529
Query: 233 QLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
+ H+ + C+ S + + +CD N Q+W
Sbjct: 530 LIRHSRIPVCVDSRYHAQKGITAEKCDSNAETQRWHL 566
>gi|410910894|ref|XP_003968925.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Takifugu rubripes]
Length = 577
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+I +P A GD++E+ LR+KL CK F WF+ +V D++
Sbjct: 375 VRAAEVWMDEYKEIYYHRNPHARLEAYGDVTERRKLREKLGCKGFRWFLDNVYPDLH--I 432
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G KN G T C D AV+G CHG G +Q F + EG+L
Sbjct: 433 PEDKPGRF-GMLKNRGKTNHCFDYNPTDENAVVGERVIVYPCHGMGQNQFFEYSMEGELR 491
Query: 169 HGER----CVDAD 177
+ R C+ D
Sbjct: 492 YNTRSPAGCIVGD 504
>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 637
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 95/221 (42%), Gaps = 46/221 (20%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LP 117
WL +K P +D GD+SE++ALR++L CKDF W+++++ +PE +P
Sbjct: 449 WLDKYKDFVYAIMPELKNVDAGDVSERLALRERLQCKDFRWYLQNI-------YPESSMP 501
Query: 118 PNLFWGEAKNLGTQKCLDSMGRTAPAVI----GTSYCHGSGSSQLFRLNAEGQLGHGERC 173
+ A C DS+G + + G CH G +Q+ + G+L + C
Sbjct: 502 VDFHHVGALRNQDHGCADSLGYDSENGVNQNAGIFPCHNQGGNQIVVFSKSGELKFDDLC 561
Query: 174 VDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ 233
++ K + VKL C + G W ++ ++Q
Sbjct: 562 MEGSKN----------------SAVKLQKC-------------TEGNQKQVWEYNKESKQ 592
Query: 234 LEHAILKRCLSVHP--ETNQLAMLRCDE--NNSYQQWRFKE 270
++H +CL+V P + +L + C + + + QQW ++
Sbjct: 593 MKHLYSSQCLTVEPTKDDGKLRLSSCADGGHQTNQQWHLEQ 633
>gi|340712798|ref|XP_003394942.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus terrestris]
Length = 571
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/120 (33%), Positives = 62/120 (51%), Gaps = 5/120 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++ L+ KLHCK FSW++K+V ++
Sbjct: 385 RAAEVWMDDYKQFYYNAVPLARNIPYGNIQDRMELKRKLHCKPFSWYLKNVYPELVIPTS 444
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E P G + GT CLDSMG +G CH +G +Q + L +G + H + C+
Sbjct: 445 EGGP----GGSLKQGT-ACLDSMGHLLDGNVGLYPCHDTGGNQEWGLTKDGLIKHHDLCL 499
>gi|195032291|ref|XP_001988471.1| GH11183 [Drosophila grimshawi]
gi|193904471|gb|EDW03338.1| GH11183 [Drosophila grimshawi]
Length = 640
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 64/120 (53%), Gaps = 6/120 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I +++AL++KLHCK F W++++V D+ P
Sbjct: 456 RAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCKPFKWYLENVYPDLQAPDP 515
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
+ G+ + T+ CLD+MG +G CH +G +Q + + G++ H + C+
Sbjct: 516 QEV-----GQFRQDMTE-CLDTMGHLVDGTVGLFPCHNTGGNQEWAYSKRGEIKHDDLCL 569
>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
(Silurana) tropicalis]
gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
Length = 603
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 56/226 (24%), Positives = 92/226 (40%), Gaps = 44/226 (19%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP--ELP 117
W+ + + P L +GD++ Q LR KL C+DF WFM +A+D+ +P ELP
Sbjct: 394 WMDEYAEYIYQRRPDYRHLSVGDVAAQKELRRKLQCRDFKWFMNEIAWDLPKYYPPVELP 453
Query: 118 PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS------GSSQLFRLNAEGQLGHG- 170
P WGE +N+ C+ T+ I C+ G+ Q+F L + G
Sbjct: 454 PAA-WGEIRNVRMDMCVGVKHVTSGTEIRLEACNKGRNSDTWGARQVFTLGWREDIRPGI 512
Query: 171 -----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+ C DA Q PVT LF+C G + W
Sbjct: 513 PQHTMKSCFDAVSQTS-------PVT-------------------LFDCHGMKG--NQLW 544
Query: 226 RFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
++ + + H C+ + ++ M +C+ ++ QQW F+ +
Sbjct: 545 KYRRD-KTVYHPTSNSCMDCNEGEYKIFMNKCNPSSPTQQWTFEHI 589
>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 576
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 59/239 (24%), Positives = 91/239 (38%), Gaps = 84/239 (35%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLG------ 129
D GD+S + ALR KL CK F W++ +V +PEL P + GE +N G
Sbjct: 363 DFGDVSSRKALRKKLQCKSFKWYLDNV-------YPELFVPGDAIGKGEIRNKGEVAGDV 415
Query: 130 TQKCLD--------------------------------------SMGRTAPAVIGTSYCH 151
Q CLD S GR A +G +CH
Sbjct: 416 VQHCLDSEVGEDIQKVVIAYPCHKSGGNQIRNRGGRSKNCLDWASHGRQRSANVGLYWCH 475
Query: 152 GSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNL 211
G +Q + L+ +G++ E C+D AG ++ +
Sbjct: 476 KKGGNQYWMLSKDGEIRRDESCID-------------------YAGADVM---------V 507
Query: 212 FNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ C G + W++ Q+ H I +CL + + ++L M CD+ N YQ W +E
Sbjct: 508 YPCHGMKGNQE--WKYLPDESQILHVITSKCLEMSRDGSKLMMNVCDQANPYQHWLIQE 564
>gi|269115411|gb|ACZ26277.1| N-acetyl galactosaminyl transferase-like protein [Mayetiola
destructor]
Length = 638
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/202 (26%), Positives = 91/202 (45%), Gaps = 35/202 (17%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK--CLD 135
+D GD+++Q A+R+KL CK F WFMK +A+D+ K+P + P F A T C+D
Sbjct: 445 IDAGDLTKQKAIREKLKCKSFDWFMKEIAFDLPKKYPPIEPADFASGAIQSITNPNICID 504
Query: 136 SMGRTAPAVIGTSYCHGS----GSSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLP 189
+M R +G C + ++Q F L + + ++ C D + P
Sbjct: 505 TMNRGKGQDVGLFSCADNLQHPQANQFFVLTWHKDIRMFGSTMCWDLPNSYPMS-----P 559
Query: 190 VTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET 249
+ F + C G + W+++ T+Q+ H C+ + +T
Sbjct: 560 ILFHI-------------------CHGQQG--NQFWKYNLKTKQITHD-KNVCVECNAKT 597
Query: 250 NQLAMLRCDENNSYQQWRFKEV 271
+L + +CD NN Q W+F +V
Sbjct: 598 MKLYVNKCDPNNPNQLWKFGKV 619
>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12 [Taeniopygia
guttata]
Length = 594
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 94/226 (41%), Gaps = 34/226 (15%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +KQ+ +P A GD++E+ LR+KL CKDF WF+++V +++
Sbjct: 392 VRAAEVWMDEYKQLYYHRNPHARLEPYGDVTERRLLREKLKCKDFKWFLENVYPELHV-- 449
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G KN G + C D + G CHG G +Q F + ++
Sbjct: 450 PEDRPGFF-GMLKNRGMENFCFDYNPTNEHQITGQRVILYPCHGMGQNQFFEYTSHNEIR 508
Query: 169 HGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF- 227
+ R + V V +G L ++ C ++ +V +F
Sbjct: 509 YNTRQPE--------------VCAAVDSGTDY--------LTMYLCQENAHSVPENQKFI 546
Query: 228 HNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWRFKE 270
L H ++C+ N +LR N+ YQ+W FKE
Sbjct: 547 FREDGALFHPQTEKCIQAEANAYNGNPAPLLRPCSNSDYQKWFFKE 592
>gi|354505207|ref|XP_003514663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like,
partial [Cricetulus griseus]
Length = 233
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 69/123 (56%), Gaps = 10/123 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + + GDISE++ LR++LHC +FSW++ +V +
Sbjct: 39 QVRLAEVWMDSYKKIFYRRNLQAAKIVQENNFGDISERLQLREQLHCHNFSWYLHNVYPE 98
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 99 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 154
Query: 167 LGH 169
L H
Sbjct: 155 LRH 157
>gi|195327857|ref|XP_002030634.1| GM25555 [Drosophila sechellia]
gi|194119577|gb|EDW41620.1| GM25555 [Drosophila sechellia]
Length = 590
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 108/257 (42%), Gaps = 43/257 (16%)
Query: 21 QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
LFR + Q+ + + +++K++IS R WL +K P + +
Sbjct: 350 HLFR-DGNFQVRYTNKDKNSEKKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD--- 135
G+++EQ AL+++LHCK F WF+ ++A D +P L P + G +++ + K CLD
Sbjct: 405 GNLAEQRALKNRLHCKPFKWFLDNLATDFLHLYPILDPAEYASGVLQSIASPKLCLDRKD 464
Query: 136 -SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
S G+ A + + S Q + L +L G C++
Sbjct: 465 PSHGQPKLAPCSSDHVFPS-PQQYWSLTNHRELRSGFYCLEVRNH--------------- 508
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQ 251
G ++++ C SG + W F + T Q+ + + CL PE N
Sbjct: 509 -----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQQQNSRHCLEAQPELNA 555
Query: 252 LAMLRCDENNSYQQWRF 268
+ CD N QQW+F
Sbjct: 556 VTSSVCDPKNHKQQWKF 572
>gi|313226886|emb|CBY22031.1| unnamed protein product [Oikopleura dioica]
Length = 685
Score = 69.3 bits (168), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 62/124 (50%), Gaps = 17/124 (13%)
Query: 58 SSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAY---DVYTKF 113
W K F R P L GDISEQV R HC +F+W M+ +AY D+Y
Sbjct: 496 DDWTKYFYWNR----PELQKLPYGDISEQVKFRHD-HCPYNFTWMMEEIAYGITDMYDAP 550
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGR---TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL L WGE + G+ C+DSMG PA YCH G +QLFR++A G +
Sbjct: 551 NEL---LAWGEIRGKGSNICVDSMGHKDNDGPAE--AWYCHKQGGNQLFRIHAGGFIAQN 605
Query: 171 ERCV 174
+C+
Sbjct: 606 LQCI 609
>gi|21358257|ref|NP_648800.1| polypeptide GalNAc transferase 8 [Drosophila melanogaster]
gi|51316119|sp|Q9VUT6.1|GALT8_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 8;
Short=pp-GaNTase 8; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 8; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8
gi|7294236|gb|AAF49587.1| polypeptide GalNAc transferase 8 [Drosophila melanogaster]
gi|17945053|gb|AAL48588.1| RE06471p [Drosophila melanogaster]
gi|220947814|gb|ACL86450.1| pgant8-PA [synthetic construct]
Length = 590
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/257 (24%), Positives = 107/257 (41%), Gaps = 43/257 (16%)
Query: 21 QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
LFR + Q+ + + +++K++IS R WL +K P + +
Sbjct: 350 HLFR-DGNFQIRYTNKDKNSEKKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD--- 135
G+++EQ L+++LHCK F WF+ ++A D +P L P + G +++ + K CLD
Sbjct: 405 GNLAEQRDLKNRLHCKPFKWFLDNLATDFLNLYPILDPAEYASGVLQSISSPKLCLDRKD 464
Query: 136 -SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
S G+ A + + S Q + L +L G C++
Sbjct: 465 PSHGQPKLAPCSSDHVFPS-PEQYWSLTNHRELRSGFYCLEVRNH--------------- 508
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQ 251
G ++++ C SG + W F + T Q+ + + CL PE N
Sbjct: 509 -----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQQQNFRHCLEAQPELNA 555
Query: 252 LAMLRCDENNSYQQWRF 268
+ CD N QQW+F
Sbjct: 556 VTSSVCDPKNHKQQWKF 572
>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
Length = 564
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 40/136 (29%), Positives = 66/136 (48%), Gaps = 13/136 (9%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K PLA + GDISE++ LR L CK F W+++HV +P
Sbjct: 378 RAAEVWMDDYKHFYYAAVPLAKNIPFGDISERLELRRNLQCKPFKWYLQHV-------YP 430
Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL + + ++G + CLD+MG + CH +G +Q + L + G + H
Sbjct: 431 ELA--IPQATSAHVGELRQGMYCLDTMGHLIDGTVALYQCHHTGGNQEWGLTSGGLIKHH 488
Query: 171 ERCVDADKQEKLVEML 186
+ C+ D K V+++
Sbjct: 489 DLCLTLDDYMKGVQVV 504
>gi|357625888|gb|EHJ76177.1| hypothetical protein KGM_07902 [Danaus plexippus]
Length = 535
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 88/222 (39%), Gaps = 32/222 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + Q P + +D GDIS+Q ALR+KL CK F WFM +A+D+ K+P
Sbjct: 320 RVAEVWMDEYAQYLYKRRPHYLKIDTGDISKQKALREKLQCKPFKWFMTQIAFDLTAKYP 379
Query: 115 ELPPNLFWGEAKNLGTQK--CLDSMGRTAPAVIGTSYCHGSGSS-QLFRLNAEGQLGHGE 171
+ P F T C+D+ + C S S+ Q F L+ +
Sbjct: 380 PVEPKPFAEGRIRPATYPHLCVDAHHGNQMDKLHLKSCTASTSAEQNFMLSWHKDIKSKT 439
Query: 172 R--CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
R C D +L L C LG L WR+H
Sbjct: 440 RNMCWDLPDSSPRSPIL-------------LYSCHLGGGNQL-------------WRYHP 473
Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+R+L+H CL T + + +C ++ + Q+W +V
Sbjct: 474 ESRRLKHGTNDNCLDFEISTRSVFIKQCSDSET-QEWIIDKV 514
>gi|344255628|gb|EGW11732.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Cricetulus
griseus]
Length = 386
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 69/123 (56%), Gaps = 10/123 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + + GDISE++ LR++LHC +FSW++ +V +
Sbjct: 42 QVRLAEVWMDSYKKIFYRRNLQAAKIVQENNFGDISERLQLREQLHCHNFSWYLHNVYPE 101
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 102 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 157
Query: 167 LGH 169
L H
Sbjct: 158 LRH 160
>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
Length = 679
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 98/220 (44%), Gaps = 38/220 (17%)
Query: 60 WLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-P 117
W+ +K+ + G + +D GD++ Q A+R KL CK F WFM+ VA+D+ +P + P
Sbjct: 464 WMDEYKKYLYDHGDGIYDRVDAGDLTAQKAIRTKLKCKSFKWFMEEVAFDLMKSYPPIDP 523
Query: 118 PNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER 172
P+ G +N+G C+D+ G +G C +Q F+L+ + L +
Sbjct: 524 PDYASGAIQNVGDSSLCVDTHGLRKHNRMGVYSCAEDLQKPQRNQFFQLSWKRDLRQRRK 583
Query: 173 --CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
C+D + PV L++C G + F++
Sbjct: 584 KDCLDVQ-----IWDANAPVW-------------------LWDCHGQQGNQ---YWFYDY 616
Query: 231 TRQL-EHAIL-KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+QL H +RCL + P + L + +C E+NSY +W F
Sbjct: 617 RKQLIRHGKEGRRCLELLPFSKTLVVNQCSESNSYMKWNF 656
>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Acromyrmex echinatior]
Length = 566
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 92/214 (42%), Gaps = 33/214 (15%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
+ GDISE+ ALR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 364 NYGDISERKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 419
Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF-- 192
R A G CH G +Q+ ++ + C+D+ + +E L PV
Sbjct: 420 PARKADLHKPCGLYPCHRQGGNQIRQVTS-------GMCIDSSGK---IEDLHQPVGMYP 469
Query: 193 CVLAGVK--LIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAIL 239
C G + + G + +C + SG+ + W ++ T + H
Sbjct: 470 CHRQGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWIYNPQTNHIRHGSS 529
Query: 240 KRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
+CL++ QL M C N+ Q+W F+ P
Sbjct: 530 DKCLAITESKQQLVMEECSSNSLRQRWSFENYDP 563
>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
Length = 536
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/130 (28%), Positives = 62/130 (47%), Gaps = 9/130 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+ P + G+I+E++ LR KL CK F W++ HV DV
Sbjct: 341 VRTAEVWMDGYKEFVYQRQPYMRNIHFGNITERLELRKKLQCKSFKWYLDHVFTDV---- 396
Query: 114 PELPPN---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
+ PN + G+ +N ++ CL+++GR A +G S C G + + L +L
Sbjct: 397 --ILPNESAIAKGKVRNPESEMCLNTLGRPKHAFLGLSPCAHEGKTMIISLTVLNELAMD 454
Query: 171 ERCVDADKQE 180
E C D +
Sbjct: 455 EVCFDVSDHQ 464
>gi|326922813|ref|XP_003207639.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Meleagris
gallopavo]
Length = 632
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 106/235 (45%), Gaps = 44/235 (18%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GDIS+++ LR +L CK+F+W++
Sbjct: 429 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLNLRQRLQCKNFTWYL 488
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+V +VY P+L P LF G KN+G CLD +G P ++ + CHG G +Q
Sbjct: 489 NNVYPEVYV--PDLNP-LFSGYLKNIGNHMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 542
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFN 213
F +A ++ H E C+ A K PV C G K+
Sbjct: 543 FEYSAHHEIRHNIQKELCLHASKG---------PVQLRECSYKGQKIF------------ 581
Query: 214 CFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ W+ H + L A L CL+ + E +++ C+ ++ +Q+W F
Sbjct: 582 -----AFGEEQWQ-HQKDQTLYSAALHMCLTGNGE--HPSLVSCNPSDPFQKWIF 628
>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Taeniopygia guttata]
Length = 555
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 95/222 (42%), Gaps = 40/222 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR+ L+CK F WFM VA+D+ +P + PP
Sbjct: 346 WMDEYAEFIYQRRPEYRHLSAGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPP 405
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
WGE +N+GT C+D+ HG+ S L RL E CV D+
Sbjct: 406 AAAWGEIRNVGTGLCVDTK-------------HGALGSPL-RL---------ENCV-KDR 441
Query: 179 QEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDGPWRF 227
E +++ TF ++ FC + + L++C G + WR+
Sbjct: 442 GEAAWNNVQV-FTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQLWRY 498
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ ++ M C+ ++ QQW F+
Sbjct: 499 RKD-KTLYHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 539
>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 624
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/228 (27%), Positives = 98/228 (42%), Gaps = 46/228 (20%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC++FSW++ ++ +
Sbjct: 430 QVRLAEVWMDEYKEIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPE 489
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLG +CLD R +I CH G +Q F +
Sbjct: 490 MFV--PDLKPT-FYGAIKNLGIDQCLDVGENNRGGKPLIMYP-CHSLGGNQYFEYTTQRD 545
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
L H + C+ A LG F NS D W
Sbjct: 546 LRHN-----------------IAKQLCLHASAGT----LGLRGCQFTVKNSQVPKDEEW- 583
Query: 227 FHNSTRQLEHAILKR------CLSVHPETNQLAMLRCDENNSYQQWRF 268
+L +L R CL+ + + AM CD N+ Q W F
Sbjct: 584 ------ELTQDLLIRNSGSGTCLTA--KDKKPAMASCDPNDPRQHWLF 623
>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
Length = 622
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 100/226 (44%), Gaps = 42/226 (18%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A + GD+SE++ LR++LHC +FSW++ +V +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 488 MFV--PDLNPT-FSGAIKNLGTSQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 543
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
L H G +L G+ L L NC NS D
Sbjct: 544 LRHN-------------------------IGKQLCLHASGSTLGLRNCQFIGKNSEVPKD 578
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + + CL+ + + AM C+ ++ YQ W F
Sbjct: 579 EEWEL-TQDQLIRNLGSGTCLTS--KDKKPAMAPCNPSDPYQLWLF 621
>gi|157128332|ref|XP_001661405.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108872614|gb|EAT36839.1| AAEL011095-PA [Aedes aegypti]
Length = 573
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 15/124 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + GDI E++ L+++L CK F W++ +V
Sbjct: 388 RAAEVWMDDYKQYYYAAVPLAKNIPFGDIEERMELKERLQCKPFKWYLANVY-------- 439
Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
P L E + G+ + C+D++G ++G CH SG +Q + + +GQ+ H
Sbjct: 440 ---PQLTIPEQQTKGSLRQGPYCMDTLGHLVDGIVGLYQCHDSGGNQDWAITKKGQIKHL 496
Query: 171 ERCV 174
+ C+
Sbjct: 497 DLCL 500
>gi|432092278|gb|ELK24901.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Myotis
davidii]
Length = 363
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 42/124 (33%), Positives = 54/124 (43%), Gaps = 7/124 (5%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 193 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 252
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 253 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTWSHEQLFTFGWREDI 312
Query: 168 GHGE 171
GE
Sbjct: 313 RPGE 316
>gi|194749861|ref|XP_001957354.1| GF24092 [Drosophila ananassae]
gi|190624636|gb|EDV40160.1| GF24092 [Drosophila ananassae]
Length = 591
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 105/240 (43%), Gaps = 44/240 (18%)
Query: 39 DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDF 98
+++K+VIS R WL +K P + +G ++E+ AL+ +LHCK F
Sbjct: 368 ESEKKVISRNYR----RVAEVWLDDYKDKLFEKMPHLTVIQVGSLTEEKALKKRLHCKPF 423
Query: 99 SWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQK-CLD-SMGRTAPAVIGTSYC---HG 152
WF+ + D +P + P F +G ++L + K CLD S G+ P S C H
Sbjct: 424 KWFLDQLGKDFLNLYPVVEPVDFAFGVLQSLASPKLCLDRSEGK--PGQPKLSECPDDHT 481
Query: 153 SGSSQL-FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNL 211
S+L + L +L G C++ Q+ G + +
Sbjct: 482 FPESELKWTLTNHRELRSGPVCLEKRNQQ-------------------------GDIY-V 515
Query: 212 FNCFNSSGTVDGPWRFHNSTRQLEHAIL---KRCLSVHPETNQLAMLRCDENNSYQQWRF 268
F C G + W + +++Q+ H + +RCL + P+ + +CD NN+ Q+W+F
Sbjct: 516 FQCHGQMG--NQFWSLNTTSQQVVHGQMGSSRRCLEIQPDAKGVTTRKCDSNNAKQRWKF 573
>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6-like [Hydra
magnipapillata]
Length = 591
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 94/219 (42%), Gaps = 37/219 (16%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK+I GD+SE+ LR+ L CK F W+++++ ++
Sbjct: 391 IRVAEVWMDEFKEIFYSFRANLKPEQGGDVSERKKLREDLKCKSFKWYLQNIIPEL---- 446
Query: 114 PELPPNLFW--GEAKNLGTQKCLDSMGRTAP-AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E+P + G+ KNLGT CLD++ + G CH G++Q F + ++ H
Sbjct: 447 -EIPDKYPYGRGDVKNLGTLSCLDTLAQNNQGGKPGLYPCHKMGTNQYFIFTKKFEIWHD 505
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
C+D + L + L+ C G + W+ H
Sbjct: 506 GLCLDLSDSD------------------------LNAKVKLWPCHKQGG--NQKWK-HTK 538
Query: 231 TRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ + H K+CL + +Q+ + CD NN+ Q+W F+
Sbjct: 539 SGLIMHESRKKCL--EGQGDQILIRACDTNNANQRWLFE 575
>gi|170046214|ref|XP_001850669.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
gi|167869055|gb|EDS32438.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
Length = 576
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 62/124 (50%), Gaps = 15/124 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ PLA + G+I E++ L+++L CK+F W++ +V
Sbjct: 391 RAAEVWMDDYKQYYYAAVPLAKNIPFGNIDERLQLKEQLECKNFKWYLDNVY-------- 442
Query: 115 ELPPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
P L E + G+ + C+D++G ++G +CH SG +Q + + GQ+ H
Sbjct: 443 ---PQLTIPEQQTKGSLRQGPYCIDTLGHLVDGIVGLYHCHNSGGNQDWAITKSGQIKHL 499
Query: 171 ERCV 174
+ C+
Sbjct: 500 DLCL 503
>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mustela putorius
furo]
Length = 461
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 54/99 (54%), Gaps = 6/99 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK + SP +D GDIS ++ LR KL CK FSW+++++ D
Sbjct: 368 RLAEVWMDEFKNFFYIISPGVTKVDYGDISSRLGLRHKLQCKPFSWYLENIYPD-----S 422
Query: 115 ELPPNLF-WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHG 152
++P + F GE +N+ T +CLD+M R +G CHG
Sbjct: 423 QIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHG 461
>gi|149032012|gb|EDL86924.1| rCG50623 [Rattus norvegicus]
Length = 431
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 100/226 (44%), Gaps = 42/226 (18%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A + GD+SE++ LR++LHC +FSW++ +V +
Sbjct: 237 QVRLAEVWMDDYKKIFYRRNLQAAKMAKENNFGDVSERLRLREQLHCHNFSWYLHNVYPE 296
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 297 MFV--PDLNPT-FSGAIKNLGTSQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 352
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
L H G +L G+ L L NC NS D
Sbjct: 353 LRHN-------------------------IGKQLCLHASGSTLGLRNCQFIGKNSEVPKD 387
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + + CL+ + + AM C+ ++ YQ W F
Sbjct: 388 EEWEL-TQDQLIRNLGSGTCLT--SKDKKPAMAPCNPSDPYQLWLF 430
>gi|444727225|gb|ELW67727.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Tupaia
chinensis]
Length = 406
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 51/106 (48%), Gaps = 1/106 (0%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 232 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 291
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+ PP+ WGE +N+ C+DS + C GS + +
Sbjct: 292 PVEPPSAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSERTW 337
>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
gallus]
Length = 611
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 95/222 (42%), Gaps = 40/222 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR+ L+CK F WFM VA+D+ +P + PP
Sbjct: 402 WMDEYAEYIYQRRPEYRHLSAGDVTAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPP 461
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
WGE +N+GT C+D+ HGS S L RL E CV D+
Sbjct: 462 AAAWGEIRNVGTGLCVDTK-------------HGSLGSPL-RL---------ESCV-KDR 497
Query: 179 QEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDGPWRF 227
E +++ TF ++ FC + + L++C G + WR+
Sbjct: 498 GEAAWNNVQV-FTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQLWRY 554
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ ++ M C+ ++ QQW F+
Sbjct: 555 RKD-KTLYHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 595
>gi|380805795|gb|AFE74773.1| polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Macaca mulatta]
Length = 336
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 49/104 (47%), Gaps = 1/104 (0%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 233 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 292
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
+ PP WGE +N+ C+DS + C GS +
Sbjct: 293 PVEPPPAAWGEIRNVAANLCVDSKHGATGTELRLDICVKDGSER 336
>gi|194873027|ref|XP_001973126.1| GG15924 [Drosophila erecta]
gi|190654909|gb|EDV52152.1| GG15924 [Drosophila erecta]
Length = 589
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 65/258 (25%), Positives = 104/258 (40%), Gaps = 49/258 (18%)
Query: 29 GQLGHGERCVDADKQVISTQKSSLELRKRSS---------WLKVFKQIRSLGSPLAMFLD 79
++GH R D + QV T K RK S WL +K P +
Sbjct: 346 SRVGHLFR--DGNFQVRYTNKDKNSERKLISRNYRRVAEIWLDEYKDKLFANMPHLTVIP 403
Query: 80 MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD-- 135
+G+++EQ L+ +LHCK F WF+ ++A D +P L P + G +++ + K CLD
Sbjct: 404 VGNLAEQRELKRRLHCKPFKWFLDNLAADFLNLYPILDPAEYASGVLQSISSPKLCLDRK 463
Query: 136 --SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFC 193
+GR A + + S Q + L +L G C++ Q
Sbjct: 464 EPRLGRPQLAPCSSDHVFPS-PEQYWSLTNHRELRSGFYCLEVRNQ-------------- 508
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETN 250
G +N++ C SG + W F + T Q+ + + CL P +N
Sbjct: 509 ------------GANVNIYQCHGQSG--NQFWSFDSKTHQVISGQQKNSRHCLEAQPASN 554
Query: 251 QLAMLRCDENNSYQQWRF 268
+ CD N Q+W+F
Sbjct: 555 SVTSSVCDPKNHRQKWKF 572
>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 677
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 92/215 (42%), Gaps = 22/215 (10%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFMK VA+D+ +P + PP
Sbjct: 469 WMDEYAEYVYQRRPEYRHLSAGDVAAQKELRSTLNCKSFKWFMKEVAWDLPKHYPPVEPP 528
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
WGE ++ + CL+S + I C + A+ GHG+ +
Sbjct: 529 AAAWGEVRSAASGLCLESKHFVSGTPIRLESC--------VKGRADVSWGHGQVFTFGWR 580
Query: 179 QEKLV--EMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEH 236
++ V M V F ++ + + L++C G + WR+ + L H
Sbjct: 581 EDIRVGDPMHTKKVCFDAVSH--------HSPVTLYDCHGMRG--NQLWRYRKD-KSLYH 629
Query: 237 AILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ C+ + ++ M C+ ++ QQW F++
Sbjct: 630 PVSNSCVDSNASERRVFMNVCNPASASQQWLFEKT 664
>gi|118093614|ref|XP_422023.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Gallus
gallus]
Length = 632
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 106/235 (45%), Gaps = 44/235 (18%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GDIS+++ LR +L CK+F+W++
Sbjct: 429 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYL 488
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+V +VY P+L P LF G KN+G CLD +G P ++ + CHG G +Q
Sbjct: 489 NNVYPEVYV--PDLNP-LFSGYLKNVGNHMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 542
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFN 213
F +A ++ H E C+ A K PV C G K+
Sbjct: 543 FEYSAHHEIRHNIQKELCLHASKG---------PVQLRECSYKGQKIF------------ 581
Query: 214 CFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ W+ H + L A L CL+ + E +++ C+ ++ +Q+W F
Sbjct: 582 -----AFGEEQWQ-HQKDQTLYSAALHMCLTGNGE--HPSLVSCNPSDPFQKWIF 628
>gi|260793003|ref|XP_002591503.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
gi|229276709|gb|EEN47514.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
Length = 618
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 83/197 (42%), Gaps = 40/197 (20%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT 140
GD+S ++ L+DKLHCK F WFM+ + D+Y PE P G +N + C DS G
Sbjct: 451 GDVSARLDLKDKLHCKPFKWFMQTIMPDMYV--PEDRPGRS-GALRNSASNLCFDSEGAE 507
Query: 141 APAVIGTSY-CHGSGSSQLFRLNAEGQLGHG----ERCVDADKQEKLVEMLKLPVTFCVL 195
T + CHG G +Q F LN+ + H E CV+A E V ++ V
Sbjct: 508 NAGKRPTMWGCHGMGGNQYFELNSREEFRHNTGGKEMCVEAQGGE-FVVLMHCTSGNNVP 566
Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ---- 251
A K G+V N ++ +CL PE Q
Sbjct: 567 AQQKWQIREDGSVYNAYH--------------------------GKCLHFDPEERQEKKE 600
Query: 252 LAMLRCDENNSYQQWRF 268
+ +RC +S Q+WRF
Sbjct: 601 VKAVRCTGQDS-QRWRF 616
>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
Length = 615
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 10/151 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A ++ GD++E+ LR+ L CK F W+++++ +P
Sbjct: 424 RTAEVWMDEYKAFFYKMVPAARNVEAGDVTERKKLRETLQCKSFKWYLENI-------YP 476
Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P P F G N T+KC+D+ G+ G CHGSG +Q + L +G++ +
Sbjct: 477 EAPLPADFKSLGAIVNRFTEKCIDTNGKKDGQSPGLQGCHGSGGNQAWSLTGKGEIRSDD 536
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIF 202
C+ + ++ LKL +K +F
Sbjct: 537 LCLSSGHVYQIGSELKLERCSVSKINIKHVF 567
>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 97/230 (42%), Gaps = 39/230 (16%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL +K+ +PL+ GDISE+V LR+ L CK F W+++++ D+
Sbjct: 400 VRAVEVWLDDYKRHFYARNPLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDL--PV 457
Query: 114 PELPPNLFWGEAKNLGT-QKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
PE P F G N G+ +CLD V+GT CHG G +Q F N++G L +
Sbjct: 458 PEDTPGQF-GALHNKGSPSRCLDYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRY 516
Query: 170 G---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW- 225
E C+ A K + E + + C N N + W
Sbjct: 517 TSQFELCI-AKKDDNSGE-------------IAAVMC---------NGKNVNPPARAIWI 553
Query: 226 ---RFHNSTRQLEHAILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKE 270
+ T QL++ C++ T ++ C + YQQW F+E
Sbjct: 554 KIPQNDGKTFQLKNKSNGLCIATEKTTGNPKIVFESCKSTSRYQQWYFQE 603
>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Anolis carolinensis]
Length = 612
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 89/219 (40%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM VA+D+ +P + PP
Sbjct: 403 WMDEYAEYIYQRRPEYRHLSAGDVATQKELRSNLNCKSFRWFMNEVAWDLRKFYPPVEPP 462
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS------GSSQLFRLNAEGQLGHGER 172
WGE N+GT C+D+ + + C S + Q+F + + G
Sbjct: 463 AAAWGEIHNVGTSLCVDTKHGALGSPLKIETCVKSRGEAAWNNVQVFTFSWREDIRPG-- 520
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C G + W++ +
Sbjct: 521 --DPQHTKK----------FCFDA------VSHNSPVTLYDCHGMKG--NQLWKYRKD-K 559
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H I C+ ++ M C+ ++ QQW F+ +
Sbjct: 560 TLYHPISGSCMDCTENDRKIFMNTCNPSSPTQQWIFEHI 598
>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis mellifera]
Length = 635
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
GD+SE+ ALR +L CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 435 GDVSERKALRKRLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 490
Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
R A PA G CH G +Q+ L + C+D+ K E L + + L
Sbjct: 491 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSPGKPEDLHQPVGLYPCHR 541
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
+ + G + +C + SGT + W ++ T+Q+ H +C
Sbjct: 542 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNPQTKQIRHGSSDKC 601
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
L++ +L M C + + Q+W F+ P
Sbjct: 602 LAITESKQRLIMEECSSSAARQRWSFENYDP 632
>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
gallus]
Length = 590
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 65/245 (26%), Positives = 100/245 (40%), Gaps = 46/245 (18%)
Query: 42 KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
KQ ++ +L +R W+ +K++ +P A GD+SE+ LR+KL CKDF
Sbjct: 374 KQAPYSRSKALANSVRAAEVWMDEYKELYYHRNPHARLEPYGDVSERRLLREKLKCKDFK 433
Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
WF+++V +++ PE P F+G KN G C D + G CHG G
Sbjct: 434 WFLENVYPELHV--PEDRPG-FFGMLKNRGMANFCFDYNPSNEHEITGHRVILYPCHGMG 490
Query: 155 SSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL------GTV 208
+Q F + ++ + R +A V+AG + L
Sbjct: 491 QNQFFEYTSHNEIRYNTRQPEA--------------CAAVIAGTDYLTMNLCQENIHRVP 536
Query: 209 LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQ 265
N F GT+ FH T ++CL + N +LR N+ YQ+
Sbjct: 537 ENQKFAFREDGTL-----FHMQT--------QKCLQAERNSYNGNPAPVLRPCTNSDYQK 583
Query: 266 WRFKE 270
W FKE
Sbjct: 584 WFFKE 588
>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 97/230 (42%), Gaps = 39/230 (16%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL +K+ +PL+ GDISE+V LR+ L CK F W+++++ D+
Sbjct: 400 VRAVEVWLDDYKRHFYARNPLSKDEKYGDISERVNLRNGLECKSFQWYLENIYPDL--PV 457
Query: 114 PELPPNLFWGEAKNLGT-QKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
PE P F G N G+ +CLD V+GT CHG G +Q F N++G L +
Sbjct: 458 PEDTPGQF-GALHNKGSPSRCLDYNPPENDLTHGVVGTFGCHGQGGNQFFEFNSKGHLRY 516
Query: 170 G---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW- 225
E C+ A K + E + + C N N + W
Sbjct: 517 TSQFELCI-AKKDDNSGE-------------IAAVMC---------NGKNVNPPARAIWI 553
Query: 226 ---RFHNSTRQLEHAILKRCLSVHPETN--QLAMLRCDENNSYQQWRFKE 270
+ T QL++ C++ T ++ C + YQQW F+E
Sbjct: 554 KIPQDDGKTFQLKNKSNGLCIATEKTTGNPKIVFESCKNTSRYQQWYFQE 603
>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Meleagris gallopavo]
Length = 528
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 100/245 (40%), Gaps = 46/245 (18%)
Query: 42 KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
KQ ++ +L +R W+ +K++ +P A GD++E+ LR+KL CKDF
Sbjct: 312 KQAPYSRSKALANSVRAAEVWMDEYKELYYHRNPHARLEPYGDVTERRLLREKLKCKDFK 371
Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
WF+++V +++ PE P F G KN G C D + G CHG G
Sbjct: 372 WFLENVYPELHV--PEDRPGFF-GMLKNRGMANFCFDYNPPNEHEITGHRVILYPCHGMG 428
Query: 155 SSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL------GTV 208
+Q F + ++ + R +A V+AG + + L
Sbjct: 429 QNQFFEYTSHNEIRYNTRQPEA--------------CAAVIAGTEYLTMNLCQENIHHVP 474
Query: 209 LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQ 265
N F GT+ FH T ++CL + N +LR N+ YQ+
Sbjct: 475 ENQKFAFREDGTL-----FHMQT--------QKCLQAESNSYNGNPAPVLRPCSNSDYQK 521
Query: 266 WRFKE 270
W FKE
Sbjct: 522 WFFKE 526
>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Takifugu rubripes]
Length = 618
Score = 68.2 bits (165), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 80/150 (53%), Gaps = 13/150 (8%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
+VI+ + L + K+F + + +A + GDISE++ LR++LHCK+FSW++
Sbjct: 416 EVITRNQVRLAEVWMDDYKKIFYRRNKNAAKMAKENNYGDISERLNLRERLHCKNFSWYL 475
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLD----SMGRTAPAVIGTSYCHGSGSSQL 158
V + + P+L P+ F G KN G++ CLD ++G P ++ T CH G +Q
Sbjct: 476 NTVYPEAFV--PDLTPDRF-GAIKNQGSKTCLDVGENNLG-GKPVMMYT--CHNMGGNQY 529
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEM 185
F ++ +L H E C+ A + V++
Sbjct: 530 FEYSSHKELRHNIGKELCLHAIPHPEAVKI 559
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Megachile rotundata]
Length = 638
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/213 (26%), Positives = 93/213 (43%), Gaps = 31/213 (14%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
+ GD+S++ ALR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 436 NYGDVSDRKALRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 491
Query: 137 MGRTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVT 191
R A PA G CH G +Q+ L + C+D+ K E L + + L
Sbjct: 492 PARKADLHKPA--GLYPCHRQGGNQIRHLVS-------SICIDSPGKPEDLHQPVGLYPC 542
Query: 192 FCVLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILK 240
+ + G + +C + SGT + W ++ T+Q+ H
Sbjct: 543 HRQGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNPQTKQIRHGSSD 602
Query: 241 RCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
+CL++ +L M C + + Q+W F+ P
Sbjct: 603 KCLAITESKQRLIMEECSPSAARQRWSFENYDP 635
>gi|62751482|ref|NP_001015534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
gi|75057892|sp|Q5EA41.1|GALT6_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|59857821|gb|AAX08745.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++L+C +FSWF+ +V +
Sbjct: 428 QVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
++ P+L P F+G KNLG CLD + P ++ T CHG G +Q F +
Sbjct: 488 MFV--PDLKPT-FFGALKNLGVDHCLDVGENNNGGKPLILYT--CHGLGGNQYFEYTTQR 542
Query: 166 QLGH 169
L H
Sbjct: 543 DLRH 546
>gi|344251833|gb|EGW07937.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Cricetulus
griseus]
Length = 457
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 70/141 (49%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF+K V +++
Sbjct: 257 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKKLRDKLQCKDFRWFLKTVYPELHV-- 314
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G ++ CLD V G CHG G +Q F ++ ++
Sbjct: 315 PEDRPGFF-GMLQNKGLREYCLDYNPPNENQVDGHQVLLYLCHGMGQNQFFEYTSQKEIR 373
Query: 169 HGER----CVDADKQEKLVEM 185
+ R CV ++ + ++ M
Sbjct: 374 YNTRQPEACVAVEEGKDVLVM 394
>gi|148878418|gb|AAI46056.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Bos
taurus]
gi|296487792|tpg|DAA29905.1| TPA: polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 43/124 (34%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++L+C +FSWF+ +V +
Sbjct: 428 QVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERLNCHNFSWFLDNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
++ P+L P F+G KNLG CLD + P ++ T CHG G +Q F +
Sbjct: 488 MFV--PDLKPT-FFGALKNLGVDHCLDVGENNNGGKPLILYT--CHGLGGNQYFEYTTQR 542
Query: 166 QLGH 169
L H
Sbjct: 543 DLRH 546
>gi|354475881|ref|XP_003500155.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Cricetulus griseus]
Length = 559
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 70/141 (49%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF+K V +++
Sbjct: 359 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKKLRDKLQCKDFRWFLKTVYPELHV-- 416
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G ++ CLD V G CHG G +Q F ++ ++
Sbjct: 417 PEDRPGFF-GMLQNKGLREYCLDYNPPNENQVDGHQVLLYLCHGMGQNQFFEYTSQKEIR 475
Query: 169 HGER----CVDADKQEKLVEM 185
+ R CV ++ + ++ M
Sbjct: 476 YNTRQPEACVAVEEGKDVLVM 496
>gi|313233395|emb|CBY24510.1| unnamed protein product [Oikopleura dioica]
Length = 679
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 18/142 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A + GDIS++ +R++L CK FSWF+++V +P
Sbjct: 489 RAAEVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENV-------YP 541
Query: 115 ELP-PN---LFWGEAK--NLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEG-Q 166
EL PN + WG N G ++C+ G T +G CHG G +Q F L EG +
Sbjct: 542 ELRIPNKDAIGWGAVSQTNKGLEECI---GNTHGGGTLGMYRCHGDGGNQEFTLTKEGKE 598
Query: 167 LGHGERCVDADKQEKLVEMLKL 188
H + C+ + +E + +K
Sbjct: 599 FRHNDLCIGYNAKEPVGNPVKF 620
>gi|195590539|ref|XP_002085003.1| GD14569 [Drosophila simulans]
gi|194197012|gb|EDX10588.1| GD14569 [Drosophila simulans]
Length = 590
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 63/257 (24%), Positives = 106/257 (41%), Gaps = 43/257 (16%)
Query: 21 QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
LFR + Q+ + + +++K++IS R WL +K P + +
Sbjct: 350 HLFR-DGNFQVRYTNKDKNSEKKLISRNYR----RVAEIWLDEYKDKLYANMPHLTVIPV 404
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD--- 135
G+++EQ AL+++LHCK F WF+ ++A D +P L P + G ++ K CLD
Sbjct: 405 GNLAEQRALKNRLHCKPFKWFLDNLATDFLNLYPILDPAEYASGVLQSRALPKLCLDRKD 464
Query: 136 -SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCV 194
S G+ A + + S Q + L +L G C++
Sbjct: 465 PSHGQPKLAPCSSDHVFPS-PEQYWSLTNHRELRSGFYCLEVRNH--------------- 508
Query: 195 LAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQ 251
G ++++ C SG + W F + T Q+ + + CL PE N
Sbjct: 509 -----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQEQNSRHCLEAQPELNA 555
Query: 252 LAMLRCDENNSYQQWRF 268
+ CD N QQW+F
Sbjct: 556 VTSSVCDPKNHKQQWKF 572
>gi|339234661|ref|XP_003378885.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
gi|316978493|gb|EFV61475.1| putative RecF/RecN/SMC N domain protein [Trichinella spiralis]
Length = 1819
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 61/125 (48%), Gaps = 3/125 (2%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ +K+ L P +D GD+++Q +R +L C F WFMK+V +D +P
Sbjct: 1613 RVAETWMDEYKEYLYLRMPRLRNVDPGDLTKQREVRQRLKCNGFDWFMKNVMFDQPKHYP 1672
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH--GE 171
+ PP++ G KN T +C+D + TS C Q F L ++ GE
Sbjct: 1673 LVEPPDVMSGRIKNKRTGRCMDVKNAGLQTELTTSQCKDYLKRQEFVLTWRSEIFQRLGE 1732
Query: 172 RCVDA 176
C+D+
Sbjct: 1733 NCLDS 1737
>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 619
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 91/220 (41%), Gaps = 36/220 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ +P + GD+++Q ALR+KL CK F WF++ VA D+ ++P
Sbjct: 406 RVAEVWMDEYKQFLYERNPQFDKTNPGDLTKQKALREKLKCKPFKWFLEEVAPDLLVRYP 465
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
P F G +++ K CLD++ A IG C + + +Q F L + +
Sbjct: 466 LREPLPFASGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQNNQFFTLTYYRDIR 525
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
E+C+DA V LFNC S G + WR
Sbjct: 526 AASVEKCLDASSDNAEV--------------------------ILFNCHESQG--NQLWR 557
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ T+ + H R + ++ + +CD Q+W
Sbjct: 558 YDQETQMIMHGKPSRDQCLDLVGKKVVVSKCDNRKKTQRW 597
>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
Length = 582
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 94/222 (42%), Gaps = 40/222 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ +P D GD+++Q ALR++L CK F WF++ VA D+ K+P
Sbjct: 369 RVAEVWMDEYKQFLFERNPEYDKTDAGDLTKQKALRERLQCKPFKWFLEEVAPDLLLKYP 428
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSG----SSQLFRLNA--EGQ 166
P F G ++L CLDS+ IG C + S+Q F L+ + +
Sbjct: 429 LRDPKPFASGRVQSLANPILCLDSLNHKEKEPIGVFSCAANKTHPQSNQFFTLSYFRDIR 488
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ ++C+DA + G+ + LFNC G + W+
Sbjct: 489 VASVDKCLDAASE--------------------------GSEVRLFNCHEIQG--NQLWQ 520
Query: 227 FHNSTRQLEHAI--LKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ + + H + +CL + ++ + +CD Q W
Sbjct: 521 YDMKNQMIRHGMPSWNQCLDL--VKQKITISKCDPTKKTQHW 560
>gi|313246954|emb|CBY35800.1| unnamed protein product [Oikopleura dioica]
Length = 696
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 18/142 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A + GDIS++ +R++L CK FSWF+++V +P
Sbjct: 506 RAAEVWMDEYKEFYFAAVPSAKMVKFGDISKRTEVRERLQCKSFSWFLENV-------YP 558
Query: 115 ELP-PN---LFWGEAK--NLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEG-Q 166
EL PN + WG N G ++C+ G T +G CHG G +Q F L EG +
Sbjct: 559 ELRIPNKDAIGWGAVSQTNKGLEECI---GNTHGGGTLGMYRCHGDGGNQEFTLTKEGKE 615
Query: 167 LGHGERCVDADKQEKLVEMLKL 188
H + C+ + +E + +K
Sbjct: 616 FRHNDLCIGYNAKEPVGNPVKF 637
>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Taeniopygia guttata]
Length = 631
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 62/235 (26%), Positives = 103/235 (43%), Gaps = 44/235 (18%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GDIS+++ LR +L CK+F+W++
Sbjct: 428 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRIDLRQRLQCKNFTWYL 487
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+V + Y P+L P LF G KN+G + CLD +G P ++ + CHG G +Q
Sbjct: 488 SNVYPEAYV--PDLNP-LFSGYLKNIGNRMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 541
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFN 213
F +A ++ H E C+ A K PV C G K
Sbjct: 542 FEYSAHHEIRHNIQKELCLHASKG---------PVQLRECTYKGQKTF------------ 580
Query: 214 CFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ W H + L + L CL+ + E LA C+ ++ +Q+W F
Sbjct: 581 -----AVGEEQW-LHQKDQTLYNEALHMCLTGNGEHPSLA--SCNPSDPFQKWIF 627
>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
Length = 619
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 61/226 (26%), Positives = 102/226 (45%), Gaps = 40/226 (17%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K I RS + +A GDIS+++ LR+ L CK+FSW++ +V +
Sbjct: 423 QVRLAEVWMDDYKLIFYRRSQSAAKMAKEKGFGDISDRLKLREDLQCKNFSWYLSNVYPE 482
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
+ P+L P F G KN G Q+CLD S P ++ T CH G +Q F +
Sbjct: 483 AFV--PDLSPVKF-GALKNRGAQQCLDVGESNNGGKPVIMYT--CHNMGGNQYFEYTSHK 537
Query: 166 QLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
+L H + C+ A + V+ + C L G SS +
Sbjct: 538 ELRHNIGKQLCLQASPYPEPVK-----IELCTLKGK-----------------GSSLAPE 575
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + L++ +CL H + +++ M C+ ++ +QQW F
Sbjct: 576 QEWLI-TEEQLLKNPSSGKCL--HKKGDKILMSPCNPDDHHQQWAF 618
>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
10-like [Acyrthosiphon pisum]
Length = 592
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 51/222 (22%), Positives = 95/222 (42%), Gaps = 32/222 (14%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + L P ++ GDI++Q +R+ L CK F WF+ VA+D+ K+P + PP
Sbjct: 395 WMDEYAEYLYLRRPHYRNINTGDITKQKKIRENLKCKPFKWFIHEVAFDLVEKYPPIEPP 454
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
++F G+ + TAP + C + S L + C+D D
Sbjct: 455 DVFKGKIRTF-----------TAPEL-----CLDATSENPLNL---------KECIDNDD 489
Query: 179 Q-EKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHA 237
+ +K + + + + + L ++L+ C N G + W + + +L
Sbjct: 490 ENQKFILSWRNDIKTRNNMCLDISDSSLKAKISLYGCHNGGG--NQLWHYDHGKMKLVQG 547
Query: 238 ILKRCLSVHPETNQLAMLRCDENNSYQQW---RFKEVRPDWS 276
RC+ T ++ + +C +NN Q+W F +WS
Sbjct: 548 KNPRCMEHDISTRKVYVTKCKQNNDDQKWIIENFNSKSTNWS 589
>gi|395824312|ref|XP_003785413.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Otolemur garnettii]
Length = 508
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 94/226 (41%), Gaps = 37/226 (16%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K++ +P A GD++E+ LR+KL CKDF WF++ V +++
Sbjct: 308 VRAAEVWMDDYKELYYHRNPRARLEPFGDVTERRQLREKLQCKDFKWFLETVFPELHV-- 365
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL- 167
PE P F G +N G +K C D V G CHG G +Q F ++ ++
Sbjct: 366 PEDRPGFF-GMLQNKGLKKYCFDYNPPDENQVAGRQVILYLCHGLGQNQFFEYTSQYEIR 424
Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
H E CV ++ ++ M C+ N G++
Sbjct: 425 YNTHHPEACVAVEEDSDILVMH---------------LCKEVAPENQKFILQEDGSL--- 466
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
FH +++ A K N +LR N+ YQ+W F+E
Sbjct: 467 --FHKQSKKCVQAAKKES-----SDNYEPLLRDCTNSDYQRWFFQE 505
>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oreochromis niloticus]
Length = 575
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 63/133 (47%), Gaps = 12/133 (9%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL FK+I +P A GD++E+ LR+KL CK F W++ ++ D++
Sbjct: 373 VRAAEVWLDEFKEIYYHRNPHARLEAFGDVTERRMLREKLGCKSFKWYLDNIYPDIHV-- 430
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
P P +F G KN G T C D V+G CHG G +Q F + G++
Sbjct: 431 PHDRPGMF-GMLKNRGKTNYCFDYNPTDENVVVGQRVILYLCHGMGQNQFFEYSVNGEIC 489
Query: 169 HGER----CVDAD 177
+ R C+ D
Sbjct: 490 YNTREPAGCIAGD 502
>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 603
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 56/98 (57%), Gaps = 11/98 (11%)
Query: 82 DISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW--GEAKNLGTQKCLDSM 137
D++ +V LR KL CK F W++++V +PEL P + F GE +N + CLD++
Sbjct: 446 DVTARVKLRSKLQCKSFQWYLENV-------YPELEIPDDKFLAAGEIRNPESGICLDTL 498
Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
G+ A +G CHG G +Q + N +G++ + C+D
Sbjct: 499 GKQEGAPVGLYACHGQGGNQYYTYNNKGEIKAEDNCMD 536
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
Length = 621
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 27/207 (13%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
D GDI+ + ALR+KL CK F W++ ++ +++ + GE +NLG + CLDS
Sbjct: 419 DFGDITSRKALREKLGCKSFKWYLDNIYPELFIPGEAVAS----GEIRNLGIGGKTCLDS 474
Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
R + +G CH G +Q+ L+ E C+D+ K E L + L
Sbjct: 475 PARRSDLHKPVGLYPCHRQGGNQISVLDR-------ELCIDSPCKPEDLHNPIGLWPCHK 527
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSG--TVDGP---------WRFHNSTRQLEHAILKRC 242
++ + G + C + SG + P W ++ +++ L H +C
Sbjct: 528 QGGNQFWMYSKSGEIRRDEACLDYSGQEVILYPCHGSKGNQFWDYNANSKLLRHGSSDKC 587
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFK 269
L+++ N+L M CDE + Q W +
Sbjct: 588 LAINEAKNKLLMEPCDEEATRQHWSLE 614
>gi|345307494|ref|XP_001507147.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Ornithorhynchus anatinus]
Length = 361
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 59/144 (40%), Gaps = 13/144 (9%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 142 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 201
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGS------SQLFRLNAEGQL 167
+ PP WGE +N+ C+DS + C GS QLF +
Sbjct: 202 PVEPPPAAWGEIRNVVANLCVDSKHGATGTEVRLDVCVKDGSERTWSHEQLFTFGWREDI 261
Query: 168 GHGER------CVDADKQEKLVEM 185
GE C+DA V +
Sbjct: 262 RPGEPLHTRKFCLDAISHSSPVTL 285
>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
Length = 613
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 68/137 (49%), Gaps = 10/137 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A ++ GD++E+ LR+ L CK F W+++++ +P
Sbjct: 423 RTAEVWMDEYKAFFYKMVPAARNVEAGDVTERKKLRETLQCKSFKWYLENI-------YP 475
Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P P F G N T+KC+D+ G+ G CHG+G +Q + L +G++ +
Sbjct: 476 EAPLPADFRSLGAIVNRFTEKCVDTNGKKDGQPPGMQACHGAGGNQAWSLTGKGEIRSDD 535
Query: 172 RCVDADKQEKLVEMLKL 188
C+ + ++ LKL
Sbjct: 536 LCLSSGHVYQIGSELKL 552
>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 599
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 68/124 (54%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R++ + +A GDISE++ LR++LHC++FSW++ ++ +
Sbjct: 420 QVRLAEVWMDEYKEIFYRRNIQAAKMAREKSFGDISERLQLREQLHCRNFSWYLHNIYPE 479
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLG +CLD R +I CH G +Q F +
Sbjct: 480 MFV--PDLKPT-FYGAIKNLGIDQCLDVGENNRGGKPLIMYP-CHSLGGNQYFEYTTQRD 535
Query: 167 LGHG 170
L H
Sbjct: 536 LRHN 539
>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
Length = 611
Score = 67.8 bits (164), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 68/137 (49%), Gaps = 10/137 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A ++ GD++++ LR+ L CK F W+++++ +P
Sbjct: 423 RTAEVWMDEYKAFFYKMVPAAKNVEAGDVTDRKKLRETLQCKSFKWYLENI-------YP 475
Query: 115 ELP-PNLF--WGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
E P P F G N T+KC+D+ G+ G CHG+G +Q + L +G++ +
Sbjct: 476 EAPLPADFRSLGSIVNRFTEKCIDTNGKKDGQAPGMQACHGAGGNQAWSLTGKGEIRSDD 535
Query: 172 RCVDADKQEKLVEMLKL 188
C+ + ++ LKL
Sbjct: 536 LCLSSGHVYQIGSELKL 552
>gi|313220438|emb|CBY31291.1| unnamed protein product [Oikopleura dioica]
Length = 354
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 47/124 (37%), Positives = 61/124 (49%), Gaps = 17/124 (13%)
Query: 58 SSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCK-DFSWFMKHVAY---DVYTKF 113
W K F R P L GDISEQV R HC +F+W M+ +AY D+Y
Sbjct: 165 DDWTKYFYWNR----PELQKLPYGDISEQVKFRHD-HCPYNFTWMMEEIAYGITDMYDAP 219
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG---RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL L WGE + G+ C+DSM PA YCH G +QLFR++A G +
Sbjct: 220 NEL---LAWGEIRGKGSNICVDSMAHKDNDGPA--EAWYCHKQGGNQLFRIHAGGFIAQN 274
Query: 171 ERCV 174
+C+
Sbjct: 275 LQCI 278
>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 533
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 100/217 (46%), Gaps = 38/217 (17%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQK-CL 134
D GDIS + LR+KL CK F W++K++ +PEL ++ GE +N+G ++ CL
Sbjct: 325 DYGDISGRKELREKLKCKSFDWYLKNI-------YPELFIPGESVAHGEIRNIGFERTCL 377
Query: 135 DSMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
DS R + +G CH G +Q+ ++ + CVD+ E +K PV
Sbjct: 378 DSPTRKSDHHKPVGLYPCHRQGGNQIANPSS-------DMCVDSAAGP---EDMKKPVNP 427
Query: 193 CVLAGV----KLIFCRLGTVLNLFNCFNSSG--TVDGP---------WRFHNSTRQLEHA 237
G ++ + G + C + SG V P W + +T+ ++H
Sbjct: 428 WPCHGEYGNQYWMYSKNGEIRRDETCLDYSGHDVVLYPCHGAKGNQLWLYDPTTKLIKHG 487
Query: 238 ILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPD 274
++C+++ + +++ M C+E + QQW + D
Sbjct: 488 SSEKCMAISRKKDKIVMETCNERENRQQWNMENFNAD 524
>gi|167520406|ref|XP_001744542.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776873|gb|EDQ90491.1| predicted protein [Monosiga brevicollis MX1]
Length = 702
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 70/123 (56%), Gaps = 5/123 (4%)
Query: 53 ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
+LR +W+ +KQI + + +D GD+S++ A+RD+L CK F W++ +V D+Y
Sbjct: 421 DLRVAEAWMDEYKQI-VMDTRGFEGVDPGDLSDRKAIRDRLQCKSFKWYLDNVHPDLYV- 478
Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P+L P + G + ++C+D+M +G CHG ++Q +RL +G + +G
Sbjct: 479 -PDLHPKMA-GVIADPQKRQCIDNMQHDRGGPVGVYGCHGL-TTQRWRLMEDGVIANGVS 535
Query: 173 CVD 175
C++
Sbjct: 536 CLN 538
>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 608
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 92/220 (41%), Gaps = 36/220 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +KQ +P + GD+++Q ALR++L CK F WF++ VA D+ ++P
Sbjct: 395 RVAEVWMDEYKQFLYERNPQFDKTNPGDLTKQKALRERLKCKPFKWFLEEVAPDLLVRYP 454
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNA--EGQ 166
P F G +++ K CLD++ A IG C + + +Q F L + +
Sbjct: 455 LREPLPFASGRVQSVANPKLCLDTLNHKAKEPIGVFGCAPNKTHPQNNQFFTLTYYRDIR 514
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
E+C+DA + V LFNC S G + WR
Sbjct: 515 AASVEKCLDASSDDAEV--------------------------ILFNCHESQG--NQLWR 546
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ T+ + H R + ++ + +CD Q+W
Sbjct: 547 YDQETQMIMHGKPSRDQCLDLVGKKVVVSKCDNRKKTQRW 586
>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
Length = 454
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 8/118 (6%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD-VYTKFPELPP 118
W+ +K P +D GDIS+++ALR +L CK F W++++VA D V TK
Sbjct: 313 WMDEYKDWLYDKRPEIKSVDYGDISDRIALRKRLKCKSFKWYLENVANDTVKTKLCAC-- 370
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ +N G CLDSMGR +G + CH G +Q F+ +L E C D
Sbjct: 371 ----FQVRNQGKNMCLDSMGRKD-GHVGLASCHNMGGNQAFQYTYIRELRTDETCFDV 423
>gi|313230492|emb|CBY18708.1| unnamed protein product [Oikopleura dioica]
Length = 644
Score = 67.4 bits (163), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 50/222 (22%), Positives = 91/222 (40%), Gaps = 32/222 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W + + P A +D GD++E A+R++L CK F WFMK + YD+ F
Sbjct: 416 VRVVEVWWDEYSKYFYERKPNAKHVDPGDLTEVKAVRERLQCKSFDWFMKEIGYDLPKHF 475
Query: 114 PELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P + P N G N+GT C+D H +G L+ + G+
Sbjct: 476 PLVEPSNAARGHITNVGTNLCVDG--------------HHAGQGDQLALSECASVREGQW 521
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCR--------LGTVLNLFNCFNSSGTVDGP 224
+ + + P++ + K R +G + + C G +
Sbjct: 522 MLTWHEDIR-------PLSGSNVENKKASDARKVCWDGTGVGNPIGFWECHTQGG--NQL 572
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+++ +Q H+ + C + H E ++ + CD++N Q+W
Sbjct: 573 FKYVPDKKQFIHSPSRACATAHEEQKKVYLKACDDSNDNQKW 614
>gi|72000997|ref|NP_001024216.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
gi|51316004|sp|Q8I136.2|GALT4_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
Short=pp-GaNTase 4; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 4; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|3047189|gb|AAC13670.1| GLY4 [Caenorhabditis elegans]
gi|11064525|emb|CAC14394.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
Length = 589
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 72/144 (50%), Gaps = 22/144 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A F++ GDI++++A+RD+L CK F W++++V +P
Sbjct: 398 RAAEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLENV-------YP 450
Query: 115 EL------PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF---RLNAEG 165
+L P F + NL CLDSM R G CHG+G +Q + +L
Sbjct: 451 QLEIPRKTPGKSFQMKIGNL----CLDSMARKESEAPGLFGCHGTGGNQEWVFDQLTKTF 506
Query: 166 QLGHGERCVD--ADKQEKLVEMLK 187
+ + C+D ++ + K V M+K
Sbjct: 507 KNAISQLCLDFSSNTENKTVTMVK 530
>gi|350592356|ref|XP_003359108.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Sus scrofa]
Length = 209
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 50/106 (47%), Gaps = 1/106 (0%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GD+S Q LR +L CKDF WFM VA+DV +P
Sbjct: 76 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 135
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+ PP WGE +N+ C+DS + C GS + +
Sbjct: 136 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDICVKDGSERTW 181
>gi|260836359|ref|XP_002613173.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
gi|229298558|gb|EEN69182.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
Length = 539
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 69/138 (50%), Gaps = 14/138 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A + G+I ++ LR KL CK F+W+++HV +P
Sbjct: 350 RAAEVWMDEYKEYYYAAVPSARNVPFGNIQSRLELRKKLSCKPFAWYLEHV-------YP 402
Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT C+D++G A +G CHGSG +Q + L + + H
Sbjct: 403 ELRIPDKKDVAFG-ALQQGTL-CMDTLGHFADGTVGVYECHGSGGNQEWALTKDKSIRHS 460
Query: 171 ERCVDADKQEKLVEMLKL 188
+ C+ Q E+LKL
Sbjct: 461 DLCLTVVNQNP-GELLKL 477
>gi|221130543|ref|XP_002162500.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Hydra magnipapillata]
Length = 578
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 69/138 (50%), Gaps = 13/138 (9%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A + GDI +++ LR L+CK F W+++++ +P
Sbjct: 389 RAAEVWMDDYKKYYFAAVPSARYSLFGDIRDRLQLRKDLNCKSFKWYLENI-------YP 441
Query: 115 ELP-PN---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL P+ + +G+ K ++ CLD+MG IG CHG G +Q + Q+ H
Sbjct: 442 ELKVPDDDVIKYGQIKYKVSEDCLDTMGHIKGEGIGLFPCHGQGGNQDWSWTKSNQIKHE 501
Query: 171 ERCVD--ADKQEKLVEML 186
C+ + K E++V M+
Sbjct: 502 SLCLSGISKKSEEIVRMV 519
>gi|432934600|ref|XP_004081948.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Oryzias
latipes]
Length = 600
Score = 67.4 bits (163), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 61/231 (26%), Positives = 102/231 (44%), Gaps = 36/231 (15%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + +A GDIS++ LR++L CK+FSW++
Sbjct: 400 QVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKEETFGDISKRKDLRERLQCKNFSWYL 459
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
K++ +++ P+L P LF G KN+G CLD+ Y CHG G +Q F
Sbjct: 460 KNIYPEIF--MPDLNPLLF-GSVKNVGKASCLDAGENNEGGKELIMYPCHGLGGNQYFEY 516
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC-FNSSGT 220
+ ++ H + Q++L C+ G V+ L C + T
Sbjct: 517 STHREVRH-------NIQKEL----------CLHGA--------GGVVKLEECQYKGRNT 551
Query: 221 VDGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
G W ++ + CL+ H + LA RC+ ++ YQQW F
Sbjct: 552 FVGAEQKWELKDN-QLFYFPGGNMCLTAHQDHPSLA--RCNPSDRYQQWSF 599
>gi|380021258|ref|XP_003694487.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis florea]
Length = 537
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 55/211 (26%), Positives = 92/211 (43%), Gaps = 31/211 (14%)
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDSMG 138
GD+SE+ ALR +L CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 337 GDVSERKALRKRLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDSPA 392
Query: 139 RTA----PAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
R A PA G CH G +Q+ L + C+D+ K E L + + L
Sbjct: 393 RKADLHKPA--GLYPCHRQGGNQIRHLVS-------SMCIDSPGKPEDLHQPVGLYPCHR 443
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
+ + G + +C + SGT + W ++ ++Q+ H +C
Sbjct: 444 QGGNQYWMLSKTGEIRRDESCLDYSGTDVILYPCHGSKGNQQWIYNPQSKQIRHGSSDKC 503
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
L++ +L M C + + Q+W F+ P
Sbjct: 504 LAITESKQRLIMEECSPSAARQRWSFENYDP 534
>gi|344271584|ref|XP_003407617.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12-like [Loxodonta
africana]
Length = 576
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 96/229 (41%), Gaps = 43/229 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 376 VRAAEVWMDEYKELYYHRNPHARLEPFGDVTERRQLRAKLQCKDFRWFLETVYPELHV-- 433
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G ++ C D + G +CHG G +Q F ++ ++
Sbjct: 434 PEDRPG-FFGMLQNKGLKEYCFDYNPPKDDQISGHQVLLYHCHGMGQNQFFEYTSQNEIR 492
Query: 169 HGER----CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
+ R CV A + ++ M C+ N F G+
Sbjct: 493 YNTRQPEGCVSAPAESDILFME---------------LCKETAPENQKFIFREDGS---- 533
Query: 225 WRFHNSTRQLEHAILKRCLSVH---PETNQLAMLRCDENNSYQQWRFKE 270
L H + K+C+ P + + +LR N+ Q+W FKE
Sbjct: 534 ---------LFHVLSKKCVQAERKAPGDSVVPLLRDCTNSERQKWFFKE 573
>gi|296484977|tpg|DAA27092.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 320
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 36/106 (33%), Positives = 50/106 (47%), Gaps = 1/106 (0%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GD+S Q LR +L CKDF WFM VA+DV +P
Sbjct: 211 RVAETWMDEFAEYIYQRRPEYRHLSTGDLSAQKELRRQLKCKDFKWFMAAVAWDVPKYYP 270
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+ PP WGE +N+ C+DS + C GS + +
Sbjct: 271 PVEPPPAAWGELRNVAANLCVDSKHGATGTELRLDVCVKDGSERTW 316
>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Sarcophilus harrisii]
Length = 590
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 64/232 (27%), Positives = 90/232 (38%), Gaps = 49/232 (21%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK+I + A GDI+E+ LRDKL CKDF WF+++V +++
Sbjct: 391 VRAAEVWMDEFKEIYYHRNMHARKEPYGDITERKELRDKLKCKDFRWFLENVYPELH--I 448
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G N G C D + + G CHG G +Q F + +L
Sbjct: 449 PEDRPGYF-GMLVNRGMADYCFDYNPPSESEITGNQVILYLCHGMGQNQFFEYTSHNELR 507
Query: 169 HGER----CVDADKQEKLVEMLKLPVTFC---VLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
+ R C D V L + C V K IF GT+
Sbjct: 508 YNTRQPEACAAVD-----VGTDHLTMHLCYKNVPENQKFIFKEDGTLF------------ 550
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWRFKE 270
H K+C+ + N +LR N+ YQ+W FKE
Sbjct: 551 --------------HVQSKKCVQAEANSSNGNPAPLLRTCTNSEYQKWFFKE 588
>gi|449275388|gb|EMC84260.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Columba livia]
Length = 632
Score = 67.0 bits (162), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 104/233 (44%), Gaps = 40/233 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GDIS+++ LR +L CK+F+W++
Sbjct: 429 QVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLDLRQRLQCKNFTWYL 488
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+V + Y P+L P LF G KN G + CLD +G P ++ + CHG G +Q
Sbjct: 489 SNVYPEAYV--PDLNP-LFSGYLKNTGNRMCLD-VGENNHGGKPLIMYS--CHGLGGNQY 542
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A ++ H E C+ A K V+L C +
Sbjct: 543 FEYSAHHEIRHNIQKELCLHASK-----------------GPVQLRECH-------YKGQ 578
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ + W H + L +A L CL+ + E LA C+ ++ +Q+W F
Sbjct: 579 KTFAVGEEQW-LHQKDQTLYNAALHMCLTGNGEHPSLA--SCNPSDPFQKWIF 628
>gi|307203928|gb|EFN82835.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Harpegnathos saltator]
Length = 482
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/211 (26%), Positives = 88/211 (41%), Gaps = 27/211 (12%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
+ GD+SE+ LR KL CK F W++ +V +++ + GE +NLG CLDS
Sbjct: 280 NYGDVSERKTLRKKLGCKSFKWYLDNVYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 335
Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
R A G CH G +Q+ R A G C+D+ K E L + + L
Sbjct: 336 PARKADLHKPCGLYPCHRQGGNQI-RQVASGI------CIDSPGKSEDLHQPVGLYPCHR 388
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
+ + G + +C + SG+ + W ++ T + H +C
Sbjct: 389 QGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWIYNPQTNHIRHGSSDKC 448
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
L++ QL M C N Q+W F+ P
Sbjct: 449 LAITESKQQLVMEECSPNAPRQKWSFENYDP 479
>gi|432098843|gb|ELK28338.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Myotis davidii]
Length = 428
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 91/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 222 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTQIAWDLPKFYPPVEPP 281
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +++GT C D+ A + + C G G + Q+F + G
Sbjct: 282 AAAWGEIRSVGTGLCADTKHGAAGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 339
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + WR+ +
Sbjct: 340 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWRYRKD-K 378
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 379 SLYHPVSGSCMDCSESDHRIFMNICNPSSPTQQWLFEHT 417
>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
Length = 631
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 65/231 (28%), Positives = 97/231 (41%), Gaps = 10/231 (4%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVIS + L + ++F + + LA GDISE++ R +L CK FSW++
Sbjct: 405 QVISRNQVRLAEVWMDDYKEIFYRRNQQAAQLARDKAFGDISERLDFRVRLRCKSFSWYL 464
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
K++ + + P+L P L +G KN+G CLD+ Y CHG G +Q F
Sbjct: 465 KNIYPEAF--IPDLNP-LSFGSVKNVGKDSCLDAGENNEGGKKLIMYPCHGLGGNQYFEY 521
Query: 162 NAEGQLGHG---ERCVD-ADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNS 217
+ ++ H E C+ A +L E V A K + V L +
Sbjct: 522 STHHEIRHNIQKELCLHGAAGAVRLEECQYKGRNTLVGAEQKWQLKDVSLVSLLLERSHF 581
Query: 218 SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
S W F R + L +CLS + LA C+ + YQ W F
Sbjct: 582 SFKTSICWLFCTQNRLVYIPALNKCLSARQDHPFLAA--CNPMDRYQLWVF 630
>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 512
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 57/215 (26%), Positives = 83/215 (38%), Gaps = 42/215 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A DMG I +VALR +L+C F W+MK+V +P
Sbjct: 333 RAAEVWMDNYKDYYYSARPSAKGRDMGSIKSRVALRKRLNCTTFDWYMKNV-------YP 385
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSS-QLFRLNAEGQLGHGE 171
EL P + K CLD++G A +G C S Q + + +G + H
Sbjct: 386 ELSVPSSTNNKHGKLKQNNLCLDTLGHQAGEPVGLQDCQQSRQGYQDWSIAMKGLIRHLN 445
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C++A Q V L +C + N W H
Sbjct: 446 LCLEARGQI-----------------VHLQYCSKDHIQN--------------WE-HTKN 473
Query: 232 RQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ H K CLS N+L + CD+ ++ QQW
Sbjct: 474 NHIVHLSTKLCLSSRHSLNELTLETCDDKSNNQQW 508
>gi|393910975|gb|EJD76111.1| glycosyl transferase, variant [Loa loa]
Length = 549
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 39/131 (29%), Positives = 61/131 (46%), Gaps = 3/131 (2%)
Query: 30 QLGHGERC--VDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
++GH RC V I S R W+ + + P + +D GD+S+Q
Sbjct: 378 RVGHIYRCKYVPFPDPGIGDFISKNYRRVAEVWMDEYAKFLYKRRPPLLTVDFGDLSKQK 437
Query: 88 ALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
A+R +L CK F WFMK +A+D +P + P + GE +N+ KC+D+ + A G
Sbjct: 438 AVRKRLKCKSFDWFMKEIAFDQEKFYPAIEPEDSAEGELRNVAANKCVDTHFKNAEEKFG 497
Query: 147 TSYCHGSGSSQ 157
C G +
Sbjct: 498 LRKCVSDGPAN 508
>gi|426226648|ref|XP_004007451.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6 [Ovis aries]
Length = 792
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 68/124 (54%), Gaps = 12/124 (9%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ LR++L+C++FSWF+ ++ +
Sbjct: 598 QVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERLNCRNFSWFLDNIYPE 657
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
++ P+L P F G KNLG CLD + P ++ CHG G +Q F +
Sbjct: 658 MFV--PDLKPTFF-GALKNLGVDHCLDVGENNNGGKPLIL--YACHGLGGNQYFEYTTQR 712
Query: 166 QLGH 169
L H
Sbjct: 713 DLRH 716
>gi|432882425|ref|XP_004074024.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Oryzias latipes]
Length = 549
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 94/230 (40%), Gaps = 42/230 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+I +P A GD+SE++ LR+KL CK+F W++ +V +++
Sbjct: 347 VRAAEVWMDDYKEIYYHRNPHARLEAFGDVSERLKLREKLGCKNFQWYLDNVYPEIHV-- 404
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
P P +F G KN G C D ++G CHG G +Q F + +G++
Sbjct: 405 PHDRPGMF-GMLKNRGKVNYCFDYNPADENTIVGQRVILYPCHGMGQNQFFEYSVDGEIC 463
Query: 169 HGER----CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
+ R C+ D + T L + C V
Sbjct: 464 YNTREPAGCIVGDS--------------------------VNTYLTMQLCKKRGQPVPME 497
Query: 225 WRF-HNSTRQLEHAILKRCLSVHPETNQ---LAMLRCDENNSYQQWRFKE 270
+F L HA+ ++C+ T+ L+ ++ YQQW F+E
Sbjct: 498 QKFVFRKDGSLYHALSQKCVQAMDRTDNGTPAPALQPCSDSVYQQWFFEE 547
>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
norvegicus]
gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
norvegicus]
gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
Length = 603
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T + + C G G S Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCTDTKHGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590
>gi|308456556|ref|XP_003090710.1| hypothetical protein CRE_13795 [Caenorhabditis remanei]
gi|308260992|gb|EFP04945.1| hypothetical protein CRE_13795 [Caenorhabditis remanei]
Length = 487
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 69/143 (48%), Gaps = 22/143 (15%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A F++ GDIS+++A+RD+L CK F W++ V +P
Sbjct: 180 RAAEVWMDEYKAIYLKNVPSARFVNYGDISDRLAIRDRLQCKSFKWYLDTV-------YP 232
Query: 115 EL-PPNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLFRL--------NAE 164
+L P G++ + CLDSM R G CHG+G +Q + NA
Sbjct: 233 QLVVPKKTPGKSLQMKMGHLCLDSMARKEDDAPGLFACHGTGGNQEWVFDPLTKTFKNAI 292
Query: 165 GQLGHGERCVDADKQEKLVEMLK 187
QL C+D ++K V M+K
Sbjct: 293 TQL-----CLDFAAEKKAVLMVK 310
>gi|148672125|gb|EDL04072.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 436
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 42/226 (18%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + + GDISE++ LR++L C +FSW++ +V +
Sbjct: 242 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPE 301
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 302 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 357
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
L H G +L G+ L L +C NS D
Sbjct: 358 LRHN-------------------------IGKQLCLHASGSTLGLRSCQFVGKNSRVPKD 392
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + ++ CL+ + + AM C+ + YQ W F
Sbjct: 393 EEWEL-TQDQLIRNSGSGTCLT--SQDKKPAMAPCNPRDPYQLWLF 435
>gi|115533032|ref|NP_001041036.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
gi|182676440|sp|O45947.3|GLT10_CAEEL RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
10; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 10; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|3880991|emb|CAA16378.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
Length = 684
Score = 66.2 bits (160), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 89/220 (40%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ +R+KL CK F WFMK +A+D +P + P
Sbjct: 474 WMDDYKETLYKHRPGVGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 533
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
GE +N+GT C+D+ + G C G Q RL + R
Sbjct: 534 ASAEGEIRNVGTNFCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDIRPKGRKI 593
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + K PV LF+C + G + +++ + +
Sbjct: 594 CFDCS-----TSVDKAPVI-------------------LFDCHSMKG--NQLFKYRVAQK 627
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
Q+ H I +CL+ L M +CD ++ Q+W ++ V
Sbjct: 628 QIYHPISGQCLTADENGKGFLHMKKCDSSSDLQKWAWQTV 667
>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 622
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 42/226 (18%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + + GDISE++ LR++L C +FSW++ +V +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 543
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
L H G +L G+ L L +C NS D
Sbjct: 544 LRHN-------------------------IGKQLCLHASGSTLGLRSCQFVGKNSRVPKD 578
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + ++ CL+ + + AM C+ + YQ W F
Sbjct: 579 EEWEL-TQDQLIRNSGSGTCLTS--QDKKPAMAPCNPRDPYQLWLF 621
>gi|307172175|gb|EFN63700.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Camponotus floridanus]
Length = 433
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 54/211 (25%), Positives = 88/211 (41%), Gaps = 27/211 (12%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
+ GD+SE+ LR KL CK F W++ ++ +++ + GE +NLG CLDS
Sbjct: 231 NYGDVSERKTLRKKLGCKSFKWYLDNIYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 286
Query: 137 MGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKLPVTFC 193
R A G CH G +Q+ ++ A G C+D+ K E L + + L
Sbjct: 287 PARKADLHKPCGLYPCHRQGGNQIRQI-ASGI------CIDSPGKSEDLHQPVGLYPCHR 339
Query: 194 VLAGVKLIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAILKRC 242
+ + G + +C + SG+ + W + T + H +C
Sbjct: 340 QGGNQYWMLSKTGEIRRDESCLDYSGSDVILYPCHGSKGNQQWIYSPQTNHIRHGSSDKC 399
Query: 243 LSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
L++ QL M C N Q+W F+ P
Sbjct: 400 LAITESKQQLVMEECSPNAPRQRWSFENYDP 430
>gi|15530299|gb|AAH13945.1| GALNT12 protein [Homo sapiens]
Length = 272
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 72 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 129
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 130 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 187
>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
Length = 622
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/226 (26%), Positives = 99/226 (43%), Gaps = 42/226 (18%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + + GDISE++ LR++L C +FSW++ +V +
Sbjct: 428 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQLRCHNFSWYLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 488 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 543
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNC----FNSSGTVD 222
L H G +L G+ L L +C NS D
Sbjct: 544 LRHN-------------------------IGKQLCLHASGSTLGLRSCQFVGKNSRVPKD 578
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + ++ CL+ + + AM C+ + YQ W F
Sbjct: 579 EEWEL-TQDQLIRNSGSGTCLTS--QDKKPAMAPCNPRDPYQLWLF 621
>gi|10437274|dbj|BAB15027.1| unnamed protein product [Homo sapiens]
Length = 284
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 84 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 141
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 142 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 199
>gi|115533034|ref|NP_001041037.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
gi|87251651|emb|CAJ76949.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
Length = 622
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 89/220 (40%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ +R+KL CK F WFMK +A+D +P + P
Sbjct: 412 WMDDYKETLYKHRPGVGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 471
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
GE +N+GT C+D+ + G C G Q RL + R
Sbjct: 472 ASAEGEIRNVGTNFCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDIRPKGRKI 531
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + K PV LF+C + G + +++ + +
Sbjct: 532 CFDCS-----TSVDKAPVI-------------------LFDCHSMKG--NQLFKYRVAQK 565
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
Q+ H I +CL+ L M +CD ++ Q+W ++ V
Sbjct: 566 QIYHPISGQCLTADENGKGFLHMKKCDSSSDLQKWAWQTV 605
>gi|62148924|dbj|BAD93346.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 581
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 381 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 438
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 439 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 496
>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
Length = 693
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 89/229 (38%), Gaps = 78/229 (34%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW-GEAKNLG------ 129
D GD+S++ LR+ L CK F W++ T +PEL P N GE +NLG
Sbjct: 492 DFGDVSDRKKLREDLQCKSFQWYLD-------TIYPELFIPGNAVANGEIRNLGYGGRTC 544
Query: 130 -------------------------------TQKCLDSMGRTA-PAVIGTSYCHGSGSSQ 157
T CLD+M + + +G CHG G Q
Sbjct: 545 LDSPSGKRNLKKAVGLYPCHKQGGNQIRNINTNMCLDAMLKNEDESPVGVYECHGQGGHQ 604
Query: 158 LFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNS 217
+ L+ G++ + C+D AG +I LF C S
Sbjct: 605 YWMLSKAGEIRRDQACLD-------------------YAGKDVI---------LFGCHGS 636
Query: 218 SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
G + W++ +T+ L H +CL+++ ++L M CD ++ Q W
Sbjct: 637 RG--NQFWQYRENTKLLHHGSSGKCLAINESRDKLIMEECDASHLRQHW 683
>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
Length = 398
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 58/105 (55%), Gaps = 4/105 (3%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+I P D GD+S + ALR++L CK F W++ +V ++
Sbjct: 297 MRLAEVWMDDYKKIFYAMRPQLKGKDYGDVSGRKALRERLMCKSFKWYLDNVISEL--AI 354
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQ 157
P+L P + GE +NLGT CLD++ + A G CHG G++Q
Sbjct: 355 PDLYP-IGRGEIRNLGTNTCLDTLAKNEAGGEPGMYMCHGMGNNQ 398
>gi|119579301|gb|EAW58897.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12) [Homo
sapiens]
Length = 517
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 61/122 (50%), Gaps = 8/122 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 317 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 374
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 375 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 433
Query: 169 HG 170
+
Sbjct: 434 YN 435
>gi|114625882|ref|XP_001157326.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
troglodytes]
Length = 483
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 61/122 (50%), Gaps = 8/122 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 283 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 340
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 341 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 399
Query: 169 HG 170
+
Sbjct: 400 YN 401
>gi|341894191|gb|EGT50126.1| CBN-GLY-4 protein [Caenorhabditis brenneri]
Length = 584
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 68/138 (49%), Gaps = 12/138 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A F++ GDI++++A+RD+L CK F W++ V +P
Sbjct: 395 RAAEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCKSFKWYLDTV-------YP 447
Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGRTAPAVIGTSYCHGSGSSQLF---RLNAEGQLGH 169
+L P G++ + CLDSM R G CHG+G +Q + L +
Sbjct: 448 QLEVPKKAAGKSVQVKMGHHCLDSMARKENEAPGLFACHGTGGNQEWVFDHLTKTFKNAI 507
Query: 170 GERCVDADKQEKLVEMLK 187
+ C+D + +K V M+K
Sbjct: 508 TQLCLDFAEDKKPVMMVK 525
>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 683
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 100/225 (44%), Gaps = 50/225 (22%)
Query: 54 LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+R WL +KQ + +G L D GD+S + LRD L C+ F W+++ T
Sbjct: 485 VRLAEVWLDEYKQYYYQRIGDDLG---DFGDVSARKRLRDNLKCRSFDWYVR-------T 534
Query: 112 KFPEL--PPN-LFWGEAKNLGT--QKCLDS-MGR-TAPAVIGTSYCHGSGSSQLFRLNAE 164
+PEL P + + GE +N G CLDS GR +G CHG G +Q + L+ E
Sbjct: 535 IYPELFVPGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKE 594
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
G++ E C+D AG +I L+ C S G +
Sbjct: 595 GEIRRDEACLD-------------------YAGSDVI---------LYPCHGSKG--NQL 624
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
W + +++ ++H ++CL + + +++M C + Q W F+
Sbjct: 625 WIYDDASGTVQHGSSRKCLEMSEDRQKVSMEIC-SGSLRQSWIFQ 668
>gi|397499979|ref|XP_003820707.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
paniscus]
Length = 568
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/119 (31%), Positives = 60/119 (50%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 368 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 425
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 426 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 483
>gi|112807221|ref|NP_078918.3| polypeptide N-acetylgalactosaminyltransferase 12 [Homo sapiens]
gi|84028209|sp|Q8IXK2.3|GLT12_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|25815116|emb|CAC80100.2| UDP-GalNAc-transferase 12 [Homo sapiens]
gi|151555601|gb|AAI48740.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
[synthetic construct]
gi|151556464|gb|AAI48410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
[synthetic construct]
gi|261858036|dbj|BAI45540.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [synthetic
construct]
Length = 581
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 8/122 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 381 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 438
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 439 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 497
Query: 169 HG 170
+
Sbjct: 498 YN 499
>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 607
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/223 (26%), Positives = 91/223 (40%), Gaps = 42/223 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+S Q LR L+CK+F WFM+ VA+D+ +P + PP
Sbjct: 399 WMDEYAEYVYQRRPEYRHLSAGDMSAQKELRSHLNCKNFRWFMEEVAWDLPKHYPPVEPP 458
Query: 119 NLFWGEAKNLGTQKCLD-------SMGRTAPAVIG---TSYCHGSGSSQLFRLNAE-GQL 167
WGE +++G+ C++ S R V G S+ HG + +R + G
Sbjct: 459 AAAWGEIRSVGSGMCMEIKHFVSGSPIRLESCVKGRGDVSWSHGQVLTFGWREDIRVGDP 518
Query: 168 GHGER-CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
H + C DA PVT L++C G + WR
Sbjct: 519 MHTRKVCFDAVSHHS-------PVT-------------------LYDCHGMKG--NQLWR 550
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ + L H + C+ ++ M CD + QQW F+
Sbjct: 551 YRKD-KSLFHPVSNSCIDSSASERRVFMNTCDPTSLSQQWLFE 592
>gi|5834643|emb|CAB55352.1| N-acetylgalactosaminyltransferase T-6 [Mus musculus]
Length = 623
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 68/124 (54%), Gaps = 10/124 (8%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + + + GDISE++ LR++L C +FSW++ +V +
Sbjct: 429 QVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLQLREQLRCHNFSWYLHNVYPE 488
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
++ P+L P F+G KNLGT +CLD R +I CH G +Q F ++
Sbjct: 489 MFV--PDLNPT-FYGAIKNLGTNQCLDVGENNRGGKPLI-MYVCHNLGGNQYFEYTSQRD 544
Query: 167 LGHG 170
L H
Sbjct: 545 LRHN 548
>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 648
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 100/225 (44%), Gaps = 50/225 (22%)
Query: 54 LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+R WL +KQ + +G L D GD+S + LRD L C+ F W+++ T
Sbjct: 450 VRLAEVWLDEYKQYYYQRIGDDLG---DFGDVSARKRLRDNLKCRSFDWYVR-------T 499
Query: 112 KFPEL--PPN-LFWGEAKNLGT--QKCLDS-MGR-TAPAVIGTSYCHGSGSSQLFRLNAE 164
+PEL P + + GE +N G CLDS GR +G CHG G +Q + L+ E
Sbjct: 500 IYPELFVPGDAVASGEVRNKGQGGSSCLDSPSGRDNMHKPVGMYPCHGQGGNQYWMLSKE 559
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
G++ E C+D AG +I L+ C S G +
Sbjct: 560 GEIRRDEACLD-------------------YAGSDVI---------LYPCHGSKG--NQL 589
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
W + +++ ++H ++CL + + +++M C + Q W F+
Sbjct: 590 WIYDDASGTVQHGSSRKCLEMSEDRQKVSMEIC-SGSLRQSWIFQ 633
>gi|22122074|dbj|BAC07181.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Homo sapiens]
Length = 581
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 38/122 (31%), Positives = 60/122 (49%), Gaps = 8/122 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LRDKL CKDF WF++ V +++
Sbjct: 381 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCKDFKWFLETVYPELHV-- 438
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 439 PEDRPGFF-GMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEIR 497
Query: 169 HG 170
+
Sbjct: 498 YN 499
>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Loxodonta africana]
Length = 622
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 98/224 (43%), Gaps = 38/224 (16%)
Query: 53 ELRKRSSWLKVFKQI---RSL-GSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I R+L + +A GDISE++ L+++LHC +FSWF+ +V +
Sbjct: 428 QVRLAEVWMDDYKEIFYRRNLQAAKMAEEKSFGDISERLKLKEQLHCHNFSWFLHNVYPE 487
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQL 167
++ P+L P F+G K+LGT CLD Y CH G +Q F + L
Sbjct: 488 MFV--PDLKPT-FYGAIKSLGTDHCLDVGENNHGGKPLIMYPCHSLGGNQYFEYTTQRDL 544
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
H + C+ A+ AG LG F NS D
Sbjct: 545 RHNIAKQLCLHAN------------------AGT------LGLRSCHFTGKNSQVPKDEE 580
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + + ++ CL+ + Q AM C+ ++ +Q W F
Sbjct: 581 WEL-TQDQLIRNSGSGTCLTS--QDKQPAMAPCNPSDPHQHWLF 621
>gi|157118275|ref|XP_001653147.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875773|gb|EAT39998.1| AAEL008252-PA [Aedes aegypti]
Length = 648
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/194 (27%), Positives = 83/194 (42%), Gaps = 48/194 (24%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPPN--LFWGEAKN--LGTQKC 133
D GD+S + LR L+CK F W++++V FPE P+ + GE +N GT +C
Sbjct: 481 DFGDVSSRKQLRQHLNCKSFRWYLENV-------FPEQFDPSRAVGRGEFRNGENGTDRC 533
Query: 134 LD-SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTF 192
LD + R G + CHG G Q++ EG++ + C+D D
Sbjct: 534 LDWPLARNQ---CGVTSCHGRGRHQMWYFTREGEITRKDHCLDYD--------------- 575
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
G L + C G + W + T+Q H + K+CL +L
Sbjct: 576 -------------GKTLEMNRCHQMGG--NQLWEYAEKTQQFRHFLSKKCLEF--SEGKL 618
Query: 253 AMLRCDENNSYQQW 266
M +C ++ S Q+W
Sbjct: 619 NMKKCMKSGSGQKW 632
>gi|443721252|gb|ELU10645.1| hypothetical protein CAPTEDRAFT_228331 [Capitella teleta]
Length = 512
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/219 (24%), Positives = 96/219 (43%), Gaps = 40/219 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A ++ GDI ++ LR+KL C+ FSWF+++V +P
Sbjct: 324 RAAEVWMDEYKSYYYAEVPSAKSVNFGDIRSRLELREKLKCRPFSWFLQNV-------YP 376
Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
L ++ +G + C+D++ +G CH +G +Q + L ++ H
Sbjct: 377 SLIVPSEQDVQFGYIQQ--GSMCVDTLSNALGGKVGMFQCHNTGGNQEWVLTKNQKIKHL 434
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+ C+ +P A + + RL T N G+ + WR N+
Sbjct: 435 DLCI------------TMP------ASARHMALRLETCAN--------GSRNQNWRLTNN 468
Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
L+ + C+ SV ET+ L + +CD ++ QQ++F
Sbjct: 469 GHMLKSSRADLCIDSVDHETHGLTLDKCDSGSATQQFQF 507
>gi|195478397|ref|XP_002086495.1| GE23160 [Drosophila yakuba]
gi|194186285|gb|EDW99896.1| GE23160 [Drosophila yakuba]
Length = 590
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 108/256 (42%), Gaps = 41/256 (16%)
Query: 21 QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
LFR + + Q+ + + ++++++IS R WL +K P + +
Sbjct: 350 HLFR-DGKFQVRYTNKDKNSERKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLDSM- 137
G+++EQ L+ +LHCK F WF+ ++A D +P L P + G +++ + K CLD
Sbjct: 405 GNLAEQRDLKKRLHCKPFKWFLDNLAADFLNLYPILDPAEYASGVLQSVSSPKLCLDRKE 464
Query: 138 GRTAPAVIG--TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
R +G +S Q + L +L G C++ Q
Sbjct: 465 PRNGQPKLGPCSSDHIFPSPEQYWSLTNRRELRSGFYCLEVRNQ---------------- 508
Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQL 252
G ++++ C SG + W F + T Q+ + ++ CL PE+N +
Sbjct: 509 ----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVVSGQQQNIRHCLEAQPESNSV 556
Query: 253 AMLRCDENNSYQQWRF 268
CD N Q+W+F
Sbjct: 557 TSSVCDLKNHRQRWKF 572
>gi|195495309|ref|XP_002095212.1| GE22271 [Drosophila yakuba]
gi|194181313|gb|EDW94924.1| GE22271 [Drosophila yakuba]
Length = 590
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/256 (23%), Positives = 108/256 (42%), Gaps = 41/256 (16%)
Query: 21 QLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDM 80
LFR + + Q+ + + ++++++IS R WL +K P + +
Sbjct: 350 HLFR-DGKFQVRYTNKDKNSERKLISRNYR----RVAEIWLDEYKDKLFANMPHLTVIPV 404
Query: 81 GDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLD-SM 137
G+++EQ L+ +LHCK F WF+ ++A D +P L P + G +++ + K CLD
Sbjct: 405 GNLAEQRDLKKRLHCKPFKWFLDNLAADFLNLYPILDPAEYASGVLQSVSSPKLCLDRKE 464
Query: 138 GRTAPAVIG--TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
R +G +S Q + L +L G C++ Q
Sbjct: 465 PRNGQPKLGPCSSDHVFPSPEQYWSLTNRRELRSGFYCLEVRNQ---------------- 508
Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL---EHAILKRCLSVHPETNQL 252
G ++++ C SG + W F + T Q+ + ++ CL PE+N +
Sbjct: 509 ----------GVNVHIYQCHGQSG--NQFWSFDSKTHQVISGQQQNIRHCLEAQPESNSV 556
Query: 253 AMLRCDENNSYQQWRF 268
CD N Q+W+F
Sbjct: 557 TSSVCDLKNHRQRWKF 572
>gi|348519859|ref|XP_003447447.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oreochromis niloticus]
Length = 624
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 41/128 (32%), Positives = 66/128 (51%), Gaps = 4/128 (3%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + +A GDIS++V LR+KL CK FSW++
Sbjct: 424 QVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKDGAFGDISKRVELREKLQCKSFSWYL 483
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
++V +V+ P+L P L +G KN+G CLD+ Y CHG G +Q F
Sbjct: 484 QNVYPEVF--MPDLNP-LRFGSVKNVGKDSCLDAGENNEGGKQLIMYPCHGLGGNQYFEY 540
Query: 162 NAEGQLGH 169
+ ++ H
Sbjct: 541 STHHEIRH 548
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Acyrthosiphon pisum]
Length = 674
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/211 (27%), Positives = 98/211 (46%), Gaps = 38/211 (18%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPN-LFWGEAKNL--GTQKC 133
D GDI+ + LR KL CK F W+++++ +PEL P + + GE +NL G + C
Sbjct: 467 DYGDITSRKDLRRKLKCKSFKWYLENI-------YPELFIPGDAVASGEVRNLGYGNKTC 519
Query: 134 LDSMGR----TAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLKL 188
LDS R PA G CH G +Q+ + + CVD+ K V++ +
Sbjct: 520 LDSPARKTDLNKPA--GLYPCHKMGGNQIKNIVS-------NMCVDSKGDANKPVDLWQC 570
Query: 189 PV----TFCVLAGVKLI------FCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI 238
+ +L+ + I G + L+ C S G + W + + T+QL H
Sbjct: 571 HQQGGNQYWMLSKIGEIRRDESCLDYAGNDVILYPCHGSKG--NQYWNYDHKTKQLRHGS 628
Query: 239 LKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
++CL+++ ++L M CD Q W+F+
Sbjct: 629 SRKCLTINSSKDKLLMEECDLMLPKQMWQFE 659
>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Anolis carolinensis]
Length = 634
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 103/236 (43%), Gaps = 46/236 (19%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + +F + + + + GDIS++ L+ +L CKDF W++
Sbjct: 431 QVITRNQVRLAEVWMDEYKNIFYRRNTEAAKIVKQQTFGDISKRHELKQRLQCKDFKWYL 490
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+V + Y P+L P L G KN+G + CLD +G P ++ T CHG G +Q
Sbjct: 491 SNVYPEAYV--PDLNPPLS-GFLKNVGRRACLD-VGENNHGGKPLIMYT--CHGLGGNQY 544
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A ++ H E C+ A + V+L C +N
Sbjct: 545 FEYSARHEIRHNIQKELCLHASQ-----------------GAVQLNECS-------YNGK 580
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLS---VHPETNQLAMLRCDENNSYQQWRF 268
S + + W + L +A L CL+ HP +M+ CD ++ +Q+W F
Sbjct: 581 KSRTSKEEKWEIRED-QLLYNAELNMCLTGNWEHP-----SMVSCDPSDHFQKWIF 630
>gi|432865221|ref|XP_004070476.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oryzias latipes]
Length = 621
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 59/230 (25%), Positives = 100/230 (43%), Gaps = 48/230 (20%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + +A GDIS+++ LR+ LHCK+FSW++ + +
Sbjct: 425 QVRLAEVWMDDYKKIYYRRNKNAAIMAQEKKYGDISDRLKLREDLHCKNFSWYLNTIYPE 484
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
++ P+L P F G KNLG+ CLD +G P ++ CH G +Q F ++
Sbjct: 485 IFV--PDLTPEKF-GAIKNLGSDTCLD-VGENNQGGKPVIM--YMCHNMGGNQYFEYSSH 538
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
+L H + C+ A + V + C L G GT
Sbjct: 539 KELRHNIGKQLCLQAFDHPEQVR-----IELCQLKG--------------------KGTD 573
Query: 222 DGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
P W F L++ +CL + + ++ M C+ + +QQW F
Sbjct: 574 VAPQQQWIF-TEENLLKNPSSGKCLQL--KGGKVYMDSCNSADMHQQWAF 620
>gi|195576344|ref|XP_002078036.1| GD23236 [Drosophila simulans]
gi|194190045|gb|EDX03621.1| GD23236 [Drosophila simulans]
Length = 674
Score = 65.1 bits (157), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 96/228 (42%), Gaps = 41/228 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +KQ P LD GD++ Q +R++L CK F WFM VA D KF
Sbjct: 458 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 517
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
P + PP+ G +N+ CLD+MG++ +G C + + +Q + L+ L
Sbjct: 518 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSCADNRTHPQPNQFWELSIFRDL 577
Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
G C+D + + +++C + G +
Sbjct: 578 RMKGFDSVCLDVHEGPP------------------------NATVWMWSCHSQGG--NQF 611
Query: 225 WRFHNSTRQLEHA-ILKRCLSVHPETNQLAML---RCDENNSYQQWRF 268
W + T++L H KRCL E N +AM+ C++ N Q+W F
Sbjct: 612 WYYDRQTQRLVHGENNKRCLEGFVE-NGIAMVVANSCEDGNDRQRWEF 658
>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
scrofa]
Length = 506
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 91/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 300 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 359
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ G + S G G + Q+F + G
Sbjct: 360 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 417
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 418 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-Q 456
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ H + C+ +++ M C+ ++ QQW F+ +
Sbjct: 457 TMYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHI 495
>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
Length = 652
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 446 WMDEYAEHIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPP 505
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 506 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPG-- 563
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 564 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-Q 602
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 603 TLYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHT 641
>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
Length = 528
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 322 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 381
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T + + C G G S Q+F + G
Sbjct: 382 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 439
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 440 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 478
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 479 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 515
>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
Length = 556
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 60/120 (50%), Gaps = 9/120 (7%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P GDISE + LR +L C+ F W++++V DV + PN
Sbjct: 367 WMDDYKKFFYQRHPYLTDQSFGDISENLKLRQRLKCRSFRWYLQNVFTDV------ILPN 420
Query: 120 ---LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
+ G+ +N + CLD+ GRT+ +G S C+ + LF + ++ + C+DA
Sbjct: 421 ETAIATGKVRNPISNMCLDTFGRTSNTFLGLSPCNIQRDTMLFAYTSRKEISWNDACLDA 480
>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
scrofa]
Length = 624
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 91/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 418 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 477
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ G + S G G + Q+F + G
Sbjct: 478 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 535
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 536 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-Q 574
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ H + C+ +++ M C+ ++ QQW F+ +
Sbjct: 575 TMYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHI 613
>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
Length = 603
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T + + C G G S Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 590
>gi|410978730|ref|XP_003995741.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Felis catus]
Length = 469
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 12/133 (9%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+++V +++
Sbjct: 270 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHV-- 327
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G +CHG G +Q F + ++
Sbjct: 328 PEDRPGFF-GMLQNKGLRDYCFDYNPPNENQIVGHQVLLYHCHGMGQNQFFEYTSRNEIR 386
Query: 169 HG----ERCVDAD 177
+ E C+ D
Sbjct: 387 YNTHQPEACIAVD 399
>gi|167519663|ref|XP_001744171.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777257|gb|EDQ90874.1| predicted protein [Monosiga brevicollis MX1]
Length = 607
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 94/229 (41%), Gaps = 32/229 (13%)
Query: 53 ELRKRSSWLKVFKQI-RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+LR W+ +K++ + + PL +D+GD+S +R K CK F WF+K+V +++
Sbjct: 403 KLRAAEVWMDEYKEVVQRVMPPLPRGMDLGDLSAMQEIRRKFQCKPFKWFLKNVYPEMFV 462
Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRT-APAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E GE +N T C D++G + A IG CH S +Q F L+ G
Sbjct: 463 PNDEESIEAS-GEIRNPQTNACFDTLGASHQGAKIGVYPCHHSHGTQEFVLSKAGD---- 517
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
V + + C+ G + ++ C + G + W++ +
Sbjct: 518 ------------VRVAAMDFDNCLDRG------NGDGSVGIWPCHQTGG--NQAWKWDRT 557
Query: 231 TRQLEHAILKRCLSVHPETN-----QLAMLRCDENNSYQQWRFKEVRPD 274
T A C+ V E L + +CD + QQW + + D
Sbjct: 558 TGLFGDATGDFCVEVRREQTANSPFALHLAKCDPKDPMQQWHVGQPKSD 606
>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Saimiri boliviensis boliviensis]
Length = 682
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 476 WMDEYAEYIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 535
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 536 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 593
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 594 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 632
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 633 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 671
>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
anubis]
Length = 637
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 431 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 490
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 491 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 548
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 549 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 587
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 588 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 624
>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
Length = 634
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 428 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 487
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T + + C G G S Q+F + G
Sbjct: 488 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 545
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 546 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 584
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 585 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 621
>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
rerio]
Length = 578
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 65/141 (46%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ SP A GD++++ LR +L CKDF WF+ ++ D+ +
Sbjct: 376 VRAAEVWMDDFKEVYYHRSPHARLEAYGDVTDRRKLRMRLRCKDFRWFLDNIYPDI--QV 433
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P +F G KN G T C D + G CHG G +Q F + ++
Sbjct: 434 PEDKPGMF-GMLKNKGMTNYCFDYNPPDEHKIAGHRVILYPCHGMGQNQFFEYSTLQEIR 492
Query: 169 HGER----CVDADKQEKLVEM 185
+ R CV AD + M
Sbjct: 493 YNTRDPAGCVVADPVSTFLTM 513
>gi|359320847|ref|XP_532008.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Canis
lupus familiaris]
Length = 578
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 67/141 (47%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+++V +++
Sbjct: 379 VRAAEVWMDDFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHV-- 436
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D V+G CHG G +Q F ++ ++
Sbjct: 437 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQVVGYQVLLYICHGMGQNQFFEYTSQNEIR 495
Query: 169 HG----ERCVDADKQEKLVEM 185
+ E C+ D ++ M
Sbjct: 496 YNTHQPEACIAVDAGTDVLVM 516
>gi|313242250|emb|CBY34413.1| unnamed protein product [Oikopleura dioica]
Length = 644
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 49/222 (22%), Positives = 91/222 (40%), Gaps = 32/222 (14%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W + + P A +D GD++E A+R++L CK F WFMK + YD+ F
Sbjct: 416 VRVVEVWWDEYSKYFYERKPNAKHVDPGDLTEVKAVRERLQCKSFDWFMKEIGYDLPKHF 475
Query: 114 PELPP-NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P + P N G N+GT C+D H +G L+ + G+
Sbjct: 476 PLVEPSNAARGHITNVGTNLCVDG--------------HHAGQGDQLALSECASVREGQW 521
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCR--------LGTVLNLFNCFNSSGTVDGP 224
+ + + P++ + K R +G + + C G +
Sbjct: 522 MLTWHEDIR-------PLSGSNVENKKASDARKVCWDGTGVGNPIGFWECHTQGG--NQL 572
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+++ +Q H+ + C + + E ++ + CD++N Q+W
Sbjct: 573 FKYVPDKKQFIHSPSRACATANEEQKKVYLKACDDSNDNQKW 614
>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 5 [Macaca mulatta]
Length = 603
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590
>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
taurus]
Length = 620
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 414 WMDEYAEHIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPP 473
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 474 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPG-- 531
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 532 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-Q 570
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 571 TLYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFE 607
>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
Length = 603
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T + + C G G S Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 590
>gi|410897032|ref|XP_003962003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Takifugu rubripes]
Length = 624
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 96/230 (41%), Gaps = 34/230 (14%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVIS + L + ++F + + L GDIS+++ LR +L CK FSW++
Sbjct: 424 QVISRNQVRLAEVWMDDYKEIFYRRNQQAAQLVRDKAFGDISQRMDLRARLKCKSFSWYL 483
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
K++ + + P+L P L +G KN+G CLD+ Y CHG G +Q F
Sbjct: 484 KNIYPEAF--IPDLNP-LGFGSVKNVGKDSCLDAGENNEGGKRVIMYPCHGLGGNQYFEY 540
Query: 162 NAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
+ ++ H E C+ VKL C+ + N+
Sbjct: 541 STRHEIRHNIQKELCLHG-----------------AAGAVKLEECQ-------YKGRNTL 576
Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
+ W+ ++ R + L +CLS E LA C+ + YQ W F
Sbjct: 577 VGAEQKWQLKDN-RLIYMPELNKCLSARQERPFLAA--CNPMDRYQLWVF 623
>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
Length = 603
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T + + C G G S Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 590
>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 599
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 88/217 (40%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ F + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 393 WMDEFAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 452
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ G + S G G + Q+F + G
Sbjct: 453 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 510
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 511 --DPQHTKK----------FCFDA------VSHSSPVTLYDCHSMKG--NQLWKYRKD-K 549
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ ++ M C ++ QQW F+
Sbjct: 550 TLYHPVSGSCMDCSESDRRVFMNACRPSSPTQQWLFE 586
>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Sarcophilus harrisii]
Length = 651
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 58/220 (26%), Positives = 95/220 (43%), Gaps = 36/220 (16%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR+ L+CK F WFM +A+D+ +P + P
Sbjct: 442 WMDEYAEYIYQRLPEYRHLSTGDVTAQKDLRNHLNCKSFKWFMTEIAWDLPRYYPPVEPA 501
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD--A 176
WGE +N+GTQ C IGT HG+ S L RL E CV A
Sbjct: 502 AAAWGEIRNVGTQLC-----------IGTK--HGAPGSPL-RL---------ESCVKGRA 538
Query: 177 DKQEKLVEMLKLPVTFCVLAG-----VKLIFCRL--GTVLNLFNCFNSSGTVDGPWRFHN 229
+ V++ + G K F + + + L++C G + W++ N
Sbjct: 539 EAAWSNVQVFTFSWREDIRPGDPQHTKKFCFDTISHSSPVTLYDCHGMKG--NQLWKYVN 596
Query: 230 STRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ +++ M C+ ++ QQW F+
Sbjct: 597 G-KTLYHPVSGSCMDCTESDHRVFMNTCNPSSPTQQWIFQ 635
>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 4 [Macaca mulatta]
Length = 602
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 396 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 455
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 456 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 513
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 514 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 552
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 553 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 589
>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Otolemur garnettii]
Length = 603
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAAQKRLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ G + S G G + Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNACNPSSLTQQWLFE 590
>gi|16741554|gb|AAH16585.1| Galnt10 protein [Mus musculus]
Length = 339
Score = 64.7 bits (156), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 55/222 (24%), Positives = 91/222 (40%), Gaps = 30/222 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P
Sbjct: 128 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVVAQKKLRVSLNCKSFKWFMTKIAWDLPKFYP 187
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQL 167
+ PP WGE +N+GT C D+ T + + C G G S Q+F +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCTDTKLGTLGSPLRLETCIRGRGEAAWNSMQVFTFTWREDI 247
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
G D +K FC A + + L++C + G + W++
Sbjct: 248 RPG----DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKY 285
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ +++ M C+ ++ QQW F+
Sbjct: 286 RKD-KTLYHPVSGSCMDCSESDHRVFMNTCNPSSLTQQWLFE 326
>gi|268569766|ref|XP_002648333.1| C. briggsae CBR-GLY-4 protein [Caenorhabditis briggsae]
Length = 523
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 68/144 (47%), Gaps = 18/144 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K I P A F++ GDI +++A+RD+L CK F W++ VY +
Sbjct: 326 RAAEVWMDEYKAIYLKNVPSARFVNYGDIGDRLAIRDRLQCKSFKWYLD----TVYPQLA 381
Query: 115 ELPPNL-----FWGEAKNLGTQKCLDSMGRT---APAVIGTSYCHGSGSSQLF---RLNA 163
L N+ W A CLDSM R APA+ CHG+G +Q + L
Sbjct: 382 SLTRNVSSQKDAWQIAPMKIGHLCLDSMARKENEAPALFA---CHGTGGNQEWIFDDLTK 438
Query: 164 EGQLGHGERCVDADKQEKLVEMLK 187
+ + C+D ++K V M+K
Sbjct: 439 TFKNAISQMCLDFSAEKKDVVMVK 462
>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
Length = 551
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 345 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 404
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 405 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 462
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 463 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 501
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 502 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 538
>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Ailuropoda melanoleuca]
Length = 598
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 392 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKTFKWFMTKIAWDLPKFYPPVEPP 451
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE N+GT C D+ G + S G G + Q+F + G
Sbjct: 452 AAAWGEIHNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 509
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 510 --DPQHTKK----------FCFDA------ISSSSPVTLYDCHSMKG--NQLWKYRRD-K 548
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M CD ++ QQW F+
Sbjct: 549 TLYHPVSGGCMDCSESDHRIFMNTCDPSSLTQQWLFE 585
>gi|426229966|ref|XP_004009054.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Ovis
aries]
Length = 339
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/224 (23%), Positives = 91/224 (40%), Gaps = 30/224 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P
Sbjct: 128 RVAEVWMDEYAEHIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMTKIAWDLPQFYP 187
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
+ PP WGE +N+GT C D+ + + C G G + Q+F +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLESCIRGRGEAAWNNMQVFTFTWREDI 247
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
G D +K FC A + + L++C + G + W++
Sbjct: 248 RPG----DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKY 285
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
+ L H + C+ +++ M C+ ++ QQW F+
Sbjct: 286 RKD-QTLYHPVSGSCMDCSESDHRIFMNTCNPSSPTQQWLFEHT 328
>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Callithrix jacchus]
Length = 667
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 461 WMDEYAEYIYQRRPEYRHLSAGDVTAQKKLRSSLNCKSFKWFMMKIAWDLPKFYPPVEPP 520
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 521 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 578
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 579 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 617
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 618 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 654
>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
Length = 551
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 345 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKTFKWFMTKIAWDLPKFYPPVEPP 404
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE N+GT C D+ G + S G G + Q+F + G
Sbjct: 405 AAAWGEIHNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 462
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 463 --DPQHTKK----------FCFDA------ISSSSPVTLYDCHSMKG--NQLWKYRRD-K 501
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M CD ++ QQW F+
Sbjct: 502 TLYHPVSGGCMDCSESDHRIFMNTCDPSSLTQQWLFE 538
>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
caballus]
Length = 561
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 355 WMDEYAEHIYQRRPEYRHLSAGDVAAQKQLRSSLNCKTFRWFMTQIAWDLPKFYPPVEPP 414
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ G + S G G + Q+F + G
Sbjct: 415 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 472
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 473 --DPQHTKK----------FCFDA------VSHTSPVTLYDCHSMKG--NQLWKYRKD-K 511
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 512 TLYHPVSGSCVDCSESDHRIFMNTCNPSSRTQQWLFE 548
>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 756
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 13/123 (10%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE---L 116
WL +K+ SP A+ ++ G+ +++V LRD+L C+ F W++++V FPE
Sbjct: 564 WLDGYKEFFYFMSPSALKVNAGNYTDRVELRDRLGCRSFQWYLENV-------FPEGGWP 616
Query: 117 PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCH--GSGSSQLFRLNAEGQLGHGERCV 174
N +GE ++ T CLD+ GRT P + H + +Q++ E ++ H C+
Sbjct: 617 GRNKIYGEVRHTATNWCLDTGGRTTP-ITEPMVAHRCDNNVNQIWMYTEEQEIKHSSLCL 675
Query: 175 DAD 177
D D
Sbjct: 676 DYD 678
>gi|32425807|gb|AAH07224.2| GALNT10 protein, partial [Homo sapiens]
Length = 247
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 41 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 100
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 101 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 158
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 159 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 197
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 198 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 236
>gi|167537119|ref|XP_001750229.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771219|gb|EDQ84888.1| predicted protein [Monosiga brevicollis MX1]
Length = 587
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 91/224 (40%), Gaps = 45/224 (20%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+I P L+ GD+SE+ ALR++L CK F W++ +V D+ P+
Sbjct: 390 WMDEYKEIYYEFRPSHRQLETGDVSERKALRERLKCKPFKWYLDNVFPDMMV--PDRHNL 447
Query: 120 LFWGEAKNLGTQKCLDSMG-RTAPAVIGTSYCHGSGSS--QLF-------RLNAEGQLGH 169
G+ +N T CLD++ R A G C S S Q+F + EG
Sbjct: 448 YAKGQLRNANTNMCLDTLTPREADMKAGVYACASSPKSENQMFFYTKRFKEIRREGTF-- 505
Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
G RC+D AG +++ C G + W H
Sbjct: 506 GSRCLD-------------------FAGGP------SGSPSMYGCHLMKGNQE--W-IHT 537
Query: 230 STRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
+ H K CL + P+++ L + C E N Q+WRF E
Sbjct: 538 PDHHILHMASKLCLEIEPKSDHGYHLVLNPCFEANERQKWRFSE 581
>gi|195359229|ref|XP_002045319.1| GM11142 [Drosophila sechellia]
gi|194122575|gb|EDW44618.1| GM11142 [Drosophila sechellia]
Length = 658
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 58/227 (25%), Positives = 93/227 (40%), Gaps = 39/227 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +KQ P LD GD++ Q +R++L CK F WFM VA D KF
Sbjct: 443 RVAEVWMDEYKQYVYKRDPKTYDSLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 502
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
P + PP+ G +N+ CLD+MG++ +G C + + +Q + L+ L
Sbjct: 503 PPVEPPSYSAGIIQNVANPVYCLDNMGKSTEEAVGMFSCADNRTHPQPNQFWELSIFRDL 562
Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
G C+D + + ++NC + G +
Sbjct: 563 RMKGFDSVCLDVHEGPP------------------------NATVWMWNCHSQGG--NQF 596
Query: 225 WRFHNSTRQLEHA-ILKRCLS--VHPETNQLAMLRCDENNSYQQWRF 268
W + T++L H KRCL V ++ C++ N Q+W F
Sbjct: 597 WYYDRQTQRLVHGENNKRCLEGFVENGIAKVVANACEDGNDRQRWEF 643
>gi|5419872|emb|CAB46378.1| hypothetical protein [Homo sapiens]
Length = 276
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 70 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 129
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 130 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 187
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 188 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 226
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 227 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 265
>gi|334333371|ref|XP_001365881.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Monodelphis domestica]
Length = 757
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 58/229 (25%), Positives = 89/229 (38%), Gaps = 43/229 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+I + A GD++E++ LR KL CK+F WF+ +V +++
Sbjct: 404 VRAAEVWMDEYKEIYYHRNMHARKEPYGDVTERLQLRAKLKCKNFKWFLDNVYPELHV-- 461
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G KN G T C D + + G+ CHG G +Q F ++
Sbjct: 462 PEDRPGFF-GMLKNRGMTDYCFDYNPPSENEITGSQVILYPCHGMGQNQFFEYTKHNEIR 520
Query: 169 HGER----CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
+ R CV D + M C N F G+
Sbjct: 521 YNTRQPEGCVAVDAGTDHLTMY---------------LCSKNAPENQKFIFKEDGS---- 561
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQ---LAMLRCDENNSYQQWRFKE 270
L H K+CL + + +LR ++ YQQW F+E
Sbjct: 562 ---------LYHVQSKKCLQAEAKASNGSPAPLLRTCTDSKYQQWFFQE 601
>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
Length = 658
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 87/228 (38%), Gaps = 48/228 (21%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +KQ P +D GD+SE+ LR +L C DF W++ V
Sbjct: 467 MRVAEVWMDHYKQYFLDTRPGQNIIDAGDVSERKELRQRLQCHDFKWYLNTVL------- 519
Query: 114 PELPPNLFWGEAKNLGTQK--------CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
P+LF +A ++ Q C+D MG+ V G CHG G++Q + +
Sbjct: 520 ----PDLFIPDANHIQHQGTLHTPDNICVDKMGQRNGGVAGVYPCHGQGTNQAWMYSITN 575
Query: 166 QL-GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
++ H C+DA L + ++L C G +
Sbjct: 576 EIRTHDSLCLDA------------------------WGSTLPSPVHLGRCHGMRGNQE-- 609
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETNQ--LAMLRCDENNSYQQWRFKE 270
WR+ + H L CL +Q L + +C N Q W + E
Sbjct: 610 WRYDPLRHTMVHVPLNACLEASSVDSQKKLQINKCLSGNQAQVWFWDE 657
>gi|119582045|gb|EAW61641.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_c [Homo sapiens]
Length = 274
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 68 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 127
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 128 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 185
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 186 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 224
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 225 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 263
>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
troglodytes]
Length = 551
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 345 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 404
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 405 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNVQVFTFTWREDIRPG-- 462
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 463 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 501
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 502 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 538
>gi|48734736|gb|AAH72450.1| GALNT10 protein, partial [Homo sapiens]
Length = 413
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 30/222 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P
Sbjct: 202 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYP 261
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
+ PP WGE +N+GT C D+ + + C G G + Q+F +
Sbjct: 262 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDI 321
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
G D +K FC A + + L++C + G + W++
Sbjct: 322 RPG----DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKY 359
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ +++ M C+ ++ QQW F+
Sbjct: 360 RKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 400
>gi|194759472|ref|XP_001961971.1| GF15238 [Drosophila ananassae]
gi|190615668|gb|EDV31192.1| GF15238 [Drosophila ananassae]
Length = 663
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 38/109 (34%), Positives = 58/109 (53%), Gaps = 6/109 (5%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQ-KCLD 135
+D GD++ Q +R++L CK F WFM VA D K+P + PP G +N+ CLD
Sbjct: 472 IDAGDLTRQKGVRERLKCKSFHWFMTEVAPDFLEKYPPVEPPAYASGAIQNVAYPIYCLD 531
Query: 136 SMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQLGHGERCVDADKQE 180
SMG+ IG C + S +Q ++L++ +L H E + D Q+
Sbjct: 532 SMGKIINDAIGLFSCADNRSHPQANQNWKLSSSRELRHKEESICLDVQD 580
>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cavia porcellus]
Length = 602
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 88/217 (40%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 396 WMDDYAEYIYQRRPEYRHLSAGDVVAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 455
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ A + C G G + Q+F + G
Sbjct: 456 AAAWGEIRNVGTGLCADTKHGALGAPLRLESCIRGRGEAAWNNMQVFTFTWREDIRPG-- 513
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 514 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRQD-K 552
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ + QQW F+
Sbjct: 553 TLFHPVSGSCMDCSESDHRIFMNTCNPASVTQQWLFE 589
>gi|313227738|emb|CBY22887.1| unnamed protein product [Oikopleura dioica]
Length = 1030
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 30/78 (38%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPP 118
W +K+ P +D+GD++ +A+++KL CKDFSW+M+ +AYD+ +P +P
Sbjct: 818 WDDNYKKYYYERRPENKNVDVGDLTAVLAIKEKLQCKDFSWYMREIAYDIPKYWPMVMPK 877
Query: 119 NLFWGEAKNLGTQKCLDS 136
N WG KN GT C +S
Sbjct: 878 NQAWGILKNEGTGLCPNS 895
>gi|260790280|ref|XP_002590171.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
gi|229275360|gb|EEN46182.1| hypothetical protein BRAFLDRAFT_90906 [Branchiostoma floridae]
Length = 1466
Score = 64.3 bits (155), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 41/149 (27%), Positives = 68/149 (45%), Gaps = 6/149 (4%)
Query: 30 QLGH--GERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQV 87
++GH G++ + S +R W+ +K + P M GDIS++
Sbjct: 1243 RVGHVWGKKTYSTGNMTLHDWASRNNMRVAEVWMDHYKVHYYIRRPYLMKRKFGDISDRR 1302
Query: 88 ALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGT 147
LR++L CKDF W++ + D+Y P+ P + G+ +N GT CLD + +
Sbjct: 1303 RLRERLQCKDFRWYLDNAFPDLY--IPDDIPGRY-GQVRNNGTNTCLDWTSKPQRE-LEM 1358
Query: 148 SYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
CH +Q F L + QL C++A
Sbjct: 1359 FPCHHGLGTQFFELTGQNQLRDERSCLEA 1387
>gi|260817709|ref|XP_002603728.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
gi|229289050|gb|EEN59739.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
Length = 501
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 62/138 (44%), Gaps = 10/138 (7%)
Query: 30 QLGHGERCVDADKQVISTQKSSLEL-----RKRSSWLKVFKQIRSLGSPLAMFLDMGDIS 84
++GH ER V S + + R W+ +K+ P + GDIS
Sbjct: 273 RVGHIERTDKPYLYVRSNDTKDINIEVNKARVAEVWMDEYKRYLYAREPQLKNISYGDIS 332
Query: 85 EQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSM-GRTAPA 143
E+ ALR +L C+ F W+M++V D + E WGE +NL CLD M GR
Sbjct: 333 ERQALRKRLGCQSFQWYMENVYPDRLEQTVENGYYRAWGELRNLQAGLCLDLMDGRG--- 389
Query: 144 VIGTSYCHGSGSSQLFRL 161
+G CHG G Q F L
Sbjct: 390 -VGLWDCHGQGGQQFFAL 406
>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [synthetic
construct]
Length = 603
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590
>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
Length = 603
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNVQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590
>gi|397517643|ref|XP_003829017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
paniscus]
Length = 444
Score = 63.9 bits (154), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 238 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 297
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 298 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNVQVFTFTWREDIRPG-- 355
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 356 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 394
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 395 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 431
>gi|426350728|ref|XP_004042920.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Gorilla gorilla gorilla]
Length = 444
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 238 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 297
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 298 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 355
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 356 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 394
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 395 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 431
>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Nomascus leucogenys]
gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
Length = 506
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 300 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 359
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 360 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 417
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 418 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 456
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 457 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 493
>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
mulatta]
Length = 565
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 359 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 418
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 419 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 476
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 477 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 515
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 516 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 552
>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
Mn2+
gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
Galnac-ser On Lectin Domain
Length = 570
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 364 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 423
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 424 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 481
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 482 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 520
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 521 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 557
>gi|348521382|ref|XP_003448205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oreochromis niloticus]
Length = 620
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 62/231 (26%), Positives = 100/231 (43%), Gaps = 50/231 (21%)
Query: 53 ELRKRSSWLKVFKQIRSLGSPLAMFLD----MGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + A + GDIS ++ LR++LHCK+FSW++ V +
Sbjct: 424 QVRLAEVWMDDYKKIYYRRNKNAAIMASEHRFGDISARLNLRERLHCKNFSWYLNTVYPE 483
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT---APAVIGTSYCHGSGSSQLFRLNAEG 165
++ P+L P G KNLG+ CLD+ P ++ +CH G +Q F +
Sbjct: 484 IF--IPDLNPEKS-GSIKNLGSNMCLDAGENNQGGKPLIM--YHCHNMGGNQYFEYTSHK 538
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAG-----VKLIFCRLGTVLNLFNCFNSSGT 220
+L H + KQ C+ A VK+ C+L GT
Sbjct: 539 ELRH-----NIGKQ------------LCLHAAVGSDPVKIELCQL----------KGKGT 571
Query: 221 VDGP---WRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
P W F L++ +CL + + QL M C+ + +QQW F
Sbjct: 572 SVAPEQEWVF-TEENLLKNPSSGKCLVL--KGGQLLMDYCNPSELHQQWTF 619
>gi|358331987|dbj|GAA50722.1| putative polypeptide N-acetylgalactosaminyltransferase 10
[Clonorchis sinensis]
Length = 738
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 59/113 (52%), Gaps = 5/113 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ +K+ P +D GD+SEQ A+R++LHCK F WFM+ +A+D+ K+P
Sbjct: 521 RVAETWMDEYKEYIYSRRPHYRHIDAGDLSEQKAVRERLHCKPFKWFMETIAFDLTKKYP 580
Query: 115 ELPP-NLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYC---HGSGSSQLFRLN 162
+ P + G+ + + Q C+D+ G T I C G Q F L+
Sbjct: 581 LVEPVSAATGDVRPVVDQTLCVDAEGATKTEPIKLKKCVEYVGGNGVQKFELS 633
>gi|71051628|gb|AAH98444.1| GALNT10 protein, partial [Homo sapiens]
Length = 310
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 90/219 (41%), Gaps = 30/219 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 104 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 163
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 164 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 221
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 222 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 260
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 261 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFEHT 299
>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
partial [Taeniopygia guttata]
Length = 402
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 71/134 (52%), Gaps = 10/134 (7%)
Query: 60 WLKVFKQ---IRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
W+ FK I S G+P ++ + G + L+ ++ C+ FSW++++V D ++
Sbjct: 212 WMDDFKDFFYIISPGAPRFVWDKRILYGIVPWCGTLKIRMKCQPFSWYLENVYPD--SQI 269
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P +L GE +N+ T +CLD+MGR +G CHG G +Q+F A+ ++ + C
Sbjct: 270 PRRYYSL--GEIRNVETNQCLDNMGRKENEKVGFFNCHGMGGNQVFSYTADKEIRTDDLC 327
Query: 174 VDADKQEKLVEMLK 187
+D + V MLK
Sbjct: 328 LDVSRLNGPVLMLK 341
>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 657
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 92/215 (42%), Gaps = 43/215 (20%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEA--------KNLGT 130
D GD+S + LR +L+CK F W++ ++ +PEL GEA K LG
Sbjct: 455 DFGDVSARKELRKRLNCKSFKWYLDNI-------YPEL---FIPGEAVAGGEVRNKGLGG 504
Query: 131 QKCLDSMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDAD-KQEKLVEMLK 187
+ CLDS R A +G CH G +Q+ N GQ C+D+ K E + + +
Sbjct: 505 KTCLDSPARKADLHKAVGLFPCHRQGGNQVSN-NWSGQ------CLDSPCKSEDMHKPVG 557
Query: 188 LPVTFCVLAGVKLIFCRLGTV-------------LNLFNCFNSSGTVDGPWRFHNSTRQL 234
L + + G + + L+ C S G + W ++ T +
Sbjct: 558 LWPCHKQGGNQYWMLSKAGEIRRDEACLDYAGQDVILYPCHGSKG--NQYWHYNPDTNLI 615
Query: 235 EHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+H K+CL++ +L M CDE Q+W F+
Sbjct: 616 QHGSSKKCLTMASTKQKLLMEDCDEMEPRQKWIFQ 650
>gi|195587296|ref|XP_002083401.1| GD13712 [Drosophila simulans]
gi|194195410|gb|EDX08986.1| GD13712 [Drosophila simulans]
Length = 631
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 84/186 (45%), Gaps = 35/186 (18%)
Query: 92 KLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY 149
+L CK F WFM+ VA+D+ +P + PP G +N+G Q CLD++GR +G
Sbjct: 453 ELKCKSFKWFMEEVAFDLMKTYPPVDPPAYAMGALQNVGNQNLCLDTLGRKKHNKMGMYA 512
Query: 150 CHGS----GSSQLFRLN--AEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFC 203
C + SQ + L+ + +L + C+D + PV
Sbjct: 513 CADNIKIPQRSQFWELSWKRDVRLRRKKECLDVQ-----IWDANAPVW------------ 555
Query: 204 RLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPETNQLAMLRCDENNS 262
L++C + G + W + + L+H +RCL + P + ++ + +CD +N
Sbjct: 556 -------LWDCHSQGG--NQYWYYDYHHKLLKHGTEGRRCLELLPFSQEVVVNKCDSDNR 606
Query: 263 YQQWRF 268
+QQW F
Sbjct: 607 FQQWNF 612
>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
lupus familiaris]
Length = 603
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSALNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE N+GT C+D+ G + S G G + Q+F + G
Sbjct: 457 AAAWGEIHNVGTGLCVDTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDAISNT------SPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590
>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Homo sapiens]
Length = 603
Score = 63.9 bits (154), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 52/217 (23%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C G G + Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQYTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ ++ QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 590
>gi|119582048|gb|EAW61644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_e [Homo sapiens]
Length = 339
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 30/222 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P
Sbjct: 128 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYP 187
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
+ PP WGE +N+GT C D+ + + C G G + Q+F +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDI 247
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
G D +K FC A + + L++C + G + W++
Sbjct: 248 RPG----DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKY 285
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ +++ M C+ ++ QQW F+
Sbjct: 286 RKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 326
>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Ornithorhynchus anatinus]
Length = 467
Score = 63.5 bits (153), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 61/243 (25%), Positives = 93/243 (38%), Gaps = 45/243 (18%)
Query: 42 KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
KQ ++ +L +R W+ +K++ +P A GD++ + LR KL C+DF
Sbjct: 254 KQAPYSRSKALANSVRAAEVWMDGYKELYYHRNPHARLEPYGDVTARRDLRSKLKCRDFK 313
Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
WF+++V +++ PE P F G KN G + C D V G CHG G
Sbjct: 314 WFLENVYPELHV--PEDRPGYF-GMLKNKGMENHCFDYNPPDENEVTGQRLILYPCHGMG 370
Query: 155 SSQLFRLNAEGQL----GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLN 210
+Q F + ++ H E C D V M C+ N
Sbjct: 371 QNQFFEYTSHHEIRYNTRHPEACAAVDVGTDYVTMY---------------LCQENVPEN 415
Query: 211 LFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWR 267
F G+ L H ++CL N +LR N+ +Q+W
Sbjct: 416 QKFIFQEDGS-------------LLHVQSQKCLQAEANAYNGNPAPLLRPCTNSDHQKWF 462
Query: 268 FKE 270
FKE
Sbjct: 463 FKE 465
>gi|195385643|ref|XP_002051514.1| GJ11806 [Drosophila virilis]
gi|194147971|gb|EDW63669.1| GJ11806 [Drosophila virilis]
Length = 653
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 89/228 (39%), Gaps = 37/228 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K+ P +D GD+S Q A+R++L CK F WFMK VA D KF
Sbjct: 440 RVAEVWMDEYKEHVYRRDPATYDNIDAGDLSRQRAVRERLKCKSFDWFMKEVAPDFLIKF 499
Query: 114 PELPPNLFWGEA--KNLGTQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQL 167
P + P + A T CLD M A +G C + +Q + L + +L
Sbjct: 500 PPIDPPAYASGAIQSQAYTDFCLDCMNVGANHPVGMYNCAENLTYPQDNQNWALTSHREL 559
Query: 168 GH-GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ C+D + +++C + G + W
Sbjct: 560 RRLDDACLDVQDAHP------------------------NATVWMWDCHHQGG--NQFWY 593
Query: 227 FHNSTRQLEHAIL-KRCLSVHPETNQLAML--RCDENNSYQQWRFKEV 271
+ L H + K C+ E +L RCDENN Q+W F +V
Sbjct: 594 YDRQHHWLVHGLRGKNCMEAFVENGVTKVLTNRCDENNVRQRWNFGKV 641
>gi|403298655|ref|XP_003940127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Saimiri boliviensis boliviensis]
Length = 272
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ SP A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 72 VRAAEVWMDEFKELYYHRSPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 129
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F G +N G + C D ++G+ CHG G +Q F ++ ++
Sbjct: 130 PEDRPGFF-GMLQNKGLKDYCFDYNPPDENQIVGSQVILYLCHGMGQNQFFEYTSQKEI 187
>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oryctolagus cuniculus]
Length = 603
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 51/217 (23%), Positives = 88/217 (40%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC------HGSGSSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ + + C S Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHWALGSPLRLESCVRDRGEAAWNSMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ + +
Sbjct: 515 --DPQHTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKYR-TDK 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ + QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPASLTQQWLFE 590
>gi|170587206|ref|XP_001898369.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158594195|gb|EDP32781.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 582
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 70/135 (51%), Gaps = 13/135 (9%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R WL +K + P A +++ GDI+ ++ L+ +L CKDF W++K + +P
Sbjct: 392 RAAEVWLGDYKYLYLRKVPSARYVNFGDITARLDLKKRLRCKDFDWYLKEI-------YP 444
Query: 115 ELP-PNLFWGEAKNLGT-QKCLDSMGR-TAPAVIGTSYCHGSGSSQLFRLNAE-GQLG-- 168
EL P+ G C+DS+GR TA + +G CHG+G +Q + LN + G L
Sbjct: 445 ELAIPSKEQGRYLTFRQGNVCIDSLGRHTALSSVGIYRCHGTGGNQEWVLNDKFGVLKSP 504
Query: 169 HGERCVDADKQEKLV 183
+ C+ D++ L+
Sbjct: 505 YSNLCITDDEKGTLI 519
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Nasonia vitripennis]
Length = 646
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 53/234 (22%), Positives = 85/234 (36%), Gaps = 73/234 (31%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGT--QKCLDS 136
+ GD+S++ ALR L CK F W++ ++ +++ + GE +NLG CLDS
Sbjct: 444 NYGDVSDRKALRKNLGCKSFKWYLDNIYPELFIPGEAVAS----GEVRNLGEGGNTCLDS 499
Query: 137 MGRTA----PA---------------------------------VIGTSYCHGSGSSQLF 159
R A PA +G CH G +Q +
Sbjct: 500 PARKADLHKPAGLYPCHRQGGNQIRHLASRLCIDSPGNPEDLHQAVGFYECHNQGGNQYW 559
Query: 160 RLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSG 219
L+ G++ E C+D GT + L+ C S G
Sbjct: 560 MLSKTGEIRRDESCLDYS----------------------------GTDVILYPCHGSKG 591
Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
W ++ T+Q+ H +CL++ +L M C Q+W F+ P
Sbjct: 592 NQQ--WTYNTQTKQIRHESSGKCLAITESKQRLLMEECSPAAPRQRWAFENYDP 643
>gi|195471079|ref|XP_002087833.1| GE18238 [Drosophila yakuba]
gi|194173934|gb|EDW87545.1| GE18238 [Drosophila yakuba]
Length = 659
Score = 63.5 bits (153), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 51/99 (51%), Gaps = 3/99 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K+ P +D GD++ Q A+R++L CK F W+M VA D KF
Sbjct: 443 RVAEVWMDEYKEYVYKRDPATYDNVDAGDLTRQRAVRERLKCKSFHWYMTEVAPDFLIKF 502
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
P + PP+ G +N+ CLDSMG+ A +G C
Sbjct: 503 PPVEPPSYAAGVIQNVANPVYCLDSMGKNAEEAVGMFSC 541
>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Acyrthosiphon pisum]
Length = 581
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 49/218 (22%), Positives = 87/218 (39%), Gaps = 41/218 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ PL+ + G+I++++AL+ L CK F W++ +V +P
Sbjct: 391 RAAEVWMDQYKRYYYNAVPLSRIVPFGNIADRLALKKNLGCKPFKWYLDNV-------YP 443
Query: 115 E--LPPNL--FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E LP + F G + CLD++ G CH G +Q + G + H
Sbjct: 444 ELKLPATVDEFVGSIRQ--GYMCLDTLENQVGKTAGIFPCHDYGGNQEWTFTIGGSIKHD 501
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
C+ P + ++ + + C T D W+F +
Sbjct: 502 MMCLS-------------PTDYSSMSLIIMKPC--------------DSTTD-EWKFDEN 533
Query: 231 TRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
T+QL + CL + +++ CD NS Q+W +
Sbjct: 534 TKQLRLKAVNLCLDTLGVHDMISINVCDSLNSNQKWEY 571
>gi|260803980|ref|XP_002596867.1| hypothetical protein BRAFLDRAFT_129116 [Branchiostoma floridae]
gi|229282127|gb|EEN52879.1| hypothetical protein BRAFLDRAFT_129116 [Branchiostoma floridae]
Length = 481
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 86/224 (38%), Gaps = 45/224 (20%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A G+I + LR+KL+CK F W++++V +P
Sbjct: 286 RTAEVWMDDYKNNFYAARPSAKGKPYGNIQGRKELREKLNCKTFQWYLENV-------YP 338
Query: 115 ELP---PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCH-----GSGSSQLFRLNAEGQ 166
EL N+ WGE K +CLD++G P +G C+ G S Q + L
Sbjct: 339 ELDIPGSNMKWGELKQ--ADQCLDTLGHKPPGAVGMQACNLHPGSGMASRQQWALTKRNM 396
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ HG ++C+ A + GT++ L C + R
Sbjct: 397 IQHGS------------------WSWCLAADTAIP----GTLVYLITC----NIQENKMR 430
Query: 227 FHNSTRQLEHAILKRCLSVHPETNQLAML--RCDENNSYQQWRF 268
+ R L H CL L + +CD+ QQW +
Sbjct: 431 WKRQDRLLMHIATGLCLDSVEAAQGLGVTIEQCDKTRVSQQWEY 474
>gi|10435819|dbj|BAB14676.1| unnamed protein product [Homo sapiens]
Length = 339
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 30/222 (13%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P
Sbjct: 128 RVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCKSFKWFMTKIAWDLPKFYP 187
Query: 115 EL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSGSS-----QLFRLNAEGQL 167
+ PP WGE +N+GT C D+ + + C G G + Q+F +
Sbjct: 188 PVEPPAAAWGEIRNVGTGLCADTKHGALGSPLRLEGCVRGRGEAAWNNMQVFTFTWREDI 247
Query: 168 GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
G D +K FC A + + L++C + G + W++
Sbjct: 248 RPG----DPQYTKK----------FCFDA------ISHTSPVTLYDCHSMKG--NQLWKY 285
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ L H + C+ +++ M C+ ++ QQW F+
Sbjct: 286 RKD-KTLYHPVSGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 326
>gi|351699369|gb|EHB02288.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Heterocephalus
glaber]
Length = 570
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 65/133 (48%), Gaps = 12/133 (9%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KLHCKDF WF++ V +++
Sbjct: 370 VRAAEVWMDEFKELYYHRTPRARLEPYGDVTERRQLRAKLHCKDFRWFLETVYPELHV-- 427
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G ++ C D V G CHG G +Q F ++ ++
Sbjct: 428 PEDRPG-FFGMLQNKGLKEYCFDYNPPNENQVEGHQVILYACHGMGQNQFFEYTSQKEIR 486
Query: 169 HG----ERCVDAD 177
+ E CV +
Sbjct: 487 YNTHQPETCVAVE 499
>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
catus]
Length = 603
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 397 WMDEYAEHIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 456
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE +N+GT C D+ G + S G G + Q+F + G
Sbjct: 457 AAAWGEIRNVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 514
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 515 --DPQHTKK----------FCFDAISNT------SPVTLYDCHSMKG--NQLWKYRKD-K 553
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H + C+ +++ M C+ + QQW F+
Sbjct: 554 TLYHPVSGSCMDCSESDHRIFMNTCNPASLTQQWLFE 590
>gi|170038563|ref|XP_001847118.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
quinquefasciatus]
gi|167882317|gb|EDS45700.1| polypeptide N-acetylgalactosaminyltransferase 5 [Culex
quinquefasciatus]
Length = 531
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 59/218 (27%), Positives = 95/218 (43%), Gaps = 41/218 (18%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNL--GTQKC 133
D GD+S + LR++L CK F W++ ++ FPEL + GE +N+ G + C
Sbjct: 329 DYGDVSSRKQLREELGCKSFRWYLDNI-------FPELFIPGEAVASGEVRNMGYGNRTC 381
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD-ADKQEKLVEMLKLPV 190
LD+ G + +G CH G +Q+ N L C+D A K E + L +
Sbjct: 382 LDAPGGKKNLRKPVGLYPCHNQGGNQV--ANPWSGL-----CIDSAAKPEDM--HTPLGI 432
Query: 191 TFCVLAGVK--LIFCRLGTV-------------LNLFNCFNSSGTVDGPWRFHNSTRQLE 235
C AG + + G + + L+ C S G + W + S+R L
Sbjct: 433 WPCHQAGGNQYWMLSKTGEIRRDEACLDYAGQDVILYPCHGSKG--NQYWNYSPSSRLLR 490
Query: 236 HAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
H +CL+++ +L M CD + Q+W F+ P
Sbjct: 491 HGSSDKCLAINESKQKLVMADCDASVEAQKWLFQNYDP 528
>gi|194225536|ref|XP_001494993.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Equus
caballus]
Length = 460
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 68/141 (48%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K++ +P A GD++E+ LR+KL CKDF WF+++V +++
Sbjct: 260 VRAAEVWMDGYKELYYHRNPHARLEPFGDVTERKQLREKLRCKDFRWFLENVYPELHV-- 317
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIG---TSY-CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D + G T Y CHG G +Q F + ++
Sbjct: 318 PEDRPGCF-GMLQNKGLKDYCFDYNPPNENQITGHQVTLYLCHGMGQNQFFEYTSRNEIR 376
Query: 169 HGER----CVDADKQEKLVEM 185
+ R CV D ++ M
Sbjct: 377 YNTRQPEGCVAVDAGTDILTM 397
>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Saccoglossus kowalevskii]
Length = 2434
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/219 (23%), Positives = 89/219 (40%), Gaps = 40/219 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P + + G+I ++ LR KL CK F W++++V +P
Sbjct: 2242 RAAEVWMDEYKKYYYSAVPSSKNIAFGNIQSRLDLRKKLQCKSFGWYLENV-------YP 2294
Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + C+D++G +G CH +G +Q F L + + H
Sbjct: 2295 ELRIPDKKDIAFGALQQ--GHLCMDTLGHFTDGTLGLYECHNTGGNQEFALTKDKAIRHQ 2352
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+ C+ ++ R V+ L C S ++ W +
Sbjct: 2353 DLCL------------------------TVMDHRPSGVIKLHGC--SESNLNQKWEQIEN 2386
Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
RQL+ C+ S T L + RC++N Q WRF
Sbjct: 2387 NRQLKFRGTDLCIDSKSVATVGLVVERCEKNAITQHWRF 2425
>gi|149639508|ref|XP_001513185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Ornithorhynchus anatinus]
Length = 634
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/134 (30%), Positives = 71/134 (52%), Gaps = 15/134 (11%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ FK+I + + + GD+S+++ LRD+L CK+F+W++ + +
Sbjct: 437 QVRLAEVWMDEFKEIFYRRNTEAAKIVKQKAFGDLSKRLELRDRLQCKNFTWYLNTIYPE 496
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
VY P+L P L G K++G CLD + T P ++ T CHG G +Q F + +
Sbjct: 497 VYV--PDLNPVLS-GYIKSVGRHVCLDVGENNQGTKPLIMYT--CHGLGGNQYFEYSEQH 551
Query: 166 QLGHG---ERCVDA 176
++ H E C+ A
Sbjct: 552 EIRHNIQKELCLHA 565
>gi|344265184|ref|XP_003404666.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 10-like [Loxodonta
africana]
Length = 602
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 89/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 396 WMDEYAEYIYQRRPEYRHLSAGDVAAQKKLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPP 455
Query: 119 NLFWGEAKNLGTQKCLDSM-GRTAPAVIGTSYCHGSGSS-----QLFRLNAEGQLGHGER 172
WGE ++GT C D+ G + S G G + Q+F + G
Sbjct: 456 AAAWGEIHSVGTGLCADTKHGALGSPLRLESCVRGRGEAAWNNMQVFTFTWREDIRPG-- 513
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +K FC A + + L++C + G + W++ +
Sbjct: 514 --DPQHTKK----------FCFDA------ISHNSPVTLYDCHSMKG--NQLWKYRKD-K 552
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L H I C+ +++ M C+ ++ QQW F+
Sbjct: 553 TLYHPISGSCMDCSESDHRIFMNTCNPSSLTQQWLFE 589
>gi|170038567|ref|XP_001847120.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167882319|gb|EDS45702.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 494
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 78/189 (41%), Gaps = 45/189 (23%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPPNLFWGEAKNLGTQKCLDSM 137
D GD+S + LR +L+C+ F W+M+ T FPE P+ G + CLD
Sbjct: 337 DFGDVSSRKKLRARLNCRPFRWYME-------TVFPEQFDPSKAVGRGQFRIGGGCLD-- 387
Query: 138 GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAG 197
T +VIG CHG G QL+ A+G++ + C+D D ++
Sbjct: 388 WPTKLSVIG---CHGLGGHQLWFFTADGEITREDHCMDFDSKK----------------- 427
Query: 198 VKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRC 257
L + C G + W F +HA +RCL + N+L M C
Sbjct: 428 -----------LEMIRCHKQKG--NQMWVFEEKAGHFKHAYYERCLEW--DGNELRMSEC 472
Query: 258 DENNSYQQW 266
+ + Q+W
Sbjct: 473 IDEDMRQKW 481
>gi|432092277|gb|ELK24900.1| N-acetylgalactosaminyltransferase 7 [Myotis davidii]
Length = 802
Score = 62.8 bits (151), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 101/229 (44%), Gaps = 26/229 (11%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 477 VRVVEVWWDEYKDYFYASRPESKALAYGDISELKKFREDHNCKSFKWFMEEIAYDITSHY 536
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P P N+ WGE K T + L + R + H S + L + G
Sbjct: 537 PLPPKNIDWGEGK---THQLLRNSPRRLRERALPPHVHPSILTDLTPQPPCRREG----- 588
Query: 174 VDADKQEKLVEML----KLPVTFCVLAGVKLIFC------RLGTVLNLFNCFNSSGTVDG 223
+ Q+ L+ + L ++ C + G + +C G + L C G +
Sbjct: 589 --PEDQDLLLLLFVSSRALTLSQCSIRGFESAYCIDSMGRTNGGFVELGPCHRMGG--NQ 644
Query: 224 PWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
+R + + + +++ +CL+ P+ +++ + C+ N Y++W++ +VR
Sbjct: 645 LFRINEANQLMQY---DQCLTKGPDGSKIMITHCNL-NEYKEWQYFKVR 689
>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mustela putorius
furo]
Length = 452
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/141 (27%), Positives = 67/141 (47%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+++V +++
Sbjct: 254 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFRWFLENVYPELHV-- 311
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 312 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQIMGHQVLLYLCHGMGQNQFFEYTSQKEIR 370
Query: 169 HG----ERCVDADKQEKLVEM 185
+ E C+ + ++ M
Sbjct: 371 YNTHQPEACIAVEAGTDILTM 391
>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Cavia porcellus]
Length = 633
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 60/238 (25%), Positives = 107/238 (44%), Gaps = 46/238 (19%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ A+R +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDLSKRFAIRKRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
V +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTVYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A ++L+L C G
Sbjct: 544 FEYSAQHEIRHSIQKELCLHATS-----DLLQLKA--CAYKGR----------------- 579
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSV---HPETNQLAMLRCDENNSYQQWRFKE 270
N+ T + W + L + LK CLS HP +++ C+ ++ Q+W F +
Sbjct: 580 NTIVTGEQVWEIQKD-QLLYNPFLKLCLSANGKHP-----SLVSCNPSDPLQKWIFNQ 631
>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
Length = 578
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 97/229 (42%), Gaps = 44/229 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDDYKEHFYNRNPPARKETYGDISERKLLRERLQCKSFDWYLKNVFSNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 494
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
E C + +Q+ V M P L +N+ F GT+
Sbjct: 495 RFNSVTELCAEVPQQKDYVGMQNCPKD------------GLPIPVNIIWHFKEDGTI--- 539
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
FH TR CLS + + + M CD + Q WRF++
Sbjct: 540 --FHPHTRL--------CLSAYRTADGRPSVQMKTCDALDKNQIWRFEK 578
>gi|348544073|ref|XP_003459506.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3-like
[Oreochromis niloticus]
Length = 600
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 62/228 (27%), Positives = 84/228 (36%), Gaps = 45/228 (19%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
LR W+ +K L + M +D GDIS++VALR L CK F W++ +VY
Sbjct: 401 LRVAEVWMDEYKSNVYLAWNIPMENHGIDYGDISQRVALRKSLQCKSFHWYLD----NVY 456
Query: 111 TKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL-- 167
+ LF+GE +N + C+D G CHG G QL R EGQL
Sbjct: 457 PEMRRYNDTLFYGEVRNSKASHLCMDQ-GVKENHTATLHPCHGWG-PQLGRYTKEGQLFL 514
Query: 168 ------GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
G RCV D+ ++L NC +
Sbjct: 515 GPLGSTGEDTRCVVDDQISNFPQLL--------------------------NCEKVTNVK 548
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
W F + + A RCL V P L + S Q+W K
Sbjct: 549 QKTWHFSQNESIINRAT-GRCLEVVPANVYFGHLLVLQPCSGQRWTIK 595
>gi|395740752|ref|XP_003780818.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12, partial [Pongo
abelii]
Length = 538
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 338 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 395
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 396 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 453
>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
livia]
Length = 560
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 94/226 (41%), Gaps = 37/226 (16%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR+ L+CK F WFM VA+D+ +P + PP
Sbjct: 340 WMDEYAEYIYQRRPEYRHLSAGDVAAQKELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPP 399
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGERCV 174
WGEA++ T +GT C HG+ S L RL E CV
Sbjct: 400 AAAWGEARDSATSLLFQIRN------VGTGLCVDTKHGALGSPL-RL---------ENCV 443
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKL-------IFC----RLGTVLNLFNCFNSSGTVDG 223
D+ E +++ TF ++ FC + + L++C G +
Sbjct: 444 -KDRGEAAWNNVQV-FTFSWREDIRPGDPQHTKKFCFDAISHSSPVTLYDCHGMKG--NQ 499
Query: 224 PWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
WR+ + L H + C+ ++ M C+ ++ QQW F+
Sbjct: 500 LWRYRKD-KTLFHPVSSSCMDCSESDRKIFMNSCNPSSPTQQWIFE 544
>gi|195115611|ref|XP_002002350.1| GI13183 [Drosophila mojavensis]
gi|193912925|gb|EDW11792.1| GI13183 [Drosophila mojavensis]
Length = 655
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 95/231 (41%), Gaps = 37/231 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMF-LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K+ P +D GD++ Q A+R++L CK F W+MK VA D KF
Sbjct: 439 RVAEVWMDEYKEFVYKRDPATYNNIDAGDLTRQRAVRERLQCKSFDWYMKEVAPDFLVKF 498
Query: 114 PEL-PPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
P + PP G ++L +CLD + + +G C + + +Q + L A +L
Sbjct: 499 PPVEPPAYASGAVESLAFKDQCLDCLNLGSNHPVGLYACAHNRTHPQENQNWALTAHREL 558
Query: 168 GH-GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWR 226
+ C+D + +++C + G + W
Sbjct: 559 RRLDDACLDVQDSHP------------------------NATVWMWDCHHQGG--NQFWY 592
Query: 227 FHNSTRQLEHAILKR-CLSVHPE--TNQLAMLRCDENNSYQQWRFKEVRPD 274
+ + L H R CL E +++ RCDE+N Q+WR +V D
Sbjct: 593 YDRKHQWLVHGHHGRNCLEAFVENGVSKVVTNRCDEHNERQRWRIGQVNDD 643
>gi|221330664|ref|NP_001137779.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
melanogaster]
gi|442625712|ref|NP_722910.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
melanogaster]
gi|25987157|gb|AAN75751.1|AF324752_1 N-acetylgalactosaminyltransferase [Drosophila melanogaster]
gi|220901927|gb|ACL82986.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
melanogaster]
gi|440213268|gb|AAN10370.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
melanogaster]
Length = 644
Score = 62.8 bits (151), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 3/99 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +KQ P LD GD++ Q +R++L CK F WFM VA D KF
Sbjct: 428 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 487
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
P + PP+ G +N+ CLD+MG++ +G C
Sbjct: 488 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSC 526
>gi|354548807|gb|AER27632.1| AT25481p1 [Drosophila melanogaster]
Length = 666
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 3/99 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +KQ P LD GD++ Q +R++L CK F WFM VA D KF
Sbjct: 450 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 509
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
P + PP+ G +N+ CLD+MG++ +G C
Sbjct: 510 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSC 548
>gi|51316006|sp|Q8IA42.2|GALT4_DROME RecName: Full=N-acetylgalactosaminyltransferase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4; Short=pp-GaNTase 4
gi|34042946|gb|AAQ56701.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 659
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 35/99 (35%), Positives = 50/99 (50%), Gaps = 3/99 (3%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +KQ P LD GD++ Q +R++L CK F WFM VA D KF
Sbjct: 443 RVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLKCKSFHWFMTEVAPDFLVKF 502
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYC 150
P + PP+ G +N+ CLD+MG++ +G C
Sbjct: 503 PPVEPPSYAAGIIQNVANPVYCLDNMGKSTEEAVGMFSC 541
>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 493
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 60/127 (47%), Gaps = 10/127 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P+A + MG + E++ LR L C F W++ +V +P
Sbjct: 370 RAAEVWMDDYKKYYYATVPVARNVPMGSVEERLNLRKSLGCHSFQWYLDNV-------YP 422
Query: 115 ELPPNLFWGE--AKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL GE A Q CLD++G + +G CHGSG +Q + L A + HG
Sbjct: 423 ELKVPAAGGERLASLRQGQMCLDTLGGSEGNPVGLFTCHGSGGNQQWSL-ASRLIAHGGL 481
Query: 173 CVDADKQ 179
C+ D +
Sbjct: 482 CITVDAE 488
>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
Length = 637
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 98/223 (43%), Gaps = 20/223 (8%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ + + +PL D GD++ Q LR++L C+ F WF++ +A D+ ++P
Sbjct: 402 RVAEVWMDEYAKFLYERNPLFEKTDPGDLTAQRELRERLQCRPFKWFLEEIAPDLLIRYP 461
Query: 115 ELPPNLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGS----SQLFRL--NAEGQ 166
P F G +++ ++ CLDS+ A IG C + + +Q F L + + +
Sbjct: 462 VREPQPFASGRVQSVADRRLCLDSLNHQAKQPIGLYTCASNQTHPQNNQFFTLSFHRDIR 521
Query: 167 LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFN-CFNSSGTVDGPW 225
+ ++C+DA + V + F R V F+ NS+G D P+
Sbjct: 522 VRSNDKCLDASRLNDEV------ILFSCHESQGNQMWRYDYVSTEFSRKQNSTGISDTPF 575
Query: 226 RFHNSTRQLEHAILK--RCLSVHPETNQLAMLRCDENNSYQQW 266
+ ++ + H RCL ++ + CD +N Q W
Sbjct: 576 Q---ESKMIIHGKEHNGRCLEADIVKRKVTVGSCDRSNPAQSW 615
>gi|47221042|emb|CAG12736.1| unnamed protein product [Tetraodon nigroviridis]
Length = 611
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 82/227 (36%), Gaps = 43/227 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
LR W+ +K L + M +D GDIS++VALR L CK F W++ +VY
Sbjct: 412 LRVAEVWMDQYKSNVYLAWNIPMESHGIDYGDISQRVALRKSLQCKSFEWYLD----NVY 467
Query: 111 TKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL--- 167
+ LF+GE +N G CHG G QL R +GQL
Sbjct: 468 PEMRRYNNTLFYGEIRNSKVSHLCMDQGIKENHTATLHPCHGWG-PQLGRYTKDGQLFLG 526
Query: 168 -----GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
G RCV D+ + +V L NC ++
Sbjct: 527 PLGSTGEDTRCVVDDQ--------------------------ISSVPQLINCDRATDVKQ 560
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
W F + + A RCL V P L + + Q+W K
Sbjct: 561 MTWHFSQNESIVNQAT-GRCLEVVPANVYFGHLLILQPCTGQRWSIK 606
>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
Length = 553
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/127 (30%), Positives = 64/127 (50%), Gaps = 15/127 (11%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P+A + G+I++++ALR+ L+CK F W++++V +P
Sbjct: 362 RAAEVWMDDYKKYYFAAVPMARTVTFGNITDRLALRNSLNCKPFKWYVENV-------YP 414
Query: 115 EL---PPNLFWGEAKNLGTQK----CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL 167
EL P + N G K C D+ GR A + IG CH +G +Q + G+L
Sbjct: 415 ELLKHLPTVRDPSGTNSGAIKYKSLCFDTYGRGAGSHIGLYACHMTGGNQAWTY-LNGRL 473
Query: 168 GHGERCV 174
HG C+
Sbjct: 474 RHGSWCL 480
>gi|426362479|ref|XP_004048391.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Gorilla gorilla gorilla]
Length = 633
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 433 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 490
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 491 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMGQNQFFEYTSQKEI 548
>gi|300797173|ref|NP_001180032.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Bos
taurus]
gi|296472282|tpg|DAA14397.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Bos
taurus]
Length = 571
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL+CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|441593636|ref|XP_003260599.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 isoform
2 [Nomascus leucogenys]
Length = 495
Score = 62.0 bits (149), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 37/122 (30%), Positives = 60/122 (49%), Gaps = 8/122 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 295 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 352
Query: 114 PELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G T C D ++G CHG G +Q F ++ ++
Sbjct: 353 PEDRPG-FFGMLQNKGLTDYCFDYNPPDENQIMGHQVILYLCHGMGQNQFFEYTSQKEIR 411
Query: 169 HG 170
+
Sbjct: 412 YN 413
>gi|440891991|gb|ELR45390.1| Polypeptide N-acetylgalactosaminyltransferase 2, partial [Bos
grunniens mutus]
Length = 530
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL+CK F W++++V +P
Sbjct: 342 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 394
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 395 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 452
Query: 171 ERCV 174
+ C+
Sbjct: 453 DLCL 456
>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Papio anubis]
Length = 571
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DXCL 497
>gi|148670411|gb|EDL02358.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12, isoform CRA_b [Mus
musculus]
Length = 222
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+ V +++
Sbjct: 22 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 79
Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G + CLD V G CHG G +Q F ++
Sbjct: 80 PEDRPG-FFGMLQNRGLRGYCLDYNPPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIR 138
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 139 YNTRQPEA 146
>gi|296190391|ref|XP_002743190.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Callithrix jacchus]
Length = 571
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 8/121 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 371 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRTKLQCKDFKWFLETVYPELHV-- 428
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 429 PEDRPG-FFGMLQNKGLKDYCFDYNPPDENQIVGYQVILYLCHGMGQNQFFEYTSQKEIR 487
Query: 169 H 169
+
Sbjct: 488 Y 488
>gi|391346326|ref|XP_003747427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Metaseiulus occidentalis]
Length = 622
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 60/123 (48%), Gaps = 13/123 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P A + GDISE++ LR+KL CK F W+MK+V +P
Sbjct: 411 RAAEVWMDDYKKYYYEQVPAAKSVAYGDISERLKLREKLRCKSFEWYMKNV-------YP 463
Query: 115 E--LPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHGE 171
E LP N+ +N +CLD++G + Y CH G +Q FRL L +
Sbjct: 464 ELKLPSNVHGYVRQN---NRCLDTLGAISDGSTVHVYPCHYLGGNQDFRLAKNHLLMVHD 520
Query: 172 RCV 174
CV
Sbjct: 521 MCV 523
>gi|390332984|ref|XP_784677.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Strongylocentrotus purpuratus]
Length = 572
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 33/124 (26%), Positives = 58/124 (46%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ F++ P A + G+I + LR+KLHCK F W++++V +P
Sbjct: 382 RAAEVWMDEFREFYYAAVPSARNVPFGNIQARTELREKLHCKPFKWYLENV-------YP 434
Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + C+D++G +G CH SG +Q F + + H
Sbjct: 435 ELRVPDKQDIAFGSIQQ--GSMCMDTLGHVHQGTLGLYDCHNSGGNQEFSFTKDKLIKHS 492
Query: 171 ERCV 174
+ C+
Sbjct: 493 DLCL 496
>gi|380795279|gb|AFE69515.1| polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
mulatta]
Length = 243
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 59/119 (49%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 43 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 100
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQL 167
PE P F+G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 101 PEDRPG-FFGMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEI 158
>gi|194855550|ref|XP_001968569.1| GG24947 [Drosophila erecta]
gi|190660436|gb|EDV57628.1| GG24947 [Drosophila erecta]
Length = 659
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 92/227 (40%), Gaps = 39/227 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAM-FLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +KQ P +D GD++ Q +R++L CK F W+M VA D K+
Sbjct: 443 RVAEVWMDEYKQYVYNRDPTTYDNVDAGDLTRQRGIRERLKCKSFHWYMTEVAPDFLVKY 502
Query: 114 PEL-PPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
P + PP+ G +N+ CLD+MG++ +G C + + +Q + L+ L
Sbjct: 503 PPVEPPSYAEGVIQNVANPVYCLDNMGKSTEKEVGMFSCADNRTHPQPNQFWELSIFRDL 562
Query: 168 ---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
C+D + + + ++NC G +
Sbjct: 563 RMKSLDSVCLDVHEPQP------------------------NATVWMWNCHTQGG--NQF 596
Query: 225 WRFHNSTRQLEHA-ILKRCLS--VHPETNQLAMLRCDENNSYQQWRF 268
W + T++L H KRCL V ++ C+E N Q+W F
Sbjct: 597 WYYDRQTQRLVHGETNKRCLEGLVENGVARVVANACEEGNDRQRWEF 643
>gi|296490594|tpg|DAA32707.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Bos
taurus]
gi|440907905|gb|ELR57989.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Bos grunniens
mutus]
Length = 633
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 104/235 (44%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGRPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A L V+L C +
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA-----------------ALGAVQLKAC-------AYKGH 579
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ T + W + L + K CLS E +++ C+ ++S Q+W F +
Sbjct: 580 KTVATGEQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDSLQKWIFNQ 631
>gi|297668747|ref|XP_002812581.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Pongo abelii]
gi|297668749|ref|XP_002812582.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Pongo abelii]
gi|297668751|ref|XP_002812583.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
3 [Pongo abelii]
Length = 633
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 107/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G K +
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV------------- 583
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T + W + L + LK CLS + E +++ CD ++ Q+W F +
Sbjct: 584 ----TGEQMWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCDPSDPLQKWIFSQ 631
>gi|126331345|ref|XP_001372222.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Monodelphis
domestica]
Length = 585
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 42/85 (49%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W +K P + L GDISE R+ +CK F WFM+ +AYD+ + +
Sbjct: 466 IRVVEVWWDGYKDYFYASRPESKALPYGDISELKKFREDHNCKSFKWFMEEIAYDIISHY 525
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMG 138
P P N+ WGEA L +D G
Sbjct: 526 PLPPKNVEWGEAPELMRMFGMDGQG 550
>gi|431909863|gb|ELK12965.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Pteropus alecto]
Length = 543
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 43/155 (27%), Positives = 73/155 (47%), Gaps = 14/155 (9%)
Query: 42 KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
KQ ++K +L +R W+ FK++ +P A GD++E+ LR KL CKDF
Sbjct: 329 KQAPYSRKKALANSVRAAEVWMDEFKELYYHRNPHARLEPFGDVTERRQLRAKLQCKDFK 388
Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
WF++ V +++ PE P F+G +N G + C D + G CHG G
Sbjct: 389 WFLETVYPELHV--PEDRPG-FFGMLQNRGLKDYCFDYNPPNEHDITGHQVLLYLCHGMG 445
Query: 155 SSQLFRLNAEGQLGHG----ERCVDADKQEKLVEM 185
+Q F ++ ++ + E C+ + ++ M
Sbjct: 446 QNQFFEYTSQREIRYNTHQPEACIAVEAGTDILIM 480
>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cricetulus griseus]
Length = 715
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 53/217 (24%), Positives = 90/217 (41%), Gaps = 30/217 (13%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD+ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 509 WMDEYAEYIYQRRPEYRHLSAGDVVAQKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPP 568
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRLNAEGQLGHGER 172
WGE +N+GT C D+ T+ + C G G S Q+F + + G
Sbjct: 569 AAAWGEIRNVGTGLCTDTKHGTSGLPLRLETCIRGRGEAAWNSMQVFTFTWKEDIRPG-- 626
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
D +KL C A + + L++C + G + W++ +
Sbjct: 627 --DPQHTKKL----------CFDA------VSHNSPVTLYDCHSMKG--NQLWKYRKD-K 665
Query: 233 QLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
L + C+ +++ M C+ ++ QQW F+
Sbjct: 666 TLYLPVSGGCMDCSESDHRIFMNTCNPSSLTQQWLFE 702
>gi|355567593|gb|EHH23934.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
mulatta]
Length = 457
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 94/229 (41%), Gaps = 43/229 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 257 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 314
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 315 PEDRPGFF-GMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIR 373
Query: 169 HG----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTV 221
+ E C+ V AG+ ++ C N G++
Sbjct: 374 YNTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL 415
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
FH +++ A K + + +LR N+ +Q+W FKE
Sbjct: 416 -----FHEQSKKCVQAARKES-----SDSFVPLLRDCTNSDHQKWFFKE 454
>gi|302563901|ref|NP_001181506.1| polypeptide N-acetylgalactosaminyltransferase 12 [Macaca mulatta]
Length = 581
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 94/229 (41%), Gaps = 43/229 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 381 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 438
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 439 PEDRPGFF-GMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIR 497
Query: 169 HG----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTV 221
+ E C+ V AG+ ++ C N G++
Sbjct: 498 YNTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL 539
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
FH +++ A K + + +LR N+ +Q+W FKE
Sbjct: 540 -----FHEQSKKCVQAARKES-----SDSFVPLLRDCTNSDHQKWFFKE 578
>gi|426256000|ref|XP_004021634.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Ovis
aries]
Length = 674
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL+CK F W++++V +P
Sbjct: 486 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 538
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 539 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 596
Query: 171 ERCV 174
+ C+
Sbjct: 597 DLCL 600
>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_a [Homo sapiens]
Length = 508
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455
Query: 171 ERCV 174
+ C+
Sbjct: 456 DLCL 459
>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
caballus]
Length = 539
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 351 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 403
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 404 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 461
Query: 171 ERCV 174
+ C+
Sbjct: 462 DLCL 465
>gi|344268030|ref|XP_003405867.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Loxodonta africana]
Length = 633
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 107/232 (46%), Gaps = 34/232 (14%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVRQKSFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
V +VY P+L P + G K+ G CLD +G P ++ T CHG G +Q
Sbjct: 490 NSVYPEVYV--PDLNP-VISGYIKSFGQHLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
F +A+ ++ H + Q++L C+ A + R T + +
Sbjct: 544 FEYSAQHEIRH-------NIQKEL----------CLHAAPGPVQLRTCT----YKGHKTV 582
Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L ++ LK CLS + E +++ C+ ++S+Q+W F +
Sbjct: 583 AAGEQIWEIQKD-QLLYNSFLKMCLSANGE--HPSLVSCNPSDSFQKWIFSQ 631
>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
And Ea2
gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
Length = 501
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 313 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 365
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 366 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 423
Query: 171 ERCV 174
+ C+
Sbjct: 424 DLCL 427
>gi|403258871|ref|XP_003921965.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Saimiri
boliviensis boliviensis]
Length = 633
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 106/229 (46%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + LK CLS + E +++ C+ ++S Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDSLQKWIFSQ 631
>gi|348569970|ref|XP_003470770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Cavia porcellus]
Length = 579
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/128 (29%), Positives = 63/128 (49%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL C+DF WF+++V +++
Sbjct: 379 VRAAEVWMDEFKELYYHRNPRARLEPYGDVTERRELRAKLRCRDFRWFLENVYPELHV-- 436
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G ++ C D V G CHG G +Q F ++ ++
Sbjct: 437 PEDRPGFF-GMLQNKGMKEYCFDYNPPDENQVWGHQVILYNCHGMGQNQFFEYTSQKEIR 495
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 496 YNTRQPEA 503
>gi|281341254|gb|EFB16838.1| hypothetical protein PANDA_002911 [Ailuropoda melanoleuca]
Length = 496
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR +L CKDF WF+++V +++
Sbjct: 297 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHV-- 354
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 355 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIR 413
Query: 169 HG----ERCVDADKQEKLVEM 185
+ E C+ + + ++ M
Sbjct: 414 YNTHQPEACIAVEAGKDVLIM 434
>gi|397508104|ref|XP_003824510.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
paniscus]
Length = 533
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455
Query: 171 ERCV 174
+ C+
Sbjct: 456 DLCL 459
>gi|395728898|ref|XP_002809364.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Pongo abelii]
Length = 532
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 344 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 396
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 397 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 454
Query: 171 ERCV 174
+ C+
Sbjct: 455 DLCL 458
>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
4 [Pan troglodytes]
Length = 533
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455
Query: 171 ERCV 174
+ C+
Sbjct: 456 DLCL 459
>gi|350592744|ref|XP_001927809.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Sus
scrofa]
Length = 571
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|332812181|ref|XP_003308857.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
troglodytes]
gi|410227516|gb|JAA10977.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410264536|gb|JAA20234.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410296424|gb|JAA26812.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 388 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 440
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 441 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 498
Query: 171 ERCV 174
+ C+
Sbjct: 499 DLCL 502
>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455
Query: 171 ERCV 174
+ C+
Sbjct: 456 DLCL 459
>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
Length = 533
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455
Query: 171 ERCV 174
+ C+
Sbjct: 456 DLCL 459
>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
gorilla gorilla]
Length = 533
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 345 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 397
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 398 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 455
Query: 171 ERCV 174
+ C+
Sbjct: 456 DLCL 459
>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 571
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|148670410|gb|EDL02357.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12, isoform CRA_a [Mus
musculus]
Length = 259
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+ V +++
Sbjct: 59 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 116
Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G + CLD V G CHG G +Q F ++
Sbjct: 117 PEDRPG-FFGMLQNRGLRGYCLDYNPPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIR 175
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 176 YNTRQPEA 183
>gi|291402210|ref|XP_002717436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oryctolagus cuniculus]
Length = 571
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|4758412|ref|NP_004472.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Homo
sapiens]
gi|51315838|sp|Q10471.1|GALT2_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|971461|emb|CAA59381.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|26996816|gb|AAH41120.1| GALNT2 protein [Homo sapiens]
gi|119590317|gb|EAW69911.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_c [Homo sapiens]
gi|239740418|gb|ACS13744.1| polypeptide N-acetylgalactosaminyltransferase 2 [Homo sapiens]
gi|307686451|dbj|BAJ21156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [synthetic
construct]
Length = 571
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
[Macaca mulatta]
Length = 554
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 366 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 418
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 419 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 476
Query: 171 ERCV 174
+ C+
Sbjct: 477 DLCL 480
>gi|158261119|dbj|BAF82737.1| unnamed protein product [Homo sapiens]
Length = 571
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
lupus familiaris]
Length = 588
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 400 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 452
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 453 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 510
Query: 171 ERCV 174
+ C+
Sbjct: 511 DLCL 514
>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
griseus]
Length = 494
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 96/228 (42%), Gaps = 32/228 (14%)
Query: 51 SLELRKRSS--WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
SL+ KR + W+ + + P L GD+ Q LR L+CK F WFM +A+D
Sbjct: 277 SLQNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVVAQKRLRGSLNCKSFKWFMTKIAWD 336
Query: 109 VYTKFPEL-PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYC-HGSG-----SSQLFRL 161
+ +P + PP WGE +N+GT C D+ T+ + C G G S Q+F
Sbjct: 337 LPKFYPPVEPPAAAWGEIRNVGTGLCTDTKHGTSGLPLRLETCIRGRGEAAWNSMQVFTF 396
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
+ + G D +KL C A + + L++C + G
Sbjct: 397 TWKEDIRPG----DPQHTKKL----------CFDA------VSHNSPVTLYDCHSMKG-- 434
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFK 269
+ W++ + L + C+ +++ M C+ ++ QQW F+
Sbjct: 435 NQLWKYRKD-KTLYLPVSGGCMDCSESDHRIFMNTCNPSSLTQQWLFE 481
>gi|332265853|ref|XP_003281928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
2 [Nomascus leucogenys]
Length = 571
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|386780726|ref|NP_001248284.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Macaca
mulatta]
gi|384941838|gb|AFI34524.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
gi|387540526|gb|AFJ70890.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
Length = 571
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
Length = 530
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 342 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 394
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 395 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 452
Query: 171 ERCV 174
+ C+
Sbjct: 453 DLCL 456
>gi|410342331|gb|JAA40112.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410342333|gb|JAA40113.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 388 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 440
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 441 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 498
Query: 171 ERCV 174
+ C+
Sbjct: 499 DLCL 502
>gi|332265851|ref|XP_003281927.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 368 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 420
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 421 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 478
Query: 171 ERCV 174
+ C+
Sbjct: 479 DLCL 482
>gi|355767580|gb|EHH62635.1| hypothetical protein EGM_21033, partial [Macaca fascicularis]
Length = 453
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 265 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 317
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 318 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 375
Query: 171 ERCV 174
+ C+
Sbjct: 376 DLCL 379
>gi|260814831|ref|XP_002602117.1| hypothetical protein BRAFLDRAFT_128843 [Branchiostoma floridae]
gi|229287423|gb|EEN58129.1| hypothetical protein BRAFLDRAFT_128843 [Branchiostoma floridae]
Length = 301
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/112 (34%), Positives = 61/112 (54%), Gaps = 9/112 (8%)
Query: 64 FKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWG 123
+K+I P A+ +D G+I+EQ+ R K C+ F W+++++ D+ P++ G
Sbjct: 121 YKKIFYALMPEALHIDYGNITEQLEFRRK--CRPFKWYLENIYPDL--GVPQINLQAV-G 175
Query: 124 EAKNLGTQK-CLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCV 174
E +NLG Q C+D++G A +G CHG G Q F L+ G L G CV
Sbjct: 176 EIRNLGQQDVCVDTLG---GAHLGLWPCHGQGEHQSFSLDKSGTLKSGSLCV 224
>gi|326674124|ref|XP_696189.4| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3-like
[Danio rerio]
Length = 604
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 60/230 (26%), Positives = 84/230 (36%), Gaps = 45/230 (19%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
LR W+ +K L L M +D GD+SE++ALR +L CK F W++ ++Y
Sbjct: 401 LRVAEVWMDQYKSNVYLAWNLPMKNHGIDYGDVSERLALRKRLQCKSFDWYLD----NIY 456
Query: 111 TKFPELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQL-- 167
+ +F+GE +N T C+D G CHG G QL R EG L
Sbjct: 457 PEMRRYNNTVFYGEIRNTNVTHLCVDQ-GIKENHTATLHPCHGWG-PQLGRYTKEGYLFL 514
Query: 168 ------GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
G RCV DK + + L NC S
Sbjct: 515 GPLGSTGEDTRCVVDDK--------------------------ISSYPQLLNCEKVSSVR 548
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
W F + + A RCL V P + + Q+W K +
Sbjct: 549 QKTWHFAQNESIISRAT-GRCLEVVPANVYFGYALILRSCTGQKWNIKNL 597
>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 89/213 (41%), Gaps = 47/213 (22%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE-LPP 118
W+ + + P +D GD+S+ ALR +L CK F W++K+V D+ +P LPP
Sbjct: 405 WMDEYAEFFYRRRPYVRKIDAGDLSKAKALRKELKCKSFDWYIKNVIPDLVQYYPPILPP 464
Query: 119 NLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADK 178
+ WG K++ + C+D + V+ S C EG + C+DA
Sbjct: 465 SAAWGRLKHVVSNLCIDPQVKKGSQVV-VSQCQ----------TPEGAV---RTCLDASY 510
Query: 179 QEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAI 238
+ K ++L ++C N G + W++ +QL H
Sbjct: 511 RSK-------------------------SILT-WDCHNQHG--NQLWKYFE--KQLIHPS 540
Query: 239 LKRCLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
K+C +V + L M+ C + +W ++ +
Sbjct: 541 SKKCATV--ASGALLMMPCSPGDRLFEWEWEHI 571
>gi|402896867|ref|XP_003911504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Papio
anubis]
Length = 581
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 93/229 (40%), Gaps = 43/229 (18%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 381 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 438
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 439 PEDRPGFF-GMLQNQGLKDYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIR 497
Query: 169 HG----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTV 221
+ E C+ V AG+ ++ C N G++
Sbjct: 498 YNTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL 539
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
FH +++ A K + +LR N+ +Q+W FKE
Sbjct: 540 -----FHEQSKKCVQAARKES-----SDGFVPLLRDCTNSDHQKWFFKE 578
>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
Length = 569
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 381 RAAEVWMDEFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 433
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 434 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 491
Query: 171 ERCV 174
+ C+
Sbjct: 492 DLCL 495
>gi|441612314|ref|XP_004088076.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Nomascus leucogenys]
Length = 570
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 382 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 434
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 492
Query: 171 ERCV 174
+ C+
Sbjct: 493 DLCL 496
>gi|149045853|gb|EDL98853.1| rCG55041, isoform CRA_a [Rattus norvegicus]
gi|149045854|gb|EDL98854.1| rCG55041, isoform CRA_a [Rattus norvegicus]
Length = 284
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+ V +++
Sbjct: 84 VRAAEVWMDDFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 141
Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G + CLD + V G CHG G +Q F + ++
Sbjct: 142 PEDRPG-FFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHGMGQNQFFEYTSRQEIR 200
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 201 YNTRQPEA 208
>gi|126307024|ref|XP_001369295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Monodelphis domestica]
Length = 571
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 9/122 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL+CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 435
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P + CLD++G A V+G CH SG +Q + L + + H +
Sbjct: 436 ELRVPDHQDIAFGALQQGNNCLDTLGHFADGVVGVYECHNSGGNQEWALTKDKSVKHMDL 495
Query: 173 CV 174
C+
Sbjct: 496 CL 497
>gi|392347955|ref|XP_232988.5| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 579
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+ V +++
Sbjct: 376 VRAAEVWMDDFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 433
Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G + CLD + V G CHG G +Q F + ++
Sbjct: 434 PEDRPG-FFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHGMGQNQFFEYTSRQEIR 492
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 493 YNTRQPEA 500
>gi|403300209|ref|XP_003940844.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Saimiri
boliviensis boliviensis]
Length = 724
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 536 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 588
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 589 ELRVPDHQDIAFG-ALQQGT-NCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 646
Query: 171 ERCV 174
+ C+
Sbjct: 647 DLCL 650
>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
Length = 577
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/111 (35%), Positives = 58/111 (52%), Gaps = 6/111 (5%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K++ L P D+GD+SE+ ALR KL CK F W++ +V + KF
Sbjct: 391 WMDDYKRLYYLHRPDLKTKDVGDLSERKALRKKLKCKSFKWYLDNV---IPHKFIPDEGV 447
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVI--GTSYCHGSGSS-QLFRLNAEGQL 167
+ +G +N + CLD++ R + I G C GSS Q+F L GQL
Sbjct: 448 VGYGALRNPNSGLCLDTLQRDEKSTITLGIFACQTGGSSAQVFSLTKSGQL 498
>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 504
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 316 RAAEVWMDEFKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 368
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 369 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 426
Query: 171 ERCV 174
+ C+
Sbjct: 427 DLCL 430
>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
Length = 688
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/208 (24%), Positives = 90/208 (43%), Gaps = 34/208 (16%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
D GD++++ LR L CK F W++ ++ +PEL ++ GE +NLG + C
Sbjct: 485 DWGDVTDRKKLRADLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 537
Query: 134 LD--SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVT 191
LD S + +GT CH G +Q+ L C+DA + + + +
Sbjct: 538 LDAPSGKKHQKKAVGTYPCHRQGGNQIANLPT-------GMCLDAKELSTEGDDTSVSIY 590
Query: 192 FCVLAGVK--LIFCRLGTVLNLFNCFNSSGTV-----------DGPWRFHNSTRQLEHAI 238
C G + + G + +C + +G + W + +T+QL H
Sbjct: 591 ECHGQGGNQYWMLSKTGEIRRDDSCLDYAGKEVTLYPCHGGKGNQFWSYRANTKQLHHGT 650
Query: 239 LKRCLSVHPETNQLAMLRCDENNSYQQW 266
+CL++ ++L M C+ + + QQW
Sbjct: 651 SGKCLAISDSKDKLQMEECNVSVARQQW 678
>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
catus]
Length = 653
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 465 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 517
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 518 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 575
Query: 171 ERCV 174
+ C+
Sbjct: 576 DLCL 579
>gi|301758254|ref|XP_002914993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Ailuropoda melanoleuca]
Length = 540
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 38/141 (26%), Positives = 68/141 (48%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR +L CKDF WF+++V +++
Sbjct: 341 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRARLQCKDFRWFLENVYPELHV-- 398
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D ++G CHG G +Q F ++ ++
Sbjct: 399 PEDRPGFF-GMLQNKGLKDYCFDYNPPNENQIVGHQVLLYLCHGLGQNQFFEYTSQEEIR 457
Query: 169 HG----ERCVDADKQEKLVEM 185
+ E C+ + + ++ M
Sbjct: 458 YNTHQPEACIAVEAGKDVLIM 478
>gi|109476381|ref|XP_001066416.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 576
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 62/128 (48%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+ V +++
Sbjct: 376 VRAAEVWMDDFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 433
Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F+G +N G + CLD + V G CHG G +Q F + ++
Sbjct: 434 PEDRPG-FFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHGMGQNQFFEYTSRQEIR 492
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 493 YNTRQPEA 500
>gi|395519661|ref|XP_003763961.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Sarcophilus harrisii]
Length = 631
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 56/227 (24%), Positives = 105/227 (46%), Gaps = 44/227 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ FK+I + + + GDIS+++ ++ +L CK+F+W++ +V +
Sbjct: 434 QVRLAEVWMDEFKEIFYRRNTEAAKIVKQKTFGDISKRLEIKHRLQCKNFTWYLNNVYPE 493
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
+Y P+L P + G +N G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 494 IYV--PDLNP-VISGYIQNKGRHLCLD-VGENNLGGKPLIMYT--CHGLGGNQYFEYSAQ 547
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
++ H + +QE + ++ PV C G K + T+D
Sbjct: 548 HEIRH------SIQQELCLHAVQGPVQLNTCSYKGQKTL------------------TID 583
Query: 223 -GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W + L + L CL+ + E +++ C+ ++ +QQW F
Sbjct: 584 VQKWEIQKD-QLLYNPSLNLCLTGNGE--HPSLVPCNPSDVFQQWDF 627
>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4 [Ciona intestinalis]
Length = 582
Score = 60.8 bits (146), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 98/227 (43%), Gaps = 47/227 (20%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR W+ +K+ + +P A + GDIS + LR+ L C DF W++ +V D++
Sbjct: 386 LRAAEVWMDDYKRHFYIRNPPASKENYGDISARKDLRNSLQCHDFKWYLDNVYPDLHV-- 443
Query: 114 PELPPNLFWGEAKNLGTQK-CLD--------SMGRTAPAVIGTSYCHGSGSSQLFRLNAE 164
PE P ++G +N G CLD + GR ++ G CHG G +Q F ++
Sbjct: 444 PEDRPG-YYGAFRNSGMSSFCLDYAPPQHNPTGGRV--SIFG---CHGQGGNQFFEYTSK 497
Query: 165 GQL---GHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFN--CFNSSG 219
++ E C+ A + + + + +P A + + GT+ ++ N C ++S
Sbjct: 498 REVRFNSEKEMCMSAVEDDTITMLDCVPDQRDSPASQEWVHSHDGTIQSVMNDKCLHASM 557
Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ DG PE L + CD + +QQW
Sbjct: 558 SKDG----------------------KPE---LLLKTCDPGSRFQQW 579
>gi|27370010|ref|NP_766281.1| polypeptide N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|51315979|sp|Q8BGT9.1|GLT12_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|26329325|dbj|BAC28401.1| unnamed protein product [Mus musculus]
gi|26334957|dbj|BAC31179.1| unnamed protein product [Mus musculus]
gi|33991661|gb|AAH56425.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|52851351|dbj|BAD52068.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74140287|dbj|BAE33836.1| unnamed protein product [Mus musculus]
Length = 576
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/128 (31%), Positives = 59/128 (46%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF+ V +++
Sbjct: 376 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERKKLRAKLQCKDFKWFLDTVYPELHV-- 433
Query: 114 PELPPNLFWGEAKNLGTQ-KCLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + CLD V G CHG G +Q F ++
Sbjct: 434 PEDRPGFF-GMLQNRGLRGYCLDYNPPNENHVEGHQVLLYLCHGMGQNQFFEYTTRKEIR 492
Query: 169 HGERCVDA 176
+ R +A
Sbjct: 493 YNTRQPEA 500
>gi|291382916|ref|XP_002708201.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oryctolagus cuniculus]
Length = 476
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 66/141 (46%), Gaps = 12/141 (8%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 277 VRAAEVWMDEFKELYYHRNPRARLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHV-- 334
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D + G CHG G +Q F ++ ++
Sbjct: 335 PEDRPGFF-GMLQNKGLKNFCFDYNPPDENQITGHQVILYTCHGMGQNQFFEYTSQMEIR 393
Query: 169 HG----ERCVDADKQEKLVEM 185
+ E CV + + ++ M
Sbjct: 394 YNTHQPEGCVAVEADKDVLVM 414
>gi|301611308|ref|XP_002935181.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Xenopus (Silurana) tropicalis]
Length = 532
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 62/133 (46%), Gaps = 12/133 (9%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R WL +K+I +P A GD+SE++ L+++L CK F WF+ ++ +++
Sbjct: 364 VRAAEVWLDEYKEIYYHRNPHARKEPYGDVSERLQLKERLQCKHFKWFLDNIYPELHV-- 421
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G KN G + C D V G CHG G +Q F + ++
Sbjct: 422 PEDKPGRF-GMLKNKGMENVCFDYNPPNENDVSGQQIILYPCHGMGQNQFFEYTSYNEIR 480
Query: 169 HGER----CVDAD 177
+ R C AD
Sbjct: 481 YNTRQPEGCASAD 493
>gi|395736404|ref|XP_002816158.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pongo
abelii]
Length = 527
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 1/78 (1%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PP 118
W+ + + P L GD++ Q LR L+CK F WFM +A+D+ +P + PP
Sbjct: 262 WMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPP 321
Query: 119 NLFWGEAKNLGTQKCLDS 136
WGE +N+GT C D+
Sbjct: 322 AAAWGEIRNVGTGLCADT 339
>gi|149730677|ref|XP_001496099.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Equus
caballus]
Length = 633
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 105/229 (45%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ A++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFAIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K+ G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSFGQSLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTV-----------------ATG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + L+ CL+ + E +++ C+ ++ Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLQMCLTANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
Length = 597
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 77/192 (40%), Gaps = 67/192 (34%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
D GD+S + LR L CK F W++ ++ +PEL
Sbjct: 458 DFGDVSARKKLRSDLGCKSFKWYLDNI-------YPEL---------------------- 488
Query: 139 RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGV 198
V G S G + + L+ +G++ E CVD
Sbjct: 489 ----FVPGESVAKG----EYWMLSKDGEIRRDESCVDY---------------------- 518
Query: 199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCD 258
G+ + +F C G + WR+++ T +L+HA+ ++CL + + +L M+ C
Sbjct: 519 ------AGSDVMVFPCHGMKGNQE--WRYNHDTGRLQHAVSQKCLGMTRDGAKLEMVTCQ 570
Query: 259 ENNSYQQWRFKE 270
++ YQ W+FKE
Sbjct: 571 YDDPYQHWKFKE 582
>gi|432111808|gb|ELK34851.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Myotis davidii]
Length = 539
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 363 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFRWYLENV-------YP 415
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 416 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 473
Query: 171 ERCV 174
+ C+
Sbjct: 474 DLCL 477
>gi|268564602|ref|XP_002647197.1| C. briggsae CBR-GLY-10 protein [Caenorhabditis briggsae]
Length = 623
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 87/220 (39%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ +R+KL CK F WFMK +A+D +P + P
Sbjct: 413 WMDEYKETLYKHRPGIGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAIEPK 472
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER-- 172
GE +++ + C+D+ + G C G Q RL + R
Sbjct: 473 ASAEGEIRHVASNLCIDTQFKEQNQRFGLRKCASEDKDGGGEQDLRLTRWHDIRPKGRKI 532
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + K P+ LF+C + G + +++ +
Sbjct: 533 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRMPQK 566
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
Q+ H + +CL+ L M +CD ++ QQW ++ +
Sbjct: 567 QIYHPVSGQCLTADENGKGFLHMKKCDSSSKLQQWAWQTI 606
>gi|432110716|gb|ELK34193.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Myotis davidii]
Length = 466
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 70/147 (47%), Gaps = 14/147 (9%)
Query: 42 KQVISTQKSSL--ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFS 99
KQ ++K +L +R W+ FK++ +P A GD++E+ LR KL CKDF
Sbjct: 252 KQAPYSRKKALANSVRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFK 311
Query: 100 WFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSG 154
WF++ V +++ PE P F G +N G + C D + + G CHG G
Sbjct: 312 WFLETVYPELHV--PEDRPGFF-GMLQNKGLKDYCFDYNPPSEHDLTGHQVLLYLCHGMG 368
Query: 155 SSQLFRLNAEGQLGHG----ERCVDAD 177
+Q F ++ ++ + E C+ +
Sbjct: 369 QNQFFEHTSQNEIRYNTHQPEACIAVE 395
>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 582
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/179 (27%), Positives = 84/179 (46%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 385 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHV--P 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD A + CHG G +Q F + ++
Sbjct: 443 EDRPG-WHGAIRSMGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 501
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+K V M P F V + F GT+ + F+ C ++ T +G
Sbjct: 502 SVTELCAEVPEQKKHVGMQNCPKDGFPVPTNIIWHFKEDGTIFHPFSGLCLSAYRTPEG 560
>gi|261260064|sp|A8Y236.2|GLT10_CAEBR RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
10; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 10; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
Length = 629
Score = 60.8 bits (146), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 87/220 (39%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ +R+KL CK F WFMK +A+D +P + P
Sbjct: 419 WMDEYKETLYKHRPGIGNADAGDLKLMKGIREKLQCKSFDWFMKEIAFDQDKYYPAIEPK 478
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGS----GSSQLFRLNAEGQLGHGER-- 172
GE +++ + C+D+ + G C G Q RL + R
Sbjct: 479 ASAEGEIRHVASNLCIDTQFKEQNQRFGLRKCASEDKDGGGEQDLRLTRWHDIRPKGRKI 538
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + K P+ LF+C + G + +++ +
Sbjct: 539 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRMPQK 572
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
Q+ H + +CL+ L M +CD ++ QQW ++ +
Sbjct: 573 QIYHPVSGQCLTADENGKGFLHMKKCDSSSKLQQWAWQTI 612
>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
norvegicus]
Length = 578
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 97/229 (42%), Gaps = 44/229 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R+ W+ +K+ +P A DISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 REAEVWMDDYKEHFYNRNPPARKETYDDISERKLLRERLQCKSFDWYLKNVFSNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 494
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
E C + +Q+ V M P L +N+ F GT+
Sbjct: 495 RFNSVTELCAEVPQQKDYVGMQNCPKD------------GLPIPVNIIWHFKEDGTI--- 539
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
FH TR CLS + + + M CD + Q WRF++
Sbjct: 540 --FHPHTRL--------CLSAYRTADGRPSVQMKTCDALDKNQIWRFEK 578
>gi|354468855|ref|XP_003496866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Cricetulus griseus]
gi|344247257|gb|EGW03361.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Cricetulus
griseus]
Length = 535
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 347 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLDNV-------YP 399
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 400 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 457
Query: 171 ERCV 174
+ C+
Sbjct: 458 DLCL 461
>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
Length = 578
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 60/229 (26%), Positives = 96/229 (41%), Gaps = 44/229 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GD+SE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDLSERKLLRERLKCKSFDWYLKNVFSNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 494
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
E C + +Q+ V M P L +N+ F GT+
Sbjct: 495 RFNSVTELCAEVPQQKDYVGMQNCPKD------------GLPVPVNIIWHFKEDGTI--- 539
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
FH TR CLS + + M CD + Q WRF++
Sbjct: 540 --FHPHTRL--------CLSAYRTAEGRPSVHMKTCDALDKNQLWRFEK 578
>gi|395531657|ref|XP_003767891.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Sarcophilus harrisii]
Length = 542
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 9/122 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 354 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 406
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P + CLD++G A V+G CH SG +Q + L + + H +
Sbjct: 407 ELRVPDHQDIAFGALQQGNNCLDTLGHFADGVVGVYECHNSGGNQEWALTKDKSVKHMDL 466
Query: 173 CV 174
C+
Sbjct: 467 CL 468
>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
Length = 534
Score = 60.5 bits (145), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 55/104 (52%), Gaps = 7/104 (6%)
Query: 40 ADKQVISTQKSSLELRKRSSWLKVFKQI---RSLGSPLAMF-LDMGDISEQVALRDKLHC 95
D V+ K+ + R W+ +KQ R L P + D+GD+S Q LR +L C
Sbjct: 309 TDNDVLQILKNYM--RVAEVWMDDYKQYYYKRMLRGPKNVTNFDLGDLSSQKPLRQRLGC 366
Query: 96 KDFSWFMKHVAYDVYTKFPELPPNLF-WGEAKNLGTQKCLDSMG 138
+DF WFM+ VA D+ +P P++ G +++GT CLDS G
Sbjct: 367 RDFGWFMREVASDLVKHYPLKDPDVLQQGRIQSVGTGLCLDSDG 410
>gi|308452095|ref|XP_003088913.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
gi|308244364|gb|EFO88316.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
Length = 620
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 52/223 (23%), Positives = 87/223 (39%), Gaps = 34/223 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ +R+KL CK F WFMK +A+D +P + P
Sbjct: 410 WMDEYKETLYKHRPGVGSADAGDLKLMKGVREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 469
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
GE ++ + C+D+ + G C G Q RL + R
Sbjct: 470 ASAQGEIRHGASNLCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDVRPKGRKI 529
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + K P+ LF+C + G + +++ S +
Sbjct: 530 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRVSQK 563
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEVRPD 274
Q+ H + +CLS L M +CD + Q+W ++ + D
Sbjct: 564 QIYHPVSGQCLSADENGKGFLYMKKCDSTSDLQKWTWQTIDND 606
>gi|195148068|ref|XP_002014996.1| GL18655 [Drosophila persimilis]
gi|194106949|gb|EDW28992.1| GL18655 [Drosophila persimilis]
Length = 646
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 91/229 (39%), Gaps = 38/229 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K+ + S +D GD++ Q A+R++L CK F W++K VA D KF
Sbjct: 432 RVAEVWMDEYKEHVYSRDPQTYNNIDAGDLTRQRAIRERLQCKSFDWYLKEVAPDFLKKF 491
Query: 114 PELP-PNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
P + PN G +N+ CLD+M A +G C + + +Q + L + +L
Sbjct: 492 PPVELPNYASGAFQNVAFATHCLDTMNLGAHNAVGLYSCAENRTHPQFNQFWLLTPDREL 551
Query: 168 GH--GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
H C+D + +++C N G + W
Sbjct: 552 RHNGASNCLDVQD------------------------VHANATVWMWSCHNQGG--NQFW 585
Query: 226 RFHNSTRQLEHAIL-KRCLS--VHPETNQLAMLRCDENNSYQQWRFKEV 271
+ + + L HA KRC+ V + CD N Q+W V
Sbjct: 586 YYDRAHKWLVHAPNGKRCMEGIVENGIGTVYTNECDPANDRQKWEIGYV 634
>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
Length = 571
Score = 60.5 bits (145), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 56/122 (45%), Gaps = 9/122 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P + CLD++G A V+G CH +G +Q + L E + H +
Sbjct: 436 ELRVPDHQDIALGXLQQGTNCLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHMDL 495
Query: 173 CV 174
C+
Sbjct: 496 CL 497
>gi|198474477|ref|XP_001356707.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
gi|198138408|gb|EAL33772.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
Length = 646
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 91/229 (39%), Gaps = 38/229 (16%)
Query: 55 RKRSSWLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
R W+ +K+ + S +D GD++ Q A+R++L CK F W++K VA D KF
Sbjct: 432 RVAEVWMDEYKEHVYSRDPQTYNNIDAGDLTRQRAIRERLQCKSFDWYLKEVAPDFLKKF 491
Query: 114 PELP-PNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSYCHGSGS----SQLFRLNAEGQL 167
P + PN G +N+ CLD+M A +G C + + +Q + L + +L
Sbjct: 492 PPVELPNYASGAFQNVAFATHCLDTMNLGAHNAVGLYSCAENRTHPQFNQFWLLTPDREL 551
Query: 168 GH--GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
H C+D + +++C N G + W
Sbjct: 552 RHNGASNCLDVQDMHA------------------------NATVWMWSCHNQGG--NQFW 585
Query: 226 RFHNSTRQLEHAIL-KRCLS--VHPETNQLAMLRCDENNSYQQWRFKEV 271
+ + + L HA KRC+ V + CD N Q+W V
Sbjct: 586 YYDRAHKWLVHAPNGKRCMEGIVENGIGTVYTNECDPANDRQKWEIGYV 634
>gi|417403505|gb|JAA48553.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 633
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 104/229 (45%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
Y P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 AYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----NACTYKGHKTV-----------------ATG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 586 EQKWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|410910520|ref|XP_003968738.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3-like
[Takifugu rubripes]
Length = 598
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 87/251 (34%), Gaps = 43/251 (17%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H R I+ LR W+ ++ L + M +D GDIS++
Sbjct: 375 RVAHIARVKKPYHSNIAYHTRRNALRVAEVWMDEYRSNVYLAWNIPMENHGIDYGDISQR 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIG 146
VALR L CK F W+++ +VY + LF+GE +N G
Sbjct: 435 VALRKSLQCKSFEWYLE----NVYPEMRRYNNTLFYGEIRNSKVSHLCMDQGIKENHTAT 490
Query: 147 TSYCHGSGSSQLFRLNAEGQL--------GHGERCVDADKQEKLVEMLKLPVTFCVLAGV 198
CHG G QL R EGQL G RCV D+
Sbjct: 491 LHPCHGWG-PQLGRYTKEGQLFLGPLGSTGEDTRCVVDDQ-------------------- 529
Query: 199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCD 258
L + L NC + W F + + A RCL V P L
Sbjct: 530 ------LSSFPQLINCDRVTNAKQMTWHFSQNESIVNQAT-GRCLEVVPANVYFGHLLIL 582
Query: 259 ENNSYQQWRFK 269
+ + Q+W K
Sbjct: 583 QPCTGQRWIIK 593
>gi|226482458|emb|CAX73828.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
Length = 603
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 96/235 (40%), Gaps = 42/235 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P LD GD++EQ +R++L CK F WFM VA+D+ K+P
Sbjct: 385 RVAEVWMDEYKEYLYKRRPSYRHLDPGDLTEQFKIRERLKCKSFKWFMTEVAFDLVKKYP 444
Query: 115 ELPP-NLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCH----GSGSSQLFRLNAEGQ-- 166
+ P ++ GE +++ CLD+ + + C G S Q F N
Sbjct: 445 LIEPISVAVGEIRSVSDSSLCLDATDASEYTPVKLRPCDKDNPGVSSVQKFEYNYRQDIR 504
Query: 167 -LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
L HG C+D L K +F L++C G + W
Sbjct: 505 VLNHGT-CLDEPG----------------LKNEKTVF--------LYSCHGQGG--NQRW 537
Query: 226 RFH------NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPD 274
+ ++ +L H CL + +++ CD + Q+W +++++ D
Sbjct: 538 KIQPVNNDKSNPIRLIHGANAGCLEADTKNRVVSVKPCDYGSFTQRWTWEKLQFD 592
>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
Length = 570
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 382 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 434
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 492
Query: 171 ERCV 174
+ C+
Sbjct: 493 DLCL 496
>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
Length = 526
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 338 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 390
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 391 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 448
Query: 171 ERCV 174
+ C+
Sbjct: 449 DLCL 452
>gi|26338209|dbj|BAC32790.1| unnamed protein product [Mus musculus]
Length = 570
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 382 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 434
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 492
Query: 171 ERCV 174
+ C+
Sbjct: 493 DLCL 496
>gi|226482456|emb|CAX73827.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
Length = 603
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 96/235 (40%), Gaps = 42/235 (17%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P LD GD++EQ +R++L CK F WFM VA+D+ K+P
Sbjct: 385 RVAEVWMDEYKEYLYKRRPSYRHLDPGDLTEQFKIRERLKCKSFKWFMTEVAFDLVKKYP 444
Query: 115 ELPP-NLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSYCH----GSGSSQLFRLNAEGQ-- 166
+ P ++ GE +++ CLD+ + + C G S Q F N
Sbjct: 445 LIEPISVAVGEIRSVSDSSLCLDATDASEYTPVKLRPCDKDNPGVSSVQKFEYNYRQDIR 504
Query: 167 -LGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
L HG C+D L K +F L++C G + W
Sbjct: 505 VLNHGT-CLDEPG----------------LKNEKTVF--------LYSCHGQGG--NQRW 537
Query: 226 RFH------NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRPD 274
+ ++ +L H CL + +++ CD + Q+W +++++ D
Sbjct: 538 KIQPVNNDKSNPIRLIHGANAGCLEADTKNRVVSVKPCDYGSFTQRWTWEKLQFD 592
>gi|195030214|ref|XP_001987963.1| GH10909 [Drosophila grimshawi]
gi|193903963|gb|EDW02830.1| GH10909 [Drosophila grimshawi]
Length = 668
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 82/204 (40%), Gaps = 36/204 (17%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLG-TQKCLD 135
+D GD++ Q A+R++L CK F WFMK VA D FP + PP G +++ CLD
Sbjct: 479 IDAGDLTRQRAVRERLQCKSFDWFMKEVAPDFLVHFPPVEPPPYASGAIQSVAYPNYCLD 538
Query: 136 SMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGH-GERCVDADKQEKLVEMLKLPV 190
SMG + +G C SQ + L+ +L C+D
Sbjct: 539 SMGLGSNNAVGMFSCAEDKESPQDSQNWVLSIHRELRRIDHECLDVQD------------ 586
Query: 191 TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAIL-KRCLSVHPET 249
+ + ++ C N G + W + + H + K+C+ E
Sbjct: 587 ------------SHVNATVWMWTCHNQGG--NQFWYYDREKYWIVHGLHGKQCMEAFVEN 632
Query: 250 N--QLAMLRCDENNSYQQWRFKEV 271
N ++ C+E N Q+W F V
Sbjct: 633 NVAKVCTNACEEGNVRQRWNFGMV 656
>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
musculus]
gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 570
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 382 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 434
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 435 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 492
Query: 171 ERCV 174
+ C+
Sbjct: 493 DLCL 496
>gi|344278311|ref|XP_003410938.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Loxodonta africana]
Length = 572
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 384 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 436
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L + + H
Sbjct: 437 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 494
Query: 171 ERCV 174
+ C+
Sbjct: 495 DLCL 498
>gi|67970320|dbj|BAE01503.1| unnamed protein product [Macaca fascicularis]
Length = 250
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 47 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 106
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 107 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 160
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G K +
Sbjct: 161 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV------------- 200
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T + W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 201 ----TGEQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 248
>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
aries]
Length = 633
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + V LK C G K +
Sbjct: 544 FEYSAQREIRHNIQKELCLHAAQG---VVQLKA----CAYKGRKTV-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T + W + L + K CLS E +++ C+ ++S Q+W F +
Sbjct: 583 ---ATGEQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDSLQKWIFNQ 631
>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
Length = 544
Score = 60.1 bits (144), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 356 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 408
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 409 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 466
Query: 171 ERCV 174
+ C+
Sbjct: 467 DLCL 470
>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 548
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 360 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 412
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 413 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 470
Query: 171 ERCV 174
+ C+
Sbjct: 471 DLCL 474
>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
(Silurana) tropicalis]
gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 582
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 71/145 (48%), Gaps = 16/145 (11%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K++ +P A + GDISE+ LR++L CK F W++K+V D++ P
Sbjct: 385 RAAEVWMDGYKELFYNRNPPARKENYGDISERKLLRERLQCKSFDWYLKNVFPDLH--IP 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP A + CHG G +Q F ++
Sbjct: 443 EDRPG-WHGAVRSMGISNECLDY---NAPDHNPTGAHLSLFGCHGQGGNQFFEYTTMREI 498
Query: 168 GHG---ERCVDADKQEKLVEMLKLP 189
E C + +Q+ + M+ P
Sbjct: 499 RFNSVTELCAEVPEQQTYIGMMHCP 523
>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
Length = 536
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 348 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 400
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 401 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 458
Query: 171 ERCV 174
+ C+
Sbjct: 459 DLCL 462
>gi|349605004|gb|AEQ00388.1| Polypeptide N-acetylgalactosaminyltransferase 3-like protein,
partial [Equus caballus]
Length = 337
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/222 (23%), Positives = 102/222 (45%), Gaps = 44/222 (19%)
Query: 60 WLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE 115
W+ +K+I + + + GD+S++ A++ +L CK+F+W++ ++ +VY P+
Sbjct: 147 WMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFAIKHRLQCKNFTWYLNNIYPEVYV--PD 204
Query: 116 LPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAEGQLGHG- 170
L P + G K+ G CLD +G P ++ T CHG G +Q F +A+ ++ H
Sbjct: 205 LNP-VISGYIKSFGQSLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQHEIRHNI 260
Query: 171 --ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFH 228
E C+ A + LV++ C G K + T + W
Sbjct: 261 QKELCLHA--AQGLVQLKA-----CTYKGHKTV-----------------ATGEQIWEIQ 296
Query: 229 NSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ L + L+ CL+ + E +++ C+ ++ Q+W F +
Sbjct: 297 KD-QLLYNPFLQMCLTANGE--HPSLVSCNPSDPLQKWIFSQ 335
>gi|109099998|ref|XP_001096023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Macaca mulatta]
gi|297264195|ref|XP_002798936.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Macaca mulatta]
gi|355564937|gb|EHH21426.1| hypothetical protein EGK_04492 [Macaca mulatta]
gi|355750584|gb|EHH54911.1| hypothetical protein EGM_04018 [Macaca fascicularis]
Length = 633
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 105/229 (45%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|332234083|ref|XP_003266237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Nomascus leucogenys]
Length = 633
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G K +
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV------------- 583
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T + W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 584 ----TGEQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
Length = 618
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++ +V +P
Sbjct: 385 RAAEVWMDEYKHFYYAAVPSARNVPYGNIQSRLELRKKLGCKPFKWYLDNV-------YP 437
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 438 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 495
Query: 171 ERCV 174
+ C+
Sbjct: 496 DLCL 499
>gi|402888519|ref|XP_003907606.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Papio
anubis]
Length = 633
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 105/229 (45%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|125815709|ref|XP_001338018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Danio
rerio]
Length = 604
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 66/149 (44%), Gaps = 16/149 (10%)
Query: 29 GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
++ H ER I LR W+ FK + + M +D GD+SE
Sbjct: 374 ARVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMNNPGVDFGDVSE 433
Query: 86 QVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTA 141
+VALR K++CK F W+++HV +PE+ + +GE +N + CLD
Sbjct: 434 RVALRKKMNCKSFRWYLEHV-------YPEMRIYNNTITYGEVRNSKASGYCLDQGSEED 486
Query: 142 PAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R EG L G
Sbjct: 487 ERAILYP-CHGM-SSQLARYTTEGLLQLG 513
>gi|291391661|ref|XP_002712292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oryctolagus cuniculus]
Length = 633
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 70/135 (51%), Gaps = 17/135 (12%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++++L CK+F+W++ V +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKNRLQCKNFTWYLNTVYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY PEL P + G K +G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PELNP-VISGYIKTVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDA 176
++ H E C+ A
Sbjct: 550 NEIRHNIQKELCLHA 564
>gi|326918600|ref|XP_003205576.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Meleagris gallopavo]
Length = 303
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 38/74 (51%), Gaps = 1/74 (1%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR L CKDF WFM VA+DV +P
Sbjct: 228 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKHLKCKDFKWFMAAVAWDVPKYYP 287
Query: 115 EL-PPNLFWGEAKN 127
+ PP WGE +N
Sbjct: 288 PVEPPPAAWGEVRN 301
>gi|326430087|gb|EGD75657.1| hypothetical protein PTSG_07775 [Salpingoeca sp. ATCC 50818]
Length = 711
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 67/125 (53%), Gaps = 5/125 (4%)
Query: 53 ELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
+LR W+ +K+I + D GD+SE+ A+R++L+C F WF+++V D++
Sbjct: 418 DLRVAEVWMDEYKEI-VYDTRGFYGKDPGDLSERRAIRERLNCHSFKWFLENVHPDLWV- 475
Query: 113 FPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
P++ P F G + + C+D+M IG CHG ++Q +RL +G + +G
Sbjct: 476 -PDIHPK-FSGIISDPSKRICVDNMQHDRGGPIGMYGCHGM-TTQRWRLMNDGVISNGVA 532
Query: 173 CVDAD 177
C+ D
Sbjct: 533 CLRWD 537
>gi|341897758|gb|EGT53693.1| CBN-GLY-10 protein [Caenorhabditis brenneri]
Length = 620
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/220 (22%), Positives = 88/220 (40%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ A+R+KL CK F WFMK +A+D +P + P
Sbjct: 410 WMDEYKETLYKHRPGVGNADAGDLKLMKAVREKLQCKSFDWFMKEIAFDQDKYYPAIEPK 469
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
GE ++ + C+D+ + G C G Q RL + R
Sbjct: 470 PSAQGEIRHAASNLCIDTQFKEQNQRFGLRKCTSDEKDGGGEQDLRLTRWHDIRPKGRKI 529
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + + K P+ LF+C + G + +++ + +
Sbjct: 530 CFDVSQS-----IDKAPII-------------------LFDCHSMKG--NQLFKYRVAAK 563
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
Q+ H + +CL+ L M +CD + Q+W ++ +
Sbjct: 564 QIYHPVSGQCLTGDENGKGFLHMKKCDSTSDLQKWTWQTI 603
>gi|308457549|ref|XP_003091148.1| CRE-GLY-10 protein [Caenorhabditis remanei]
gi|308258137|gb|EFP02090.1| CRE-GLY-10 protein [Caenorhabditis remanei]
Length = 620
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 51/220 (23%), Positives = 86/220 (39%), Gaps = 34/220 (15%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P D GD+ +R+KL CK F WFMK +A+D +P + P
Sbjct: 410 WMDEYKETLYKHRPGVGSADAGDLKLMKGVREKLQCKSFDWFMKEIAFDQDKYYPAVEPK 469
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER-- 172
GE ++ + C+D+ + G C G Q RL + R
Sbjct: 470 ASAQGEIRHGASNLCIDTQFKEQNQRFGLRKCTSDDKDGGGEQDLRLTRWHDVRPKGRKI 529
Query: 173 CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTR 232
C D + K P+ LF+C + G + +++ S +
Sbjct: 530 CFDVS-----TSVDKAPII-------------------LFDCHSMKG--NQLFKYRVSQK 563
Query: 233 QLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKEV 271
Q+ H + +CLS L M +CD + Q+W ++ +
Sbjct: 564 QIYHPVSGQCLSADENGKGFLYMKKCDSTSDLQKWTWQTI 603
>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Callithrix jacchus]
Length = 633
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 105/229 (45%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKTFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGHPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + L CLS + E +++ C+ ++S Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFLNMCLSANGE--HPSLVSCNPSDSLQKWIFSQ 631
>gi|300797404|ref|NP_001179787.1| polypeptide N-acetylgalactosaminyltransferase 3 [Bos taurus]
Length = 633
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 103/235 (43%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGRPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A L V+L C +
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA-----------------ALGAVQLKAC-------AYKGH 579
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ + W + L + K CLS E +++ C+ ++S Q+W F +
Sbjct: 580 KTVAIGEQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDSLQKWIFNQ 631
>gi|401399841|ref|XP_003880648.1| putative UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3 [Neospora caninum
Liverpool]
gi|325115059|emb|CBZ50615.1| putative UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3 [Neospora caninum
Liverpool]
Length = 635
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/235 (26%), Positives = 100/235 (42%), Gaps = 46/235 (19%)
Query: 46 STQKSSLELRKRSS--WLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWF 101
S+ S+ + K + W+ + + R +G P + + ++ R + CK F WF
Sbjct: 424 SSPGDSIAINKMRTMMWMDEYADLAWRVIGQPKVNYRPE-SLEKRREWRKRKGCKSFRWF 482
Query: 102 MKHV--AYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPA-VIGTSYCHGSGSSQL 158
M++V DV T ++P + G +N + CLD+MG +P +G CHG G +Q
Sbjct: 483 MENVFPEGDVVT-LDDVP---YLGPLRNAKVEMCLDNMGWASPGHAVGLGRCHG-GETQT 537
Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
F ++GH D + C+ KL +CR
Sbjct: 538 FMFFR--KVGHVMPVNDDEA--------------CLQPSGKLDWCR-------------- 567
Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVRP 273
GT W F S+ QL K+CLS QL M+ CD+ ++YQ W +K +P
Sbjct: 568 GTSQFWWDF-TSSGQLMFRETKQCLSAF--GRQLRMVDCDDTDAYQIWSWKAYKP 619
>gi|327262105|ref|XP_003215866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Anolis carolinensis]
Length = 575
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/219 (25%), Positives = 94/219 (42%), Gaps = 41/219 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ L+ +L+CK F W++++V +P
Sbjct: 387 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRLNCKPFKWYLENV-------YP 439
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L + + H
Sbjct: 440 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 497
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
+ C L + + P G+++ L C + G W S
Sbjct: 498 DLC--------LTVVDRAP----------------GSLIKLQGCRENDGR--QKWEQIES 531
Query: 231 TRQLEHAILKRCL-SVHPETNQLAMLRCDENNSYQQWRF 268
+L H CL S + + L + C+ + S QQW+F
Sbjct: 532 NAKLRHVGSNLCLDSRNAKNGGLTVEVCNPSLS-QQWKF 569
>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
Length = 455
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 58/110 (52%), Gaps = 5/110 (4%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ + + P A + GDIS+++ LR+KL C+ F W+M++V ++ P
Sbjct: 313 RAAEAWMDGYAKFYYKREPGARGVPYGDISDRLKLREKLKCRSFKWYMRNVYPEL--NVP 370
Query: 115 ELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAE 164
E + F GE + G KCLDS+G + T CHG G +Q + + +
Sbjct: 371 EGVNDKF-GELRQGG--KCLDSIGGKPGDRVSTFPCHGGGGNQAWDMTKD 417
>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
gallus]
Length = 566
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR +L CK F W++++V +P
Sbjct: 378 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENV-------YP 430
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L + + H
Sbjct: 431 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 488
Query: 171 ERCV 174
+ C+
Sbjct: 489 DLCL 492
>gi|47199070|emb|CAF89237.1| unnamed protein product [Tetraodon nigroviridis]
Length = 129
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/107 (35%), Positives = 54/107 (50%), Gaps = 12/107 (11%)
Query: 80 MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLG-TQKCLDSMG 138
GD++E+ LR+KL CK F WF+K++ D++ PE P +F G KN G C D
Sbjct: 5 FGDVTERRKLREKLGCKSFGWFLKNIYPDLHV--PEDNPGMF-GMLKNRGKADHCFDYNP 61
Query: 139 RTAPAVIGTSY----CHGSGSSQLFRLNAEGQLGHGER----CVDAD 177
V+G CHG G +Q F + +G+L + R CV D
Sbjct: 62 TDDDVVVGERVILYPCHGMGQNQFFEYSKDGELRYNTRTPAGCVMGD 108
>gi|326428122|gb|EGD73692.1| glycosyl transferase [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 51/205 (24%), Positives = 80/205 (39%), Gaps = 51/205 (24%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYD-VYTKFPELPPNLFWGEAKNLGTQKCLDSM 137
D+G + + L+ +L CK F W++K+V + + T ++ G KN C+D+M
Sbjct: 393 DIGPLDHMLELKQQLQCKSFDWYLKNVYPEGIITDLSDIRA---LGTVKNTANNMCVDNM 449
Query: 138 GRT-APAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
A A +G CHG GS TF +LA
Sbjct: 450 QHMYADAKMGAYACHGHGSQ----------------------------------TFLILA 475
Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTV-------DGPWRFHNSTRQLEHAILKRCLSVHPET 249
K + V NL C S T+ D W + + L++ ++CL+V +
Sbjct: 476 RTK----EIRPVGNLELCLTSDTTISWCESRHDVTWDYDQANMLLKNTKTQKCLAVG-DG 530
Query: 250 NQLAMLRCDENNSYQQWRFKEVRPD 274
N L CD +N +W F + PD
Sbjct: 531 NNLVTEDCDLHNDRHKWTFDDKHPD 555
>gi|395836156|ref|XP_003791031.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Otolemur garnettii]
Length = 571
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 383 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 435
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G V+G CH +G +Q + L E + H
Sbjct: 436 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFTDGVVGVYECHNAGGNQEWALTKEKAVKHI 493
Query: 171 ERCV 174
+ C+
Sbjct: 494 DLCL 497
>gi|395844920|ref|XP_003795196.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Otolemur garnettii]
Length = 633
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/231 (23%), Positives = 102/231 (44%), Gaps = 48/231 (20%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F ++
Sbjct: 496 VYV--PDLNP-VISGYIKSIGKPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSSL 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFNCFNSSG 219
++ H E C+ A K PV C G K +
Sbjct: 550 REIRHNIQKELCLHAAKG---------PVQLKACAYKGHKTVV----------------- 583
Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T++ W + L + + K CLS + E L + C+ ++ Q+W F +
Sbjct: 584 TIEQIWEIQKD-QLLYNPLFKMCLSANGEHPNL--VSCNPSDPLQKWIFSQ 631
>gi|47216162|emb|CAG10036.1| unnamed protein product [Tetraodon nigroviridis]
Length = 534
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 100/233 (42%), Gaps = 36/233 (15%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +KQ +P A GDIS ++ LR+KL C+ F+W++K++ +
Sbjct: 321 VRAAEVWMDSYKQHFYNRNPAARKETYGDISGRLLLREKLKCQSFTWYLKNI-------Y 373
Query: 114 PEL--PPNL--FWGEAKNLG-TQKCLDSMGRTAP--AVIGTSY----CHGSGSSQLFRLN 162
PEL P + + G +NLG + +CLD AP +V G CHG G +Q F
Sbjct: 374 PELHIPEDRAGWHGAVRNLGISSECLDY---NAPEHSVTGAQLSLFGCHGQGGNQYFEYT 430
Query: 163 AEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSG 219
++ ++ E C + + + V M P L R +L ++
Sbjct: 431 SQKEIRFNTVTELCAEVLEGKVAVGMRHCPGDGESRPASILWEFRQVRPPSLRKERENNE 490
Query: 220 TVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLA---MLRCDENNSYQQWRFK 269
DG + H C++V+ + A M RCD + QQW+F+
Sbjct: 491 KPDG---------SIYHPHTDTCITVYRTSTGRADAKMARCDPGDKTQQWKFE 534
>gi|195030212|ref|XP_001987962.1| GH10908 [Drosophila grimshawi]
gi|193903962|gb|EDW02829.1| GH10908 [Drosophila grimshawi]
Length = 684
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 44/81 (54%), Gaps = 2/81 (2%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL-PPNLFWGEAKNLG-TQKCLD 135
+D GD++ Q A+R++L CK F WFMK VA D FP + PP G +++ CLD
Sbjct: 495 IDAGDLTRQRAVRERLQCKSFDWFMKEVAPDFLVHFPPVEPPPYASGAIQSVAYPNYCLD 554
Query: 136 SMGRTAPAVIGTSYCHGSGSS 156
SMG+ +G C + S
Sbjct: 555 SMGKGWKHAVGLFTCAQNKKS 575
>gi|169790935|ref|NP_938048.2| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Mus
musculus]
gi|148688066|gb|EDL20013.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9, isoform CRA_a [Mus
musculus]
Length = 604
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 16/148 (10%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
+ALR+KL C+ F W++++V +PE+ L +GE +N + CLD
Sbjct: 435 LALREKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 487
Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R +AEG L G
Sbjct: 488 RAILYP-CHGM-SSQLVRYSAEGLLQLG 513
>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
Length = 518
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 32/116 (27%), Positives = 55/116 (47%), Gaps = 3/116 (2%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P GDIS+++ LR KL CK+F W++K+V +++ +
Sbjct: 328 WMDGYKRYFYKRQPHLKGHPFGDISKRLELRKKLKCKNFDWYIKNVVPEIFLPNSSI--- 384
Query: 120 LFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
+ GE +N + C+DS+G A IG CH +Q ++ + C D
Sbjct: 385 IARGELRNPASGDCIDSLGAGEHAYIGIYKCHKQMGNQYLVYTKNEEIIVDDNCFD 440
>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Taeniopygia guttata]
Length = 669
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR +L CK F W++++V +P
Sbjct: 481 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRMELRKRLSCKPFKWYLENV-------YP 533
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L + + H
Sbjct: 534 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 591
Query: 171 ERCV 174
+ C+
Sbjct: 592 DLCL 595
>gi|169790937|ref|NP_001116111.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform B [Mus
musculus]
gi|74142966|dbj|BAE42509.1| unnamed protein product [Mus musculus]
gi|148688067|gb|EDL20014.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9, isoform CRA_b [Mus
musculus]
Length = 237
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 68/148 (45%), Gaps = 16/148 (10%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 8 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
+ALR+KL C+ F W++++V +PE+ L +GE +N + CLD
Sbjct: 68 LALREKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 120
Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R +AEG L G
Sbjct: 121 RAILYP-CHGM-SSQLVRYSAEGLLQLG 146
>gi|327283203|ref|XP_003226331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Anolis carolinensis]
Length = 644
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 66/145 (45%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 377 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMMNPGVDFGDVSER 436
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAK-NLGTQKCLDSMGRTAPAVI 145
VALR +L C+ F W+++ +VY++ + +GE + N + CLD I
Sbjct: 437 VALRQRLQCRSFKWYLE----NVYSEMRIYNNTITYGEVRNNKASGYCLDQGAEDDDKAI 492
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R ++EG L G
Sbjct: 493 LYP-CHGM-SSQLVRYSSEGLLQLG 515
>gi|190358441|ref|NP_001121823.1| polypeptide N-acetylgalactosaminyltransferase 2 [Danio rerio]
Length = 559
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 57/122 (46%), Gaps = 9/122 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK P A + G+I ++ ++ +L CK F W++++V +P
Sbjct: 371 RAAEVWMDDFKNFYYAAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLENV-------YP 423
Query: 115 EL--PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGER 172
EL P + Q CLD++G A V+G CH +G +Q + L + + H +
Sbjct: 424 ELRVPDHQDIAFGALQQGQNCLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHMDL 483
Query: 173 CV 174
C+
Sbjct: 484 CL 485
>gi|21912817|gb|AAM81087.1|AF321881_1 UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1 [Toxoplasma gondii]
gi|21912813|gb|AAM81085.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1 [Toxoplasma gondii]
gi|221487005|gb|EEE25251.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1, putative
[Toxoplasma gondii GT1]
Length = 751
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 13/112 (11%)
Query: 53 ELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+LR W+ + +I L P L +GD++E ALRD+L CKDF+W++K++
Sbjct: 543 KLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELKALRDRLKCKDFNWYLKNI------ 596
Query: 112 KFPEL-PPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+PEL PPNL G +N CLD++ T IG CH +Q F
Sbjct: 597 -YPELEPPNLAHAMTGAMRNPKFNCCLDTL-TTKNQEIGVYPCHFEHGTQAF 646
>gi|431895640|gb|ELK05066.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Pteropus alecto]
Length = 367
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR L CK F W++++V +P
Sbjct: 179 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKTLACKPFKWYLENV-------YP 231
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 232 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKEKSVKHM 289
Query: 171 ERCV 174
+ C+
Sbjct: 290 DLCL 293
>gi|237831699|ref|XP_002365147.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1, putative
[Toxoplasma gondii ME49]
gi|211962811|gb|EEA98006.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1, putative
[Toxoplasma gondii ME49]
gi|221506688|gb|EEE32305.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1, putative
[Toxoplasma gondii VEG]
Length = 751
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 13/112 (11%)
Query: 53 ELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+LR W+ + +I L P L +GD++E ALRD+L CKDF+W++K++
Sbjct: 543 KLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELKALRDRLKCKDFNWYLKNI------ 596
Query: 112 KFPEL-PPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+PEL PPNL G +N CLD++ T IG CH +Q F
Sbjct: 597 -YPELEPPNLAHAMTGAMRNPKFNCCLDTL-TTKNQEIGVYPCHFEHGTQAF 646
>gi|301604565|ref|XP_002931923.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Xenopus (Silurana) tropicalis]
Length = 604
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
VALR KL C+ F W+++ +VY + + +GE +N T CLD I
Sbjct: 435 VALRQKLQCRSFKWYLE----NVYPEMRVYNNTITYGELRNSKATGYCLDQGSEDDNKAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R ++EG L G
Sbjct: 491 LYP-CHGM-SSQLVRYSSEGLLQLG 513
>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
Length = 737
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 38/134 (28%), Positives = 66/134 (49%), Gaps = 15/134 (11%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR WL +K+ M +D+G++SE+ ALR+KL C F W++ ++ +
Sbjct: 496 LRLADVWLDDYKRFYHARIGFEM-VDIGNVSERKALREKLKCHSFDWYLTNI-------Y 547
Query: 114 PEL---PPNLFWGEAKNLGTQKCLD----SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQ 166
PEL L G+ ++ CLD S ++ +I T CH G +Q + L++E +
Sbjct: 548 PELFVPSKALASGDIESAAGPHCLDAPLPSENDSSSVIIKTRPCHKQGGNQFWLLSSENE 607
Query: 167 LGHGERCVDADKQE 180
+ + C D+ Q+
Sbjct: 608 IRRDDYCFDSGIQK 621
>gi|358332242|dbj|GAA50924.1| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 403
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 89/192 (46%), Gaps = 37/192 (19%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDS- 136
+D GD+SE+ A+ ++L C F W++K +VY +F ++ G+ ++ C+DS
Sbjct: 239 IDFGDVSERKAILERLKCHSFDWYLK----NVYPEFFVPTDSVAHGDIESEAGPHCIDSP 294
Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
+ ++G CH G +Q + ++ G++ +C D A
Sbjct: 295 LKGDGKVIVGMWPCHREGGNQYWLMSKLGEIRRDNKCWD--------------------A 334
Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLR 256
G++ +G V LF+C G + S +++H +RC+++ + L+M
Sbjct: 335 GIE-----VGRVA-LFDCHGVRGNQHF---VYTSNNEIKHG--ERCVTLSEDNKSLSMEY 383
Query: 257 CDENNSYQQWRF 268
C N++ Q+W+F
Sbjct: 384 CS-NSARQRWKF 394
>gi|158300338|ref|XP_001238288.2| AGAP012253-PA [Anopheles gambiae str. PEST]
gi|157013111|gb|EAU75785.2| AGAP012253-PA [Anopheles gambiae str. PEST]
Length = 593
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 55/232 (23%), Positives = 92/232 (39%), Gaps = 48/232 (20%)
Query: 54 LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+R W+ +K+I G P + + G ++ + A+R+ CK FS+++++
Sbjct: 396 IRLAEVWMDEYKRIIFDIYGIPHYLEEEFGSVATRKAIRESAKCKPFSYYLENA------ 449
Query: 112 KFPE----LPPNLFWGEAKN--LGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
FPE L P F GE N LG CL RT +G + C Q + N
Sbjct: 450 -FPEMHNPLVPGAFRGEVHNMALGNGSCLTY--RTRDRFLGMAPCDHLEKDQYWTHNYYQ 506
Query: 166 QLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
+L C+DA +GTV+ ++ C S G + W
Sbjct: 507 ELNSYRNCIDA----------------------------VGTVVEVYQCHRSRG--NQAW 536
Query: 226 RFHNSTRQLEHAILKRCLSVHPETNQLAML-RCDENNSYQQWRFKEVRPDWS 276
+ ++Q+ CL+++ +T +L +CD QQW + D S
Sbjct: 537 KVLVESQQILSVARNLCLALNLQTKTTLLLEKCDATKPSQQWNVSYIELDVS 588
>gi|301614636|ref|XP_002936794.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Xenopus (Silurana) tropicalis]
Length = 625
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 55/95 (57%), Gaps = 8/95 (8%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
+ GD+S+++ LR +L CK+F+W++ ++ ++Y PE P L G+ KN+G CLD G
Sbjct: 458 EYGDLSKRLDLRHRLQCKNFTWYLNNIYPEMYV--PERHP-LIHGDLKNVGRDLCLDVGG 514
Query: 139 RT---APAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
P ++ + CHG G +Q F ++ ++ H
Sbjct: 515 ENHGDKPLIMYS--CHGLGGNQYFEYTSKHEIRHN 547
>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Cavia porcellus]
Length = 531
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR +L C+ F W++++V +P
Sbjct: 343 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCRPFQWYLENV-------YP 395
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 396 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 453
Query: 171 ERCV 174
+ C+
Sbjct: 454 DLCL 457
>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
Length = 415
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 49/83 (59%), Gaps = 11/83 (13%)
Query: 82 DISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL--PPNLFW--GEAKNLGTQKCLDSM 137
D+S+++ALR+KL CK+F W++ +V +PEL P F G+ +N + CLD++
Sbjct: 339 DVSQRLALREKLKCKNFKWYLDNV-------YPELEVPDTNFAASGQVRNPSSDMCLDTL 391
Query: 138 GRTAPAVIGTSYCHGSGSSQLFR 160
G+ +G CHG G +Q+ R
Sbjct: 392 GKKDDTPLGLYQCHGQGGNQVKR 414
>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 628
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR +L CK F W++++V +P
Sbjct: 440 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLSCKPFKWYLENV-------YP 492
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L + + H
Sbjct: 493 ELRVPDHQDIAFG-ALQQGT-NCLDTLGHFADGVVGVYECHNAGGNQEWALTKDRSVKHM 550
Query: 171 ERCV 174
+ C+
Sbjct: 551 DLCL 554
>gi|397507787|ref|XP_003824367.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
paniscus]
Length = 633
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 106/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQSLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G K +
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KGCTYKGHKTVV------------- 583
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T + W + L + K CLS + E +++ C+ ++ Q+W F +
Sbjct: 584 ----TGEQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWVFSQ 631
>gi|390347275|ref|XP_003726736.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 507
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 38/123 (30%), Positives = 60/123 (48%), Gaps = 14/123 (11%)
Query: 72 SPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPE--LPPNL-FWGEAKNL 128
+P +D GDIS ++ +RD+L CK+F W++++V +PE LP N G N
Sbjct: 319 TPEYQAVDAGDISSRIRIRDELKCKNFEWYLENV-------YPENILPLNFQALGRFMNE 371
Query: 129 GTQKCLDSM----GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVE 184
G C+D + GR A + + C +Q F N QL H C+ A + + V
Sbjct: 372 GVNLCIDVLHATDGRRMGAHLAVNACREGALAQTFSWNDLSQLRHDRFCITAVEGDNHVM 431
Query: 185 MLK 187
+L+
Sbjct: 432 LLE 434
>gi|157114758|ref|XP_001652407.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883560|gb|EAT47785.1| AAEL001146-PA [Aedes aegypti]
Length = 552
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 84/218 (38%), Gaps = 38/218 (17%)
Query: 54 LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+R W+ + Q+ G P + GD+S++ LR+ L+CK F W++++
Sbjct: 364 VRVAEVWMDQYAQVVYDMFGGP-EFRGNFGDVSDRKKLRESLNCKSFKWYLENA------ 416
Query: 112 KFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
FPEL + +G G+ +G ++C R G E
Sbjct: 417 -FPELEDPVSYGVGH-----------GKFTNLGVGKNFCP--------RYRKAGYTFRME 456
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
C D D Q + ML T V G L +F C G + WR+ ST
Sbjct: 457 PCTDDDYQHWVHNMLGEISTSNVCLDYD------GITLYMFECHKGQG--NQKWRYSKST 508
Query: 232 RQLEHAILKRCLSVHPETN-QLAMLRCDENNSYQQWRF 268
+Q + K CL V P +L +C+ Q+W F
Sbjct: 509 KQFTNVKNKVCLDVGPAPEMKLVAEKCNATKESQKWEF 546
>gi|348534088|ref|XP_003454535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oreochromis niloticus]
Length = 559
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ ++ +L+CK F W++++V +P
Sbjct: 371 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLEMKKRLNCKPFKWYLENV-------YP 423
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + G CLD++G A V+G CH +G +Q + L + + H
Sbjct: 424 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 481
Query: 171 ERCV 174
+ C+
Sbjct: 482 DLCL 485
>gi|432098371|gb|ELK28171.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Myotis davidii]
Length = 633
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 52/228 (22%), Positives = 100/228 (43%), Gaps = 42/228 (18%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEG 165
VY P+L P + G K+ G CLD + + P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSFGQPLCLDVGENNQGSKPLILYT--CHGLGGNQYFEYSAQH 550
Query: 166 QLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
++ H E C+ A + + C G K + T +
Sbjct: 551 EIRHNIQKELCLHAAPG-------PVQLKTCTYKGHKTV-----------------ATGE 586
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
W L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 587 QIWEIQKDLL-LYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|440897124|gb|ELR48890.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Bos
grunniens mutus]
Length = 499
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 56/119 (47%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR +L CKDF WF+ V +++
Sbjct: 299 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERRQLRARLGCKDFKWFLNTVYPELHV-- 356
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F G +N G + C D + G CHG G +Q F ++ ++
Sbjct: 357 PEDRPGFF-GMLQNKGLRDYCFDYNPPNEHEITGHQVILYRCHGMGQNQFFEYTSQNEI 414
>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
glaber]
Length = 567
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR +L C+ F W++++V +P
Sbjct: 379 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKRLGCQPFQWYLENV-------YP 431
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L E + H
Sbjct: 432 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGIYECHNAGGNQEWALTKEKSVKHM 489
Query: 171 ERCV 174
+ C+
Sbjct: 490 DLCL 493
>gi|426222263|ref|XP_004005316.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Ovis aries]
Length = 467
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 8/122 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR +L CKDF WF+ V +++
Sbjct: 267 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERRQLRARLGCKDFKWFLNTVYPELHV-- 324
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQLG 168
PE P F G +N G + C D + G CHG G +Q F ++ ++
Sbjct: 325 PEDRPGFF-GMLQNKGLKDYCFDYNPPNEHEITGHQVILYRCHGMGQNQFFEYTSQNEIR 383
Query: 169 HG 170
+
Sbjct: 384 YN 385
>gi|313234048|emb|CBY19624.1| unnamed protein product [Oikopleura dioica]
Length = 827
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 62/112 (55%), Gaps = 10/112 (8%)
Query: 54 LRKRSSWLKVFKQIR-SLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDV-YT 111
+R WL FK++ G+P+ +D G++S ++ LR+ L C+ FSW+M++V ++ YT
Sbjct: 697 IRVADVWLDEFKEVYFRRGAPILKNIDPGNMSSRIQLRENLQCQPFSWYMENVLPELDYT 756
Query: 112 KFPELPPNLFWGE----AKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+L +F GE A+N ++C DS G+ A I CHG +Q +
Sbjct: 757 MNDDL---IFAGEIISQARNRTNRQCFDSTGKDN-AQIQIFPCHGLLGNQYY 804
>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Ailuropoda melanoleuca]
Length = 578
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 84/179 (46%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L C+ F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAVRSMGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+K V M P F + A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKKYVGMQNCPKDGFPIPANIIWHFKEDGTIFHPHSGLCLSAFRTPEG 556
>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
Length = 491
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/164 (27%), Positives = 77/164 (46%), Gaps = 11/164 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L C+ F W++K+V +++ P
Sbjct: 320 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLKCQSFDWYLKNVFSNLHV--P 377
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD A + CHG G +Q F + ++
Sbjct: 378 EDRPG-WHGAVRSMGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 436
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLN 210
E C + +Q+K V M P F + A + F GT+ +
Sbjct: 437 SVTELCAEVPEQKKYVGMQNCPKDGFPIPANIIWHFKEDGTIFH 480
>gi|114581503|ref|XP_515871.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
troglodytes]
gi|410331347|gb|JAA34620.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Pan
troglodytes]
Length = 633
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 106/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G K +
Sbjct: 544 FEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KGCTYKGHKTVV------------- 583
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
T + W + L + K CLS + E +++ C+ ++ Q+W F +
Sbjct: 584 ----TGEQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWVFSQ 631
>gi|326436017|gb|EGD81587.1| polypeptide N-acetylgalactosaminyltransferase 14 [Salpingoeca sp.
ATCC 50818]
Length = 645
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 94/229 (41%), Gaps = 34/229 (14%)
Query: 53 ELRKRSSWLKVFKQI-RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+LR W+ +K I + + SPL +G + +R + +CK F W++++V +++
Sbjct: 442 KLRAAEVWMDEYKDIVKRVMSPLPRGKTLGPLDMMFDIRKRHNCKPFKWYLENVYPEMFV 501
Query: 112 KFPELPPNLFW-GEAKNLGTQKCLDSMG-RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
P P ++ GE +N T C D++G R A IG CH S +Q F L+ G
Sbjct: 502 --PYDPDHVVASGEVRNPHTDACFDTLGARHQGARIGAYPCHHSHGTQEFVLSTGGD--- 556
Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
+ + + C+ G + ++ C + G W +
Sbjct: 557 -------------IRVAAMDFDTCLDRG------NGDGSIGIWPCHQTGGNQK--WNYDP 595
Query: 230 STRQLE----HAILKRCLSVHPETN-QLAMLRCDENNSYQQWRFKEVRP 273
+ +L H + + P + +L + CD N QQW FKE P
Sbjct: 596 KSGRLSDGDGHICAEAQRTATPSSPFKLVLSPCDLTNPAQQWIFKEAEP 644
>gi|281343220|gb|EFB18804.1| hypothetical protein PANDA_020124 [Ailuropoda melanoleuca]
Length = 275
Score = 58.2 bits (139), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R +W+ F + P L GDIS Q LR +L CKDF WFM VA+DV +P
Sbjct: 203 RVAETWMDEFAEYIYQRRPEYRHLSTGDISAQKELRKQLKCKDFKWFMAAVAWDVPKYYP 262
Query: 115 EL-PPNLFWGEAK 126
+ PP WGE +
Sbjct: 263 PVEPPPAAWGEVR 275
>gi|301775621|ref|XP_002923231.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Ailuropoda melanoleuca]
Length = 601
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDSFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W++ +VY + L +GE +N + CLD I
Sbjct: 435 LALRQRLKCRSFKWYLD----NVYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 491 LYP-CHGM-SSQLVRYSAEGLLQLG 513
>gi|301608341|ref|XP_002933751.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 586
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/231 (23%), Positives = 98/231 (42%), Gaps = 49/231 (21%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K + + +A GDIS+++ L+ L CK+F+W+++++ +
Sbjct: 389 QVRLAEVWMDDYKVLYYRRNEQAAKIAKEKSFGDISKRLKLKADLQCKNFTWYLENIYPE 448
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLD-----SMGRTAPAVIGTSYCHGSGSSQLFRLNA 163
++ P+ P ++G KN GTQ C+D + G P + CHG G +Q F +
Sbjct: 449 MFV--PDRDPT-YYGAIKNEGTQNCIDVGENNNYGSQLPIMYP---CHGMGGNQYFEYST 502
Query: 164 EGQLGHGERCVDADKQEKLVEMLKLPVTFCV---LAGVKLIFCRLGTVLNLFNCFNSSGT 220
+L H LK + C VKL+ C+ + N++
Sbjct: 503 HKELRHN---------------LKTQLCLCSKYEPGPVKLVDCQ-------YKGQNTTVP 540
Query: 221 VDGPWRFHNSTRQLEHAILKRCLSV---HPETNQLAMLRCDENNSYQQWRF 268
+ W F R + + CL+ HP T + C+ ++ Q W F
Sbjct: 541 GNEEWEF-TKDRLIRNVGSNMCLTAQGDHPSTAE-----CNPDDIDQLWSF 585
>gi|281340802|gb|EFB16386.1| hypothetical protein PANDA_012334 [Ailuropoda melanoleuca]
Length = 600
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDSFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W++ +VY + L +GE +N + CLD I
Sbjct: 435 LALRQRLKCRSFKWYLD----NVYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 491 LYP-CHGM-SSQLVRYSAEGLLQLG 513
>gi|154152027|ref|NP_001093795.1| polypeptide N-acetylgalactosaminyltransferase 12 [Bos taurus]
gi|151553796|gb|AAI48122.1| GALNT12 protein [Bos taurus]
Length = 579
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 8/119 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR +L CKDF WF+ V +++
Sbjct: 379 VRAAEVWMDEFKELYYHRNPQARLEPFGDVTERRQLRARLGCKDFKWFLNTVYPELHV-- 436
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAEGQL 167
PE P F+G +N G + C D + G CHG G +Q F ++ ++
Sbjct: 437 PEDRPG-FFGMLQNKGLRDYCFDYNPPNEHEITGHQVILYRCHGMGQNQFFEYTSQNEI 494
>gi|153266878|ref|NP_004473.2| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
gi|209572629|sp|Q14435.2|GALT3_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|62822129|gb|AAY14678.1| unknown [Homo sapiens]
gi|109731077|gb|AAI13568.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|109731742|gb|AAI13566.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|119631729|gb|EAX11324.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3), isoform
CRA_b [Homo sapiens]
gi|313883200|gb|ADR83086.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3)
[synthetic construct]
Length = 633
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 103/225 (45%), Gaps = 44/225 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ W + L + LK CLS + E +++ C+ ++ Q+W
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKW 627
>gi|74004468|ref|XP_535940.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Canis lupus familiaris]
Length = 632
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 53/232 (22%), Positives = 106/232 (45%), Gaps = 34/232 (14%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 429 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 488
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ + Y P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 489 NTIYPEAYV--PDLNP-VISGYIKSIGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 542
Query: 159 FRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSS 218
F +A+ ++ H + Q++L C+ A L+ R T + +
Sbjct: 543 FEYSAQHEIRH-------NIQKEL----------CLHAAQGLVQLRACT----YKGHRTV 581
Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ + W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 582 ASGEQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 630
>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
Length = 442
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 37/126 (29%), Positives = 63/126 (50%), Gaps = 15/126 (11%)
Query: 39 DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDF 98
DA K+++ + E+ W+ +K+I P DMGD++++ LR LHC+DF
Sbjct: 325 DAWKKLVHNNRRMAEV-----WMDDYKEIYYRKHPEYRKYDMGDVTQRKLLRKGLHCRDF 379
Query: 99 SWFMKHVAYDVYTKFPELPPNLFWGEAKNLG------TQKCLDSMGRTAPAVIGTSYCHG 152
SW++ HV +Y P++ P + G+ ++ T CLD + + G CHG
Sbjct: 380 SWYLSHVFPTLYV--PDIRP-IAHGQVSHVTSISSEQTGLCLDVI-KAGKEPAGVFPCHG 435
Query: 153 SGSSQL 158
G +Q+
Sbjct: 436 KGGTQV 441
>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
laevis]
gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
Length = 622
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 4/128 (3%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVIS + L + ++ + + + GD+S+++ L+ LHCK+F+W++
Sbjct: 420 QVISRNQVRLAEVWMDDYKIIYYRRNDQAAKMVKEKSFGDVSKRLKLKADLHCKNFTWYL 479
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
+++ +++ P+ P + G KN G QKCLD Y CHG G +Q F
Sbjct: 480 ENIYPELFV--PDRDPT-YSGAVKNEGAQKCLDVGENNHGGKPLIMYPCHGMGGNQYFEY 536
Query: 162 NAEGQLGH 169
+ +L H
Sbjct: 537 STHKELRH 544
>gi|1617312|emb|CAA63371.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
(GalNAc-T3) [Homo sapiens]
Length = 633
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 103/225 (45%), Gaps = 44/225 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLRCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ W + L + LK CLS + E +++ C+ ++ Q+W
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKW 627
>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
Length = 721
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 54/242 (22%), Positives = 97/242 (40%), Gaps = 60/242 (24%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGT--QKC 133
D GD+S++ LR L CK F W++ ++ +PEL ++ GE +NLG + C
Sbjct: 482 DWGDVSDRRKLRTDLKCKSFKWYLDNI-------YPELFIPGDSVAHGEIRNLGYGGRTC 534
Query: 134 LDSMG--RTAPAVIGTSYCHGSGSSQLFR------LNAEGQL------------GHGERC 173
LD+ + +GT CH G +Q+ L+A+ + G G
Sbjct: 535 LDAPAGKKHQKKAVGTYPCHRQGGNQIANVPKGMCLDAKEKSEEETPVSVYECHGQGGNQ 594
Query: 174 VDAD-------------KQEKLVEMLKLPVTFCVLAGVKLIFCRLGTV------------ 208
V A E L+ + + + + + + G +
Sbjct: 595 VSASMSTSSELRKAGGGDSESLIPGFSISLLYGFIFQSYWMLSKAGEIRRDDSCLDYAGK 654
Query: 209 -LNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWR 267
+ LF C G + W + +T+QL H +CL++ ++L M C+++ + QQW
Sbjct: 655 DVTLFGCHGGKG--NQFWTYRENTKQLHHGTSGKCLAISESKDKLLMEECNQSLNRQQWT 712
Query: 268 FK 269
+
Sbjct: 713 LE 714
>gi|315139010|ref|NP_001186711.1| POC1B-GALNT4 protein isoform 2 [Homo sapiens]
gi|194387348|dbj|BAG60038.1| unnamed protein product [Homo sapiens]
Length = 406
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 81/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 209 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 266
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 267 EDRPGWH-GAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 325
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 326 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 384
>gi|405967231|gb|EKC32417.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 570
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/198 (21%), Positives = 80/198 (40%), Gaps = 33/198 (16%)
Query: 76 MFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLD 135
+F D GD++++ LR++L C F WF+K+V D++ + +AK + +D
Sbjct: 399 LFGDYGDVTDRKKLRERLQCHSFDWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVD 458
Query: 136 SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
+ P + CH G +Q + L+ G++ + C+D E ++
Sbjct: 459 NHNYHKP--VNMWPCHNQGGNQYWMLSKNGEIRRDDGCLDYSGGESVI------------ 504
Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAML 255
++ C G + +R NS + HA ++C+ + +L M
Sbjct: 505 ---------------VYPCHGQKGNQEWQYREDNS---IYHANTQKCMETSVDGQKLTMK 546
Query: 256 RCDENNSYQQWRFKEVRP 273
C + Q W +K P
Sbjct: 547 TCTGIDR-QIWTWKRKSP 563
>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Anolis carolinensis]
Length = 592
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 94/242 (38%), Gaps = 62/242 (25%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K++ +P A GD++E+ LR+KL CKDF W++ ++ +
Sbjct: 390 VRAAEVWMDSYKELYYHRNPHARMEPYGDVTERRLLREKLKCKDFKWYLDNI-------Y 442
Query: 114 PEL--PPNL--FWGEAKNLGTQK-CLDSMGRTAPAVIGTSY----CHGSGSSQLFRLNAE 164
PEL P + ++G KN G C D + G CHG G +Q F +
Sbjct: 443 PELHVPEDRLGYFGMLKNKGMANFCFDYNPPNEHDITGHVVILYPCHGMGQNQFFEYTSY 502
Query: 165 GQL----GHGERCVDADKQEKLVEML-------KLPVTFCVLAGVKLIFCRLGTVLNL-- 211
++ H E C D ++M K+P K + GT++++
Sbjct: 503 HEIRYNTRHPEACAAVDPGTDYLKMYLCQDDIQKIP------ENQKFMLREDGTLVHMQT 556
Query: 212 FNCFNS-SGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
C + S +G N +LR N+ YQ+W FKE
Sbjct: 557 LKCLQAESNAYNG--------------------------NPAPVLRPCSNSEYQKWFFKE 590
Query: 271 VR 272
R
Sbjct: 591 KR 592
>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
Length = 626
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 64/135 (47%), Gaps = 11/135 (8%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K++ + +D+GD S + LR L CK F W++++V D KF +
Sbjct: 434 WMDDYKELFYMHRGDLKTIDIGDTSARKKLRKDLKCKSFKWYLENVLPD---KFIMTEHS 490
Query: 120 LFWGEAKN--LGTQKCLDSMGRTA--PAVIGTSYCHGS-GSSQLFRLNAEGQLGHGERCV 174
L +G N G Q CLD++ R P +G CH SQ+F L+ GQL E C
Sbjct: 491 LGYGRVMNDAFGKQLCLDNLQRNEDQPYNLGQYPCHAQMAMSQVFALSKLGQLRREESCA 550
Query: 175 DADKQ---EKLVEML 186
+ E V+ML
Sbjct: 551 EVQDNVSAEAPVKML 565
>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
laevis]
gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
Length = 582
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 57/228 (25%), Positives = 96/228 (42%), Gaps = 44/228 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K++ +P A + GDISE+ LR++L CK F W++K V +++ P
Sbjct: 385 RAAEVWMDGYKELFYNRNPPAQKENYGDISERKLLRERLQCKSFDWYLKKVFPELH--IP 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP A + CHG G +Q F + ++
Sbjct: 443 EDRPG-WHGAVRSMGISSECLDY---NAPEHNPTGAHLSLFGCHGQGGNQFFEYTTKREI 498
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
E C + Q+ + M+ C G +S +
Sbjct: 499 RFNSVTELCAEVPDQQTYIGMMH-----CTKDG-------------------NSVPKNIA 534
Query: 225 WRFHNSTRQLEHAILKRCLSVH--PETN-QLAMLRCDENNSYQQWRFK 269
W F N + H C+S + PE + M CD ++ Q W+F+
Sbjct: 535 WEFRNDG-TIYHPSSGMCISSYRTPEGRADVQMKGCDPSDKNQLWKFE 581
>gi|440909764|gb|ELR59641.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Bos grunniens
mutus]
Length = 604
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDGFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W++ + VY + L +GE +N + CLD I
Sbjct: 435 LALRQRLKCRSFKWYLDN----VYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 491 LYP-CHGM-SSQLVRYSAEGLLQLG 513
>gi|410968769|ref|XP_003990872.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3 [Felis catus]
Length = 633
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 107/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ + Y P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTIYPEAYV--PDLNP-VISGYIKSIGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G K +
Sbjct: 544 FEYSAQREIRHNIQKELCLHA--AQGLVQLRA-----CAYKGHKTV-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+SG + W + L + LK CLS + E +++ C+ ++ Q+W F +
Sbjct: 583 -ASG--EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 631
>gi|291190646|ref|NP_001167159.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
gi|223648406|gb|ACN10961.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
Length = 560
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ FK P A + G+I ++ ++ +L C+ F W++++V +P
Sbjct: 372 RAAEVWMDEFKNFYYAAVPSARNVPYGNIQSRMEMKKRLGCQPFKWYLENV-------YP 424
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + G CLD++G A V+G CH +G +Q + L + + H
Sbjct: 425 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 482
Query: 171 ERCV 174
+ C+
Sbjct: 483 DLCL 486
>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
Length = 621
Score = 57.4 bits (137), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 98/233 (42%), Gaps = 40/233 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVIS + L + ++F + + +A GDI+E+ LR++LHCK+F+W++
Sbjct: 421 QVISRNQVRLAEVWMDDYKEIFYRRNQQAAQMAREKTYGDITERRRLRERLHCKNFTWYL 480
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSY-CHGSGSSQLFRL 161
++V +++ P+L P + G KN GT CLD Y CHG G +Q F
Sbjct: 481 QNVYPEMFV--PDLNPTSY-GAIKNEGTNSCLDVGENNHGGKPLIMYPCHGMGGNQYFEY 537
Query: 162 NAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAG---VKLIFCRLGTVLNLFNCFNSS 218
+ L H + KQ C+ AG V+L CR F +
Sbjct: 538 TTQRDLRH-----NVGKQ------------LCLRAGAGPVQLGECR-------FRGQPTR 573
Query: 219 GTVDGPWRFHNSTRQLEHAILKRCLSV---HPETNQLAMLRCDENNSYQQWRF 268
D W HA + CL+ HP AM CD ++ +Q W F
Sbjct: 574 VPPDEEWELTQDHLIKNHAS-RMCLTARGKHP-----AMAPCDPSDPHQLWSF 620
>gi|169790926|ref|NP_001116108.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Homo
sapiens]
Length = 603
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 433
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 489
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512
>gi|143811393|sp|Q9HCQ5.3|GALT9_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 9;
AltName: Full=Polypeptide GalNAc transferase 9;
Short=GalNAc-T9; Short=pp-GaNTase 9; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 9;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
Length = 603
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 433
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 489
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512
>gi|193786287|dbj|BAG51570.1| unnamed protein product [Homo sapiens]
Length = 460
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 231 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 290
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 291 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 346
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 347 LYP-CHGM-SSQLVRYSADGLLQLG 369
>gi|38327556|ref|NP_068580.2| polypeptide N-acetylgalactosaminyltransferase 9 isoform B [Homo
sapiens]
gi|21360816|gb|AAM49722.1|AF458594_1 hypothetical protein [Homo sapiens]
gi|127802610|gb|AAH93819.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 (GalNAc-T9) [Homo
sapiens]
gi|127802793|gb|AAH93817.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 (GalNAc-T9) [Homo
sapiens]
gi|313882776|gb|ADR82874.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 (GalNAc-T9)
(GALNT9), transcript variant B [synthetic construct]
Length = 237
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 8 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 67
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 68 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 123
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 124 LYP-CHGM-SSQLVRYSADGLLQLG 146
>gi|432891064|ref|XP_004075530.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3-like
[Oryzias latipes]
Length = 529
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 65/146 (44%), Gaps = 20/146 (13%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
LR W+ +K L + M +D GDIS++V LR L CK F W++ +VY
Sbjct: 330 LRVAEVWMDEYKSNVYLAWNIPMENHGIDYGDISQRVVLRKNLQCKSFQWYLD----NVY 385
Query: 111 TKFPELPPNLFWGEAKNLG-TQKCLDS-MGRTAPAVIGTSYCHGSG--------SSQLFR 160
+ LF+GE +N + C+D M A++ CHG G SQLF
Sbjct: 386 PEMRRYNNTLFYGEIRNSKVSHLCMDQGMKENHTAILHP--CHGWGPQLGRYTKDSQLF- 442
Query: 161 LNAEGQLGHGERCVDADKQEKLVEML 186
L G G RCV D+ ++L
Sbjct: 443 LGPLGSTGEDTRCVVDDQVSSFPQLL 468
>gi|10336504|dbj|BAB13699.2| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 603
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 433
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 489
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512
>gi|410912128|ref|XP_003969542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Takifugu rubripes]
Length = 558
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ L+ ++ CK F W++++V +P
Sbjct: 370 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENV-------YP 422
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + G CLD++G A V+G CH +G +Q + L + + H
Sbjct: 423 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 480
Query: 171 ERCV 174
+ C+
Sbjct: 481 DLCL 484
>gi|189066640|dbj|BAG36187.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 102/225 (45%), Gaps = 44/225 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
VY P+L P + G K+ G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 VYV--PDLNP-VISGYIKSAGQPLCLD-VGENNQGGKPLIMYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + LV++ C G K + T
Sbjct: 550 HEIRHNIQKELCLHA--AQGLVQL-----KACTYKGHKTVV-----------------TG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
+ W + L + LK CLS + E +++ C+ ++ Q+W
Sbjct: 586 EQIWEIQKD-QLLYNPFLKMCLSANGE--HPSLVSCNPSDPLQKW 627
>gi|334326859|ref|XP_001378345.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
[Monodelphis domestica]
Length = 603
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 374 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 433
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + + +GE +N + CLD I
Sbjct: 434 LALRQRLQCRSFKWYLE----NVYPEMRIYNNTITYGEVRNSKASGYCLDQGAEDDDRAI 489
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 490 LYP-CHGM-SSQLVRYSAEGLLQLG 512
>gi|350592396|ref|XP_003483457.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Sus scrofa]
Length = 217
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 8 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDGFKSHVYMAWNIPMTNPGVDFGDVSER 67
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W++ + VY + L +GE +N + CLD I
Sbjct: 68 LALRQRLKCRSFKWYLDN----VYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 123
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 124 LYP-CHGM-SSQLVRYSAEGLLQLG 146
>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
catus]
Length = 582
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 78/164 (47%), Gaps = 11/164 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 385 RAAEVWMDQYKEHFYNRNPPARKEAYGDISERKLLRERLKCKSFDWYLKNVFSNLHV--P 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLD-SMGRTAPAVIGTSY--CHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD + + P S CHG G +Q F + ++
Sbjct: 443 EDRPG-WHGAIRSMGISSECLDYNSPDSNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 501
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLN 210
E C + +Q+ V M P F + A + F GT+ +
Sbjct: 502 SVTELCAEVPEQKNYVGMQNCPKDGFPIPANIIWHFEEDGTIFH 545
>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
2 [Sus scrofa]
Length = 578
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 96/226 (42%), Gaps = 38/226 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSIGISSECLDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
E C + +Q+ V M P + N+ F GT+ F
Sbjct: 498 SVTELCAEVPEQKNHVGMQNCPKDGSPIP------------TNIIWHFKEDGTI-----F 540
Query: 228 HNSTRQLEHAILKRCLSVH--PETNQLAMLR-CDENNSYQQWRFKE 270
H H+ L CLS + PE +R CD + Q W+F++
Sbjct: 541 H------PHSGL--CLSAYRTPEGRPDVQMRTCDAADKNQIWKFEK 578
>gi|119618953|gb|EAW98547.1| hCG31690 [Homo sapiens]
Length = 477
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 231 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 290
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 291 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 346
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 347 LYP-CHGM-SSQLVRYSADGLLQLG 369
>gi|397487071|ref|XP_003814635.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Pan
paniscus]
Length = 603
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 433
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 489
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512
>gi|157114760|ref|XP_001652408.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883561|gb|EAT47786.1| AAEL001151-PA [Aedes aegypti]
Length = 592
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 93/235 (39%), Gaps = 51/235 (21%)
Query: 44 VISTQKSSLE---LRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDF 98
+I T+K + LR W+ +KQI G P ++GD+S + +R+K CK F
Sbjct: 381 LIQTKKDVVRANSLRLAEVWMDEYKQIIFDIYGLPRYPVEEIGDVSHRKQIREKAKCKTF 440
Query: 99 SWFMKHVAYDVYTKFPELPPNL----FWGEAKN--LGTQKCLDSMGRTAPAVIGTSYCHG 152
++ V FPE+ + F GE KN LG CL+ T + T C
Sbjct: 441 KYY-------VQAAFPEMNNPMVEGAFHGEVKNMALGNDTCLEYQLDTNTVRMAT--CDH 491
Query: 153 SGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLF 212
+ Q + N +L + C+D G + ++
Sbjct: 492 QETGQFWAHNYYQELNSHKHCLDY----------------------------TGDTMGVY 523
Query: 213 NCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLR-CDENNSYQQW 266
C S G + W++ T+Q++ +CL + ET + ++ CDE Q+W
Sbjct: 524 GCHRSRG--NQAWQYVKKTKQIKSVKHGKCLGLSLETMKTLLVEDCDEKKESQKW 576
>gi|169790950|ref|NP_001116116.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Rattus
norvegicus]
gi|149063717|gb|EDM14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 604
Score = 57.0 bits (136), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 16/148 (10%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
+ALR KL C+ F W++++V +PE+ L +GE +N + CLD
Sbjct: 435 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 487
Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R +A+G L G
Sbjct: 488 RAILYP-CHGM-SSQLVRYSADGLLQLG 513
>gi|449279337|gb|EMC86972.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Columba livia]
Length = 519
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 290 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 349
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + + +GE +N + CLD I
Sbjct: 350 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 405
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R ++EG L G
Sbjct: 406 LYP-CHGM-SSQLVRYSSEGVLQLG 428
>gi|301608339|ref|XP_002933739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 622
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 56/93 (60%), Gaps = 8/93 (8%)
Query: 80 MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGR 139
GDIS+++ L+ L CK+F+W+++++ +++ P+ P ++G+ KN G+Q CLD+ +
Sbjct: 457 FGDISKRLKLKADLQCKNFTWYLENIYPEMFV--PDRDPT-YYGKVKNEGSQNCLDAGEK 513
Query: 140 T---APAVIGTSYCHGSGSSQLFRLNAEGQLGH 169
P ++ + C+G G +Q F + +L H
Sbjct: 514 NHGGKPLIM--NLCNGMGGTQYFEYSTHKELRH 544
>gi|449476915|ref|XP_004176604.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 9 [Taeniopygia
guttata]
Length = 604
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + + +GE +N + CLD I
Sbjct: 435 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R ++EG L G
Sbjct: 491 LYP-CHGM-SSQLVRYSSEGVLQLG 513
>gi|354479146|ref|XP_003501774.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Cricetulus griseus]
Length = 610
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 16/148 (10%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 381 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 440
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
+ALR KL C+ F W++++V +PE+ L +GE +N + CLD
Sbjct: 441 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 493
Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R +A+G L G
Sbjct: 494 RAILYP-CHGM-SSQLVRYSADGLLQLG 519
>gi|405977048|gb|EKC41520.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 635
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 65/128 (50%), Gaps = 14/128 (10%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K++ + D GDISE+V LR +L+CK F WF+ +V +
Sbjct: 439 MRLAEVWMDEYKRLFYTHRKDLLGQDYGDISERVELRKRLNCKSFKWFLDNV-------Y 491
Query: 114 PE--LPPNLF--WGEAKNLGTQKCLDSMGRTAPAV--IGTSYC-HGSGSSQLFRLNAEGQ 166
PE +P WG +N + CLD++ + V +G C +G+ ++++F L+ +
Sbjct: 492 PEKFIPDENVHAWGMVRNPPSNLCLDTLQKDEKTVFDMGIYSCQNGASANEVFSLSINDE 551
Query: 167 LGHGERCV 174
L E C+
Sbjct: 552 LRREEACL 559
>gi|360043880|emb|CCD81426.1| putative n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 526
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 51/225 (22%), Positives = 92/225 (40%), Gaps = 36/225 (16%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ P +D GD+S+Q LR +L CK F WFM VA+D+ +P
Sbjct: 318 WMDEYKEYIYRSLPECRKVDPGDLSQQHNLRKRLQCKSFKWFMTEVAFDLTKAYPPPEEV 377
Query: 120 LF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYC----HGSGSSQLFRLNAEGQLGHGER- 172
LF GE +++ C+D+ T + + S+C G Q F + + + +R
Sbjct: 378 LFATGEIRSVAFPYLCIDAARATKRSPVKLSFCSAISKGHNYMQSFEYSFKQDIKAVKRS 437
Query: 173 -CVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGT----VDGPWRF 227
C+D + PV F + C +G + +R
Sbjct: 438 LCLDVPDAKP-----NKPVLF-------------------YPCHGQNGNQHWHLQSAYRN 473
Query: 228 HNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKEVR 272
+ + + H+ CL ++ + + RC N Q+W ++++R
Sbjct: 474 NENPVIIAHSTSGLCLDLNLRGLSVFVRRCSSNLRTQRWNWRDLR 518
>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
Length = 575
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 378 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 435
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 436 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 494
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 495 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 553
>gi|157820525|ref|NP_001100621.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform B [Rattus
norvegicus]
gi|149063716|gb|EDM14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 237
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 67/148 (45%), Gaps = 16/148 (10%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 8 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
+ALR KL C+ F W++++V +PE+ L +GE +N + CLD
Sbjct: 68 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 120
Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R +A+G L G
Sbjct: 121 RAILYP-CHGM-SSQLVRYSADGLLQLG 146
>gi|355753170|gb|EHH57216.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
fascicularis]
Length = 542
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 53/228 (23%), Positives = 89/228 (39%), Gaps = 49/228 (21%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ FK++ +P A GD++E+ LR KL CKDF WF++ V +++
Sbjct: 350 VRAAEVWMDEFKELYYHRNPHARLEPFGDVTERKQLRAKLQCKDFKWFLETVYPELHV-- 407
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTS----YCHGSGSSQLFRLNAEGQLGH 169
PE P F C D ++G CHG G +Q F ++ ++ +
Sbjct: 408 PEDRPGFF--------GMYCFDYNPPDENQIVGHQVILYVCHGMGHNQFFEYTSQKEIRY 459
Query: 170 G----ERCVDADKQEKLVEMLKLPVTFCVLAGVKLI---FCRLGTVLNLFNCFNSSGTVD 222
E C+ V AG+ ++ C N G++
Sbjct: 460 NTHQPEGCI------------------AVEAGMDILIMHLCEETAPENQKFILQEDGSL- 500
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
FH +++ A K + + +LR N+ +Q+W FKE
Sbjct: 501 ----FHEQSKKCVQAARKES-----SDSFVPLLRDCTNSDHQKWFFKE 539
>gi|401407036|ref|XP_003882967.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1 [Neospora caninum
Liverpool]
gi|325117383|emb|CBZ52935.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1 [Neospora caninum
Liverpool]
Length = 762
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 38/112 (33%), Positives = 58/112 (51%), Gaps = 13/112 (11%)
Query: 53 ELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYT 111
+LR W+ + +I L P L +GD++E ALRD+L CKDF+W++++V
Sbjct: 554 KLRAAHVWMGEYARIVELVVPRLPQGQSLGDLTELKALRDRLQCKDFNWYLENV------ 607
Query: 112 KFPEL-PPNL---FWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLF 159
+PEL PPN+ G +N C+D++ T IG CH +Q F
Sbjct: 608 -YPELEPPNVAHALTGAMRNPKFNCCVDTL-TTKNQEIGVYPCHFEHGTQAF 657
>gi|344246331|gb|EGW02435.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Cricetulus
griseus]
Length = 557
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 328 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 387
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR KL C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 388 LALRQKLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 443
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 444 LYP-CHGM-SSQLVRYSADGLLQLG 466
>gi|301603772|ref|XP_002931524.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2-like [Xenopus
(Silurana) tropicalis]
Length = 569
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 35/124 (28%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ L+ L CK F W++++V +P
Sbjct: 381 RAAEVWMDDYKNFYYAAVPSARNVPYGNIQSRLELKKNLGCKPFKWYLENV-------YP 433
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G A GT CLD++G A V+G CH +G +Q + L + + H
Sbjct: 434 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 491
Query: 171 ERCV 174
+ C+
Sbjct: 492 DLCL 495
>gi|345791109|ref|XP_543349.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Canis
lupus familiaris]
Length = 600
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 371 RVAHIERSKKPYNNDIDYYAKRNALRAAEVWMDSFKSHVYMAWNIPMTNPGVDFGDVSER 430
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 431 LALRQRLKCRSFKWYLE----NVYPEMRTYNDTLTYGEVRNSKASGYCLDQGAEDDDHAI 486
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R + EG L G
Sbjct: 487 LYP-CHGM-SSQLVRYSTEGLLQLG 509
>gi|47217176|emb|CAG11012.1| unnamed protein product [Tetraodon nigroviridis]
Length = 598
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 60/124 (48%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ L+ ++ CK F W++++V +P
Sbjct: 409 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELKKRVGCKPFKWYLENV-------YP 461
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + G CLD++G A V+G CH +G +Q + L + + H
Sbjct: 462 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGVYECHNAGGNQEWALTKDKSVKHM 519
Query: 171 ERCV 174
+ C+
Sbjct: 520 DLCL 523
>gi|432852860|ref|XP_004067421.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Oryzias latipes]
Length = 556
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 59/124 (47%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ ++ +L CK F W++ +V +P
Sbjct: 368 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLEMKKRLGCKPFKWYLDNV-------YP 420
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +G + G CLD++G A V+G CH +G +Q + L + + H
Sbjct: 421 ELRVPDHQDIAFGALQQGGN--CLDTLGHFADGVVGIYECHNAGGNQEWALTKDKSVKHM 478
Query: 171 ERCV 174
+ C+
Sbjct: 479 DLCL 482
>gi|402911950|ref|XP_003918561.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Papio anubis]
Length = 237
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 8 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 68 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 123
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 124 LYP-CHGM-SSQLVRYSADGLLQLG 146
>gi|380813218|gb|AFE78483.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Macaca
mulatta]
Length = 603
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 374 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 433
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 434 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 489
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 490 LYP-CHGM-SSQLVRYSADGLLQLG 512
>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
norvegicus]
Length = 633
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 104/235 (44%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGDKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + V LK CV G + I
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGRRTI-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + + + CLS + E +++ CD + Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFRMCLSSNGE--HPSLVPCDTTDLLQKWIFSQ 631
>gi|410922253|ref|XP_003974597.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Takifugu rubripes]
Length = 604
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 64/146 (43%), Gaps = 10/146 (6%)
Query: 29 GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
++ H ER I LR W+ +K + + M +D GD+SE
Sbjct: 374 ARVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEYKSHVYMAWNIPMNNPGVDFGDVSE 433
Query: 86 QVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAV 144
++ALR +L C+ F W+++H VY + + +GE +N + CLD
Sbjct: 434 RLALRKRLQCRSFRWYLEH----VYPEMRVYNNTITYGEVRNSKASGYCLDQGAEDDDKA 489
Query: 145 IGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R + EG L G
Sbjct: 490 ILYP-CHGM-SSQLARYSTEGLLQLG 513
>gi|334330196|ref|XP_003341314.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Monodelphis
domestica]
Length = 631
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 55/227 (24%), Positives = 104/227 (45%), Gaps = 44/227 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ FK+I + + + GDIS+++ +R +L CK+F+W++ ++ +
Sbjct: 434 QVRLAEVWMDEFKEIFYRRNTEAAKIVKQKAYGDISKRLDIRHRLQCKNFTWYLNNIYPE 493
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
+Y P+L P + G +N+G CLD +G P ++ T CH G +Q F + +
Sbjct: 494 IYV--PDLNP-VISGYIQNIGRHLCLD-VGENNQGGKPLIMYT--CHFLGGNQYFEXSEQ 547
Query: 165 GQLGHGERCVDADKQEKLVEMLKLPVTF--CVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
++ H + ++E + L+ PV C G K + TVD
Sbjct: 548 HEIRH------SIQKELCLHALQGPVQMKACSYKGQK------------------TFTVD 583
Query: 223 -GPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
W L + L RCL+ + E +++ C+ ++ +Q+W F
Sbjct: 584 EQKWEIRKD-HLLYNPSLHRCLTGNGE--HPSLVPCNPSDVFQKWDF 627
>gi|193786146|dbj|BAG51429.1| unnamed protein product [Homo sapiens]
Length = 354
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 66/249 (26%), Positives = 98/249 (39%), Gaps = 36/249 (14%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 125 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSER 184
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 185 LALRQRLKCRSFKWYLE----NVYPEMRVYNNTLTYGEVRNSKASAYCLDQGAEDGDRAI 240
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRL 205
CHG SSQL R +A+G L G L LP + C L+
Sbjct: 241 -LYPCHGM-SSQLVRYSADGLLQLG----------PLGSTAFLPDSKC------LVDDGT 282
Query: 206 GTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN-----QLAMLRCDEN 260
G + L C + + W F S + A RCL V + +L + RC
Sbjct: 283 GRMPTLKKCEDVARPTQRLWDFTQSGPIVSRAT-GRCLEVEMSKDANFGLRLVVQRC--- 338
Query: 261 NSYQQWRFK 269
S Q+W +
Sbjct: 339 -SGQKWMIR 346
>gi|426374733|ref|XP_004054218.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Gorilla gorilla gorilla]
Length = 237
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 8 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 67
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 68 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 123
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 124 LYP-CHGM-SSQLVRYSADGLLQLG 146
>gi|355564838|gb|EHH21338.1| hypothetical protein EGK_04374, partial [Macaca mulatta]
Length = 524
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 295 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 354
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 355 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 410
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 411 LYP-CHGM-SSQLVRYSADGLLQLG 433
>gi|355786668|gb|EHH66851.1| hypothetical protein EGM_03916, partial [Macaca fascicularis]
Length = 524
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 295 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 354
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 355 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 410
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 411 LYP-CHGM-SSQLVRYSADGLLQLG 433
>gi|354487360|ref|XP_003505841.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Cricetulus griseus]
Length = 633
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/235 (24%), Positives = 103/235 (43%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
V +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTVYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + V LK CV G + +
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTV-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + + K CLS E L + CD ++ Q+W F +
Sbjct: 583 ---APGEQIWDIQKG-QLLYNPLFKMCLSATGEHPNL--VPCDTSDLLQKWIFSQ 631
>gi|51490862|emb|CAD44536.1| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
Length = 168
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 91/194 (46%), Gaps = 40/194 (20%)
Query: 80 MGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGR 139
GD+S++ ++ +L CK+F+W++ ++ +VY P+L P + G K++G CLD +G
Sbjct: 2 FGDLSKRFEIKHRLRCKNFTWYLNNIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGE 57
Query: 140 T----APAVIGTSYCHGSGSSQLFRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTF 192
P ++ T CHG G +Q F +A+ ++ H E C+ A + LV++
Sbjct: 58 NNQGGKPLIMYT--CHGLGGNQYFEYSAQHEIRHNIQKELCLHA--AQGLVQL-----KA 108
Query: 193 CVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQL 252
C G K + T + W + L + LK CLS + E
Sbjct: 109 CTYKGHKTVV-----------------TGEQIWEIQKD-QLLYNPFLKMCLSANGE--HP 148
Query: 253 AMLRCDENNSYQQW 266
+++ C+ ++ Q+W
Sbjct: 149 SLVSCNPSDPLQKW 162
>gi|395513519|ref|XP_003760971.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
[Sarcophilus harrisii]
Length = 570
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 341 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 400
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + + +GE +N + CLD I
Sbjct: 401 LALRQRLQCRSFKWYLE----NVYPEMRIYNNTITYGEVRNSKASGYCLDQGAEDDDRAI 456
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 457 LYP-CHGM-SSQLVRYSAEGLLQLG 479
>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Meleagris gallopavo]
Length = 579
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 39/142 (27%), Positives = 69/142 (48%), Gaps = 10/142 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A + GDISE+ LR++L CK F+W++++V +++ P
Sbjct: 382 RAAEVWMDEYKEHFYNRNPPARKENYGDISERKLLRERLKCKSFNWYLRNVFSELHV--P 439
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSM---GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD + A + CHG G +Q F + +
Sbjct: 440 EDRPG-WHGAVRSVGISSECLDYVLPEHNPTGAHLSLFGCHGQGGNQFFEYTSNKEFRFN 498
Query: 171 ---ERCVDADKQEKLVEMLKLP 189
E C + +QE + M P
Sbjct: 499 SVTELCAEVPEQEDFIGMRSCP 520
>gi|405967230|gb|EKC32416.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 347
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 41/195 (21%), Positives = 78/195 (40%), Gaps = 33/195 (16%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
D GD++++ LR++L C F WF+K+V D++ + +AK + +D+
Sbjct: 179 DYGDVTDRKKLRERLQCHSFDWFVKNVYPDLFVPGEAIASGEIRSKAKPMCIDSAVDNHN 238
Query: 139 RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGV 198
P + CH G +Q + L+ G++ + C+D E ++
Sbjct: 239 YHKP--VNMWPCHNQGGNQYWMLSKNGEIRRDDGCLDYSGGESVI--------------- 281
Query: 199 KLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCD 258
++ C G + +R NS + HA ++C+ + +L M C
Sbjct: 282 ------------VYPCHGQKGNQEWQYREDNS---IYHANTQKCMETSVDGQKLTMKTCT 326
Query: 259 ENNSYQQWRFKEVRP 273
+ Q W +K P
Sbjct: 327 GIDR-QIWTWKRKSP 340
>gi|348550781|ref|XP_003461209.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Cavia porcellus]
Length = 604
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 435 LALRQRLKCRSFRWYLE----NVYPEMRVYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 491 LYP-CHGM-SSQLVRYSADGLLQLG 513
>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Sus scrofa]
gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Sus scrofa]
Length = 582
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 95/226 (42%), Gaps = 38/226 (16%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 385 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLGCKSFDWYLKNVFSNLHV--P 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD A + CHG G +Q F + ++
Sbjct: 443 EDRPG-WHGAIRSIGISSECLDYNSPENNPTGANLSLFGCHGQGGNQFFEYTSNREIRFN 501
Query: 171 ---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRF 227
E C + +Q+ V M P + N+ F GT+ P
Sbjct: 502 SVTELCAEVPEQKNHVGMQNCPKDGSPIP------------TNIIWHFKEDGTIFHP--- 546
Query: 228 HNSTRQLEHAILKRCLSVH--PETNQLAMLR-CDENNSYQQWRFKE 270
H+ L CLS + PE +R CD + Q W+F++
Sbjct: 547 --------HSGL--CLSAYRTPEGRPDVQMRTCDAADKNQIWKFEK 582
>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
Length = 494
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 58/124 (46%), Gaps = 13/124 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P+A G I + LR +L CK F W++++V +P
Sbjct: 305 RTAEVWMDEYKRFYYAARPMARSALYGSIKSRKELRKRLQCKPFKWYLQNV-------YP 357
Query: 115 EL----PPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
EL ++ +GE K + CLD++G A +G CHG +Q + L + + H
Sbjct: 358 ELQIPDSQDVSFGELKQ--GKSCLDTLGSQAGGSVGMFDCHGQAGNQEWALTKKSTVRHL 415
Query: 171 ERCV 174
+ C+
Sbjct: 416 DLCL 419
>gi|324522446|gb|ADY48063.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Ascaris suum]
Length = 210
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/153 (28%), Positives = 73/153 (47%), Gaps = 26/153 (16%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR + WL + + P ++ GDISE A+R++LHCK F W+++++ ++
Sbjct: 6 LRAANVWLDEYLEKFYEAKPYLRGVNFGDISEMKAIRERLHCKPFKWYLENIYPELLPN- 64
Query: 114 PELP-------PNLFW---GEAKNL---GTQKCLDS------MGRTAPAVIGTSYCHGSG 154
LP PN +W G+ ++ GT CL + R A A++ CH S
Sbjct: 65 -NLPSRESLQRPNPYWKSTGDKYHIKLHGTTLCLAGESNNGRISRGARAIL--ERCHKSA 121
Query: 155 SSQLFRLNAEGQL---GHGERCVDADKQEKLVE 184
Q++R G+L G C+D+ K +L++
Sbjct: 122 RQQMWRWTKLGELRPMGSSSLCLDSLKGPRLLK 154
>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4 [Equus caballus]
Length = 703
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/182 (27%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR +L CK F W++K+V +++ P
Sbjct: 506 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRKRLKCKSFDWYLKNVFSNLHV--P 563
Query: 115 ELPPNLFWGEAKNLGT-QKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G +CLD AP A + CHG G +Q F ++ ++
Sbjct: 564 EDRPG-WHGAIRSMGIPSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSKKEI 619
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
E C + +Q+ V M P F + A + F GT+ + + C ++ T
Sbjct: 620 RFNSVTELCAEVPEQKNHVGMQNCPKDGFSIPANIIWHFKEDGTIFHPHSGQCLSAYWTP 679
Query: 222 DG 223
+G
Sbjct: 680 EG 681
>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
abelii]
Length = 578
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
troglodytes]
gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
paniscus]
gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
Length = 578
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
gorilla gorilla]
Length = 578
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|326923175|ref|XP_003207815.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Meleagris gallopavo]
Length = 709
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 78/200 (39%), Gaps = 51/200 (25%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLF--WGEAKNLGTQKCLDS 136
++GD+S+Q ALR++L C+ F W++++V +P+L L G N+ T +C+ +
Sbjct: 545 ELGDLSQQKALRERLRCRSFRWYLENV-------YPDLAAPLVKAGGLLVNVATARCIAA 597
Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
+ + C + +Q F L HGE CV
Sbjct: 598 QNTS----LALQACDVNNKAQHFNYTWLRLLRHGELCV---------------------- 631
Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQ---LEHAILKR-----CLSVHPE 248
+G L C D WR + + EH +++ CL VHP
Sbjct: 632 ------VPMGAAPGLRPC--GGWESDLAWRHSSMAARPELAEHLVVQHQHPPTCLLVHPV 683
Query: 249 TNQLAMLRCDENNSYQQWRF 268
L + C N YQQWRF
Sbjct: 684 LLALRLGACSATNRYQQWRF 703
>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
Length = 578
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
Length = 578
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
norvegicus]
gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
Length = 633
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/235 (23%), Positives = 104/235 (44%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ +VY P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTIYPEVYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGDKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + V LK CV G + I
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + + + CLS + E +++ CD + Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFRMCLSSNGE--HPSLVPCDTTDLLQKWIFSQ 631
>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryctolagus cuniculus]
Length = 579
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A D GDISE+ LR++L CK F W++K+V ++ P
Sbjct: 382 RAAEVWMDDYKEHFYNRNPPARKEDYGDISERKLLRERLKCKSFDWYLKNVFSSLHV--P 439
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 440 EDRPG-WHGAIRSKGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 498
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +++ V M P F + A + F GT+ + + C ++ T +G
Sbjct: 499 SVTELCAEVPEKKNHVGMQNCPKDGFPIPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 557
>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
anubis]
Length = 578
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 75/164 (45%), Gaps = 11/164 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSKGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLN 210
E C + +Q+ V M P F V A + F GT+ +
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFLVPANIIWHFKEDGTIFH 541
>gi|348516166|ref|XP_003445610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
[Oreochromis niloticus]
Length = 604
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 66/149 (44%), Gaps = 16/149 (10%)
Query: 29 GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
++ H ER I LR W+ +K + + M +D GD+SE
Sbjct: 374 ARVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDEYKSHVYMAWNIPMNNPGVDFGDVSE 433
Query: 86 QVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTA 141
++ALR +L C+ F W+++HV +PE+ + +GE +N + CLD
Sbjct: 434 RLALRKRLQCRSFRWYLEHV-------YPEMRIYNNTITYGEVRNSKASGYCLDQGSEDD 486
Query: 142 PAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
I CHG SSQL R + EG L G
Sbjct: 487 DKAILYP-CHGM-SSQLARYSTEGLLQLG 513
>gi|351698595|gb|EHB01514.1| Polypeptide N-acetylgalactosaminyltransferase 9, partial
[Heterocephalus glaber]
Length = 519
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 290 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDEFKSHVYMAWNIPMTSPGVDFGDVSER 349
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 350 LALRQRLKCRSFRWYLE----NVYPEMRVYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 405
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 406 LYP-CHGM-SSQLVRYSADGLLQLG 428
>gi|296213304|ref|XP_002753213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
[Callithrix jacchus]
Length = 311
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 100/252 (39%), Gaps = 42/252 (16%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 82 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 141
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKN-LGTQKCLDSMGRTAP 142
+ALR KL C+ F W++++V +PE+ L +GE +N + CLD
Sbjct: 142 LALRQKLKCRSFKWYLENV-------YPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDDD 194
Query: 143 AVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIF 202
I CHG SSQL R +A+G L G L LP + C++ +
Sbjct: 195 RAI-LYPCHGM-SSQLVRYSADGLLQLG----------PLGSTAFLPDSKCLVDDGR--- 239
Query: 203 CRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN-----QLAMLRC 257
G L C + + W F S + A RCL V + +L + RC
Sbjct: 240 ---GRTPTLKKCEDVARPAQRLWDFTQSGPIVSRAT-GRCLEVEMAKDANFGLRLVVQRC 295
Query: 258 DENNSYQQWRFK 269
S Q+W +
Sbjct: 296 ----SGQKWMIR 303
>gi|51316092|sp|Q9GM01.1|GALT9_MACFA RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 9;
AltName: Full=Polypeptide GalNAc transferase 9;
Short=GalNAc-T9; Short=pp-GaNTase 9; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 9;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|11041469|dbj|BAB17277.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Macaca
fascicularis]
Length = 606
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 377 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 436
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 437 LALRQRLKCRSFKWYLE----NVYPEMRIYNNTLTYGEVRNSKASGYCLDQGAEDGDRAI 492
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 493 -LYPCHGM-SSQLVRYSADGLLQLG 515
>gi|67613609|ref|XP_667313.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3 [Cryptosporidium
hominis TU502]
gi|54658437|gb|EAL37084.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3 [Cryptosporidium
hominis]
Length = 732
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 96/228 (42%), Gaps = 40/228 (17%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR W+ F QI + +P + +G + + L+ KL CK F WF+ +VA + Y
Sbjct: 526 LRTARVWMDEFYQITEMLAPNPN-IKLGSFDDMLHLKKKLKCKPFRWFLDNVAPETYIT- 583
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSM-GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
+L LF GE +N CLDSM G+T IG +CHG +Q F ++ H +
Sbjct: 584 -QLDHLLFVGEIRNKKLNNICLDSMGGQTDGDKIGVFHCHGKKGTQAFMMS-----NHTQ 637
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
+ K+ ++C+ + +K C + N+ WR N
Sbjct: 638 QIRIVSKE-----------SYCIGSNLKYAACSNSEITNI-------------WRLENEM 673
Query: 232 RQLEHAILKR-CLSVHPETN-----QLAMLRCDENNSYQQWRFKEVRP 273
+ K CLS+ + + + +L CD N+ Q W + +P
Sbjct: 674 IKANVEPNKYVCLSLTEDNDSSTKHKAELLDCDPNDPSQHWNVNKFKP 721
>gi|395840154|ref|XP_003792930.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9
[Otolemur garnettii]
Length = 604
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 44/145 (30%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 435 LALRQRLKCRSFRWYLE----NVYPEMRIYNDTLTYGEVRNSKASGYCLDQGAEDDDRAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +A+G L G
Sbjct: 491 LYP-CHGM-SSQLVRYSADGLLQLG 513
>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 578
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|66363212|ref|XP_628572.1| extracellular protein with a signal peptide followed by family 2
glycosyltransferase and ricin domains [Cryptosporidium
parvum Iowa II]
gi|46229580|gb|EAK90398.1| extracellular protein with a signal peptide followed by family 2
glycosyltransferase and ricin domains [Cryptosporidium
parvum Iowa II]
gi|448826184|gb|AAQ95174.2| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T1, partial
[Cryptosporidium parvum]
Length = 637
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 91/221 (41%), Gaps = 41/221 (18%)
Query: 53 ELRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVY 110
+LR W+ F + R +G P +D G + E++ LR++L C F WF+++V +
Sbjct: 426 KLRTLYLWMDEFGDLAWRVMGRPR---VDTGPLDERIKLRERLRCNSFKWFLENVNPEAE 482
Query: 111 TKFPELPPNLFWGEAKNLGTQKCLDSMGRTAP-AVIGTSYCHGSGSSQLFRLNAEGQLGH 169
K + P + G KN+G+ C+D+ G P + CH +G +Q F + H
Sbjct: 483 VKSIDDVP--YIGNIKNIGSNLCIDTDGFNNPGGKVKLWSCH-TGETQNFMYFKTSK--H 537
Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
++ + C+ KL +C + W
Sbjct: 538 WMVTINDES--------------CITEKFKLDWCNEHSYH---------------WDVEI 568
Query: 230 STRQLEHAILKRCLS-VHPETNQLAMLRCDENNSYQQWRFK 269
++ A++KR S + NQL + C+ NN +QQW +
Sbjct: 569 ISKSPLTALIKRSNSCITAVDNQLKLTPCETNNKFQQWEIE 609
>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
Length = 1150
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 66/146 (45%), Gaps = 16/146 (10%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFW-GEAKNLGTQK-CLDS 136
D GD+++ ALR+KL CK F WF+K VA ++ +P + P F G +++ C+D+
Sbjct: 555 DAGDLTKMKALREKLQCKPFEWFLKEVAPEILQLYPPVEPEPFASGAIQSIAEPTLCIDT 614
Query: 137 MGRTAPAVIGTSYC-----HGSGSSQLFRL--NAEGQLGHGERCVDADKQEKLVEMLKLP 189
M R IG C H +Q F L + + Q E+C D + + P
Sbjct: 615 MQRPRGNPIGMHPCDSDLIHPKNMNQYFVLSWHRDIQQKSDEQCFDVPESAP-----RSP 669
Query: 190 VTFCVLAGVKLI--FCRLGTVLNLFN 213
VT +K + R ++ ++
Sbjct: 670 VTIYTCHNIKYLQHLPRTSVIIPFYD 695
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/67 (31%), Positives = 37/67 (55%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P+ +D GD+++ LR +L+CK F WF+++VA D+ +P
Sbjct: 933 RVAEVWMDEYKKYVYRKDPMRFSIDAGDLTKMKELRRRLNCKPFRWFIENVAPDLIDWYP 992
Query: 115 ELPPNLF 121
+ P F
Sbjct: 993 PIEPEPF 999
>gi|354484387|ref|XP_003504370.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Cricetulus griseus]
Length = 532
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 95/229 (41%), Gaps = 44/229 (19%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K +P A GDISE+ LR++L CK F W++++V +++ P
Sbjct: 335 RAAEVWMDEYKDHFYNRNPPARKEVYGDISERKLLRERLKCKSFHWYLENVFSNLHV--P 392
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAP------AVIGTSYCHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP A + CHG G +Q F + ++
Sbjct: 393 EDRPG-WHGAIRSMGISSECLDY---NAPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEI 448
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGP 224
E C + +Q+ V M P L +N+ F GT+
Sbjct: 449 RFNSVTELCAEVPEQKHHVGMQNCPKD------------GLPAPVNIIWLFKEDGTI--- 493
Query: 225 WRFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
FH T CLS + + M CD ++ Q WRF++
Sbjct: 494 --FHPHT--------GLCLSAFRTSEGRPSVQMKTCDAHDKNQIWRFEK 532
>gi|410057869|ref|XP_001172345.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9, partial
[Pan troglodytes]
Length = 288
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 76/171 (44%), Gaps = 15/171 (8%)
Query: 6 PAV--IGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQVISTQKSSLELRKRSSWLKV 63
PAV +G + GS ++ + ++ H ER I LR W+
Sbjct: 84 PAVGGVGAAVWQCGGSMEVLPCS---RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDD 140
Query: 64 FKQIRSLGSPLAMF---LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNL 120
FK + + M +D GD+SE++ALR +L C+ F W+++ +VY + L
Sbjct: 141 FKSHVYMAWNIPMTNPGVDFGDVSERLALRQRLKCRSFKWYLE----NVYPEMRVYNNTL 196
Query: 121 FWGEAKN-LGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
+GE +N + CLD I CHG SSQL R +A+G L G
Sbjct: 197 TYGEVRNSKASGYCLDQGAEDGDRAI-LYPCHGM-SSQLVRYSADGLLQLG 245
>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
aries]
Length = 582
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/182 (28%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V ++ P
Sbjct: 385 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHV--P 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY------CHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP TS CHG G +Q F + ++
Sbjct: 443 EDRPG-WHGAIRSIGISSECLDY---NAPDNNPTSANLSLFGCHGQGGNQFFEYTSNKEI 498
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
E C + + +K V M P F + A + F GT+ + + C ++ T
Sbjct: 499 RFNSVTELCAEVPELKKHVGMQNCPKDGFPIPANIIWHFKEDGTIFHPHSGLCLSAYRTP 558
Query: 222 DG 223
+G
Sbjct: 559 EG 560
>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
Length = 578
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/179 (27%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEIPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
Length = 587
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 91/223 (40%), Gaps = 37/223 (16%)
Query: 60 WLKVFKQ-IRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPP 118
W+ +KQ I D GD+S+ +R+KL CK F WF++ VA ++ +P + P
Sbjct: 374 WMDEYKQYIYDRDPERYAKTDAGDMSKMKTIREKLMCKPFKWFLQEVAPEIIELYPPVEP 433
Query: 119 NLFW-GEAKNLGTQK-CLDSMGRTAPAVIGTSYCHGS-----GSSQLF--RLNAEGQLGH 169
+ G +++ C+D+M R IG C S +Q F + + Q +
Sbjct: 434 EPYASGSIQSVADSSLCIDTMQRGRGEPIGLYPCSNSLIEPTNHNQYFVHSWHRDIQHKY 493
Query: 170 GERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHN 229
GE C D + + G+ + +F C G + +++ +
Sbjct: 494 GEGCFDVPQ------------------------SKPGSPVTIFTCHMHQG--NQFFQYDH 527
Query: 230 STRQLEHAILKR-CLSVHPETNQLAMLRCDENNSYQQWRFKEV 271
TRQ+ K C ++L + RC+ Q+WR ++
Sbjct: 528 DTRQIYAGKDKSFCFDADGMNDKLILNRCNRKQESQRWRMGQM 570
>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
grunniens mutus]
Length = 566
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V ++ P
Sbjct: 369 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHV--P 426
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY------CHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP TS CHG G +Q F + ++
Sbjct: 427 EDRPG-WHGAIRSIGISSECLDY---NAPDNNPTSANLSLFGCHGQGGNQFFEYTSNKEI 482
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
E C + + +K V M P F + A + F GTV + + C ++ T
Sbjct: 483 RFNSVTELCAEVPELKKHVGMQNCPKDGFPIPANIIWHFKEDGTVFHPHSGLCLSAFRTQ 542
Query: 222 DG 223
+G
Sbjct: 543 EG 544
>gi|281344573|gb|EFB20157.1| hypothetical protein PANDA_017448 [Ailuropoda melanoleuca]
Length = 539
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 35/110 (31%), Positives = 55/110 (50%), Gaps = 13/110 (11%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 343 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 395
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFR 160
EL ++ +G A GT CLD++G A V+G CH +G +Q +R
Sbjct: 396 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQGWR 443
>gi|118098416|ref|XP_415088.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Gallus gallus]
Length = 604
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 375 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 434
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + + +GE +N + CLD I
Sbjct: 435 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 490
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +++G L G
Sbjct: 491 LYP-CHGM-SSQLVRYSSDGVLQLG 513
>gi|260809642|ref|XP_002599614.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
gi|229284894|gb|EEN55626.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
Length = 432
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 60/113 (53%), Gaps = 10/113 (8%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K + P ++ G++ + LR LHC DFSW++++V ++Y P++ P
Sbjct: 320 WMDDYKNVIYRKHPDYKTVETGNVMPRKVLRKALHCHDFSWYVQNVYPNLYV--PDVRP- 376
Query: 120 LFWGEAKNLGTQKCLDSMG--RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
+ +G+ + G KCLD++ + P + G CHG G +Q + G++ G
Sbjct: 377 VAYGQVRMTG--KCLDAVSPEKEQPKLFG---CHGLGGNQYWEFTRAGEVRFG 424
>gi|195379310|ref|XP_002048423.1| GJ11361 [Drosophila virilis]
gi|194155581|gb|EDW70765.1| GJ11361 [Drosophila virilis]
Length = 540
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/221 (23%), Positives = 89/221 (40%), Gaps = 33/221 (14%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R WL +K P + +G + +Q ALR +L+CK F WF+ H+ D ++P
Sbjct: 329 RVAEVWLDEYKDKLYANMPHLTHIKVGSLKKQKALRQRLNCKPFKWFLDHLGRDFLEQYP 388
Query: 115 -ELPPNLFWGEAKNLGTQK-CLD-SMGRTAPAVIGTSY-CHGSGSSQLFRLNAEGQLGHG 170
+ P + +G ++L CL+ + P ++ Q + L+ L
Sbjct: 389 MDEPLDFAFGAVQSLAAPTLCLERTESAEHPQLLPCDEDLMYPKLEQKWSLSHFRDLHSS 448
Query: 171 ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS 230
RC++ +Q E+ L+ C + +G + W F +
Sbjct: 449 SRCLEVHQQHPNAEIW------------------------LWQCHHHAG--NQFWSFDLN 482
Query: 231 TRQLEHA---ILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
++Q+ H +RCL E ++ CD NN QQW+F
Sbjct: 483 SKQIIHGQNRAEQRCLEALVEQRKVVANICDANNLRQQWKF 523
>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
Length = 578
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 84/182 (46%), Gaps = 19/182 (10%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V ++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFSTLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMGRTAPAVIGTSY------CHGSGSSQLFRLNAEGQL 167
E P + G +++G + +CLD AP TS CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSIGISSECLDY---NAPDNNPTSANLSLFGCHGQGGNQFFEYTSNKEI 494
Query: 168 GHG---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTV 221
E C + + +K V M P F + A + F GTV + + C ++ T
Sbjct: 495 RFNSVTELCAEVPELKKHVGMQNCPKDGFPIPANIIWHFKEDGTVFHPHSGLCLSAFRTQ 554
Query: 222 DG 223
+G
Sbjct: 555 EG 556
>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 497
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 62/124 (50%), Gaps = 7/124 (5%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +K+ +D GD+S+++ LR KL C +F+W++K++ ++
Sbjct: 335 MRVAEVWMDHYKKFFYATQTKLHMVDYGDVSDRLELRRKLGCHNFTWYLKNIIPEMI--L 392
Query: 114 PELPPNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERC 173
P N F GE +N T CLDS + + C + S Q+F L ++ QL G+ C
Sbjct: 393 PVDDANYF-GEIRNDATGLCLDS---ASGKPLRVDIC-AATSDQIFTLTSDHQLRIGKEC 447
Query: 174 VDAD 177
+ D
Sbjct: 448 LMKD 451
>gi|170038571|ref|XP_001847122.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
gi|167882321|gb|EDS45704.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
Length = 560
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 58/244 (23%), Positives = 96/244 (39%), Gaps = 52/244 (21%)
Query: 40 ADKQVISTQKSSLELRKRSSWLKVFKQI--RSLGSPLAMFLDMGDISEQVALRDKLHCKD 97
A+K V+ LR W+ +KQ+ G P ++GD+S + +R K +CK
Sbjct: 351 ANKDVVRANS----LRLAEVWMDEYKQVIFDIYGLPRYPVEEVGDVSSRKEVRRKANCKT 406
Query: 98 FSWFMKHVAYDVYTKFPELPPNL----FWGEAKN--LGTQKCLDSMGRTAPAVIGTSYCH 151
F ++++ T +PE+ L F GE KN LG CL A +G + C
Sbjct: 407 FRYYIE-------TAYPEMKNPLIEGAFRGEVKNAALGNDTCLTY--HAATNTVGMASCD 457
Query: 152 GSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNL 211
+ SQ + N +L + C+D G+ L +
Sbjct: 458 HAEKSQFWVHNYYQELNSYKHCLDY----------------------------TGSELGV 489
Query: 212 FNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQ-LAMLRCDENNSYQQWRFKE 270
F C G + W++ + Q++ +CL++ E+ L M CD + Q+W
Sbjct: 490 FGCHRGRG--NQAWQYKIDSGQIKSVKHGKCLALDMESRTFLKMEDCDGSKEVQKWLVTP 547
Query: 271 VRPD 274
V D
Sbjct: 548 VSLD 551
>gi|444727591|gb|ELW68073.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Tupaia chinensis]
Length = 554
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 13/107 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL+CK F W++++V +P
Sbjct: 300 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLNCKPFKWYLENV-------YP 352
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
EL ++ +G A GT CLD++G A V+G CH +G +Q
Sbjct: 353 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQ 397
>gi|71994070|ref|NP_001022877.1| Protein GLY-9, isoform b [Caenorhabditis elegans]
gi|58081905|emb|CAI46621.1| Protein GLY-9, isoform b [Caenorhabditis elegans]
Length = 325
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 39/215 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K++ + D+GD++ + LR +L+CK F WF+ ++A KF
Sbjct: 138 WMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIA---KGKFIMDEDV 194
Query: 120 LFWGEAKNL--GTQKCLDSMGRTA--PAVIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
+ +G + GT+ C D++ R ++G +C G GSS QL L+ EG L C
Sbjct: 195 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCA 254
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS-TRQ 233
+ +++ C N W + N R
Sbjct: 255 SEEN-----------------GNIRMKTCSKKAQFN------------ERWAYENKMIRN 285
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
L+ ++ P N + ++ CDE + +Q+W F
Sbjct: 286 LKSGKCMSTANLKPGDNAI-VVECDEKDEHQKWNF 319
>gi|221504299|gb|EEE29974.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3, putative
[Toxoplasma gondii VEG]
Length = 635
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 78/187 (41%), Gaps = 41/187 (21%)
Query: 90 RDKLHCKDFSWFMKHV--AYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPA-VIG 146
R + CK F WFM++V DV T ++P + G +N CLD+MG +P +G
Sbjct: 471 RKRKGCKSFRWFMENVFPEGDVVT-LDDVP---YLGPLRNDKIGMCLDNMGWASPGHAVG 526
Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLG 206
YCHG G +Q F ++GH D + C+ +L +CR
Sbjct: 527 LEYCHG-GDTQTFMFFR--KVGHVMPVNDDEA--------------CLQPSGRLDWCR-- 567
Query: 207 TVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
GT W F S+ QL K+CLS +L M+ CD+ + YQ W
Sbjct: 568 ------------GTAQFWWDF-TSSGQLMFRETKQCLSAF--GRKLRMVECDDTDPYQIW 612
Query: 267 RFKEVRP 273
+ P
Sbjct: 613 SWTAYNP 619
>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Anolis carolinensis]
Length = 583
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 94/228 (41%), Gaps = 42/228 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A + GD+SE+ LR KL C +F W++K++ +++ P
Sbjct: 386 RAAEVWMDDYKEHFYNRNPPARKENFGDLSERKLLRKKLQCNNFDWYLKNIFPNLHV--P 443
Query: 115 ELPPNLFWGEAKNLG-TQKCLDSMG---RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G +++G + +CLD A + CHG G +Q F ++
Sbjct: 444 EDRPG-WHGAIRSMGISSECLDYNSPEHNPTGAHVSLFGCHGQGGNQFFEYTVNQEIRFN 502
Query: 171 ---ERCVDADKQEKLVEMLKLPV--TFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPW 225
E C + +Q+ V M P TF + +K F GT+
Sbjct: 503 SVTELCAEVPEQKDFVGMRTCPKDGTF-IPETIKWQFREDGTIY---------------- 545
Query: 226 RFHNSTRQLEHAILKRCLSVH--PETN-QLAMLRCDENNSYQQWRFKE 270
H +CLS + PE + M C+ + Q W+F++
Sbjct: 546 ----------HPHSAKCLSAYRTPEGRADIQMTACNAGDKNQAWKFEK 583
>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
Length = 578
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 48/179 (26%), Positives = 82/179 (45%), Gaps = 13/179 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ +R++L CK F W++K+V +++ P
Sbjct: 381 RVAEVWMDEYKEHFYNRNPPARKEAYGDISERKLIRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPG-WHGAIRSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 497
Query: 171 ---ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 498 SVTELCAEVPEQKNYVGMQNCPKDGFLVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
Length = 579
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 89/215 (41%), Gaps = 39/215 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K++ + D+GD++ + LR +L+CK F WF+ ++A + ++
Sbjct: 392 WMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDV--- 448
Query: 120 LFWGEAKNL--GTQKCLDSMGRTAP--AVIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
+ +G + GT+ C D++ R ++G +C G GSS QL L+ EG L C
Sbjct: 449 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCA 508
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS-TRQ 233
+ +++ C + + W + N R
Sbjct: 509 SEEN-----------------GNIRMKTC------------SKKAQFNERWAYENKMIRN 539
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
L+ ++ P N + ++ CDE + +Q+W F
Sbjct: 540 LKSGKCMSTANLKPGDNAI-VVECDEKDEHQKWNF 573
>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
Length = 518
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 2/99 (2%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
D GD+S + ALR++LHCK F W++ ++ D++ L G + CLDS
Sbjct: 338 DYGDVSARKALRERLHCKSFKWYLDNIYPDLFIPGESLASGEVNGVFNSQSQPACLDSAA 397
Query: 139 --RTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVD 175
+ I CH G +Q + L+ G++ E C D
Sbjct: 398 DKKAYNKAIKLWPCHNMGGNQYWMLSKSGEIRRDEGCFD 436
>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Nomascus leucogenys]
Length = 578
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/178 (26%), Positives = 77/178 (43%), Gaps = 11/178 (6%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR++L CK F W++K+V +++ P
Sbjct: 381 RAAEVWMDEYKEHFYNRNPPARKEAYGDISERKLLRERLRCKSFDWYLKNVFPNLHV--P 438
Query: 115 ELPPNLFWGEAKNLGTQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG- 170
E P + +CLD A + CHG G +Q F + ++
Sbjct: 439 EDRPGWHGAIHSRGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFNS 498
Query: 171 --ERCVDADKQEKLVEMLKLPVT-FCVLAGVKLIFCRLGTVLNLFN--CFNSSGTVDG 223
E C + +Q+ V M P F V A + F GT+ + + C ++ T +G
Sbjct: 499 VTELCAEVPEQKNYVGMQNCPKDGFPVPANIIWHFKEDGTIFHPHSGLCLSAYRTPEG 556
>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
Length = 597
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 21/129 (16%)
Query: 39 DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKD 97
D D I+T + +L W+ F I L P L D+GD++ +V LR KL CK
Sbjct: 395 DRDTHGINTARMAL------VWMDEFINIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKS 448
Query: 98 FSWFMKHVAYDVYTKFPE-LPPN---LFWGEAKNLGTQKCLDSM--GRTAPAVIGTSYCH 151
F+W++K++ +PE PN + WG+ K++ + CLD + P +G C
Sbjct: 449 FAWYLKNI-------YPEKFVPNADVVGWGKVKSVSSNLCLDDLLQNNEKPYNVGLYPCG 501
Query: 152 GS-GSSQLF 159
SQLF
Sbjct: 502 KVLQKSQLF 510
>gi|237841027|ref|XP_002369811.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3, putative
[Toxoplasma gondii ME49]
gi|37622165|gb|AAQ95173.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3 [Toxoplasma gondii]
gi|37724831|gb|AAO20278.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3 [Toxoplasma gondii]
gi|211967475|gb|EEB02671.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3, putative
[Toxoplasma gondii ME49]
gi|221483678|gb|EEE21990.1| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3, putative
[Toxoplasma gondii GT1]
Length = 635
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 54/187 (28%), Positives = 78/187 (41%), Gaps = 41/187 (21%)
Query: 90 RDKLHCKDFSWFMKHV--AYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRTAPA-VIG 146
R + CK F WFM++V DV T ++P + G +N CLD+MG +P +G
Sbjct: 471 RKRKGCKSFRWFMENVFPEGDVVT-LDDVP---YLGPLRNDKIGMCLDNMGWASPGHAVG 526
Query: 147 TSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLG 206
YCHG G +Q F ++GH D + C+ +L +CR
Sbjct: 527 LEYCHG-GDTQTFMFFR--KVGHVMPVNDDEA--------------CLQPSGRLDWCR-- 567
Query: 207 TVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQW 266
GT W F S+ QL K+CLS +L M+ CD+ + YQ W
Sbjct: 568 ------------GTAQFWWDF-TSSGQLMFRETKQCLSAF--GRKLRMVECDDTDPYQIW 612
Query: 267 RFKEVRP 273
+ P
Sbjct: 613 SWTAYNP 619
>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
Length = 597
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 62/129 (48%), Gaps = 21/129 (16%)
Query: 39 DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKD 97
D D I+T + +L W+ F I L P L D+GD++ +V LR KL CK
Sbjct: 395 DRDTHGINTARMAL------VWMDEFINIFFLNRPDLKFHADIGDVTHRVMLRKKLRCKS 448
Query: 98 FSWFMKHVAYDVYTKFPE-LPPN---LFWGEAKNLGTQKCLDSM--GRTAPAVIGTSYCH 151
F+W++K++ +PE PN + WG+ K++ + CLD + P +G C
Sbjct: 449 FAWYLKNI-------YPEKFVPNADVVGWGKVKSVSSNLCLDDLLQNNEKPYNVGLYPCG 501
Query: 152 GS-GSSQLF 159
SQLF
Sbjct: 502 KVLQKSQLF 510
>gi|198414821|ref|XP_002124597.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4, partial [Ciona intestinalis]
Length = 230
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 64/128 (50%), Gaps = 8/128 (6%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLA-MFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTK 112
+R WL +K+ + PL+ D+GD++++ LR +L+C DF W++ +V + +
Sbjct: 36 MRLAEVWLDDYKRHFYVRVPLSERNQDVGDVTKRKELRQRLNCHDFKWYLDNVFPEQFV- 94
Query: 113 FPELPPNLFWGEAKNLGTQKCLD-SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG- 170
PE P + L CLD S T AV+ CHG S+Q F ++ ++ +
Sbjct: 95 -PEDRPGFYGPIESVLLKDNCLDVSFDATNQAVVQMYQCHGM-SNQFFEYTSKNEIRYNQ 152
Query: 171 --ERCVDA 176
E+C+D
Sbjct: 153 ISEKCLDT 160
>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
Length = 579
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 46/215 (21%), Positives = 89/215 (41%), Gaps = 39/215 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K++ + D+GD++ + LR +L+CK F WF+ ++A + ++
Sbjct: 392 WMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNCKPFKWFLDNIAKGKFIMDEDV--- 448
Query: 120 LFWGEAKNL--GTQKCLDSMGRTAP--AVIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
+ +G + GT+ C D++ R ++G +C G GSS QL L+ EG L C
Sbjct: 449 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSKEGNLRRENTCA 508
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNS-TRQ 233
+ +++ C + + W + N R
Sbjct: 509 SEEN-----------------GNIRMKTC------------SKKAQFNERWAYENKMIRN 539
Query: 234 LEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRF 268
L+ ++ P N + ++ CDE + +Q+W F
Sbjct: 540 LKSGKCMSTANLKPGDNAI-VVECDEKDEHQKWNF 573
>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 100/229 (43%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ + +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYLNTIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
Y P+L P + G K++G CLD +G P ++ T CHG G +Q F +A+
Sbjct: 496 AYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSAQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + V LK CV G + I
Sbjct: 550 REIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-----------------APG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + + K CLS + E L + CD + Q+W F +
Sbjct: 586 EQIWEIRKD-QLLYNPLFKMCLSSNGEHPNL--VPCDATDLLQKWIFSQ 631
>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
Length = 507
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/196 (26%), Positives = 87/196 (44%), Gaps = 37/196 (18%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMG 138
D GD S+++ LR L CK F W++++V + KF + +G +N + CLD++
Sbjct: 338 DAGDFSDRLQLRRDLKCKSFKWYLENVYPE---KFIPDENVIAYGMVRNPSSNLCLDTLS 394
Query: 139 RTAPAV--IGTSYCHGSGSS-QLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVL 195
+ V +G C G SS QLF L+ +L E C+D+ E
Sbjct: 395 KDEKMVFNLGLYGCQGGVSSNQLFSLSQSNELRREESCMDSVGGE--------------- 439
Query: 196 AGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETN--QLA 253
G+ + L C S G + W ++ T+ + H+ K CL E N ++
Sbjct: 440 ----------GSPVKLMPCHGSRGHQE--WTYNLETKVIVHSSGK-CLDKVEEKNGGEVK 486
Query: 254 MLRCDENNSYQQWRFK 269
+ +C+ +S Q W F+
Sbjct: 487 VEKCNGKDS-QIWEFE 501
>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 102/235 (43%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ + Y P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + V LK CV G + I
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + + K CLS + E L + CD + Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFKMCLSSNGEHPNL--VPCDATDLLQKWIFSQ 631
>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
musculus]
Length = 633
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 57/235 (24%), Positives = 102/235 (43%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ + Y P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 490 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + V LK CV G + I
Sbjct: 544 FEYSAQREIRHNIQKELCLHATQG---VVQLKA----CVYKGHRTI-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + + K CLS + E L + CD + Q+W F +
Sbjct: 583 ---APGEQIWEIRKD-QLLYNPLFKMCLSSNGEHPNL--VPCDATDLLQKWIFSQ 631
>gi|326929546|ref|XP_003210923.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Meleagris gallopavo]
Length = 481
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 65/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 252 RVAHIERTKKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMANPGVDFGDVSER 311
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + + +GE +N + CLD I
Sbjct: 312 IALRQRLQCRSFKWYLE----NVYPEMRVYNNTVTYGEVRNSKASGYCLDQGAEEDDKAI 367
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +++G L G
Sbjct: 368 LYP-CHGM-SSQLVRYSSDGVLQLG 390
>gi|256083753|ref|XP_002578103.1| peptidase [Schistosoma mansoni]
Length = 1860
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/98 (31%), Positives = 49/98 (50%), Gaps = 2/98 (2%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ P +D GD+S+Q LR +L CK F WFM VA+D+ +P
Sbjct: 313 RVAEVWMDEYKEYIYRSLPECRKVDPGDLSQQHNLRKRLQCKSFKWFMTEVAFDLTKAYP 372
Query: 115 ELPPNLF-WGEAKNLG-TQKCLDSMGRTAPAVIGTSYC 150
LF GE +++ C+D+ T + + S+C
Sbjct: 373 PPEEVLFATGEIRSVAFPYLCIDAARATKRSPVKLSFC 410
>gi|432851710|ref|XP_004067046.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4-like
[Oryzias latipes]
Length = 609
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/177 (27%), Positives = 77/177 (43%), Gaps = 11/177 (6%)
Query: 29 GQLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISE 85
++ H ER + +++ LR W+ FK + + M +D+GDISE
Sbjct: 380 ARIAHIERAHKPYTEDLTSNVRRNALRVAEVWMDEFKSHVYMAWNIPMEDSGIDIGDISE 439
Query: 86 QVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAV 144
+ ALR +L CK F W++ ++Y + + +G KN L + CLD G V
Sbjct: 440 RKALRKRLQCKTFRWYL----VNIYPEMRMYSDTIAYGVLKNSLKSDLCLDQ-GPDGDNV 494
Query: 145 IGTSYCHG-SGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLP-VTFCVLAGVK 199
CHG + S + E +G +D D + LV++ P +T C A K
Sbjct: 495 PILYLCHGMTPQSVYYTQTQELHIGLLSPTIDDDDNKCLVDVNSKPRLTECSYAAAK 551
>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 621
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/135 (28%), Positives = 60/135 (44%), Gaps = 20/135 (14%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
WL +K + P M +D GDIS++V LR L CK F WF++++ D + F +
Sbjct: 417 WLDEYKYLFYANQPETMKIDPGDISQRVQLRKNLQCKSFRWFLQNIYPDSHYNFAFVGV- 475
Query: 120 LFWGEAKNLGTQKCLD-----------SMGRTAPAVIGTSY----CHGSGSSQLFRLNAE 164
G+ KN+ + CLD G+ A V + CH G QLF +
Sbjct: 476 ---GQLKNVASGACLDFGKAAGHGGKEFKGKDATNVTSNTVELWPCH-DGKIQLFIRTDK 531
Query: 165 GQLGHGERCVDADKQ 179
+ + C+D + Q
Sbjct: 532 KEFRYIHMCLDYNVQ 546
>gi|444726436|gb|ELW66970.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Tupaia chinensis]
Length = 651
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 423 RVAHIERTRKPYSGDIDYYARRNALRAAEVWMDGFKSHVYMAWNIPMTNPGVDFGDVSER 482
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W++ +VY + L +GE +N + CLD I
Sbjct: 483 LALRRRLKCRSFKWYLD----NVYPEMRVYNDTLTYGEVRNSKASGYCLDQGAEDDDQAI 538
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R +AEG L G
Sbjct: 539 -LYPCHGM-SSQLVRYSAEGLLQLG 561
>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
Length = 604
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/129 (31%), Positives = 61/129 (47%), Gaps = 21/129 (16%)
Query: 39 DADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSP-LAMFLDMGDISEQVALRDKLHCKD 97
D D I+T + +L W+ + + L P L D+GD++ +V LR KL CK
Sbjct: 401 DRDTHGINTARMAL------VWMDEYINVFFLNRPDLKFHPDIGDVTHRVVLRKKLRCKS 454
Query: 98 FSWFMKHVAYDVYTKFPE-LPPNL---FWGEAKNLGTQKCLDSM--GRTAPAVIGTSYCH 151
F W++K+V +PE PN+ WG+ K + + CLD + P +G C
Sbjct: 455 FEWYLKNV-------YPEKFVPNMNVKAWGKVKAVNSNLCLDDLLNNNEKPYNLGLYACG 507
Query: 152 GS-GSSQLF 159
+ SQLF
Sbjct: 508 KALQKSQLF 516
>gi|431894865|gb|ELK04658.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Pteropus alecto]
Length = 633
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/235 (22%), Positives = 103/235 (43%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 430 QVIARNQVRLAEVWMDDYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 489
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
++ +VY P+L P + G K+ G CLD +G P ++ T CHG G +Q
Sbjct: 490 NNIYPEVYV--PDLNP-VISGYIKSFGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 543
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F + + ++ H E C+ A + LV++ C G K +
Sbjct: 544 FEYSVQHEIRHNIQKELCLHA--AQGLVQL-----KACAYKGHKTV-------------- 582
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W L + + K CLS + E +++ C+ ++ Q+W F +
Sbjct: 583 ---AAGEQIWEIQKD-HLLYNPLFKMCLSANGE--HPSLVSCNPSDLLQKWIFSQ 631
>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 626
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 12/123 (9%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K P + GDISE+ ALR+KL CK F W++ +V KF
Sbjct: 420 WMDNYKYYFYQNRPELRKISFGDISERKALRNKLQCKSFKWYLDNV---YPNKFVPSEKV 476
Query: 120 LFWGEAKNLGTQKCLDSMGR----TAPAVIGTSYCH---GSGSSQLFRLNAEGQLGHGER 172
+G A+N T CLDSM T P +G CH SG +QL ++ +
Sbjct: 477 FAFGNARNPNTGMCLDSMSHNYDNTEP--LGIYPCHKDTNSGGNQLVSYTWRHEIRKEDS 534
Query: 173 CVD 175
C +
Sbjct: 535 CAE 537
>gi|355689613|gb|AER98891.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mustela putorius
furo]
Length = 302
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 106/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 100 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 159
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ + Y P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 160 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 213
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G + +
Sbjct: 214 FEYSAQHEIRHNIQRELCLHA--AQGLVQLRA-----CAYKGHRTV-------------- 252
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+SG + W + L + K CLS + E +++ C+ ++ Q+W F +
Sbjct: 253 -ASG--EQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 301
>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Takifugu rubripes]
Length = 583
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 96/229 (41%), Gaps = 46/229 (20%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
+R W+ +KQ +P A GDIS ++ LRDKL C+ F+W++K++ D++
Sbjct: 386 VRAAEVWMDSYKQHFYNRNPPARKETYGDISGRLLLRDKLKCQSFNWYLKNIYPDLH--I 443
Query: 114 PELPPNLFW-GEAKNLG-TQKCLDSMGRTAP--AVIGTSY----CHGSGSSQLFRLNAEG 165
PE W G ++LG +CLD AP +V G CHG G +Q F ++
Sbjct: 444 PE--DRAGWHGAVRHLGINSECLDY---NAPEHSVTGAHLSLFGCHGQGGNQYFEYTSQK 498
Query: 166 QLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVD 222
++ E C + + + M C G R+ ++L
Sbjct: 499 EIRFNTVTELCAEVVEGRASIGM-----KHCPGDGE----TRIPSIL------------- 536
Query: 223 GPWRFHNSTRQLEHAILKRCLSVHPET---NQLAMLRCDENNSYQQWRF 268
W F + H C++ + + + M+RC+ + QQW+F
Sbjct: 537 --WEFRQDG-SIYHPHTDMCITAYRTSAGRTDVKMMRCNPGDKNQQWKF 582
>gi|301784471|ref|XP_002927652.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like,
partial [Ailuropoda melanoleuca]
Length = 463
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 34/107 (31%), Positives = 53/107 (49%), Gaps = 13/107 (12%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K P A + G+I ++ LR KL CK F W++++V +P
Sbjct: 315 RAAEVWMDEYKNFYYAAVPSARNVPYGNIQSRLELRKKLSCKPFKWYLENV-------YP 367
Query: 115 ELP----PNLFWGEAKNLGTQKCLDSMGRTAPAVIGTSYCHGSGSSQ 157
EL ++ +G A GT CLD++G A V+G CH +G +Q
Sbjct: 368 ELRVPDHQDIAFG-ALQQGTN-CLDTLGHFADGVVGVYECHNAGGNQ 412
>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
Length = 520
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/124 (29%), Positives = 61/124 (49%), Gaps = 8/124 (6%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K+ + D+GD+SE+ ALR KL CK F W++++VA + + L N
Sbjct: 330 WMDDYKKFYYIHRLDLKEKDVGDLSERKALRQKLKCKSFKWYLENVAKNKFV----LDEN 385
Query: 120 L-FWGEAKNLGTQKCLDSMGRTAPAVIGTSYC---HGSGSSQLFRLNAEGQLGHGERCVD 175
+ +G +N ++ CLD++ R I S +G +Q+F L +G L C
Sbjct: 386 VAAFGSLRNPSSELCLDTLQRDEGEAIPLSVFPCQNGKSEAQIFSLTNDGILRRELTCAK 445
Query: 176 ADKQ 179
D+
Sbjct: 446 IDRD 449
>gi|66357692|ref|XP_626024.1| extracellular protein with a ricin domain and a family 2
glycosyltransferase domain [Cryptosporidium parvum Iowa
II]
gi|46227210|gb|EAK88160.1| extracellular protein with a ricin domain and a family 2
glycosyltransferase domain [Cryptosporidium parvum Iowa
II]
gi|448826186|gb|AAQ95175.2| UDP-N-acetyl-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase T3, partial
[Cryptosporidium parvum]
Length = 732
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 96/228 (42%), Gaps = 40/228 (17%)
Query: 54 LRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKF 113
LR W+ F QI + +P + +G + + L+ KL CK F WF+ +VA + Y
Sbjct: 526 LRTARVWMDEFYQITEMLAPNPN-IKLGSFDDMLHLKKKLKCKPFRWFLDNVAPETYIT- 583
Query: 114 PELPPNLFWGEAKNLGTQK-CLDSM-GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGE 171
+L LF GE +N CLDSM G+T IG +CHG +Q F ++ H +
Sbjct: 584 -QLDHLLFVGEIRNKKLDNICLDSMGGQTDGDKIGVFHCHGKKGTQAFMMS-----NHTQ 637
Query: 172 RCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNST 231
+ K+ ++C+ + +K C + N+ WR N
Sbjct: 638 QIRIVSKE-----------SYCIGSNLKYAACSNSEMTNV-------------WRLENEM 673
Query: 232 RQLEHAILKR-CLSVHPETN-----QLAMLRCDENNSYQQWRFKEVRP 273
+ K CLS+ + + + +L CD N+ Q W + +P
Sbjct: 674 IKANVEPNKYVCLSLTEDNDSSTKHKAELLDCDPNDPSQHWNVNKFKP 721
>gi|301783121|ref|XP_002926975.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Ailuropoda melanoleuca]
gi|281344477|gb|EFB20061.1| hypothetical protein PANDA_016676 [Ailuropoda melanoleuca]
Length = 632
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/235 (22%), Positives = 106/235 (45%), Gaps = 40/235 (17%)
Query: 43 QVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFM 102
QVI+ + L + ++F + + + + GD+S++ ++ +L CK+F+W++
Sbjct: 429 QVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRLQCKNFTWYL 488
Query: 103 KHVAYDVYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQL 158
+ + Y P+L P + G K++G CLD +G P ++ T CHG G +Q
Sbjct: 489 NTIYPEAYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQY 542
Query: 159 FRLNAEGQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCF 215
F +A+ ++ H E C+ A + LV++ C G + +
Sbjct: 543 FEYSAQHEIRHNIQRELCLHA--AQGLVQLRA-----CAYKGHRTV-------------- 581
Query: 216 NSSGTVDGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+SG + W + L + K CLS + E +++ C+ ++ Q+W F +
Sbjct: 582 -ASG--EQIWEIQKD-QLLYNPFFKMCLSANGE--HPSLVSCNPSDPLQKWIFSQ 630
>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
Length = 522
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/142 (26%), Positives = 69/142 (48%), Gaps = 10/142 (7%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A + GD+SE+ LR++L CK F+W++K++ +++ P
Sbjct: 325 RAAEVWMDEYKEHFYNRNPSARKENYGDLSERKILRERLKCKSFNWYLKNIFAELHV--P 382
Query: 115 ELPPNLFWGEAKNLG-TQKCLD-SMGRTAP--AVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++ G +CLD ++ P A + CHG G +Q F + ++
Sbjct: 383 EDRPG-WHGAIRSAGIASECLDYALPENHPTGAHLSLFGCHGQGGNQFFEYTSNKEIRFN 441
Query: 171 ---ERCVDADKQEKLVEMLKLP 189
E C + + E + M P
Sbjct: 442 SVTELCAEVPEHEDFITMRNCP 463
>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oryzias latipes]
Length = 758
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 84/199 (42%), Gaps = 49/199 (24%)
Query: 79 DMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAK--NLGTQKCLDS 136
D+G+++EQ+ LR KL CK+F W++++V +PEL L E N G ++CL +
Sbjct: 593 DIGNLTEQIELRKKLKCKNFKWYLENV-------YPELDAPLVKAEGLVFNQGLRRCL-A 644
Query: 137 MGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDADKQEKLVEMLKLPVTFCVLA 196
M + + + C + SQ F + G+ C+ Q+ L
Sbjct: 645 MFKDS---LSFETCDLNKQSQHFSYTWLRHVRQGDLCIAP--QQSL-------------- 685
Query: 197 GVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL-------EHAILKRCLSVHPET 249
++ +C N+ + W FH S++ E A CL
Sbjct: 686 ----------NSFSVVSCDNAKAELR--W-FHKSSKSTLADHFIAEFASQHMCLEAESRD 732
Query: 250 NQLAMLRCDENNSYQQWRF 268
+ L + CD NNS+Q+W+F
Sbjct: 733 DSLHLSPCDSNNSFQKWQF 751
>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
Length = 579
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 50/216 (23%), Positives = 87/216 (40%), Gaps = 41/216 (18%)
Query: 60 WLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPELPPN 119
W+ +K++ + D+GD++ + LR +L+CK F WF+ ++A KF
Sbjct: 392 WMDDYKRLYYMHREDLRTKDVGDLTSRHELRKRLNCKSFKWFLDNIA---KGKFIMDEDV 448
Query: 120 LFWGEAKNL--GTQKCLDSMGRTAPA--VIGTSYCHGSGSS-QLFRLNAEGQLGHGERCV 174
+ +G + GT+ C D++ R ++G +C G GSS QL L+ EG L C
Sbjct: 449 VAYGALHTVVSGTRMCTDTLQRDEKMSQLLGVFHCQGKGSSPQLMSLSREGNLRRENTCA 508
Query: 175 DADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTVDGPWRFHNSTRQL 234
+ V++ C N R+ + L
Sbjct: 509 SEEN-----------------GNVRMKTCSKKAQFNE--------------RWAYEKQML 537
Query: 235 EHAILKRCLSVH--PETNQLAMLRCDENNSYQQWRF 268
+ +CLS + ++ CDEN+ +Q+W F
Sbjct: 538 RNLKSGKCLSTANLKAGDDAVVIDCDENDPHQKWNF 573
>gi|403292275|ref|XP_003937178.1| PREDICTED: uncharacterized protein LOC101029503 [Saimiri boliviensis
boliviensis]
Length = 1450
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 66/145 (45%), Gaps = 12/145 (8%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 1221 RVAHIERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 1280
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR KL C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 1281 LALRQKLKCRSFKWYLE----NVYPEMRTYNNTLTYGEVRNSKASGYCLDQGAEDDDRAI 1336
Query: 146 GTSYCHGSGSSQLFRLNAEG--QLG 168
CHG SSQL R +A+G QLG
Sbjct: 1337 -LYPCHGM-SSQLVRYSADGLLQLG 1359
>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 697
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 14/106 (13%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCL 134
+D GD+SE+ ALR+KL C+ F W++ ++ +PEL L G+ ++ CL
Sbjct: 524 IDFGDVSERKALREKLKCRSFDWYLTNI-------YPELFIPSKALASGDIESAAGPHCL 576
Query: 135 DSM----GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
DS G VI CH G +Q + L+ ++ E C D+
Sbjct: 577 DSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDS 622
>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 654
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 52/106 (49%), Gaps = 14/106 (13%)
Query: 78 LDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFPEL---PPNLFWGEAKNLGTQKCL 134
+D GD+SE+ ALR+KL C+ F W++ ++ +PEL L G+ ++ CL
Sbjct: 524 IDFGDVSERKALREKLKCRSFDWYLTNI-------YPELFIPSKALASGDIESAAGPHCL 576
Query: 135 DSM----GRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHGERCVDA 176
DS G VI CH G +Q + L+ ++ E C D+
Sbjct: 577 DSPTPRNGDKKRTVIKIWPCHKQGGNQFWLLSPNNEIRRDEYCFDS 622
>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
glaber]
Length = 582
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 95/228 (41%), Gaps = 42/228 (18%)
Query: 55 RKRSSWLKVFKQIRSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYDVYTKFP 114
R W+ +K+ +P A GDISE+ LR +L CK F W++K+V +++ P
Sbjct: 385 RAAEVWMDDYKEHFYNRNPPARKEAYGDISERKLLRKQLRCKSFDWYLKNVFSNLHV--P 442
Query: 115 ELPPNLFWGEAKNLG-TQKCLD---SMGRTAPAVIGTSYCHGSGSSQLFRLNAEGQLGHG 170
E P + G ++LG + +CLD A + CHG G +Q F + ++
Sbjct: 443 EDRPG-WHGAIRSLGISSECLDYNSPDNNPTGANLSLFGCHGQGGNQFFEYTSNKEIRFN 501
Query: 171 ---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGT--VLNLFNCFNSSGTVDGPW 225
E C + +Q+ V M P R G+ +N+ F GT+ P
Sbjct: 502 SVTELCAEVPEQKTYVGMQTCP--------------RDGSPAPVNIIWHFKEDGTIFHP- 546
Query: 226 RFHNSTRQLEHAILKRCLSVHPETN---QLAMLRCDENNSYQQWRFKE 270
H+ L CLS + + M C+ + Q WRF++
Sbjct: 547 ----------HSGL--CLSAYRTAEGRPDVQMRTCEALDKNQIWRFEK 582
>gi|291412932|ref|XP_002722732.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Oryctolagus cuniculus]
Length = 535
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 64/145 (44%), Gaps = 10/145 (6%)
Query: 30 QLGHGERCVDADKQVISTQKSSLELRKRSSWLKVFKQIRSLGSPLAMF---LDMGDISEQ 86
++ H ER I LR W+ FK + + M +D GD+SE+
Sbjct: 306 RVAHIERSRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMTNPGVDFGDVSER 365
Query: 87 VALRDKLHCKDFSWFMKHVAYDVYTKFPELPPNLFWGEAKN-LGTQKCLDSMGRTAPAVI 145
+ALR +L C+ F W+++ +VY + L +GE +N + CLD I
Sbjct: 366 LALRRRLKCRSFQWYLE----NVYPEMRVYSDTLAYGEVRNSKASGYCLDQGSEDDDQAI 421
Query: 146 GTSYCHGSGSSQLFRLNAEGQLGHG 170
CHG SSQL R + +G L G
Sbjct: 422 LYP-CHGM-SSQLVRYSTDGLLQLG 444
>gi|350593559|ref|XP_003133495.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Sus
scrofa]
Length = 633
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 53/229 (23%), Positives = 100/229 (43%), Gaps = 44/229 (19%)
Query: 53 ELRKRSSWLKVFKQI----RSLGSPLAMFLDMGDISEQVALRDKLHCKDFSWFMKHVAYD 108
++R W+ +K+I + + + GD+S++ ++ +L CK+F+W++ ++ +
Sbjct: 436 QVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKRRLQCKNFTWYLNNIYPE 495
Query: 109 VYTKFPELPPNLFWGEAKNLGTQKCLDSMGRT----APAVIGTSYCHGSGSSQLFRLNAE 164
Y P+L P + G K++G CLD +G P ++ T CHG G +Q F + +
Sbjct: 496 AYV--PDLNP-VISGYIKSVGQPLCLD-VGENNQGGKPLILYT--CHGLGGNQYFEYSVQ 549
Query: 165 GQLGHG---ERCVDADKQEKLVEMLKLPVTFCVLAGVKLIFCRLGTVLNLFNCFNSSGTV 221
++ H E C+ A + V LK C G K + T
Sbjct: 550 HEIRHNIQKELCLHAAQG---VVQLKT----CAYKGHKTV-----------------ATG 585
Query: 222 DGPWRFHNSTRQLEHAILKRCLSVHPETNQLAMLRCDENNSYQQWRFKE 270
+ W + L + K CLS E +++ C+ ++ Q+W F +
Sbjct: 586 EQIWEIQKD-QLLYNPFFKMCLSASGE--HPSLVSCNPSDPLQKWIFSQ 631
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,392,518,095
Number of Sequences: 23463169
Number of extensions: 172500921
Number of successful extensions: 344855
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1338
Number of HSP's successfully gapped in prelim test: 389
Number of HSP's that attempted gapping in prelim test: 340991
Number of HSP's gapped (non-prelim): 3150
length of query: 277
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 137
effective length of database: 9,074,351,707
effective search space: 1243186183859
effective search space used: 1243186183859
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 76 (33.9 bits)