BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy736
(304 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
[Tribolium castaneum]
gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
Length = 602
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 209/293 (71%), Positives = 224/293 (76%), Gaps = 12/293 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTSIVIVFHNEAWSTLLRTVWSVINRSPR LLKEIILVDDASER +
Sbjct: 143 CKDKKYPKLLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRPLLKEIILVDDASEREHLGR 202
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA- 114
+ + + I G +H K V +D + T E + A
Sbjct: 203 KLEEYVQTLPVPVIVLRTHKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLAR 262
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KTVVCPIIDVISD+TFEYITASDMTWGGFNWKLNFRWYRVP REM RR DR++P
Sbjct: 263 IVQDRKTVVCPIIDVISDETFEYITASDMTWGGFNWKLNFRWYRVPQREMERRNNDRTAP 322
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
LRTPTMAGGLF+IDK+YFYELGSYDEGMDIWGGENLEMSFRVWQCGG LEIIPCSHVGHV
Sbjct: 323 LRTPTMAGGLFSIDKEYFYELGSYDEGMDIWGGENLEMSFRVWQCGGKLEIIPCSHVGHV 382
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAA 281
FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG +S V +A
Sbjct: 383 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGARSVPVGDVSA 435
>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Nasonia vitripennis]
Length = 635
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/283 (72%), Positives = 222/283 (78%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K +P LP TSIVIVFHNEAWSTLLRTVWSVINRSPR LLKEIILVDDASER +
Sbjct: 174 CKSKKFPKLLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRALLKEIILVDDASEREHLKQ 233
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
+ D + Y+ ++ G +L +H K V +D + T E + A
Sbjct: 234 KLEDYVETLPVPTYVYRTEKRSGLIRARLL-GAKHVKGQVITFLDAHCECTEGWLEPLLA 292
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEYITASDMTWGGFNWKLNFRWYRV REM RR GDR++
Sbjct: 293 RIAHDKKTVVCPIIDVISDDTFEYITASDMTWGGFNWKLNFRWYRVAQREMDRRNGDRTA 352
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
PLRTPTMAGGLF+IDKDYFYELG+YDEGMDIWGGENLEMSFRVWQCGGILEI PCSHVGH
Sbjct: 353 PLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGILEISPCSHVGH 412
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG
Sbjct: 413 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 455
>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Megachile rotundata]
Length = 605
Score = 396 bits (1017), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/283 (71%), Positives = 221/283 (78%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+ +
Sbjct: 151 CKTKKYPKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 210
Query: 59 PIIDVISDQTF-EYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
+ D + Y+ ++ G +L +H K V +D + T E + A
Sbjct: 211 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGA-KHVKGQVITFLDAHCECTEGWLEPLLA 269
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLNFRWYRV REM RR GDR++
Sbjct: 270 RIAENRSTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTA 329
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMSFRVWQCGG LEI PCSHVGH
Sbjct: 330 PLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGH 389
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFRDKSPYTFPGGVSK+VLHNAARVAEVWMDEWRDFYYAMNPG
Sbjct: 390 VFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPG 432
>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
Length = 592
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 198/288 (68%), Positives = 223/288 (77%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CKKK YPT LPTTSIVIVFHNEAWSTLLRT+WSVINRSPR LLKEIILVDDASER
Sbjct: 130 CKKKHYPTKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASER----- 184
Query: 61 IDVISDQTFEYITASD-----MTWGGFNWKLREK---NRHKKTVVCPIIDVISDQT---F 109
D + Q +Y+ + G + +R + +H K V +D + T
Sbjct: 185 -DHLGQQLEDYVQTLPVHTYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWL 243
Query: 110 EYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A KTVVCPIIDVISD+TFEY+TASD TWGGFNWKLNFRWYRVP REM RR
Sbjct: 244 EPLLARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRN 303
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
DR++PLRTPTMAGGLF+ID+DYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILEI PC
Sbjct: 304 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPC 363
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFRDKSPYTFPGGV+ IVL NAARVAEVW+DEW++FYY M+PG
Sbjct: 364 SHVGHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPG 411
>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
echinatior]
Length = 605
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 201/283 (71%), Positives = 223/283 (78%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y +LP TSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKEIILVDDASER +
Sbjct: 151 CKNKKYLKYLPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKQ 210
Query: 59 PIID-VISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
+ D VI+ Y+ ++ G +L +H K V +D + T E + +
Sbjct: 211 DLEDYVITLPVPTYVYRTEKRSGLIRARLLGA-KHVKGQVITFLDAHCECTEGWLEPLLS 269
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ TVVCPIIDVISD TFEYI+ASDMTWGGFNWKLNFRWYRV REM RR DR++
Sbjct: 270 RIANDRHTVVCPIIDVISDDTFEYISASDMTWGGFNWKLNFRWYRVAQREMDRRNSDRTA 329
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMSFRVWQCGG LEI PCSHVGH
Sbjct: 330 PLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGH 389
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG
Sbjct: 390 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 432
>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 199/303 (65%), Positives = 227/303 (74%), Gaps = 23/303 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CKKK YP LPTTSIVIVFHNEAWSTLLRT+WSVINRSPR LLKEIILVDDASER
Sbjct: 133 CKKKHYPAKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASER----- 187
Query: 61 IDVISDQTFEYITASD-----MTWGGFNWKLREK---NRHKKTVVCPIIDVISDQT---F 109
+ + Q EY+ + G + +R + +H K V +D + T
Sbjct: 188 -EHLGRQLEEYVRTLPVPTFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWL 246
Query: 110 EYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A KTVVCPIIDVISD+TFEY+TASD TWGGFNWKLNFRWYRVP REM RR
Sbjct: 247 EPLLARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRN 306
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
DR++PLRTPTMAGGLF+ID+DYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILEI PC
Sbjct: 307 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEISPC 366
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCAAHF 283
SHVGHVFRDKSPYTFPGGV+ IVL NAARVAEVW+DEW++FYY M+PG + + +
Sbjct: 367 SHVGHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPGARKASAGDVSER 426
Query: 284 RML 286
R L
Sbjct: 427 RAL 429
>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
saltator]
Length = 605
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/283 (70%), Positives = 221/283 (78%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y +LP TSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKE+ILVDDASER +
Sbjct: 151 CKNKKYNKYLPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEVILVDDASERDHLKQ 210
Query: 59 PIIDVISDQTF-EYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
+ D I+ Y+ ++ G +L +H K V +D + T E + +
Sbjct: 211 DLEDYIATLPVPTYVYRTEKRSGLIRARLLGA-KHVKGQVITFLDAHCECTEGWLEPLLS 269
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLNFRWYRV REM RR DR++
Sbjct: 270 RIANDRHTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRNSDRTA 329
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMSFRVWQCGG LEI PCSHVGH
Sbjct: 330 PLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGH 389
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG
Sbjct: 390 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 432
>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 600
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 198/303 (65%), Positives = 228/303 (75%), Gaps = 23/303 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CKKK Y LPTTSIVIVFHNEAWSTLLRT+WSVINRSPR LLKEIILVDDASER
Sbjct: 135 CKKKHYSAKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASER----- 189
Query: 61 IDVISDQTFEYITASDMTW-----GGFNWKLREK---NRHKKTVVCPIIDVISDQT---F 109
D + Q +Y++ ++ G + +R + +H K V +D + T
Sbjct: 190 -DHLGKQLEDYVSTLPVSTFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWL 248
Query: 110 EYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A KTVVCPIIDVISD+TFEY+TASD TWGGFNWKLNFRWYRVP REM RR
Sbjct: 249 EPLLARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPSREMQRRN 308
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
DR++PLRTPTMAGGLF+ID+DYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILEI PC
Sbjct: 309 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPC 368
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCAAHF 283
SHVGHVFRDKSPYTFPGGV+ IVL NAARVAEVW+DEW++FYY M+PG + + +
Sbjct: 369 SHVGHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPGARKASAGDVSER 428
Query: 284 RML 286
R L
Sbjct: 429 RAL 431
>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Acyrthosiphon pisum]
Length = 615
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 199/305 (65%), Positives = 217/305 (71%), Gaps = 53/305 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YPT +PTTSIVIVFHNEAWSTLLRTVWSVINRSPR+LLKEI+LVDDASER
Sbjct: 160 CKSKQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGK 219
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ ++ Q ++ A + G L
Sbjct: 220 KLEDYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLAR 279
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++KTVVCP+IDVISD T FEY+TASDMTWGGFNWKLN
Sbjct: 280 IVLNRKTVVCPVIDVISDDT---------------------FEYVTASDMTWGGFNWKLN 318
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFY+LGSYDEGMDIWGGENLEMS
Sbjct: 319 FRWYRVPQREMTRRNQDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMS 378
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEIIPCSHVGHVFRDKSPY+FPGGVSKIVLHNAARVAEVWMDEWRDFYYAM
Sbjct: 379 FRVWQCGGTLEIIPCSHVGHVFRDKSPYSFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 438
Query: 269 NPGKS 273
NPG S
Sbjct: 439 NPGAS 443
>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
Length = 626
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/288 (67%), Positives = 221/288 (76%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YPT LPTTS+VIVFHNEAW+TLLRT+WS INRSPR LLKEIILVDDASE+
Sbjct: 170 CKAKRYPTLLPTTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEK----- 224
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ + + EYI ++ F + R +H K V +D + T
Sbjct: 225 -EHLGKKLEEYIKTLPVSTRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWL 283
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + ++ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLNFRWYRVP REM RRG
Sbjct: 284 EPLLSRIVEDRSTVVCPIIDVISDTTFEYIQASDMTWGGFNWKLNFRWYRVPEREMQRRG 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR++PLRTPTMAGGLFAID++YFY++GSYDEGMDIWGGENLEMSFRVWQCGG+LEI+PC
Sbjct: 344 GDRTAPLRTPTMAGGLFAIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPC 403
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFRDKSPY+FPGGV +VL NAARVAEVWMDEW +FYYAMNPG
Sbjct: 404 SHVGHVFRDKSPYSFPGGVQAVVLKNAARVAEVWMDEWGEFYYAMNPG 451
>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
Length = 632
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/303 (65%), Positives = 223/303 (73%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ K YP+ LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKEIILVDDASER +
Sbjct: 179 CRHKHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGK 238
Query: 59 PIIDVISD---QTF------------------EYITASDMTW---------GGFNWKLRE 88
+ D ++ +TF E++T +T+ G L
Sbjct: 239 QLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLAR 298
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++++TVVCPIIDVISD+T FEYITASD TWGGFNWKLN
Sbjct: 299 IVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGGFNWKLN 337
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMS
Sbjct: 338 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 397
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWRDFYYAM
Sbjct: 398 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWRDFYYAM 457
Query: 269 NPG 271
+ G
Sbjct: 458 STG 460
>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 605
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 193/297 (64%), Positives = 217/297 (73%), Gaps = 12/297 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK K Y LPTTS+VIVFHNEAWSTLLRTVWSVINRSP+ L+KEIILVDDAS +
Sbjct: 148 CKTKKYFELLPTTSVVIVFHNEAWSTLLRTVWSVINRSPKPLIKEIILVDDASVQPHLGK 207
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA- 114
+ + + G +H K V +D + T E + A
Sbjct: 208 KLENYVKTLPVPVTVLRTPKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLAR 267
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KTVVCPIIDVISD+TFEYITASD TWGGFNW+LNFRWYRVP REM RR D++ P
Sbjct: 268 ITEDRKTVVCPIIDVISDETFEYITASDTTWGGFNWRLNFRWYRVPKREMDRRNNDKTVP 327
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMSFRVWQCGG LEI+PCSHVGHV
Sbjct: 328 IRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEIVPCSHVGHV 387
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAHFRM 285
FRDKSPYTFPGGVS+IVLHNA RVAEVWMDEWRDFYYAMNPG K V + ++
Sbjct: 388 FRDKSPYTFPGGVSQIVLHNANRVAEVWMDEWRDFYYAMNPGAKKIEVGDITSRLKL 444
>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
Length = 630
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 195/309 (63%), Positives = 215/309 (69%), Gaps = 65/309 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 231
Query: 61 IDVISDQTFEYITA--------------------------------------SDMTWGGF 82
D + Q EY+ + T G
Sbjct: 232 -DFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWL 290
Query: 83 NWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG 142
L ++++TVVCPIIDVISD+T FEYITASD TWGG
Sbjct: 291 EPLLARIVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGG 329
Query: 143 FNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
FNWKLNFRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYELGSYDEGMDIWGG
Sbjct: 330 FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYELGSYDEGMDIWGG 389
Query: 203 ENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWR 262
ENLEMSFR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWR
Sbjct: 390 ENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWR 449
Query: 263 DFYYAMNPG 271
DFYY+M+ G
Sbjct: 450 DFYYSMSTG 458
>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
Length = 633
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 196/303 (64%), Positives = 222/303 (73%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ K YP+ LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKEIILVDDASER +
Sbjct: 180 CRHKHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGK 239
Query: 59 PIIDVISD---QTF------------------EYITASDMTW---------GGFNWKLRE 88
+ D ++ +TF E++T +T+ G L
Sbjct: 240 QLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLAR 299
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++++TVVCPIIDVISD T FEYITASD TWGGFNWKLN
Sbjct: 300 IVQNRRTVVCPIIDVISDDT---------------------FEYITASDSTWGGFNWKLN 338
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDK+YFYE+GSYDEGMDIWGGENLEMS
Sbjct: 339 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKEYFYEIGSYDEGMDIWGGENLEMS 398
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWRDFYYAM
Sbjct: 399 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWRDFYYAM 458
Query: 269 NPG 271
+ G
Sbjct: 459 STG 461
>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
Length = 630
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/309 (62%), Positives = 215/309 (69%), Gaps = 65/309 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 231
Query: 61 IDVISDQTFEYITA--------------------------------------SDMTWGGF 82
D + Q EY+ + T G
Sbjct: 232 -DFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWL 290
Query: 83 NWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG 142
L ++++TVVCPIIDVISD+T FEYITASD TWGG
Sbjct: 291 EPLLARIVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGG 329
Query: 143 FNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
FNWKLNFRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGG
Sbjct: 330 FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGG 389
Query: 203 ENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWR 262
ENLEMSFR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWR
Sbjct: 390 ENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWR 449
Query: 263 DFYYAMNPG 271
DFYY+M+ G
Sbjct: 450 DFYYSMSTG 458
>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 617
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/309 (62%), Positives = 215/309 (69%), Gaps = 65/309 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 164 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 218
Query: 61 IDVISDQTFEYITA--------------------------------------SDMTWGGF 82
D + Q EY+ + T G
Sbjct: 219 -DFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWL 277
Query: 83 NWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG 142
L ++++TVVCPIIDVISD+T FEYITASD TWGG
Sbjct: 278 EPLLARIVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGG 316
Query: 143 FNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
FNWKLNFRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGG
Sbjct: 317 FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGG 376
Query: 203 ENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWR 262
ENLEMSFR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWR
Sbjct: 377 ENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWR 436
Query: 263 DFYYAMNPG 271
DFYY+M+ G
Sbjct: 437 DFYYSMSTG 445
>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
Length = 536
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/309 (62%), Positives = 215/309 (69%), Gaps = 65/309 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 83 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 137
Query: 61 IDVISDQTFEYITA--------------------------------------SDMTWGGF 82
D + Q EY+ + T G
Sbjct: 138 -DFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWL 196
Query: 83 NWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG 142
L ++++TVVCPIIDVISD+T FEYITASD TWGG
Sbjct: 197 EPLLARIVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGG 235
Query: 143 FNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
FNWKLNFRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGG
Sbjct: 236 FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGG 295
Query: 203 ENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWR 262
ENLEMSFR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWR
Sbjct: 296 ENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWR 355
Query: 263 DFYYAMNPG 271
DFYY+M+ G
Sbjct: 356 DFYYSMSTG 364
>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
Length = 639
Score = 379 bits (973), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/282 (67%), Positives = 212/282 (75%), Gaps = 11/282 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ KSYP LPTTSIVIVFHNEAWSTLLRTVWS+I RSPR LL EIILVDDASER +
Sbjct: 180 CRDKSYPGLLPTTSIVIVFHNEAWSTLLRTVWSIITRSPRELLAEIILVDDASERDYLGK 239
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA- 114
+ D +++ G + K V +D + T E + A
Sbjct: 240 ELEDHVANFPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDAHCECTEGWLEPLLAR 299
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
K VVCPIIDVISD++FEY+TASDMTWGGFNWKLNFRWYRVP REM RR GDR+ P
Sbjct: 300 VAENRKIVVCPIIDVISDESFEYVTASDMTWGGFNWKLNFRWYRVPQREMDRRNGDRTQP 359
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
LRTPTMAGGLF+IDKDYF E+G+YDEGMDIWGGENLEMSFRVWQCGG LEIIPCSHVGHV
Sbjct: 360 LRTPTMAGGLFSIDKDYFEEIGTYDEGMDIWGGENLEMSFRVWQCGGELEIIPCSHVGHV 419
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FRDKSPY+FPGGV+KIV NAARVAEVWMD W+DF+Y MNPG
Sbjct: 420 FRDKSPYSFPGGVAKIVNKNAARVAEVWMDRWKDFFYEMNPG 461
>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
floridanus]
Length = 608
Score = 379 bits (973), Expect = e-103, Method: Compositional matrix adjust.
Identities = 196/303 (64%), Positives = 210/303 (69%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP +LP TSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKEIILVDDASER
Sbjct: 154 CKNKKYPKYLPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKK 213
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ + Q ++ A + T G L
Sbjct: 214 ELEKHITELPVPTYVYRTEKRSGLIRARLLGAKYVKGQVITFLDAHCECTEGWLEPLLSR 273
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 274 IANDRHTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 312
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMS
Sbjct: 313 FRWYRVAQREMDRRNGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 372
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEI CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM
Sbjct: 373 FRVWQCGGTLEISSCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 432
Query: 269 NPG 271
NPG
Sbjct: 433 NPG 435
>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 194/303 (64%), Positives = 217/303 (71%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGK 236
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ + +S ++ A + T G L
Sbjct: 237 QLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLAR 296
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++++TVVCPIIDVISD+T FEYITASD TWGGFNWKLN
Sbjct: 297 IVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGGFNWKLN 335
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMS
Sbjct: 336 FRWYRVPSREMSRRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWRDFYYAM
Sbjct: 396 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWRDFYYAM 455
Query: 269 NPG 271
+ G
Sbjct: 456 STG 458
>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus terrestris]
gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus terrestris]
Length = 606
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 198/318 (62%), Positives = 217/318 (68%), Gaps = 54/318 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ ++ Q ++ A + T G L
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 272 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 310
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDKDYFYELG+YDEGMDIWGGENLEMS
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEI PCSHVGHVFRDKSPYTFPGGVSK+VLHNAARVAEVWMDEWRDFYYAM
Sbjct: 371 FRVWQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAM 430
Query: 269 NPG-KSASVSTCAAHFRM 285
NPG +S +V + ++
Sbjct: 431 NPGARSVAVGDVSERIKL 448
>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
Length = 621
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 193/303 (63%), Positives = 218/303 (71%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C+ K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKEIILVDDASER
Sbjct: 168 CRHKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGK 227
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ + ++ + ++ A + T G L
Sbjct: 228 QLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVAGEVITFLDAHCECTEGWLEPLLAR 287
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++++TVVCPIIDVISD+T FEYITASD TWGGFNWKLN
Sbjct: 288 IVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGGFNWKLN 326
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMS
Sbjct: 327 FRWYRVPQREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 386
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWRDFYYAM
Sbjct: 387 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWRDFYYAM 446
Query: 269 NPG 271
+ G
Sbjct: 447 STG 449
>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
Length = 630
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 194/303 (64%), Positives = 217/303 (71%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGK 236
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ + +S ++ A + T G L
Sbjct: 237 QLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLAR 296
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++++TVVCPIIDVISD+T FEYITASD TWGGFNWKLN
Sbjct: 297 IVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGGFNWKLN 335
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMS
Sbjct: 336 FRWYRVPSREMSRRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMS 395
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVW+DEWRDFYYAM
Sbjct: 396 FRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWLDEWRDFYYAM 455
Query: 269 NPG 271
+ G
Sbjct: 456 STG 458
>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus impatiens]
Length = 606
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 196/303 (64%), Positives = 210/303 (69%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ ++ Q ++ A + T G L
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 272 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 310
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDKDYFYELG+YDEGMDIWGGENLEMS
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEI PCSHVGHVFRDKSPYTFPGGVSK+VLHNAARVAEVWMDEWRDFYYAM
Sbjct: 371 FRVWQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAM 430
Query: 269 NPG 271
NPG
Sbjct: 431 NPG 433
>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 3 [Bombus impatiens]
Length = 607
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 196/309 (63%), Positives = 213/309 (68%), Gaps = 53/309 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ ++ Q ++ A + T G L
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 272 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 310
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDKDYFYELG+YDEGMDIWGGENLEMS
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEI PCSHVGHVFRDKSPYTFPGGVSK+VLHNAARVAEVWMDEWRDFYYAM
Sbjct: 371 FRVWQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAM 430
Query: 269 NPGKSASVS 277
NP + +V+
Sbjct: 431 NPEGARNVA 439
>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis florea]
Length = 605
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 196/303 (64%), Positives = 209/303 (68%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE
Sbjct: 151 CKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 210
Query: 55 -------RVVCPI------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
R+ P + Q ++ A + T G L
Sbjct: 211 DLEHYVKRLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 270
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 271 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 309
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMS
Sbjct: 310 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 369
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEI PCSHVGHVFRDKSPYTFPGGVSK+VLHNAARVAEVWMDEWRDFYYAM
Sbjct: 370 FRVWQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAM 429
Query: 269 NPG 271
NPG
Sbjct: 430 NPG 432
>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis mellifera]
Length = 606
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/303 (64%), Positives = 209/303 (68%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+
Sbjct: 152 CKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ + Q ++ A + T G L
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 271
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 272 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 310
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMS
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 370
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG LEI PCSHVGHVFRDKSPYTFPGGVSK+VLHNAARVAEVWMDEWRDFYYAM
Sbjct: 371 FRVWQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAM 430
Query: 269 NPG 271
NPG
Sbjct: 431 NPG 433
>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
Length = 675
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/337 (58%), Positives = 222/337 (65%), Gaps = 76/337 (22%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K YP+ LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 231
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------HKKTVVCPIIDVISDQT 108
D + Q +Y+ + + LR + R H V +D + T
Sbjct: 232 -DFLGKQLEDYVAKLPVK----TFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECT 286
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
++ +TVVCPIIDVISD+TFEYITASD TWGGFNWKLNFRWYRVP REM
Sbjct: 287 EGWLEPLLARIVQNRRTVVCPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREM 346
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILE
Sbjct: 347 ARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILE 406
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWM--------------------- 258
IIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVWM
Sbjct: 407 IIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGVLEIAPCSRVGHVFRKST 466
Query: 259 ------------------------DEWRDFYYAMNPG 271
D+W++FYY+ PG
Sbjct: 467 PYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPG 503
>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
Length = 675
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 196/337 (58%), Positives = 221/337 (65%), Gaps = 76/337 (22%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 231
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------HKKTVVCPIIDVISDQT 108
D + Q EY+ + + LR + R H V +D + T
Sbjct: 232 -DFLGKQLEEYVAKLPVK----TFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECT 286
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
++ +TVVCPIIDVISD+TFEYITASD TWGGFNWKLNFRWYRVP REM
Sbjct: 287 EGWLEPLLARIVQNRRTVVCPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREM 346
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILE
Sbjct: 347 ARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILE 406
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWM--------------------- 258
IIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVWM
Sbjct: 407 IIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGVLEIAPCSRVGHVFRKST 466
Query: 259 ------------------------DEWRDFYYAMNPG 271
D+W++FYY+ PG
Sbjct: 467 PYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPG 503
>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
Length = 594
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/269 (69%), Positives = 205/269 (76%), Gaps = 11/269 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CKKK YP LPTTSIVIVFHNEAWSTLLRT+WSVINRSPR LLKEIILVDDASER +
Sbjct: 140 CKKKHYPAKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGR 199
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA- 114
+ D + I + G +H K V +D + T E + A
Sbjct: 200 QLEDYVKTLPVSTIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLAR 259
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KTVVCPIIDVISD+TFEY+TASD TWGGFNWKLNFRWYRVP REM RR DR++P
Sbjct: 260 IVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRNHDRTAP 319
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
LRTPTMAGGLF+ID+DYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILEI PCSHVGHV
Sbjct: 320 LRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHV 379
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWM 258
FRDKSPYTFPGGV+ IVL NAARVAEVWM
Sbjct: 380 FRDKSPYTFPGGVANIVLKNAARVAEVWM 408
>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
Length = 677
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 194/337 (57%), Positives = 222/337 (65%), Gaps = 76/337 (22%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR+LLKEIILVDDASER
Sbjct: 179 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASER----- 233
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------HKKTVVCPIIDVISDQT 108
D + + +Y+ + + LR + R H V +D + T
Sbjct: 234 -DFLGKKLEDYVAKLPVR----TFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECT 288
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
++ +TVVCPIIDVISD+TFEYITASD TWGGFNWKLNFRWYRVP REM
Sbjct: 289 EGWLEPLLARIVQNRRTVVCPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREM 348
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILE
Sbjct: 349 ARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILE 408
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWM--------------------- 258
IIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARVAEVWM
Sbjct: 409 IIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGILEIAPCSRVGHVFRKST 468
Query: 259 ------------------------DEWRDFYYAMNPG 271
D+W++FYY+ PG
Sbjct: 469 PYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPG 505
>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Acyrthosiphon pisum]
Length = 615
Score = 366 bits (940), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 186/305 (60%), Positives = 208/305 (68%), Gaps = 53/305 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YPT +PTTSIVIVFHNEAWSTLLRTVWSVINRSPR+LLKEI+LVDDASER
Sbjct: 160 CKSKQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGK 219
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ ++ Q ++ A + G L
Sbjct: 220 KLEDYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLAR 279
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++KTVVCP+IDVISD T FEY+TASDMTWGGFNWKLN
Sbjct: 280 IVLNRKTVVCPVIDVISDDT---------------------FEYVTASDMTWGGFNWKLN 318
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFY+LGSYDEGMDIWGGENLEMS
Sbjct: 319 FRWYRVPQREMTRRNQDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMS 378
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+W CGG LEI PCSHVGHVFR +PYTFPGG S IV HN AR+AEVWMDEW+ FYYA+
Sbjct: 379 FRIWMCGGTLEISPCSHVGHVFRKSTPYTFPGGTSHIVNHNNARLAEVWMDEWKHFYYAI 438
Query: 269 NPGKS 273
NPG S
Sbjct: 439 NPGAS 443
>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 460
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 183/303 (60%), Positives = 214/303 (70%), Gaps = 23/303 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVI SPR LL+EIILVDDASER
Sbjct: 7 CKDKVYPEKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASER----- 61
Query: 61 IDVISDQTFEYITASD-----MTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------F 109
+ + Q +Y+ D M G + +R + V +I +
Sbjct: 62 -EHLGKQLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWL 120
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCP+IDVISD+TFEYI+ASD+TWGGFNWKLNFRWYRVP RE+ RRG
Sbjct: 121 EPLLARIAEDRTRVVCPVIDVISDETFEYISASDLTWGGFNWKLNFRWYRVPQRELDRRG 180
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLFAIDKDYF ELG YDEGMDIWGGENLE+SFR+W CGG LEI+PC
Sbjct: 181 GDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPC 240
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCAAHF 283
SHVGHVFR +PYTFPGG SKIV HN AR+AEVW+DEW++FY+A+NP +H
Sbjct: 241 SHVGHVFRKSTPYTFPGGTSKIVNHNNARLAEVWLDEWKEFYFAINPAAKNVDKGDLSHR 300
Query: 284 RML 286
R L
Sbjct: 301 RNL 303
>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
Length = 495
Score = 357 bits (917), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 182/290 (62%), Positives = 211/290 (72%), Gaps = 26/290 (8%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K+YP LPTTS+VIVFHNEAWSTLLRTV SVINRSP LLKEIILVDDASE+
Sbjct: 46 CKSKTYPVESLPTTSVVIVFHNEAWSTLLRTVHSVINRSPPPLLKEIILVDDASEK---- 101
Query: 60 IIDVISDQTFEYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT-- 108
D + Q EY++ + G +L+ R + V+ +D + T
Sbjct: 102 --DFLGRQLDEYLSKLSVHVYVLRMEKRTGLIRARLKGAARAEGKVIT-FLDAHCECTEG 158
Query: 109 ------FE-YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
FE + K+VVCPIIDVISD+TFEYIT SDMTWGGFNWKLNFRWY VP RE+ R
Sbjct: 159 WLEPLLFEIHKNRKSVVCPIIDVISDETFEYITGSDMTWGGFNWKLNFRWYPVPQREVER 218
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
RGGDRS PLR+PTMAGGL AI++DYFYE+GSYD+GMDIWGGENLEMSFR+W CGG L I+
Sbjct: 219 RGGDRSLPLRSPTMAGGLLAIERDYFYEIGSYDDGMDIWGGENLEMSFRIWMCGGTLLIV 278
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
CSHVGHVFR +PYTFPGG +I+ HN AR+AEVWMDEWR FYY +NPG
Sbjct: 279 TCSHVGHVFRKATPYTFPGGTGRIINHNNARLAEVWMDEWRSFYYKINPG 328
>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 621
Score = 356 bits (914), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 177/288 (61%), Positives = 210/288 (72%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVI SPR LL+EIILVDDASER
Sbjct: 168 CKDKVYPEKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASER----- 222
Query: 61 IDVISDQTFEYITASD-----MTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------F 109
+ + + +Y+ + M G + +R + V +I +
Sbjct: 223 -EHLGKKLEDYVVKLEVPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQHWL 281
Query: 110 EYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCP+IDVISD+TFEYI+ASDMTWGGFNWKLNFRWYRVP RE+ RRG
Sbjct: 282 EPLLARIAEDRTRVVCPVIDVISDETFEYISASDMTWGGFNWKLNFRWYRVPQREVERRG 341
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLF+IDKDYF ELG YDEGMDIWGGENLE+SFR+W CGG LEI+PC
Sbjct: 342 GDRTLPIRTPTMAGGLFSIDKDYFNELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPC 401
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFR +PY+FPGG S+IV HN AR+AEVW+DEW+DFY+A+NP
Sbjct: 402 SHVGHVFRKSTPYSFPGGTSRIVNHNNARLAEVWLDEWKDFYFAINPA 449
>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
ricinus]
Length = 582
Score = 355 bits (911), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 181/303 (59%), Positives = 212/303 (69%), Gaps = 23/303 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LPTTS+ IVFHNEAWSTLLRTV SVI SPR LL+EIILVDDASER
Sbjct: 129 CKDKVYPEKLPTTSVDIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASER----- 183
Query: 61 IDVISDQTFEYITASD-----MTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------F 109
+ + Q +Y+ D M G + +R + V +I +
Sbjct: 184 -EHLGKQLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWL 242
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCP+IDVISD+TFEYI+ASD+TWGGFNWKLNFR YRVP RE+ RRG
Sbjct: 243 EPLLARIAEDRTRVVCPVIDVISDETFEYISASDLTWGGFNWKLNFRGYRVPQRELDRRG 302
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLFAIDKDYF ELG YDEGMDIWGGENLE+SFR+W CGG LEI+PC
Sbjct: 303 GDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPC 362
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCAAHF 283
SHVGHVFR +PYTFPGG SKIV HN AR+AEVW+DEW++FY+A+NP +H
Sbjct: 363 SHVGHVFRKSTPYTFPGGTSKIVNHNNARLAEVWLDEWKEFYFAINPAAKNVDKGDLSHR 422
Query: 284 RML 286
R L
Sbjct: 423 RNL 425
>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
Length = 541
Score = 352 bits (903), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 190/351 (54%), Positives = 213/351 (60%), Gaps = 107/351 (30%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 46 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 100
Query: 61 IDVISDQTFEYITA--------------------------------------SDMTWGGF 82
D + Q EY+ + T G
Sbjct: 101 -DFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWL 159
Query: 83 NWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG 142
L ++++TVVCPIIDVISD+T FEYITASD TWGG
Sbjct: 160 EPLLARIVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGG 198
Query: 143 FNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
FNWKLNFRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGG
Sbjct: 199 FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGG 258
Query: 203 ENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARV--------- 253
ENLEMSFR+WQCGGILEIIPCSHVGHVFRDKSPYTFPGGV+KIVLHNAARV
Sbjct: 259 ENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVAKIVLHNAARVWMCGGVLEI 318
Query: 254 ---------------------------------AEVWMDEWRDFYYAMNPG 271
EVW+D+W++FYY+ PG
Sbjct: 319 APCSRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPG 369
>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
Length = 630
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/309 (57%), Positives = 205/309 (66%), Gaps = 65/309 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y + LPTTSIVIVFHNEAW+TLLRTVWSVINRSPR LLKEIILVDDASER
Sbjct: 177 CRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASER----- 231
Query: 61 IDVISDQTFEYITA--------------------------------------SDMTWGGF 82
D + Q EY+ + T G
Sbjct: 232 -DFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWL 290
Query: 83 NWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG 142
L ++++TVVCPIIDVISD+T FEYITASD TWGG
Sbjct: 291 EPLLARIVQNRRTVVCPIIDVISDET---------------------FEYITASDSTWGG 329
Query: 143 FNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
FNWKLNFRWYRVP REM RR DR++PLRTPTMAGGLF+IDKDYFYE+GSYDEGMDIWGG
Sbjct: 330 FNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGG 389
Query: 203 ENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWR 262
ENLEMSFRVW CGG+LEI PCS VGHVFR +PYTFPGG ++IV HN AR+ EVW+D+W+
Sbjct: 390 ENLEMSFRVWMCGGVLEIAPCSRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEVWLDDWK 449
Query: 263 DFYYAMNPG 271
+FYY+ PG
Sbjct: 450 EFYYSFYPG 458
>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus impatiens]
Length = 606
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 181/303 (59%), Positives = 201/303 (66%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ ++ Q ++ A + T G L
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 272 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 310
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDKDYFYELG+YDEGMDIWGGENLEMS
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMS 370
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+W CGG LEI CSHVGHVFR +PYTFPGG SKIV HN AR+AEVW+D+W+ FYY +
Sbjct: 371 FRIWMCGGTLEIATCSHVGHVFRKSTPYTFPGGTSKIVNHNNARLAEVWLDQWKYFYYNI 430
Query: 269 NPG 271
NPG
Sbjct: 431 NPG 433
>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 624
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 172/291 (59%), Positives = 210/291 (72%), Gaps = 27/291 (9%)
Query: 1 CKKKSYP--TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
CK+KSYP + LP TS+VIVFHNEAWSTLLRTV S+INRSPR LL EI+LVDDASER
Sbjct: 163 CKRKSYPPNSDLPDTSVVIVFHNEAWSTLLRTVHSIINRSPRELLNEILLVDDASER--- 219
Query: 59 PIIDVISDQTFEYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT- 108
+ + + +YI S+ G +L+ + + V+ +D + T
Sbjct: 220 ---EELGKKLEDYIARLPVSTRVIRSEERTGLIRARLKGAKQARGKVIT-FLDAHCECTE 275
Query: 109 -------FEYITAKT-VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
+E +T VVCPIIDVI D +FEYIT SDMTWGGFNWKLNFRWY VP RE+
Sbjct: 276 GWLEPLLYEIHKDRTAVVCPIIDVIGDDSFEYITGSDMTWGGFNWKLNFRWYPVPQRELD 335
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
RRGGDRS+P +TPTMAGGLF+ID+DYFYE+GSYDEGMDIWGGENLEMSFRVW CGG + I
Sbjct: 336 RRGGDRSNPTKTPTMAGGLFSIDRDYFYEVGSYDEGMDIWGGENLEMSFRVWMCGGKVYI 395
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
+ CS VGHVFR SPY++PGGV++I+ HN R+ EVWMDE++DF+Y +NPG
Sbjct: 396 VTCSRVGHVFRKTSPYSWPGGVARIINHNTQRIVEVWMDEYKDFFYKINPG 446
>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Metaseiulus occidentalis]
Length = 583
Score = 340 bits (871), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 169/287 (58%), Positives = 203/287 (70%), Gaps = 23/287 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+KK+YP LPTTSIVIVFHNEAW+TLLRTV S+I SPR L+ EIILVDDASE
Sbjct: 126 CRKKTYPDRLPTTSIVIVFHNEAWTTLLRTVHSIIQMSPRELIAEIILVDDASE------ 179
Query: 61 IDVISDQTFEYIT-----ASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------F 109
D + + +Y+ + G + +R + +TV +I +
Sbjct: 180 FDHLGQKLEDYVAKLPVPVHVLRTGKRSGLIRARLIGAETVTGQVITFLDAHCECTEGWL 239
Query: 110 EYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCP+IDVISD+ F Y+ ASD TWGGFNWKLNFRWYRVP RE RRG
Sbjct: 240 EPLLARIAEDNTRVVCPVIDVISDENFAYVPASDQTWGGFNWKLNFRWYRVPQRENDRRG 299
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLFA+DK YF +LG YDEGMDIWGGENLEMSFR+W CGG LEI+ C
Sbjct: 300 GDRTLPVRTPTMAGGLFAMDKAYFEKLGKYDEGMDIWGGENLEMSFRIWMCGGTLEIVTC 359
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
SHVGHVFR +PYTFPGG KIV HN AR+A+VW+DEW+DFY+A+NP
Sbjct: 360 SHVGHVFRKSTPYTFPGGTGKIVNHNNARLADVWLDEWKDFYFAINP 406
>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Ornithorhynchus anatinus]
Length = 556
Score = 339 bits (870), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 169/289 (58%), Positives = 203/289 (70%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP T +VIVFHNEAWSTLLRTV+SVINRSPR+LL E+ILVDDASER
Sbjct: 105 CKTKIYPDELPNTRVVIVFHNEAWSTLLRTVFSVINRSPRSLLSEVILVDDASER----- 159
Query: 61 IDVISDQTFEYITASDM---------TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
D + Y+ D+ G +LR + V+ +D + TF +
Sbjct: 160 -DFLKTSLENYVKNLDVPVKIIRMEQRSGLIRARLRGAAASRGQVIT-FLDAHCECTFGW 217
Query: 112 ITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 218 LEPLLARIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRR 277
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+
Sbjct: 278 KGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVT 337
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 338 CSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
tropicalis]
gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
Length = 452
Score = 339 bits (870), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 169/283 (59%), Positives = 206/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAW+TLLRTV SVINRSPR LL+EIILVDDASER +
Sbjct: 106 CKTKVYPDSLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLQEIILVDDASEREFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + T ++ + G +LR K V+ +D + T ++
Sbjct: 166 PLETYVKKLTVPVHVLRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRRGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus laevis]
gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
Length = 556
Score = 338 bits (866), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 170/283 (60%), Positives = 202/283 (71%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TSIVIVFHNEAWSTLLRTV SVINRSP L+ EIILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDASERDFLKT 164
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + + I + G +L N K ++ +D + TF ++
Sbjct: 165 PLENYVKHLEVAVKILRMEQRSGLIRARLSGANVAKGKIIT-FLDAHCECTFGWLEPLLA 223
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 224 RIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 283
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+IDK YF ELG+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 284 PVRTPTMAGGLFSIDKKYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 343
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMD+++DF+Y ++PG
Sbjct: 344 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPG 386
>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
[Bos taurus]
Length = 519
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 66 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 125
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 126 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 184
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 185 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 244
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 245 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 304
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 305 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 347
>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Canis lupus familiaris]
gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Ailuropoda melanoleuca]
gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
Length = 559
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
adamanteus]
Length = 556
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 170/283 (60%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRT++SV+NRSP LL EIILVDDASER +
Sbjct: 105 CKTKVYPDELPTTSVVIVFHNEAWSTLLRTIYSVMNRSPHYLLSEIILVDDASERDFLKL 164
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
P+ + + + Q I + G +LR K V+ +D + T E + A
Sbjct: 165 PLENYVRNLQVPVKIIRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTTGWLEPLLA 223
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
K VVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 224 RIKEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 283
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 284 PVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 343
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 344 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
aries]
Length = 559
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
catus]
Length = 559
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Equus caballus]
Length = 559
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
Length = 591
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 138 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 197
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 198 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 256
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 257 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 316
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 317 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 376
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 377 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 419
>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Sus scrofa]
Length = 559
Score = 337 bits (864), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
taurus]
gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
taurus]
gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
Length = 559
Score = 337 bits (864), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mustela putorius
furo]
Length = 461
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Callithrix jacchus]
gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
boliviensis boliviensis]
Length = 559
Score = 336 bits (862), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
Length = 559
Score = 336 bits (862), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE++ F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKTFFYIISPG 387
>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Anolis carolinensis]
Length = 557
Score = 336 bits (862), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 202/283 (71%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TS+VIVFHNEAWSTLLRT++SVINR+P LL EIILVDDASER +
Sbjct: 106 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFLKV 165
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + Q I + G +LR K V+ +D + T ++
Sbjct: 166 PLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTLGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
K VVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 387
>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
laevis]
gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
Length = 559
Score = 336 bits (862), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 168/289 (58%), Positives = 203/289 (70%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LPTTS+VIVFHNEAW+TLLRTV SVINRSPR LL+EI+LVDDASER
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLREIVLVDDASER----- 160
Query: 61 IDVISDQTFEYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
D + Y+ + G +LR K V+ +D + T +
Sbjct: 161 -DFLKRALETYVKKLSVPVHVIRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTVGW 218
Query: 112 ITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ +TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 219 LEPLLARINHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRR 278
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+
Sbjct: 279 RGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVT 338
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 339 CSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pongo abelii]
Length = 612
Score = 336 bits (862), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Anolis carolinensis]
Length = 556
Score = 336 bits (862), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 202/283 (71%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TS+VIVFHNEAWSTLLRT++SVINR+P LL EIILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFLKV 164
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + Q I + G +LR K V+ +D + T ++
Sbjct: 165 PLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTLGWLEPLLA 223
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
K VVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 224 RIKEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 283
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 284 PVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 343
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 344 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Ornithorhynchus anatinus]
Length = 559
Score = 336 bits (861), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKFDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Ornithorhynchus anatinus]
Length = 558
Score = 336 bits (861), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 105 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 164
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 165 PLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVIT-FLDAHCECTVGWLEPLLA 223
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 224 RIKFDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 283
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 284 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 343
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 344 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 386
>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 559
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
Length = 483
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 169/283 (59%), Positives = 202/283 (71%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TSIVIVFHNEAWSTLLRTV SVINRSP L+ EIILVDD+SER +
Sbjct: 32 CKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDSSERDFLKS 91
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + + I + G +LR N K ++ +D + T ++
Sbjct: 92 PLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQIIT-FLDAHCECTIGWLEPLLA 150
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 151 RIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 210
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+IDK YF ELG+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 211 PVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 270
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMD+++DF+Y ++PG
Sbjct: 271 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPG 313
>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
(Silurana) tropicalis]
gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus (Silurana) tropicalis]
Length = 498
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 169/283 (59%), Positives = 202/283 (71%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TSIVIVFHNEAWSTLLRTV SVINRSP L+ EIILVDD+SER +
Sbjct: 32 CKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDSSERDFLKS 91
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + + I + G +LR N K ++ +D + T ++
Sbjct: 92 PLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQIIT-FLDAHCECTIGWLEPLLA 150
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 151 RIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 210
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+IDK YF ELG+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 211 PVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 270
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMD+++DF+Y ++PG
Sbjct: 271 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPG 313
>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Anolis carolinensis]
Length = 498
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 202/283 (71%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TS+VIVFHNEAWSTLLRT++SVINR+P LL EIILVDDASER +
Sbjct: 32 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFLKV 91
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + Q I + G +LR K V+ +D + T ++
Sbjct: 92 PLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTLGWLEPLLA 150
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
K VVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 151 RIKEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 210
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 211 PVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 270
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 271 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 313
>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDSLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Cavia porcellus]
Length = 559
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
Length = 553
Score = 335 bits (859), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLENYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SD T+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMAGSDKTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Nomascus leucogenys]
gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pan troglodytes]
gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
paniscus]
gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
gorilla gorilla]
gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [synthetic
construct]
gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
fascicularis]
gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
Length = 559
Score = 335 bits (859), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
Length = 559
Score = 335 bits (859), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Loxodonta africana]
Length = 559
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TS+VIVFHNEAWSTLLRTV SV+NRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDALPRTSVVIVFHNEAWSTLLRTVHSVLNRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
anubis]
Length = 559
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLERYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Monodelphis domestica]
Length = 556
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 167/283 (59%), Positives = 199/283 (70%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL EIILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEIILVDDASERDFLKM 164
Query: 61 IDVISDQTFEY---ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI----- 112
+ E I + G +LR K V+ +D + T ++
Sbjct: 165 ALENYVKNLEVPVKIIRMEQRSGLIRARLRGAAASKGQVIT-FLDAHCECTLGWLEPLLA 223
Query: 113 ----TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ KTVVCPIID+ISD FEY SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 224 RIKESRKTVVCPIIDLISDDNFEYTAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 283
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGH
Sbjct: 284 PVRTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGH 343
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 344 VFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
mutus]
Length = 564
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV S+IN SPR +L+EI+LVDDASER +
Sbjct: 111 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSIINHSPRHMLEEIVLVDDASERDFLKR 170
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 171 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 229
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 230 RIKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 289
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 290 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 349
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 350 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 392
>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 334 bits (857), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 206/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
P+ + ++ + G +L+ + V+ +D + T E + A
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVIT-FLDAHCECTAGWLEPLLA 224
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
Length = 501
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 166/282 (58%), Positives = 204/282 (72%), Gaps = 13/282 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKQDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++P
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISP 386
>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 206/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
P+ + ++ + G +L+ + V+ +D + T E + A
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVIT-FLDAHCECTAGWLEPLLA 224
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
Acetylgalactosaminyltransferase-T1
Length = 472
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 166/283 (58%), Positives = 206/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 19 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 78
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITA 114
P+ + ++ + G +L+ + V+ +D + T E + A
Sbjct: 79 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVIT-FLDAHCECTAGWLEPLLA 137
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 138 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 197
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 198 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 257
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 258 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 300
>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Taeniopygia guttata]
Length = 559
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Meleagris gallopavo]
gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
Length = 559
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Sarcophilus harrisii]
Length = 559
Score = 333 bits (855), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKVDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+ YF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
Length = 499
Score = 333 bits (855), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 46 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 105
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 106 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 164
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 165 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 224
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 225 PVRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 284
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 285 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 327
>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Monodelphis domestica]
Length = 559
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 165/283 (58%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKVDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+ YF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 164/283 (57%), Positives = 205/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKL+FRWY VP REM RR GDR+
Sbjct: 225 RIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLDFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
caballus]
Length = 453
Score = 333 bits (854), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 166/282 (58%), Positives = 204/282 (72%), Gaps = 13/282 (4%)
Query: 2 KKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCP 59
K K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EI+LVDDASER + P
Sbjct: 1 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 60
Query: 60 IIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 61 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLAR 119
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P
Sbjct: 120 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 179
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGHV
Sbjct: 180 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 239
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 240 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 281
>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Takifugu rubripes]
Length = 556
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 170/289 (58%), Positives = 202/289 (69%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP +P TSIVIVFHNEAWSTLLRTV SVINRSPR LL EI+LVDDASER
Sbjct: 105 CKTKVYPDDVPNTSIVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASER----- 159
Query: 61 IDVISDQTFEY---------ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT--- 108
D + + Y I + G +LR K V+ +D + T
Sbjct: 160 -DFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVIT-FLDAHCECTVGW 217
Query: 109 FEYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
E + A+ VVCPIIDVISD+TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 218 LEPLLARIKEDRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRR 277
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GDR+ P+RTPTMAGGLF+IDK YF E+GSYD GMDIWGGENLEMSFR+WQCGG LEI+
Sbjct: 278 KGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVT 337
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
CSHVGHVFR +PY+FPGG +++ N R+AEVWMD+++DF+Y ++PG
Sbjct: 338 CSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYIISPG 386
>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 559
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 164/283 (57%), Positives = 204/283 (72%), Gaps = 13/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +++EI+LVDDASER +
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMVEEIVLVDDASERDFLKR 165
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + ++ + G +L+ K V+ +D + T ++
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVIT-FLDAHCECTVGWLEPLLA 224
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+TVVCPII VISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 225 RIKQDRRTVVCPIICVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGH
Sbjct: 285 PVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 345 VFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 387
>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
Length = 488
Score = 331 bits (848), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 165/285 (57%), Positives = 199/285 (69%), Gaps = 31/285 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K+YP LP S+VIVFHNEAW TLLR+V S+INR+PR L+EIILVDDASER V
Sbjct: 46 CKSKTYPKELPRMSVVIVFHNEAWCTLLRSVNSIINRTPRPYLEEIILVDDASERGVPVK 105
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISD 106
++ + ++ G +LR K V+ P++ I++
Sbjct: 106 LERMGKRS-----------GLIRARLRGSGAAKGPVITFLDAHIECTEGWAEPLLTRIAE 154
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RRGGDR
Sbjct: 155 DR------TTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRGGDR 208
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ PLRTPTMAGGLFAIDK YF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHV
Sbjct: 209 TMPLRTPTMAGGLFAIDKSYFEEIGTYDSGMDIWGGENLEISFRIWQCGGTLEIVTCSHV 268
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
GHVFR +PYTFPGG +I+ N R+AEVWMD ++DF+Y ++PG
Sbjct: 269 GHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDNFKDFFYIISPG 313
>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Oryzias latipes]
Length = 556
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 166/288 (57%), Positives = 200/288 (69%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LPTTSIVIVFHNEAWSTLLRTV SVI+RSPR LL EI+LVDDASER
Sbjct: 105 CKTKVYADDLPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASER----- 159
Query: 61 IDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVISDQT------F 109
D + + Y+ ++ + +R + R +I +
Sbjct: 160 -DFLKKKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWL 218
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCPIIDVISD+TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 219 EPLLARIKEDRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 278
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLF+IDK YF E+GSYD GMDIWGGENLEMSFR+WQCGG LEI+ C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFR +PY+FPGG +++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 339 SHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Meleagris gallopavo]
Length = 556
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 166/282 (58%), Positives = 195/282 (69%), Gaps = 11/282 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TS+VIVFHNEAWSTLLRTV SV+ RSPR LL EIILVDDASER +
Sbjct: 105 CKTKVYPEELPNTSVVIVFHNEAWSTLLRTVHSVLARSPRRLLAEIILVDDASEREFLKA 164
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITAK 115
+ + + G + V +D + T E + A+
Sbjct: 165 SLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVVTFLDAHCECTRGWLEPLLAR 224
Query: 116 ------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P
Sbjct: 225 IREDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 284
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPTMAGGLF+ID++YF E+GSYD GMDIWGGENLEMSFRVWQCGG LEI+ CSHVGHV
Sbjct: 285 VRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGSLEIVTCSHVGHV 344
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 345 FRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Oryzias latipes]
Length = 557
Score = 328 bits (842), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 166/288 (57%), Positives = 200/288 (69%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LPTTSIVIVFHNEAWSTLLRTV SVI+RSPR LL EI+LVDDASER
Sbjct: 106 CKTKVYADDLPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASER----- 160
Query: 61 IDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVISDQT------F 109
D + + Y+ ++ + +R + R +I +
Sbjct: 161 -DFLKKKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWL 219
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCPIIDVISD+TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 220 EPLLARIKEDRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 279
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLF+IDK YF E+GSYD GMDIWGGENLEMSFR+WQCGG LEI+ C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 339
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFR +PY+FPGG +++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 340 SHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDEFKDFFYIISPG 387
>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
1 [Ciona intestinalis]
Length = 573
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 166/289 (57%), Positives = 203/289 (70%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LPTTSIVIVFHNEAWSTLLRTV S+INRSP LL+EIILVDDASER +
Sbjct: 119 CKSKKYPEKLPTTSIVIVFHNEAWSTLLRTVHSIINRSPSHLLEEIILVDDASERDFLGA 178
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PII-D 102
P+ + +T + + G +LR + V+ P++ +
Sbjct: 179 PLERYVRKLRTLVRVVRMEKRTGLIRARLRGASVSTGQVITFLDAHCECTEGWLEPLLSE 238
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ D+T TVVCPIIDVISD+TFE++ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 239 IAKDRT-------TVVCPIIDVISDETFEFMVGSDMTYGGFNWKLNFRWYPVPQREMDRR 291
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GDR+ P+R+PTMAGGLF+IDK YF ELG+YD GMDIWGGENLE+SFR+WQCGG L I+
Sbjct: 292 KGDRTLPVRSPTMAGGLFSIDKSYFEELGTYDAGMDIWGGENLEISFRIWQCGGTLLIVT 351
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
CSHVGHVFR +PYTFPGG +I+ N R+AEVWMD +++F+Y + PG
Sbjct: 352 CSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDSFKNFFYIITPG 400
>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Oryzias latipes]
Length = 558
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 166/288 (57%), Positives = 200/288 (69%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LPTTSIVIVFHNEAWSTLLRTV SVI+RSPR LL EI+LVDDASER
Sbjct: 107 CKTKVYADDLPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASER----- 161
Query: 61 IDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVISDQT------F 109
D + + Y+ ++ + +R + R +I +
Sbjct: 162 -DFLKKKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWL 220
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCPIIDVISD+TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 221 EPLLARIKEDRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 280
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLF+IDK YF E+GSYD GMDIWGGENLEMSFR+WQCGG LEI+ C
Sbjct: 281 GDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 340
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFR +PY+FPGG +++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 341 SHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDEFKDFFYIISPG 388
>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
gallus]
Length = 556
Score = 328 bits (840), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 166/282 (58%), Positives = 195/282 (69%), Gaps = 11/282 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K YP LP TS+VIVFHNEAWSTLLRTV SV+ RSPR LL EIILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVHSVVARSPRRLLAEIILVDDASEREFLKA 164
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITAK 115
+ + + G + V +D + T E + A+
Sbjct: 165 SLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVITFLDAHCECTRGWLEPLLAR 224
Query: 116 ------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P
Sbjct: 225 IWEDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 284
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPTMAGGLF+ID++YF E+GSYD GMDIWGGENLEMSFRVWQCGG LEI+ CSHVGHV
Sbjct: 285 VRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGSLEIVTCSHVGHV 344
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FR +PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 345 FRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 386
>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Oreochromis niloticus]
Length = 556
Score = 328 bits (840), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 165/288 (57%), Positives = 199/288 (69%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSIVIVFHNEAWSTLLRTV SVINRSP+ LL EIILVDDASER
Sbjct: 105 CKTKVYSDDLPNTSIVIVFHNEAWSTLLRTVHSVINRSPKHLLVEIILVDDASER----- 159
Query: 61 IDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVISDQT------F 109
D + + Y+ ++ + +R + R +I +
Sbjct: 160 -DFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTVGWL 218
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCPIIDVISD+TFEY+ SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 219 EPLLARIKEDRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 278
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGGLF+IDK YF E+GSYD GMDIWGGENLEMSFR+WQCGG LEI+ C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFR +PY+FPGG +++ N R+AEVWMD+++DF+Y ++PG
Sbjct: 339 SHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYIISPG 386
>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
Length = 559
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 168/303 (55%), Positives = 198/303 (65%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVINRSPR LL+EI+LVDDASER
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLEEIVLVDDASERDFLKR 165
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + +Q I A + T G L
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 226 IKHDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 264
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 325 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 384
Query: 269 NPG 271
+PG
Sbjct: 385 SPG 387
>gi|395823173|ref|XP_003804166.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1 [Otolemur garnettii]
Length = 539
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 160/277 (57%), Positives = 196/277 (70%), Gaps = 21/277 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LPTTS+VIVFHNEAWSTLLRTV SV+NRSPR +++EI+LVDDASER
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVVNRSPRHMIEEIVLVDDASER----- 160
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCP 120
DQ ++ ++ + + + ++ Y KTVVCP
Sbjct: 161 -----DQHQRRQMIGSLSHADWDCSSQHVAHQGASDILKAFEICF-----YDFRKTVVCP 210
Query: 121 IIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLF 180
IIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+RTPTMAGGLF
Sbjct: 211 IIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLF 270
Query: 181 AIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS------HVGHVFRDKS 234
+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CS VGHVFR +
Sbjct: 271 SIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSXXXXXXXVGHVFRKAT 330
Query: 235 PYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 331 PYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 367
>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 565
Score = 324 bits (831), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 168/295 (56%), Positives = 200/295 (67%), Gaps = 27/295 (9%)
Query: 1 CKKKSYP--TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
CKKK+YP LP TSIVIVFHNEAWSTL+R V S+INRSPR LL+EIILVDDASER
Sbjct: 110 CKKKTYPPHNTLPKTSIVIVFHNEAWSTLIRNVHSIINRSPRMLLEEIILVDDASER--- 166
Query: 59 PIIDVISDQTFEYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT- 108
D + + +Y+ D G +LR V+ +D + T
Sbjct: 167 ---DFLGKELEDYVKKLPVRVRVERMDKRSGLIRARLRGAGVSTGEVIT-FLDAHCECTQ 222
Query: 109 --FEYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
E + A+ VVCPIIDVISD+TFE+ SDMT+GGFNWKLNFRWY VP REM
Sbjct: 223 GWLEPLMARIAEDRSRVVCPIIDVISDETFEFHAGSDMTYGGFNWKLNFRWYSVPKREMD 282
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
RR GDR+ PL TPTMAGGLFAI KDYF E+G+YD GMDIWGGENLEMSFR+W CGG LEI
Sbjct: 283 RRKGDRTIPLNTPTMAGGLFAIHKDYFEEIGTYDAGMDIWGGENLEMSFRIWMCGGTLEI 342
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSAS 275
+ CSHVGHVFR +PY+FPGG I+ N R+AEVWMD+++ F+Y ++PG S
Sbjct: 343 VTCSHVGHVFRKTTPYSFPGGTGAIINKNNRRLAEVWMDDYKTFFYKISPGSKKS 397
>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
Length = 632
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 169/311 (54%), Positives = 203/311 (65%), Gaps = 46/311 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP +P TS+VIVFHNEAWSTLLRTV SVINRSPR LL EI+LVDDASER
Sbjct: 105 CKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASER----- 159
Query: 61 IDVISDQTFEY---------ITASDMTWGGFNWKLREKNRHKKTVVC------------- 98
D + + Y I + G +LR K V+
Sbjct: 160 -DFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWL 218
Query: 99 -PIIDVISDQTFEYITA-----------------KTVVCPIIDVISDQTFEYITASDMTW 140
P++ I + ++ TA VVCPIIDVISD+TFEY+ SDMT+
Sbjct: 219 EPLLARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMAGSDMTY 278
Query: 141 GGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIW 200
GGFNWKLNFRWY VP REM RR GDR+ P+RTPTMAGGLF+IDK YF E+GSYD GMDIW
Sbjct: 279 GGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIW 338
Query: 201 GGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDE 260
GGENLEMSFR+WQCGG LEI+ CSHVGHVFR +PY+FPGG +++ N R+AEVWMD+
Sbjct: 339 GGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDD 398
Query: 261 WRDFYYAMNPG 271
++DF+Y ++PG
Sbjct: 399 FKDFFYIISPG 409
>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oryzias latipes]
Length = 557
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 167/303 (55%), Positives = 199/303 (65%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV SVI+RSPR+LL+EI+LVDDASER
Sbjct: 104 CKNKLYPDDLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPRSLLEEIVLVDDASERDFLKR 163
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ V +Q I A + T G L
Sbjct: 164 QLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTLGWLEPLLTR 223
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ K+TVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 224 IKQDKRTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 262
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTIPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 323 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 382
Query: 269 NPG 271
+PG
Sbjct: 383 SPG 385
>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 580
Score = 323 bits (829), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 165/292 (56%), Positives = 202/292 (69%), Gaps = 27/292 (9%)
Query: 1 CKKKSYP--TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
CK+KSYP + LP+TSIVIVFHNEAWSTLLR++ S+INRSPR LL EIILVDDASER
Sbjct: 122 CKRKSYPPVSELPSTSIVIVFHNEAWSTLLRSIHSIINRSPRELLTEIILVDDASER--- 178
Query: 59 PIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI--SDQTFE- 110
D + Q +Y+ + G + +R + R V ++ + DQ
Sbjct: 179 ---DFLGQQLDDYVKRLQVPVHVERMGTRSGLIRARLRGAGLVKGHVLGFLXSHDQCSAS 235
Query: 111 -----YITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
Y+ A + VVCPIIDVISD F + T SDMT+GGFNWKL FRWY VP RE
Sbjct: 236 SLRPVYLEASRRHDRRNVVCPIIDVISDDNFAFHTGSDMTYGGFNWKLQFRWYPVPQREA 295
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
RRGGDR+ PLR+PTMAGGLF+IDK YF E+G+YD GMD+WGGENLE+SFR+W CGG LE
Sbjct: 296 DRRGGDRTIPLRSPTMAGGLFSIDKTYFEEIGTYDAGMDVWGGENLEISFRIWMCGGTLE 355
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
I+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMD++R FYY ++PG
Sbjct: 356 IVTCSHVGHVFRKSTPYTFPGGTGRIINRNNQRLAEVWMDDFRHFYYRISPG 407
>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Ailuropoda melanoleuca]
Length = 556
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 196/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSPR LL E+ILVDDASER
Sbjct: 105 CKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
Length = 516
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 196/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSPR LL E+ILVDDASER
Sbjct: 60 CKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKL 119
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 120 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 179
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 180 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 218
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 219 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 278
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 279 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 338
Query: 269 NPG 271
+PG
Sbjct: 339 SPG 341
>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Ailuropoda melanoleuca]
Length = 557
Score = 323 bits (827), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 196/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSPR LL E+ILVDDASER
Sbjct: 106 CKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKL 165
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 166 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 226 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 264
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 325 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 384
Query: 269 NPG 271
+PG
Sbjct: 385 SPG 387
>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cavia porcellus]
Length = 568
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/304 (54%), Positives = 195/304 (64%), Gaps = 53/304 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPGK 272
+P K
Sbjct: 384 SPAK 387
>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
Length = 613
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/281 (56%), Positives = 198/281 (70%), Gaps = 23/281 (8%)
Query: 8 TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPI----- 60
T +PTTSI+IVFHNEAW+TLLRT+ SVINRSPR LL+EII++DD S+R +V P+
Sbjct: 168 TGMPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIK 227
Query: 61 ----------IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
++ S +T S M G L + + P+I +++
Sbjct: 228 ALPVPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTEGWLEPLISRVAEDR-- 285
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
K VV PIIDVISD TFEY+TAS+ TWGGFNW LNFRWY VP RE+ RRG DRS P+
Sbjct: 286 ----KRVVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYSVPKRELNRRGSDRSMPI 341
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
+TPT+AGGLFAIDK +FY++GSYDEGM +WGGENLE+SFRVW CGG LEI PCS VGHVF
Sbjct: 342 QTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVF 401
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
R ++PYTFPGG +K++ HNAAR AEVWMDE++ F+Y M P
Sbjct: 402 RKQTPYTFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVPA 442
>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oreochromis niloticus]
Length = 557
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 198/303 (65%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAW+TLLRTV SVI+RSP TLL+EI+LVDDASER
Sbjct: 104 CKNKLYPDNLPRTSVVIVFHNEAWTTLLRTVHSVIDRSPHTLLEEIVLVDDASERDFLKQ 163
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ V +Q I A + T G L
Sbjct: 164 QLERYVRKLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLAR 223
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ +KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 224 IKQDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 262
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 323 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 382
Query: 269 NPG 271
+PG
Sbjct: 383 SPG 385
>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
Length = 615
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 161/278 (57%), Positives = 196/278 (70%), Gaps = 23/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPI------- 60
LPTTSI+IVFHNEAW+TLLRT+ SVINRSPR LL+EIILVDD S+R +V P+
Sbjct: 171 LPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIILVDDKSDRDYLVKPLDAYIKKF 230
Query: 61 --------IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI 112
++ S +T S M G L + P++ +++
Sbjct: 231 PVPVHLVHLEDRSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVTRVAEDR---- 286
Query: 113 TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
K VV PIIDVISD TFEY+TAS+ TWGGFNW LNFRWY VP RE+ RRG DRS P++T
Sbjct: 287 --KRVVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRGADRSMPIQT 344
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
PT+AGGLFAIDK +FY++GSYDEGM +WGGENLE+SFRVW CGG LEI PCS VGHVFR
Sbjct: 345 PTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRK 404
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
++PYTFPGG +K++ HNAAR AEVWMDE++ F+Y M P
Sbjct: 405 QTPYTFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVP 442
>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 559
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 198/303 (65%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LPTTS+VIVFHNEAWSTLLRTV SV +RSPR +L+EI+LVDDASER
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVTDRSPRHMLEEIVLVDDASERDFLKR 165
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + +Q I A + T G L
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGASVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ +KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 226 IKQDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 264
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 325 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 384
Query: 269 NPG 271
+PG
Sbjct: 385 SPG 387
>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Saimiri boliviensis boliviensis]
Length = 556
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASEREFLKL 164
Query: 61 ------------IDVI-------------------SDQTFEYITAS-DMTWGGFNWKLRE 88
+ +I Q ++ A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
musculus]
Length = 594
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 107 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 166
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 167 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 226
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 227 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 265
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 325
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 326 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 385
Query: 269 NPG 271
+PG
Sbjct: 386 SPG 388
>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pan troglodytes]
gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Pongo abelii]
gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pongo abelii]
gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
paniscus]
gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
Length = 556
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
aries]
Length = 556
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
norvegicus]
gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
Length = 556
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
Length = 556
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
Length = 592
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Danio rerio]
Length = 557
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 167/303 (55%), Positives = 197/303 (65%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAW+TLLRTV SVI+RSPR LL+EI+LVDDASER
Sbjct: 104 CKTKVYPDDLPRTSVVIVFHNEAWTTLLRTVHSVIDRSPRHLLEEIVLVDDASERDFLKR 163
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ V +Q I A + T G L
Sbjct: 164 QLEHYVRKLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLSR 223
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
KKTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 224 IKLDKKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 262
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 323 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 382
Query: 269 NPG 271
+PG
Sbjct: 383 SPG 385
>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Nomascus leucogenys]
Length = 557
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 106 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 165
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 166 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 226 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 264
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 325 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 384
Query: 269 NPG 271
+PG
Sbjct: 385 SPG 387
>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
Length = 556
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
Length = 516
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 65 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 124
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 125 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 184
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 185 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 223
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 224 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 283
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 284 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 343
Query: 269 NPG 271
+PG
Sbjct: 344 SPG 346
>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
Length = 556
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Otolemur garnettii]
Length = 556
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 61 --------IDV-----------------------ISDQTFEYITAS-DMTWGGFNWKLRE 88
+DV Q ++ A + T G L
Sbjct: 165 TLENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
Length = 561
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
Length = 519
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 68 CKTKVYPDQLPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 127
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 128 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 187
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 188 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 226
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 227 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 286
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 287 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 346
Query: 269 NPG 271
+PG
Sbjct: 347 SPG 349
>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cricetulus griseus]
Length = 497
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 46 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 105
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 106 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 165
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 166 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 204
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 264
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 265 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 324
Query: 269 NPG 271
+PG
Sbjct: 325 SPG 327
>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Oryctolagus cuniculus]
Length = 557
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 106 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 165
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 166 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 226 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 264
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 324
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 325 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 384
Query: 269 NPG 271
+PG
Sbjct: 385 SPG 387
>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
catus]
Length = 546
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 196/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSPR LL E+ILVDDASER
Sbjct: 95 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKL 154
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 155 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLEPLLAR 214
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 215 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 253
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 254 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 313
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 314 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 373
Query: 269 NPG 271
+PG
Sbjct: 374 SPG 376
>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
Length = 611
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/278 (57%), Positives = 196/278 (70%), Gaps = 23/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPI------- 60
LPTTSI+IVFHNEAW+TLLRT+ SVINRSPR LL+EII++DD S+R +V P+
Sbjct: 170 LPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKKF 229
Query: 61 --------IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI 112
++ S +T S M G L + P++ +++
Sbjct: 230 PIPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVHRVAEDR---- 285
Query: 113 TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
K VV PIIDVISD TFEY+TAS+ TWGGFNW LNFRWY VP RE+ RRG DRS P++T
Sbjct: 286 --KRVVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRGSDRSMPIQT 343
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
PT+AGGLFAIDK +FY++GSYDEGM +WGGENLE+SFRVW CGG LEI PCS VGHVFR
Sbjct: 344 PTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRK 403
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
++PYTFPGG +K++ HNAAR AEVWMDE++ F+Y M P
Sbjct: 404 QTPYTFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVP 441
>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
3 [Canis lupus familiaris]
Length = 556
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TS+VIVFHNEAWSTLLRTV+SVINRSPR LL E+ILVDDASER
Sbjct: 105 CKTKVYADELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Otolemur garnettii]
Length = 558
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER +
Sbjct: 107 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 166
Query: 61 --------IDV-----------------------ISDQTFEYITAS-DMTWGGFNWKLRE 88
+DV Q ++ A + T G L
Sbjct: 167 TLENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 226
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 227 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 265
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 266 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 325
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 326 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 385
Query: 269 NPG 271
+PG
Sbjct: 386 SPG 388
>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
taurus]
gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
Length = 556
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 164/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK + YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTRVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Anolis carolinensis]
Length = 559
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 166/303 (54%), Positives = 197/303 (65%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LPTTS+VIVFHNEAWSTLLRTV SVINRSPR +L+EIILVDDASER
Sbjct: 106 CKTKVYSDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHILEEIILVDDASERDFLKR 165
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + +Q I A + T G L
Sbjct: 166 LLENYVKKLQIPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++TVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 226 IKADRRTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 264
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 265 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 324
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 325 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 384
Query: 269 NPG 271
+PG
Sbjct: 385 SPG 387
>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
Length = 612
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 160/279 (57%), Positives = 195/279 (69%), Gaps = 23/279 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVI--- 64
+P TSI+IVFHNEAW+TLLRT+ SVINRSPR LL+EIILVDD S+R +V P+ I
Sbjct: 169 MPKTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIILVDDKSDRDYLVKPLDSYIKMF 228
Query: 65 ------------SDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI 112
S +T S+M G L + P++ +++
Sbjct: 229 PIPIHLVHLENRSGLIRARLTGSEMAKGKILLFLDAHVEVTDGWLEPLVSRVAEDR---- 284
Query: 113 TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
K VV PIIDVISD TFEY+TAS+ TWGGFNW LNFRWY VP RE+ RRG DRS P++T
Sbjct: 285 --KRVVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRGSDRSMPIQT 342
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
PT+AGGLFAIDK +FY++GSYDEGM +WGGENLE+SFRVW CGG LEI PCS VGHVFR
Sbjct: 343 PTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRK 402
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
++PYTFPGG +K++ HNAAR AEVWMDE++ F+Y M P
Sbjct: 403 QTPYTFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVPA 441
>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Callithrix jacchus]
Length = 561
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 194/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+ YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Callithrix jacchus]
Length = 556
Score = 320 bits (820), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 194/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+ YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
partial [Papio anubis]
Length = 444
Score = 320 bits (819), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 165/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 39 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 98
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 99 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 158
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 159 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 197
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMS
Sbjct: 198 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMS 257
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 258 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 317
Query: 269 NPG 271
+PG
Sbjct: 318 SPG 320
>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
glaber]
Length = 330
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/303 (54%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 46 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKF 105
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 106 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 165
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 166 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 204
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLE+S
Sbjct: 205 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEIS 264
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 265 FRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 324
Query: 269 NPG 271
+PG
Sbjct: 325 SPG 327
>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Danio rerio]
Length = 556
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/303 (53%), Positives = 195/303 (64%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K+YP LP TSIVIVFHNEAWSTLLRTV S INRSPR LL EI+LVDDASER
Sbjct: 105 CKTKTYPDDLPNTSIVIVFHNEAWSTLLRTVHSAINRSPRQLLYEILLVDDASERDFLKE 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + +Q I A + T G +
Sbjct: 165 KLEDYVATLEVPVRILRMEQRTGLIRARLRGAAATRGQVITFLDAHCECTTGWLEPLMAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
++ VVCPIIDVIS D+TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRRAVVCPIIDVIS---------------------DETFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+ YF E+G+YD GMDIWGGENLEMS
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDSGMDIWGGENLEMS 323
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PY+FPGG +++ N R+AEVWMDE++DF+Y +
Sbjct: 324 FRIWQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDEFKDFFYII 383
Query: 269 NPG 271
+PG
Sbjct: 384 SPG 386
>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 399
Score = 317 bits (813), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 161/288 (55%), Positives = 200/288 (69%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LPTTS+VIVFHNEAWSTLLRTV SVIN SPR +++EI+LVDDA+ER
Sbjct: 85 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINCSPRHMVEEIVLVDDANER----- 139
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLRE---KNRHK-----KTVVCPIIDVISDQTFEYI 112
D + Y+ + + R ++R K K V +D + T ++
Sbjct: 140 -DFLKRTLESYVKKLKVPVHVIRMEHRSGLIRDRLKGDAVSKGQVIIFLDAHCECTVGWL 198
Query: 113 TA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+TVVCPIIDVISD TFE + SDMT+GGFNWKLNFRWY VP REM RR
Sbjct: 199 EPLLTRIKQDRRTVVCPIIDVISDDTFECMAGSDMTYGGFNWKLNFRWYLVPQREMDRRK 258
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GDR+ P+RTPTMAGG F+ID+DYF E+G+YD GMDIWG ENLE+SFR+WQCGG LEI+ C
Sbjct: 259 GDRTLPVRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEISFRIWQCGGTLEIVTC 318
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVF+ +PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 319 SHVGHVFQKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 366
>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
Length = 588
Score = 317 bits (811), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 162/290 (55%), Positives = 197/290 (67%), Gaps = 27/290 (9%)
Query: 1 CKKKSYPTF--LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
CKK Y LP TSI+ VFHNEAWSTLLR++ SVINRSPR +L+EIILVDD SE+
Sbjct: 132 CKKHDYANLGALPKTSIIFVFHNEAWSTLLRSIHSVINRSPREMLEEIILVDDKSEK--- 188
Query: 59 PIIDVISDQTFEY---------ITASDMTWGGFNWKLREKNRHKKTVVCPIIDV------ 103
D + Q +Y I G +L E + K V +D
Sbjct: 189 ---DFLGKQLDDYVKNLPVPVHIIRQQHREGLIRARL-EGAKIAKGEVLTFLDAHIEASP 244
Query: 104 --ISDQTFEYITAKT-VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
+ +E +T V+CPIIDVISD TFE++T SD+T+GGFNWKLNFRWY VP RE+
Sbjct: 245 GWLEPLLYEIKKDRTNVICPIIDVISDDTFEFLTGSDLTYGGFNWKLNFRWYPVPQREVD 304
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
RRGGDRS P++TPTMAGGLF+IDK YFYE+GSYD GMDIWGGENLEMSFR+W CGG + I
Sbjct: 305 RRGGDRSLPMQTPTMAGGLFSIDKSYFYEIGSYDSGMDIWGGENLEMSFRIWMCGGTVLI 364
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR +PYTFPGG S+I+ N R+AEVWMD+++ F+Y +NP
Sbjct: 365 ATCSHVGHVFRKATPYTFPGGTSQIINKNNRRLAEVWMDDYKKFFYIVNP 414
>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
malayi]
Length = 584
Score = 313 bits (801), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 160/282 (56%), Positives = 193/282 (68%), Gaps = 25/282 (8%)
Query: 8 TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC--PIIDVIS 65
T LP SI+IVFHNEAWSTLLRT+ SVINRSP L+KE+IL+DD S R P+ I
Sbjct: 143 TSLPMVSIIIVFHNEAWSTLLRTLHSVINRSPLHLIKEVILIDDLSNRTYLRKPLDTYIK 202
Query: 66 --DQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTF 109
F I + + G +L+ K V+ P++D +S
Sbjct: 203 RFSLPFHLIHLPERS-GLIRARLQGAKVAKGKVLLFLDAHVEVTEGWLEPLLDRVSTDR- 260
Query: 110 EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
K VV PIIDVISD+ FEYITASD+TWGGFNW LNFRWY VP REM RR DRS P
Sbjct: 261 -----KRVVAPIIDVISDENFEYITASDVTWGGFNWHLNFRWYPVPMREMERRNHDRSVP 315
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
L+TPT+AGGLFAID+ +FY++GSYDEGM+IWGGENLE+SFRVW CGG LEI PCS VGHV
Sbjct: 316 LQTPTIAGGLFAIDRQFFYDIGSYDEGMEIWGGENLEISFRVWMCGGSLEIHPCSRVGHV 375
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FR +PY+FPGG ++++ HNAAR AEVWMDE++D +Y M P
Sbjct: 376 FRKHTPYSFPGGTARVIHHNAARTAEVWMDEYKDIFYGMVPA 417
>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
Length = 584
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 158/282 (56%), Positives = 193/282 (68%), Gaps = 25/282 (8%)
Query: 8 TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC--PIIDVIS 65
T LP SI+IVFHNEAWSTLLRT+ SVINRSP L+KE+IL+DD S R P+ I
Sbjct: 143 TSLPMVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRKPLDTYIK 202
Query: 66 --DQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTF 109
F I + + G +L+ K V+ P++D +S
Sbjct: 203 RFSLPFHLIHLPERS-GLIRARLQGAKVAKGKVLLFLDAHVEVTEGWLEPLLDRVSTDR- 260
Query: 110 EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
K VV PIIDVISD+ FEYITASD+TWGGFNW LNFRWY VP REM RR DRS P
Sbjct: 261 -----KRVVAPIIDVISDENFEYITASDVTWGGFNWHLNFRWYPVPMREMERRNHDRSVP 315
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
L+TPT+AGGLFAID+ +FY++GSYDEGM++WGGENLE+SFRVW CGG LEI PCS VGHV
Sbjct: 316 LQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPCSRVGHV 375
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FR +PY+FPGG ++++ HN AR AEVWMDE++D +Y+M P
Sbjct: 376 FRKHTPYSFPGGTARVIHHNTARTAEVWMDEYKDIFYSMVPA 417
>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
Length = 584
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 159/282 (56%), Positives = 194/282 (68%), Gaps = 25/282 (8%)
Query: 8 TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVIS 65
T LPT SI+IVFHNEAWSTLLRT+ SVINRSP L+KE+IL+DD S R + P+ I
Sbjct: 143 TSLPTVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRSPLDLYIK 202
Query: 66 --DQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTF 109
F I + + G +L+ K V+ P++D +S
Sbjct: 203 RFSLPFHLIHLPERS-GLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPLLDRVS---- 257
Query: 110 EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
+ K VV PIIDVISD+ FEYITASD+TWGGFNW LNFRWY VP REM RR DRS P
Sbjct: 258 --VDRKRVVAPIIDVISDENFEYITASDITWGGFNWHLNFRWYPVPMREMERRNHDRSVP 315
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
L+TPT+AGGLFAID+ +FY++GSYDEGM++WGGENLE+SFRVW CGG LEI PCS VGHV
Sbjct: 316 LQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPCSRVGHV 375
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
FR +PY+FPGG + ++ NAAR AEVWMDE++D +Y M P
Sbjct: 376 FRKHTPYSFPGGTANVIHRNAARTAEVWMDEYKDIFYKMVPA 417
>gi|440905500|gb|ELR55875.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Bos grunniens
mutus]
Length = 412
Score = 310 bits (794), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 166/334 (49%), Positives = 199/334 (59%), Gaps = 65/334 (19%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER +
Sbjct: 16 CKTRVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 75
Query: 61 IDVISDQTFEY---ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
+ E I + G +LR K V+ +D + T ++
Sbjct: 76 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVIT-FLDAHCECTLGWLEPLLA 134
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 135 RIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 194
Query: 169 PLR----------------------------------------------------TPTMA 176
P+R TPTMA
Sbjct: 195 PVRLLPEKWLSLKVIEHTSPAQCLASAISLLDKEKASTSGESGGTGFKQSQLIERTPTMA 254
Query: 177 GGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPY 236
GGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGHVFR +PY
Sbjct: 255 GGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPY 314
Query: 237 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
TFPGG ++ N R+AEVWMDE++DF+Y ++P
Sbjct: 315 TFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISP 348
>gi|157113705|ref|XP_001652065.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108877647|gb|EAT41872.1| AAEL006558-PA [Aedes aegypti]
Length = 368
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 135/160 (84%), Positives = 149/160 (93%)
Query: 112 ITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
+ KTVVCPIIDVISD+TFEY+TASD TWGGFNWKLNFRWYRVP REM RR DR++PLR
Sbjct: 28 LDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRNHDRTAPLR 87
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TPTMAGGLF+ID+DYFYE+GSYDEGMDIWGGENLEMSFR+WQCGGILEI PCSHVGHVFR
Sbjct: 88 TPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFR 147
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
DKSPYTFPGGV+ IVL NAARVAEVW+DEW++FYY M+PG
Sbjct: 148 DKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPG 187
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/121 (38%), Positives = 60/121 (49%), Gaps = 28/121 (23%)
Query: 42 LLKEIILVDDASERVVCPIIDVISDQTFEYITASDMTWGGFNWKL---------REKNRH 92
LL I+L + VVCPIIDVISD+TFEY+TASD TWGGFNWKL RE R
Sbjct: 22 LLARIVL---DRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRR 78
Query: 93 KKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFE---YITASDMTWGGFNWKLNF 149
P + T+ + + D +E Y D+ WGG N +++F
Sbjct: 79 NHDRTAP------------LRTPTMAGGLFSIDRDYFYEIGSYDEGMDI-WGGENLEMSF 125
Query: 150 R 150
R
Sbjct: 126 R 126
>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 561
Score = 303 bits (777), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 158/290 (54%), Positives = 195/290 (67%), Gaps = 27/290 (9%)
Query: 1 CKKKSYP--TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
CKKK YP LPTTSI+IVFHNEAWSTL+R + S+INRSPR +L+EIILVDDASER
Sbjct: 102 CKKKIYPPSQKLPTTSIIIVFHNEAWSTLIRNIHSIINRSPREILEEIILVDDASER--- 158
Query: 59 PIIDVISDQTFEYITASDMT---------WGGFNWKLREKNRHKKTVVCPIIDVISDQT- 108
D + Q +Y+ + G +LR V+ +D + T
Sbjct: 159 ---DFLGKQLDDYVRGLSVRVRVVRMAERSGIVGARLRGAAISTGEVLT-FLDAHCECTK 214
Query: 109 --FEYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
E + A+ VV P+ID ISD+TFEY + ++ GGFNW+LNFRWY + RE
Sbjct: 215 GWLEPLIARIAEDRTRVVSPVIDSISDETFEYNSVPELGCGGFNWRLNFRWYPMSKREKK 274
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
RR GD + P+ TPTMAGGLF+I K+YFY +G+YDEGMDIWGGENLEMSFR+W CGG LEI
Sbjct: 275 RRKGDATIPINTPTMAGGLFSIHKEYFYRIGTYDEGMDIWGGENLEMSFRIWMCGGTLEI 334
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+PCSHVGHVFR KSPYTFPGGV+ +V +N R+AEVWMDE++ FYY P
Sbjct: 335 VPCSHVGHVFRGKSPYTFPGGVATVVHNNNRRLAEVWMDEYKSFYYKTVP 384
>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Takifugu rubripes]
Length = 557
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 161/303 (53%), Positives = 189/303 (62%), Gaps = 53/303 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV SVI+RSP TLL+EIILVDDASER
Sbjct: 104 CKNKLYPDNLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKR 163
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ V DQ I A + T G L
Sbjct: 164 PLEQYVRRLEVPVRVVRMDQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLAR 223
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ +KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 224 IKKDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 262
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+R AGG +DYF E+G+YD GMDIWGGENLE+S
Sbjct: 263 FRWYPVPQREMDRRKGDRTLPVRWVRCAGGXXXXXRDYFQEIGTYDAGMDIWGGENLEIS 322
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R+AEVWMDE+++F+Y +
Sbjct: 323 FRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYII 382
Query: 269 NPG 271
+PG
Sbjct: 383 SPG 385
>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 637
Score = 296 bits (758), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 150/282 (53%), Positives = 188/282 (66%), Gaps = 12/282 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C+ +P+ LPTTSIVIVFHNE STLLRT+ S++ RSP ++EII+VDDAS +
Sbjct: 181 CRSHEFPSDLPTTSIVIVFHNEGNSTLLRTLTSIVMRSPTEFIQEIIMVDDASVDREYLK 240
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTV-VCPIIDV---ISDQTFEYITA 114
I++ + + T K R K K T +D S EY+
Sbjct: 241 DILETFVKELPVRVEIIRNTQRLGLMKSRLKGAEKATGDTLTFLDAHIECSPGWLEYLLY 300
Query: 115 K------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VVCPIIDVI+D F Y+T SDMTWGGFNW+LNFRWY VP RE +RR D S
Sbjct: 301 EVKKDRTAVVCPIIDVINDDDFAYLTGSDMTWGGFNWRLNFRWYPVPNREEVRRNYDHSL 360
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
PL +PTMAGGLF ID+ YFYE+G+YD GM++WGGENLEMSFRVWQCGG + I PCSHVGH
Sbjct: 361 PLLSPTMAGGLFTIDRKYFYEIGAYDPGMEVWGGENLEMSFRVWQCGGKVLIHPCSHVGH 420
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR ++PYTFPGG K++ HN R+ EVW+D+++DF YA+ P
Sbjct: 421 VFRKQTPYTFPGGTGKVIFHNNKRLVEVWLDKYKDFVYAIMP 462
>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 383
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/287 (51%), Positives = 191/287 (66%), Gaps = 24/287 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
C+ K+Y + LPTTS++IVFHNEAWSTLLRTV+SVINRSP+ LLKEIILVDD S+R
Sbjct: 49 CRNKTYSSALPTTSVIIVFHNEAWSTLLRTVFSVINRSPKKLLKEIILVDDCSQRAFLKK 108
Query: 58 ----------CPIIDVISDQTFEYITA----SDMTWGGFNWKLREKNRHKKTVVCPIIDV 103
P++ V S + I A ++ G L + + P++D
Sbjct: 109 ALDNFVLNLPVPVLIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECTEGWLEPLLDR 168
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I+ K V P+IDVI+D+TF+Y D+ GGFNW L FRWY PP E+ RRG
Sbjct: 169 IA------FDRKIAVAPVIDVINDETFQYQKGIDVYRGGFNWNLQFRWYSSPPSELKRRG 222
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT+AGGLF+ID+ +F+E+G+YD+ M IWGGENLEMSFR+WQCGG LEIIPC
Sbjct: 223 NDVTHPVRTPTIAGGLFSIDRQFFFEIGAYDKEMKIWGGENLEMSFRIWQCGGQLEIIPC 282
Query: 224 SHVGHVFRDKSPYTFP-GGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
SHVGHVFR KSP+ FP G ++ + N RVAEVWMDEW+ +Y ++
Sbjct: 283 SHVGHVFRKKSPHDFPRGNSARTLTTNLVRVAEVWMDEWKSLFYIIS 329
>gi|241133788|ref|XP_002404588.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215493637|gb|EEC03278.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 459
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 150/282 (53%), Positives = 190/282 (67%), Gaps = 13/282 (4%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CKK Y T P TSI+IVFHNEAWSTLLRTV S INRSPR LL+EI+LVDDASER
Sbjct: 3 CKKLKYNTEGYPDTSIIIVFHNEAWSTLLRTVHSAINRSPRHLLREILLVDDASERSKFT 62
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTF---EYIT- 113
P + + + F + +NR + + + D F IT
Sbjct: 63 APASYFLPQTLHIFSIQGGIFRRIFASAVFHQNR---SFLSVFVSGREDSLFIANLRITR 119
Query: 114 -AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
A VVCP+ID+I+D+TF Y+ + +M WG FNW+L+FRW+ V RE RR G+ ++P RT
Sbjct: 120 QATVVVCPVIDIINDETFAYVRSFEMHWGAFNWELHFRWFPVGEREHKRRSGNATAPFRT 179
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P MAGGLF+ID+ YFYE+G+YD+ MDIWGGEN+E+SFR+WQCGG +E++PCSHVGH+FR
Sbjct: 180 PVMAGGLFSIDRGYFYEMGAYDDQMDIWGGENMEISFRIWQCGGSVEVVPCSHVGHLFRR 239
Query: 233 KSPYTF--PGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGK 272
SPYTF PGGV ++ N ARVA VWMDEW FY+ MN G+
Sbjct: 240 TSPYTFPNPGGVGSVLFSNLARVAAVWMDEWAAFYFNMNRGE 281
>gi|391342179|ref|XP_003745400.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Metaseiulus occidentalis]
Length = 610
Score = 292 bits (747), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/288 (53%), Positives = 190/288 (65%), Gaps = 25/288 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C+ SY LP TS++IVFHNEAWSTLLRTV SVINRSPR L+KEI+LVDDAS+R
Sbjct: 147 CRNISYAYDLPDTSVIIVFHNEAWSTLLRTVHSVINRSPRDLVKEIMLVDDASDREFLKR 206
Query: 56 --------VVCPIIDVISDQTFEYITASDMTWGGFNWK-LREKNRHKKTV---VCPIIDV 103
+ PI + S + I A M K L + H + + P++
Sbjct: 207 SLDAYVRSLNFPIKVIRSPKRSGLIRARLMGARAAEGKVLTFLDAHCECTTGWLEPLLQR 266
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I + VVCPIID+I D TF Y+ + ++ WG NW+++FRWY V P + +R
Sbjct: 267 IKEDR------TRVVCPIIDIIHDDTFAYVKSFELHWGAINWEMHFRWYPVGPHVLKQRH 320
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GD S P +TP MAGGLF+IDK+YFYE+G+YDE MDIWGGEN+EMSFR+WQCGG LEI+PC
Sbjct: 321 GDPSEPFKTPVMAGGLFSIDKEYFYEMGAYDEQMDIWGGENVEMSFRIWQCGGSLEIVPC 380
Query: 224 SHVGHVFRDKSPYTF--PGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
SHVGHVFR SPYTF P GV I+ N ARVAEVWMD+W +FY+ MN
Sbjct: 381 SHVGHVFRRSSPYTFPHPKGVGGILFSNLARVAEVWMDDWAEFYFNMN 428
>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
Length = 670
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 143/281 (50%), Positives = 187/281 (66%), Gaps = 11/281 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ Y TS+VIVFHNEAWSTL+RTV SVINRS L+EIILVDDASE+ ++
Sbjct: 121 CRSIKYEKISLKTSVVIVFHNEAWSTLMRTVQSVINRSSVDYLEEIILVDDASEKDELIA 180
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDV---ISDQTFEYITAK 115
+ + + G K V +D ++D E + ++
Sbjct: 181 LVESFLKTIPVAHTLIRLPQRSGLIVGRVRGAEIAKGDVLTFLDAHVEVTDGWLEPLLSR 240
Query: 116 T------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P+IDVISD TF+Y+TA++ TWGGF+W +NFRWY+ RE RRG ++++P
Sbjct: 241 ISEDRTRVVAPVIDVISDDTFQYVTAAESTWGGFSWTMNFRWYQASAREQKRRGKNKTTP 300
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPT+AGGLF+ID+ YF+++G+YDEGM IWGGENLE+SFRVW CGG LEI PCSHVGHV
Sbjct: 301 IRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEISFRVWMCGGTLEINPCSHVGHV 360
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FR ++PYTF GG S ++ NA R AEVWMDE+++FYY M P
Sbjct: 361 FRKQTPYTFEGGTSNVIYGNARRTAEVWMDEYKEFYYKMTP 401
>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 511
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 147/286 (51%), Positives = 184/286 (64%), Gaps = 24/286 (8%)
Query: 2 KKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPII 61
K ++ P LPT S+VIVFHNEAWSTLLRTV SVI+RSPR LL EIILVDD S R
Sbjct: 59 KYQNLPELLPT-SVVIVFHNEAWSTLLRTVQSVIDRSPRELLTEIILVDDGSTR------ 111
Query: 62 DVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYIT 113
+ + EY+ + K RE + K V +D + T ++
Sbjct: 112 KFLKEDLDEYVARLPVPVKVIRTKEREGLIRARMIGAKEAKGQVLTFLDAHCECTKGWLE 171
Query: 114 A---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
K VVCP+ID+I+D TF Y+ + ++ WG FNW L+FRWY + E+ +R
Sbjct: 172 PLLVRVSEDRKKVVCPVIDIINDDTFAYVRSFELHWGAFNWNLHFRWYTLGTTEIKKRKN 231
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P TP MAGGLFAI +DYFYE+G+YDE M IWGGENLEMSFR WQCGG +EI+PCS
Sbjct: 232 DVTEPFPTPAMAGGLFAIRRDYFYEIGAYDEQMKIWGGENLEMSFRGWQCGGSVEIVPCS 291
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVGH+FR SPYTFPGGV +I+ N ARVA VWMDEW++F++ NP
Sbjct: 292 HVGHLFRKSSPYTFPGGVGEILHANLARVALVWMDEWQEFFFKFNP 337
>gi|328723398|ref|XP_001946977.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 1 [Acyrthosiphon pisum]
gi|328723400|ref|XP_003247833.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 2 [Acyrthosiphon pisum]
Length = 624
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 144/273 (52%), Positives = 184/273 (67%), Gaps = 13/273 (4%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISD- 66
LP+++++IVFHNEAWSTL+RTV SVI+RSP+ LL EIILVDDAS R + + D ++
Sbjct: 161 LPSSTVIIVFHNEAWSTLMRTVQSVIDRSPKYLLNEIILVDDASTRKFLEKELDDYVAKL 220
Query: 67 QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---------KTV 117
I S G +L R K + +D + T ++ A K V
Sbjct: 221 PVLTRIIRSPKRIGLIKARLM-GARQAKGKILVFLDAHCECTLGWLEALVSRVAEDRKRV 279
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
VCP+ID+ISD+TF Y+ + ++ WG FNW L+FRWY ++M+ D + RTP MAG
Sbjct: 280 VCPVIDIISDETFAYVRSFELHWGAFNWDLHFRWYTRTTPDIMKGQRDITQAFRTPAMAG 339
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLFA+DK YF+ELG YDE M+IWGGENLE+SFRVWQCGG +EI PCSHVGHVFR SPYT
Sbjct: 340 GLFAMDKSYFFELGGYDERMEIWGGENLELSFRVWQCGGSIEIAPCSHVGHVFRKSSPYT 399
Query: 238 FPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FPGGVS ++ N ARVA VWMDEW++FY+ NP
Sbjct: 400 FPGGVSHVLYTNLARVALVWMDEWQEFYFKFNP 432
>gi|157113401|ref|XP_001657811.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108877741|gb|EAT41966.1| AAEL006452-PA [Aedes aegypti]
Length = 661
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 195/327 (59%), Gaps = 57/327 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K YP+ LPTTSI+IVFHNEAWS LLRTVWSVI RSPR L+KEI+LVDDAS+R
Sbjct: 133 CVSKEYPSKLPTTSIIIVFHNEAWSVLLRTVWSVIIRSPRHLIKEILLVDDASDRRFLKN 192
Query: 56 ----------VVCPIID----------------VISDQTFEYITAS-DMTWGGFNWKLRE 88
VV I+ V + T ++ A + + G L
Sbjct: 193 DLENYVQKLPVVISILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPLLAR 252
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ K VVCP+ID+ISD F YI ++FE+ WG FNW+++
Sbjct: 253 VQENPKKVVCPVIDIISDDNFSYI---------------KSFEF------HWGAFNWQMH 291
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY + E+ R D + P TP MAGGLF ID+ YF+++G+YDE + IWGG+NLEMS
Sbjct: 292 FRWYTLSDEELAERRKDTTMPFHTPAMAGGLFTIDRKYFFDVGAYDERLKIWGGDNLEMS 351
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FR+WQCGG +EI PCSHVGH+FR SPYTFPGGVS I+ N ARVA VWMD+W F++
Sbjct: 352 FRIWQCGGEIEIAPCSHVGHLFRKSSPYTFPGGVSGILNENLARVALVWMDDWAKFFFKF 411
Query: 269 NPG----KSASVSTCAAHFRMLSYSSW 291
N G KS +VS+ A + LS S+
Sbjct: 412 NKGTEEFKSLNVSSRVALKKHLSCKSF 438
>gi|158296916|ref|XP_317241.4| AGAP008229-PA [Anopheles gambiae str. PEST]
gi|157014942|gb|EAA12407.4| AGAP008229-PA [Anopheles gambiae str. PEST]
Length = 663
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/327 (46%), Positives = 194/327 (59%), Gaps = 57/327 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K YP LPTTSI+IVFHNEAWS LLRTVWSVINRSP+ L++EI+LVDDAS+R
Sbjct: 133 CVSKLYPAKLPTTSIIIVFHNEAWSVLLRTVWSVINRSPKGLVREILLVDDASDRRFLKH 192
Query: 56 ------------------------VVCPIID--VISDQTFEYITAS-DMTWGGFNWKLRE 88
V ++ + + T ++ A + + G L
Sbjct: 193 ELDNYVQKLPLSVTILRLNKREGLVAARLLGARMATGDTLTFLDAHCECSPGWLEPLLAR 252
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ K VVCP+ID+ISD F YI ++FE+ WG FNW L+
Sbjct: 253 VQENPKKVVCPVIDIISDDNFSYI---------------KSFEF------HWGAFNWPLH 291
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY + E+ R D ++P RTP MAGGLF ID+ YF+++G+YDE + IWGG+NLEMS
Sbjct: 292 FRWYTLSDEELAERRKDTTTPFRTPAMAGGLFTIDRKYFFDIGAYDERLKIWGGDNLEMS 351
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
FRVWQCGG +EI PCSHVGH+FR SPYTFPGGVS I+ N ARVA VWMD+W F++
Sbjct: 352 FRVWQCGGEVEIAPCSHVGHLFRKSSPYTFPGGVSGILNENLARVALVWMDDWAKFFFKF 411
Query: 269 NPG----KSASVSTCAAHFRMLSYSSW 291
N G KS +VS A R L+ S+
Sbjct: 412 NKGTEEFKSLNVSNRLALKRSLNCKSF 438
>gi|307183924|gb|EFN70514.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Camponotus
floridanus]
Length = 471
Score = 280 bits (715), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 144/278 (51%), Positives = 179/278 (64%), Gaps = 23/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP TSI+IVFHNEAWSTLLRTV SVINRSPR LLKEIILVDD SER + P+ D +
Sbjct: 58 LPKTSIIIVFHNEAWSTLLRTVHSVINRSPRELLKEIILVDDNSEREFLKNPLDDYVKTL 117
Query: 68 TFE-YITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYI 112
+ + S+ G +L + K V+ P+++ I
Sbjct: 118 SVPTRVLRSNARIGLIKARLLGAHNAKGEVLTFLDAHCECTVGWLEPLLEAIGK------ 171
Query: 113 TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
A VV P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P RT
Sbjct: 172 NATRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRDNIIEPFRT 231
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P MAGGLF+++KDYF++LGSYD+ M IWGGENLE+SFR WQCGG +EI PCSHVGH+FR
Sbjct: 232 PAMAGGLFSMNKDYFFKLGSYDDEMRIWGGENLELSFRTWQCGGSVEIAPCSHVGHLFRK 291
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
SPYTFPGGV I+ N ARVA VWMD+W DFY+ NP
Sbjct: 292 SSPYTFPGGVGDILYGNLARVALVWMDQWADFYFKFNP 329
>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Nasonia vitripennis]
Length = 610
Score = 279 bits (714), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 143/278 (51%), Positives = 181/278 (65%), Gaps = 23/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP+TS++IVFHNEAWSTLLRTV SVINRSPR LL+EIILVDD S+R + P+ + ++
Sbjct: 157 LPSTSVIIVFHNEAWSTLLRTVHSVINRSPRKLLEEIILVDDNSDRDFLRKPLDEYVAQL 216
Query: 68 TFE-YITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYI 112
+ SD G N +L N K V+ P+++ IS
Sbjct: 217 NVPTRVLRSDKRVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPLLEAISK------ 270
Query: 113 TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VV P+ID+I+D TF Y + ++ WG FNW L+FRW + + R + P +T
Sbjct: 271 NRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLMLNGALLRERRENIVDPFKT 330
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P MAGGLF++D++YF+ELGSYDE M IWGGENLE+SFRVWQCGG +EI PCSHVGH+FR
Sbjct: 331 PAMAGGLFSMDREYFFELGSYDEHMRIWGGENLELSFRVWQCGGSVEIAPCSHVGHIFRK 390
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
SPYTFPGGV +I+ N ARVA VWMDEW FY+ NP
Sbjct: 391 SSPYTFPGGVDEILYGNLARVALVWMDEWGKFYFNFNP 428
>gi|340711409|ref|XP_003394268.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Bombus terrestris]
Length = 604
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 146/278 (52%), Positives = 183/278 (65%), Gaps = 25/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TSI+IVFHNEAWSTLLRTV+SVINRSPR LL+EIILVDD S+R D + D
Sbjct: 154 LPKTSIIIVFHNEAWSTLLRTVYSVINRSPRHLLEEIILVDDNSDR------DFLKDALD 207
Query: 70 EYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------TA 114
E++ S G N +L NR K V+ +D + T ++ A
Sbjct: 208 EHVKNLKVSTKVLRSRKRIGLVNARLLGANRAKGEVLT-FLDAHCECTVGWLEPLLEAVA 266
Query: 115 KT---VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
K VV P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P R
Sbjct: 267 KNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFR 326
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP MAGGLF+++++YF+ELGSYD+ M IWGGENLE+SFRVWQCGG +EI PCSHVGH+FR
Sbjct: 327 TPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLFR 386
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
SPYTFPGGV +I+ N ARVA VWMDEW +FY+ N
Sbjct: 387 KSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFN 424
>gi|350416150|ref|XP_003490858.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Bombus impatiens]
Length = 604
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 146/278 (52%), Positives = 183/278 (65%), Gaps = 25/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TSI+IVFHNEAWSTLLRTV+SVINRSPR LL+EIILVDD S+R D + D
Sbjct: 154 LPKTSIIIVFHNEAWSTLLRTVYSVINRSPRHLLEEIILVDDNSDR------DFLKDALD 207
Query: 70 EYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------TA 114
E++ S G N +L NR K V+ +D + T ++ A
Sbjct: 208 EHVKNLKVSTKVLRSRKRIGLVNARLLGANRAKGEVLT-FLDAHCECTVGWLEPLLEAVA 266
Query: 115 KT---VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
K VV P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P R
Sbjct: 267 KNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFR 326
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP MAGGLF+++++YF+ELGSYD+ M IWGGENLE+SFRVWQCGG +EI PCSHVGH+FR
Sbjct: 327 TPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLFR 386
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
SPYTFPGGV +I+ N ARVA VWMDEW +FY+ N
Sbjct: 387 KSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFN 424
>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
Length = 575
Score = 276 bits (707), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 184/296 (62%), Gaps = 54/296 (18%)
Query: 6 YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDV 63
YPT+ P TSI+IVFHNEAWSTLLRTVWSVINRSP LL+EIILVDD+SER + P+ D
Sbjct: 119 YPTY-PKTSIIIVFHNEAWSTLLRTVWSVINRSPPELLEEIILVDDSSERKFLKKPLDDY 177
Query: 64 ISD-----------------------------QTFEYITAS-DMTWGGFNWKLREKNRHK 93
+++ ++ A + T G L + +
Sbjct: 178 VANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEALLSVIKQDR 237
Query: 94 KTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
VVCP+ID+I+D TF Y+ ++FE + WG FNW L FRW+
Sbjct: 238 TAVVCPVIDIINDDTFAYV---------------KSFE------LHWGAFNWNLQFRWFT 276
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+ RE+ R D + P TPTMAGGLFAID++YF+E+G+YD+GM+IWGGENLEMSFR+WQ
Sbjct: 277 LGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQ 336
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
CGG ++I PCS VGH+FR SPY+FPGG++K + N ARVA VWMD+W FY+ N
Sbjct: 337 CGGKVQIAPCSRVGHLFRKSSPYSFPGGINKTLFSNLARVARVWMDDWARFYFKFN 392
>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
Length = 607
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 184/296 (62%), Gaps = 54/296 (18%)
Query: 6 YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDV 63
YPT+ P TSI+IVFHNEAWSTLLRTVWSVINRSP LL+EIILVDD+SER + P+ D
Sbjct: 151 YPTY-PKTSIIIVFHNEAWSTLLRTVWSVINRSPPELLEEIILVDDSSERKFLKKPLDDY 209
Query: 64 ISD-----------------------------QTFEYITAS-DMTWGGFNWKLREKNRHK 93
+++ ++ A + T G L + +
Sbjct: 210 VANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEALLSVIKQDR 269
Query: 94 KTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
VVCP+ID+I+D TF Y+ ++FE + WG FNW L FRW+
Sbjct: 270 TAVVCPVIDIINDDTFAYV---------------KSFE------LHWGAFNWNLQFRWFT 308
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+ RE+ R D + P TPTMAGGLFAID++YF+E+G+YD+GM+IWGGENLEMSFR+WQ
Sbjct: 309 LGGRELKLRKNDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQ 368
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
CGG ++I PCS VGH+FR SPY+FPGG++K + N ARVA VWMD+W FY+ N
Sbjct: 369 CGGKVQIAPCSRVGHLFRKSSPYSFPGGINKTLFSNLARVARVWMDDWARFYFKFN 424
>gi|380030377|ref|XP_003698825.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Apis florea]
Length = 595
Score = 276 bits (706), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 145/278 (52%), Positives = 181/278 (65%), Gaps = 25/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TSI+IVFHNEAWSTLLRTV+SVI+RSPR LL+EIILVDD S+R D + D
Sbjct: 145 LPKTSIIIVFHNEAWSTLLRTVYSVIDRSPRQLLEEIILVDDNSDR------DFLKDTLD 198
Query: 70 EYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------TA 114
E++ S G N +L N K V+ +D + T ++ A
Sbjct: 199 EHVKNLQVSTKVLRSRKRIGLVNARLLGANNAKGEVLT-FLDAHCECTVGWLEPLLEAVA 257
Query: 115 KT---VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
K VV P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P R
Sbjct: 258 KNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFR 317
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP MAGGLF++++DYF+ELGSYD M IWGGENLE+SFRVWQCGG +EI PCSHVGH+FR
Sbjct: 318 TPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLFR 377
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
SPYTFPGGV +I+ N ARVA VWMDEW +FY+ N
Sbjct: 378 KSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFN 415
>gi|383848548|ref|XP_003699911.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Megachile rotundata]
Length = 604
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 145/282 (51%), Positives = 184/282 (65%), Gaps = 25/282 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TSI+IVFHNEAWSTLLRTV+SV+NRSPR LL+EIILVDD S+R + + D
Sbjct: 154 LPRTSIIIVFHNEAWSTLLRTVYSVVNRSPRHLLEEIILVDDDSDR------EFLKDALD 207
Query: 70 EYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------TA 114
E++ + S G N +L N K V+ +D + T ++ A
Sbjct: 208 EHVKSLRVPTKVLRSKKRIGLVNARLLGANEAKGEVLT-FLDAHCECTVGWLEPLLEAVA 266
Query: 115 KT---VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
K VV P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P R
Sbjct: 267 KNKTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFR 326
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP MAGGLF++++DYF+ELGSYD+ M IWGGENLE+SFRVWQCGG +EI PCSHVGH+FR
Sbjct: 327 TPAMAGGLFSMNRDYFFELGSYDDQMKIWGGENLELSFRVWQCGGSVEIAPCSHVGHLFR 386
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
SPYTFPGGV +I+ N ARVA VWMDEW +FY+ N S
Sbjct: 387 KSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFNAEAS 428
>gi|195149249|ref|XP_002015570.1| GL10955 [Drosophila persimilis]
gi|194109417|gb|EDW31460.1| GL10955 [Drosophila persimilis]
Length = 667
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/295 (48%), Positives = 183/295 (62%), Gaps = 25/295 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y + LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDASER
Sbjct: 140 CRDKKYDSGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASER----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ Q YI + + K R H + V +D + +
Sbjct: 195 -SYLKRQLESYIRVLTVPTRIYRMKKRSGLVPARLLGAEHARGDVLTFLDAHCECSRGWL 253
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR- 162
E + A+ V+CP+ID+ISD F Y + WGGFNW+L+FRW+ +
Sbjct: 254 EPLLARIKEARNVVICPVIDIISDDNFSYTKTFENHWGGFNWQLSFRWFSSERKRQTTEI 313
Query: 163 -GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D ++P+ TP MAGGLFAID++YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 TAKDSTAPIATPGMAGGLFAIDRNYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F G S SV
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFVMLYTSGLSLSV 428
>gi|125806852|ref|XP_001360187.1| GA18187 [Drosophila pseudoobscura pseudoobscura]
gi|54635358|gb|EAL24761.1| GA18187 [Drosophila pseudoobscura pseudoobscura]
Length = 667
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 143/295 (48%), Positives = 183/295 (62%), Gaps = 25/295 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y + LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDASER
Sbjct: 140 CRDKKYDSGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASER----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ Q YI + + K R H + V +D + +
Sbjct: 195 -SYLKRQLESYIRVLTVPTRIYRMKKRSGLVPARLLGAEHARGDVLTFLDAHCECSRGWL 253
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR- 162
E + A+ V+CP+ID+ISD F Y + WGGFNW+L+FRW+ +
Sbjct: 254 EPLLARIKEARNVVICPVIDIISDDNFSYTKTFENHWGGFNWQLSFRWFSSERKRQTTEI 313
Query: 163 -GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D ++P+ TP MAGGLFAID++YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 TAKDSTAPIATPGMAGGLFAIDRNYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F G S SV
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFVMLYTSGLSLSV 428
>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
Length = 562
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/289 (49%), Positives = 185/289 (64%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ +YP LPTTS++IV+HNEA+STLLRTVWSVI+RSP+ LLKEIILVDD S+R +
Sbjct: 147 CRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRY 206
Query: 59 PIID------------VISDQTFEYITASDM----TWGGFNWKLREKNRHKKTVVCPIID 102
P +D + S + I A M G L K + P++
Sbjct: 207 PTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + K V CP+ID+I+D TF+Y +M GGFNW L FRWY +P +
Sbjct: 267 RIK------LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQH 320
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P+ +PTMAGGLF+I+++YF ELG YD GMDIWGGENLEMSFR+WQCGG +EI+P
Sbjct: 321 LLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILP 380
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR SP+ FPG S VL+ N RVAEVWMD+W+ ++Y + P
Sbjct: 381 CSHVGHVFRKSSPHDFPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAP 429
>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
echinatior]
Length = 595
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 139/273 (50%), Positives = 180/273 (65%), Gaps = 13/273 (4%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP TSI+IVFHNEAWSTLLRTV SVINRSP+ LL+EIILVDD SER + + D + +
Sbjct: 163 LPKTSIIIVFHNEAWSTLLRTVHSVINRSPKELLEEIILVDDNSEREFLKNSLDDYVKNL 222
Query: 68 TFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI---------TAKTV 117
+ + S+ G +L N K V+ +D + T ++ A +
Sbjct: 223 SVSTRVLRSNERIGLIKARLLGANDAKGEVLT-FLDAHCECTIGWLEPLLEAVGKNATRI 281
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
V P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P RTP MAG
Sbjct: 282 VAPVIDIINDNTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRDNIVEPFRTPAMAG 341
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLF++++DYF++LGSYD+ M IWGGENLE+SFR WQCGG +EI PCSHVGH+FR SPYT
Sbjct: 342 GLFSMNRDYFFKLGSYDDQMRIWGGENLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYT 401
Query: 238 FPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FPGGV I+ N ARVA VWMD+W +FY+ NP
Sbjct: 402 FPGGVGDILYGNLARVALVWMDQWAEFYFKFNP 434
>gi|328783898|ref|XP_003250361.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Apis
mellifera]
Length = 603
Score = 275 bits (702), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/278 (52%), Positives = 181/278 (65%), Gaps = 25/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TSI+IVFHNEAWSTLLRTV+SVI+RSP LL+EIILVDD S+R D + D
Sbjct: 153 LPKTSIIIVFHNEAWSTLLRTVYSVIDRSPIQLLEEIILVDDNSDR------DFLKDALD 206
Query: 70 EYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------TA 114
E+I S G N +L N+ K V+ +D + T ++ A
Sbjct: 207 EHIKNLQVSTKVLRSKKRIGLVNARLLGANKAKGEVLT-FLDAHCECTVGWLEPLLEAVA 265
Query: 115 KT---VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
K VV P+ID+I+D TF Y + ++ WG FNW L+FRW + R + R + P R
Sbjct: 266 KNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFR 325
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP MAGGLF++++DYF+ELGSYD M IWGGENLE+SFRVWQCGG +EI PCSHVGH+FR
Sbjct: 326 TPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLFR 385
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
SPYTFPGGV +I+ N ARVA VWMDEW +FY+ N
Sbjct: 386 KSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFN 423
>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
Length = 693
Score = 275 bits (702), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/289 (50%), Positives = 186/289 (64%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ +YP LPTTS++IV+HNEA+STLLRTVWSVI+RSP+ LL+EI+LVDD S+R +
Sbjct: 147 CRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLREILLVDDFSDRDFLRY 206
Query: 59 PIID------------VISDQTFEYITASDM----TWGGFNWKLREKNRHKKTVVCPIID 102
P +D + S+Q I A M G L K + P++
Sbjct: 207 PKLDESLKPLPTDIKIIRSNQRVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + K V CP+ID+I+D TF+Y +M GGFNW L FRWY +P +
Sbjct: 267 RIK------LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTEMAKQH 320
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P+ +PTMAGGLF+ID++YF ELG YD GMDIWGGENLEMSFR+WQCGG +EI+P
Sbjct: 321 LLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILP 380
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR SP+ FPG S VL+ N RVAEVWMDEW+ ++Y + P
Sbjct: 381 CSHVGHVFRKSSPHDFPGKSSGKVLNANLLRVAEVWMDEWKYYFYKIAP 429
>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
Length = 617
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 146/289 (50%), Positives = 185/289 (64%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ +YP LPTTS++IV+HNEA+STLLRTVWSVI+RSP+ LLKEIILVDD S+R +
Sbjct: 147 CRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLKEIILVDDFSDREFLRY 206
Query: 59 PIID------------VISDQTFEYITASDM----TWGGFNWKLREKNRHKKTVVCPIID 102
P +D + S + I A M G L K + P++
Sbjct: 207 PKLDESIKPIPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + K V CP+ID+I+D TF+Y +M GGFNW L FRWY +P +
Sbjct: 267 RIK------LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPSSMAKQH 320
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P+ +PTMAGGLF+ID++YF ELG YD GMDIWGGENLEMSFR+WQCGG +EI+P
Sbjct: 321 LLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILP 380
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR SP+ FPG S VL+ N RVAEVWMDEW+ ++Y + P
Sbjct: 381 CSHVGHVFRKSSPHDFPGKSSGKVLNANLLRVAEVWMDEWKYYFYKIAP 429
>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 6; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
Length = 618
Score = 274 bits (700), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 144/289 (49%), Positives = 185/289 (64%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ +YP LPTTS++IV+HNEA+STLLRTVWSVI+RSP+ LLKEIILVDD S+R +
Sbjct: 147 CRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRY 206
Query: 59 PIID------------VISDQTFEYITASDM----TWGGFNWKLREKNRHKKTVVCPIID 102
P +D + S + I A M G L K + P++
Sbjct: 207 PTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + K V CP+ID+I+D TF+Y +M GGFNW L FRWY +P +
Sbjct: 267 RIK------LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQH 320
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P+ +PTMAGGLF+I+++YF ELG YD GMDIWGGENLEMSFR+WQCGG +EI+P
Sbjct: 321 LLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILP 380
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR SP+ FPG S VL+ N RVAEVWMD+W+ ++Y + P
Sbjct: 381 CSHVGHVFRKSSPHDFPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAP 429
>gi|195120313|ref|XP_002004673.1| GI20058 [Drosophila mojavensis]
gi|193909741|gb|EDW08608.1| GI20058 [Drosophila mojavensis]
Length = 668
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 139/296 (46%), Positives = 184/296 (62%), Gaps = 26/296 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K Y +PTTS++IVFHNEAWS LLRT+ SVINRSPR LL+EIILVDDAS+R
Sbjct: 140 CREKRYTQNMPTTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ Q YI + + K R +H + V +D + +
Sbjct: 195 -SFLKRQLEAYIEVLKVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWL 253
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWY---RVPPREMM 160
E + A+ V+CP+ID+ISD F Y + WG FNW+L+FRW+ R + +
Sbjct: 254 EPLLARIKESRNVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKTRQAIA 313
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
+ D ++P+ TP MAGGLFAID+ YFYE+G+YD M IWGGEN+EMSFR+WQCGG +EI
Sbjct: 314 KENKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDRDMRIWGGENVEMSFRIWQCGGRIEI 373
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
PCSHVGH+FR +PYTFPGG+S+++ N AR A VWMD+W+ F G S S
Sbjct: 374 SPCSHVGHIFRSSTPYTFPGGMSEVLTSNLARAATVWMDDWQYFVMLYTAGLSLSA 429
>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
Length = 617
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 144/289 (49%), Positives = 185/289 (64%), Gaps = 25/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ +YP LPTTS++IV+HNEA+STLLRTVWSVI+RSP+ LLKEIILVDD S+R +
Sbjct: 147 CRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRY 206
Query: 59 PIID------------VISDQTFEYITASDM----TWGGFNWKLREKNRHKKTVVCPIID 102
P +D + S + I A M G L K + P++
Sbjct: 207 PTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + K V CP+ID+I+D TF+Y +M GGFNW L FRWY +P +
Sbjct: 267 RIK------LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQH 320
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P+ +PTMAGGLF+I+++YF ELG YD GMDIWGGENLEMSFR+WQCGG +EI+P
Sbjct: 321 LLDPTGPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILP 380
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR SP+ FPG S VL+ N RVAEVWMD+W+ ++Y + P
Sbjct: 381 CSHVGHVFRKSSPHDFPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAP 429
>gi|195402751|ref|XP_002059968.1| GJ14949 [Drosophila virilis]
gi|194140834|gb|EDW57305.1| GJ14949 [Drosophila virilis]
Length = 666
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 140/294 (47%), Positives = 183/294 (62%), Gaps = 24/294 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKRYAHGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRQLLKEIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ Q YI ++ + K R +H + V +D + +
Sbjct: 195 -SFLKRQLEAYIKVLNVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWL 253
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
E + A+ V+CP+ID+ISD F Y + WG FNW+L+FRW+ R+ +
Sbjct: 254 EPLLARIKESREVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTSVK 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D ++P+ TP MAGGLFAID+ YFYE+G+YD M IWGGEN+EMSFR+WQCGG +EI P
Sbjct: 314 PKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSEMRIWGGENVEMSFRIWQCGGRIEISP 373
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
CSHVGH+FR +PYTFPGG+S+++ N AR A VWMD+W+ F G S S
Sbjct: 374 CSHVGHIFRSSTPYTFPGGMSEVLTANLARAATVWMDDWQYFVMLYTAGLSLSA 427
>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/281 (50%), Positives = 178/281 (63%), Gaps = 12/281 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C K YP LPTTS+++V+HNEA STLLR V S+INRSP LL EIILVDDAS E +
Sbjct: 46 CANKVYPKKLPTTSVILVYHNEARSTLLRNVHSIINRSPHDLLAEIILVDDASDQEHLGK 105
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ D I+ Y+ G ++ K V+ C + + +
Sbjct: 106 SLEDYIAKLPVSVYVVKMKGRSGLIRARMAGAAVAKGQVLTFLDSHCEVTEGWLEPMLAR 165
Query: 112 ITAK--TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
I T VCP+IDVISD TF+Y +D GGF W L F+W+ VP RE +RR GD + P
Sbjct: 166 IAEDRTTSVCPVIDVISDDTFQYQHGNDPQMGGFGWSLFFKWFPVPKREQIRRKGDPTEP 225
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+R TMAGGLFAIDK YF ELG YD G +IWGGENLE+SF++W CGG LE IPCSHVGHV
Sbjct: 226 VRVSTMAGGLFAIDKSYFEELGQYDPGFNIWGGENLELSFKLWMCGGKLEFIPCSHVGHV 285
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FR KSPY FP G + V N R+AEVW+DE+++FYY ++P
Sbjct: 286 FRKKSPYHFPPGTN-YVNKNNKRLAEVWLDEYKNFYYRISP 325
>gi|195474291|ref|XP_002089425.1| GE24246 [Drosophila yakuba]
gi|194175526|gb|EDW89137.1| GE24246 [Drosophila yakuba]
Length = 667
Score = 270 bits (689), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 140/293 (47%), Positives = 180/293 (61%), Gaps = 31/293 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y + LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKKYASGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+ Q Y+ + F K R + + V +D + + ++
Sbjct: 195 -SYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWL 253
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ K V+CP+ID+ISD F Y + WG FNW+L+FRW+ + RR
Sbjct: 254 EPLLSRIKESRKVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRRT 310
Query: 164 GDRSS-----PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
D SS P+ TP MAGGLFAID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +
Sbjct: 311 ADNSSKDSTAPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRV 370
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
EI PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F G
Sbjct: 371 EISPCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFIMLYTSG 423
>gi|195027660|ref|XP_001986700.1| GH20386 [Drosophila grimshawi]
gi|193902700|gb|EDW01567.1| GH20386 [Drosophila grimshawi]
Length = 666
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/294 (47%), Positives = 179/294 (60%), Gaps = 24/294 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++IVFHNEAWS LLRT+ SVINRSPR LL+EIILVDDAS R
Sbjct: 140 CRDKRYANSLPNTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASNR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ Q YI + + K R +H + V +D + +
Sbjct: 195 -SFLKRQLEAYIQVLAVPTRLYRMKERSGLVPARLLGAQHARGDVLTFLDAHCECSRGWL 253
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
E + A+ V+CP+ID+ISD F Y + WG FNW+L+FRW+ R+
Sbjct: 254 EPLLARIGESREVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTTAN 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D ++P+ TP MAGGLFAID+ YFYE+G+YD M IWGGEN+EMSFR+WQCGG +EI P
Sbjct: 314 TKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSDMRIWGGENVEMSFRIWQCGGRIEISP 373
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
CSHVGH+FR +PYTFPGG+S+++ N AR A VWMD+W+ F G S S
Sbjct: 374 CSHVGHIFRSSTPYTFPGGMSEVLTANLARAATVWMDDWQYFVMLYTAGLSLSA 427
>gi|194753512|ref|XP_001959056.1| GF12252 [Drosophila ananassae]
gi|190620354|gb|EDV35878.1| GF12252 [Drosophila ananassae]
Length = 667
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 177/290 (61%), Gaps = 25/290 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKKYTGSLPHTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRTY--- 196
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+ Q Y+ + + K R H + V +D + + ++
Sbjct: 197 ---LKRQLESYVKVLSVPTKIYRMKKRSGLVPARLLGAEHARGDVLTFLDAHCECSRGWL 253
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ K V+CP+ID+ISD F Y + WG FNW+L+FRW+ +
Sbjct: 254 EPLLARIKESRKVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQATVS 313
Query: 164 G--DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
G D ++P+ TP MAGGLF+ID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 GAKDSTAPIATPGMAGGLFSIDRKYFYEMGSYDANMRVWGGENVEMSFRIWQCGGRVEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F G
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFVMLYTSG 423
>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
Length = 927
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/285 (51%), Positives = 188/285 (65%), Gaps = 16/285 (5%)
Query: 1 CKKKSYP--TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER-VV 57
C+ K YP + LPTTS++IV+HNEA+STLLRTV SVI+RSP+ +LKEIILVDD S R +
Sbjct: 147 CRDKIYPAPSELPTTSVIIVYHNEAFSTLLRTVVSVIDRSPKEVLKEIILVDDFSSRSFL 206
Query: 58 CPIID--VISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYI 112
+D V++ I + G +L N V+ +D + T E +
Sbjct: 207 KDDLDNFVVTLGIRIKIIRAQRRVGLIRARLMGANEADGEVLT-FLDSHCECTKGWLEPL 265
Query: 113 TA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
A K VVCP+IDVI+D+TF Y ++ GGFNW L FRWY VPP + R D
Sbjct: 266 LARIKENRKAVVCPVIDVINDRTFAYQKGIELFRGGFNWNLQFRWYAVPPDIVKGRANDP 325
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ P+++PTMAGGLF+IDK YF ELG+YD GM+IWGGEN+E+SFR+WQCGG +EI+PCSHV
Sbjct: 326 TMPIQSPTMAGGLFSIDKRYFEELGAYDPGMEIWGGENIEISFRIWQCGGRIEILPCSHV 385
Query: 227 GHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNP 270
GH+FR SP+ FPG S KI+ N RVAEVWMDEW+ +Y P
Sbjct: 386 GHIFRKASPHDFPGKSSGKILNSNLLRVAEVWMDEWKYLFYKTAP 430
>gi|194863912|ref|XP_001970676.1| GG10775 [Drosophila erecta]
gi|190662543|gb|EDV59735.1| GG10775 [Drosophila erecta]
Length = 667
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/293 (47%), Positives = 180/293 (61%), Gaps = 31/293 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y + LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKKYASGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+ Q Y+ + F K R + + V +D + + ++
Sbjct: 195 -SYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWL 253
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ K V+CP+ID+ISD F Y + WG FNW+L+FRW+ + R+
Sbjct: 254 EPLLSRIKESRKVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRQT 310
Query: 164 GDRSS-----PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
D SS P+ TP MAGGLFAID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +
Sbjct: 311 ADNSSKDSTAPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRV 370
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
EI PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F G
Sbjct: 371 EISPCSHVGHVFRSSTPYTFPGGMSEVLTENLARAATVWMDDWQYFVMLYTSG 423
>gi|19921720|ref|NP_610256.1| polypeptide GalNAc transferase 3 [Drosophila melanogaster]
gi|51316121|sp|Q9Y117.1|GALT3_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
Short=pp-GaNTase 3; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|5052600|gb|AAD38630.1|AF145655_1 BcDNA.GH09147 [Drosophila melanogaster]
gi|7304264|gb|AAF59298.1| polypeptide GalNAc transferase 3 [Drosophila melanogaster]
Length = 667
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 177/290 (61%), Gaps = 25/290 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y + LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKKYASGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+ Q Y+ + F K R + + V +D + + ++
Sbjct: 195 -SYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWL 253
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM--R 161
+ K V+CP+ID+ISD F Y + WG FNW+L+FRW+ +
Sbjct: 254 EPLLSRIKESRKVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTAGN 313
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D + P+ TP MAGGLFAID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 SSKDSTDPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F G
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFIMLYTSG 423
>gi|195581118|ref|XP_002080381.1| GD10277 [Drosophila simulans]
gi|194192390|gb|EDX05966.1| GD10277 [Drosophila simulans]
Length = 667
Score = 266 bits (680), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 135/283 (47%), Positives = 176/283 (62%), Gaps = 25/283 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKKYAGGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+ Q Y+ + F K R + + V +D + + ++
Sbjct: 195 -SYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWL 253
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM--R 161
+ K V+CP+ID+ISD F Y + WG FNW+L+FRW+ +
Sbjct: 254 EPLLSRIKESRKVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQATGN 313
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D ++P+ TP MAGGLFAID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 SSKDSTAPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDF 264
PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYF 416
>gi|195332013|ref|XP_002032693.1| GM20824 [Drosophila sechellia]
gi|194124663|gb|EDW46706.1| GM20824 [Drosophila sechellia]
Length = 667
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 135/283 (47%), Positives = 176/283 (62%), Gaps = 25/283 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP+TS++IVFHNEAWS LLRT+ SVINRSPR LLKEIILVDDAS+R
Sbjct: 140 CRDKKYAGGLPSTSVIIVFHNEAWSVLLRTLTSVINRSPRHLLKEIILVDDASDR----- 194
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+ Q Y+ + F K R + + V +D + + ++
Sbjct: 195 -SYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWL 253
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM--R 161
+ K V+CP+ID+ISD F Y + WG FNW+L+FRW+ +
Sbjct: 254 EPLLSRINESRKVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTAGN 313
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D ++P+ TP MAGGLFAID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 SSKDSTAPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDF 264
PCSHVGHVFR +PYTFPGG+S+++ N AR A VWMD+W+ F
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYF 416
>gi|195430254|ref|XP_002063171.1| GK21532 [Drosophila willistoni]
gi|194159256|gb|EDW74157.1| GK21532 [Drosophila willistoni]
Length = 667
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 135/283 (47%), Positives = 177/283 (62%), Gaps = 25/283 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + Y LP TS++IVFHNEAWS LLRT+ SVINRSP+ LLKEIILVDDAS+R
Sbjct: 140 CRDRKYTKNLPNTSVIIVFHNEAWSVLLRTITSVINRSPKHLLKEIILVDDASDRTY--- 196
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQT---F 109
+ Q YI + + K R +H + V +D + +
Sbjct: 197 ---LKRQLESYIKVLAVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWL 253
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ V+CP+ID+ISD F Y + WG FNW+L+FRW+ +
Sbjct: 254 EPLLARIRESRQVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSERKRSTTES 313
Query: 164 G--DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D ++P+ TP MAGGLFAID+ YFYE+GSYD M +WGGEN+EMSFR+WQCGG +EI
Sbjct: 314 SVKDLTAPIATPGMAGGLFAIDRKYFYEMGSYDSEMRVWGGENVEMSFRIWQCGGRIEIS 373
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDF 264
PCSHVGHVFR +PYTFPGG+S+++ +N AR A VWMD+W+ F
Sbjct: 374 PCSHVGHVFRSSTPYTFPGGMSEVLTNNLARAASVWMDDWQYF 416
>gi|344237432|gb|EGV93535.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Cricetulus
griseus]
Length = 413
Score = 264 bits (674), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 114/160 (71%), Positives = 137/160 (85%)
Query: 112 ITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
+ +TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+R
Sbjct: 82 LEGRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVR 141
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGHVFR
Sbjct: 142 TPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFR 201
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
+PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 202 KATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 241
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 53/107 (49%), Gaps = 25/107 (23%)
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKL---------REKNRHKKTVVCPIIDVISD 106
VVCPIIDVISD TFEY+ SDMT+GGFNWKL RE +R K P
Sbjct: 87 VVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP------- 139
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDM---TWGGFNWKLNFR 150
+ T+ + + D F+ I D WGG N +++FR
Sbjct: 140 -----VRTPTMAGGLFSIDRD-YFQEIGTYDAGMDIWGGENLEISFR 180
>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
Length = 400
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 114/157 (72%), Positives = 136/157 (86%)
Query: 115 KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPT 174
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+RTPT
Sbjct: 72 RTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPT 131
Query: 175 MAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKS 234
MAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGHVFR +
Sbjct: 132 MAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKAT 191
Query: 235 PYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 192 PYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 228
Score = 64.3 bits (155), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 53/107 (49%), Gaps = 25/107 (23%)
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKL---------REKNRHKKTVVCPIIDVISD 106
VVCPIIDVISD TFEY+ SDMT+GGFNWKL RE +R K P
Sbjct: 74 VVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP------- 126
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDM---TWGGFNWKLNFR 150
+ T+ + + D F+ I D WGG N +++FR
Sbjct: 127 -----VRTPTMAGGLFSIDRD-YFQEIGTYDAGMDIWGGENLEISFR 167
>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Macaca mulatta]
Length = 375
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 115/157 (73%), Positives = 135/157 (85%)
Query: 115 KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPT 174
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+RTPT
Sbjct: 49 KTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPT 108
Query: 175 MAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKS 234
MAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGHVFR +
Sbjct: 109 MAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKAT 168
Query: 235 PYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PYTFPGG ++ N R+AEVWMDE++DF+Y ++PG
Sbjct: 169 PYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPG 205
>gi|74215848|dbj|BAE28617.1| unnamed protein product [Mus musculus]
Length = 330
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 114/157 (72%), Positives = 136/157 (86%)
Query: 115 KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPT 174
+TVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+RTPT
Sbjct: 2 RTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPT 61
Query: 175 MAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKS 234
MAGGLF+ID+DYF E+G+YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGHVFR +
Sbjct: 62 MAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKAT 121
Query: 235 PYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PYTFPGG +I+ N R+AEVWMDE+++F+Y ++PG
Sbjct: 122 PYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPG 158
Score = 63.9 bits (154), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 42/107 (39%), Positives = 53/107 (49%), Gaps = 25/107 (23%)
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKL---------REKNRHKKTVVCPIIDVISD 106
VVCPIIDVISD TFEY+ SDMT+GGFNWKL RE +R K P
Sbjct: 4 VVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP------- 56
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDM---TWGGFNWKLNFR 150
+ T+ + + D F+ I D WGG N +++FR
Sbjct: 57 -----VRTPTMAGGLFSIDRD-YFQEIGTYDAGMDIWGGENLEISFR 97
>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
Length = 617
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 180/304 (59%), Gaps = 55/304 (18%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ ++P LPTTS++IV+HNEA+STLLRTVWSVI+RSP+ LLKEIILVDD S+R +
Sbjct: 147 CRNITFPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLKY 206
Query: 59 PIID------------VISDQTFEYITASDM-------------------TWGGFNWKLR 87
P +D V S + I A M T G L
Sbjct: 207 PKLDESLKPLPTDIKIVRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 88 EKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKL 147
++K V CP+ID+ I+D TF+Y +M GGFNW L
Sbjct: 267 RIKLNRKAVPCPVIDI---------------------INDNTFQYQKGIEMFRGGFNWNL 305
Query: 148 NFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEM 207
FRWY +P D + P+ +PTMAGGLF+ID++YF ELG YD GMDIWGGENLEM
Sbjct: 306 QFRWYGMPSSMAKEHLLDPTGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEM 365
Query: 208 SFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYY 266
SFR+WQCGG +EI+PCSHVGHVFR SP+ FPG S KI+ N RVAEVWMDEW+ ++Y
Sbjct: 366 SFRIWQCGGRVEILPCSHVGHVFRKSSPHDFPGKSSGKILNANLLRVAEVWMDEWKYYFY 425
Query: 267 AMNP 270
+ P
Sbjct: 426 KLAP 429
>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
Length = 595
Score = 261 bits (667), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/288 (48%), Positives = 189/288 (65%), Gaps = 28/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ KSYP+ LP TSIVI F NEAWSTLLRTV SV++R+PR LL+EIIL+DD S++
Sbjct: 131 CRGKSYPSGLPKTSIVICFFNEAWSTLLRTVHSVLDRTPRELLQEIILIDDFSDQ----- 185
Query: 61 IDVISDQTFEYIT---------ASDMTWGGFNWKLREKNRHKKTVVCPIIDV---ISDQT 108
+ ++ EYI +D G +++ H V +D +S Q
Sbjct: 186 -SHLKEELEEYIRDHLPMVQLYRTDKREGLIRARVKGAT-HASGDVLMFLDSHCEVSKQW 243
Query: 109 FEYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
E + A+ VVCPIID+I+ TFEY TAS + GGFNW L+F+W +VP +++++
Sbjct: 244 LEPLLARIAEDRTRVVCPIIDIINSDTFEY-TASPLVRGGFNWGLHFKWDQVP-QQLLQG 301
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
++P+ +PTMAGGLFAID++YF ELG YDEGMDIWGGENLE+SFR+W CGG LEIIP
Sbjct: 302 PDGAAAPINSPTMAGGLFAIDREYFDELGRYDEGMDIWGGENLEISFRIWMCGGTLEIIP 361
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGHVFR + PY P G + N+ R+A VWMDE++D Y+++ P
Sbjct: 362 CSRVGHVFRKRRPYGSPNG-EDTMSKNSLRMAHVWMDEYKDQYFSLRP 408
>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 497
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 138/290 (47%), Positives = 180/290 (62%), Gaps = 29/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C K YP LPTTS+++VFHNEAWSTLLRT S+INRSPR LL E+ILVDD S E +
Sbjct: 68 CNDKKYPGKLPTTSVIVVFHNEAWSTLLRTTHSIINRSPRELLMEVILVDDCSTQEHLKK 127
Query: 59 PIIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ D ++ ++ ++ G +LR + K V+ P++
Sbjct: 128 PLDDYVAKLPVPVHVERMEVRSGLIRSRLRGGSVAKGDVLTYLDSHCECTEGWLEPLVSR 187
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I D KT V PIID+I D++F YI AS+ GGF W+L +W R+P E RR
Sbjct: 188 IGDDR------KTRVQPIIDIIDDRSFAYIGASESNSGGFTWQLQHQWVRIPEYEQNRRV 241
Query: 164 GD----RSSPL--RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
+ R L RTPTMAGGLF+I+K YF ++G+YD GMD+WGGEN+EMSFR+W CGG
Sbjct: 242 SEYDNIRQVTLFHRTPTMAGGLFSINKTYFEKMGAYDTGMDVWGGENIEMSFRIWMCGGK 301
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
+EIIPCS +GHV+R PY+FP G + NA RVAEVWMD ++ F+YA
Sbjct: 302 IEIIPCSRIGHVYRRYIPYSFPNGSDPTIYRNAMRVAEVWMDHYKKFFYA 351
>gi|291230380|ref|XP_002735141.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 510
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 140/289 (48%), Positives = 175/289 (60%), Gaps = 41/289 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + YP L TTS+VIVFHNEAW+TLLRTV SVINRSPR LL EIILVDD S RV P+
Sbjct: 53 CQNREYPGVLQTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLTEIILVDDYSNRV--PV 110
Query: 61 IDVISDQ---------------TFEYITASD----MTWGGFNWKLREKNRHKKTVVCPII 101
+ Q T E +T D T G L K VVCP+I
Sbjct: 111 MVHHCQQREGLTRARLIGAAMATGEVVTFLDSHCECTRGWLEPLLARIAEDKTNVVCPVI 170
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
++ISD TFE+I SD T GGF+W+L F W+ VP RE+ R
Sbjct: 171 NIISDTTFEFING-----------SDAT---------QVGGFDWRLIFNWHVVPHRELQR 210
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
DR+SP+R+PTMAGGLF+I K++F LG+YD G D+WG ENLE+SF+ W CGG LE +
Sbjct: 211 IKFDRTSPVRSPTMAGGLFSIHKEFFTRLGTYDPGFDVWGAENLELSFKTWMCGGTLEFV 270
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVFR +SP+ FP ++ N R+AEVW+DE++ YY +P
Sbjct: 271 PCSHVGHVFRKRSPHRFPPTTHNVMQRNNRRLAEVWLDEYKYLYYNAHP 319
>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Saccoglossus kowalevskii]
Length = 2434
Score = 261 bits (666), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 136/283 (48%), Positives = 173/283 (61%), Gaps = 18/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
CKK + T LP TS++I FHNEA STLLRTV SV+NRSP +++KEIILVDD S+
Sbjct: 1985 CKKLDWKTALPQTSVIITFHNEARSTLLRTVVSVLNRSPTSIIKEIILVDDYSDNAEDGK 2044
Query: 57 ---VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + V+ ++ E + +D G L + + P+I I +
Sbjct: 2045 ELEKIPKVKVLRNEKREGLMRSRVRGADYATGTILTFLDSHCECNQNWIEPLITKIQENN 2104
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
K VV PIIDVI+ F+Y+ AS GGF+W L F+W + P E +R D +
Sbjct: 2105 ------KAVVSPIIDVINMDNFQYVAASADLKGGFDWNLVFKWDYMTPAERNKRKSDPIA 2158
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLFAI K +F ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 2159 AIRTPMIAGGLFAISKSWFEELGKYDMMMDVWGGENLEISFRVWQCGGTLEIIPCSRVGH 2218
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR + PYTFPGG + N R AEVWMDE++ +YY+ P
Sbjct: 2219 VFRKQHPYTFPGGSGNVFAKNTRRAAEVWMDEYKKYYYSAVPS 2261
>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 573
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 140/280 (50%), Positives = 178/280 (63%), Gaps = 24/280 (8%)
Query: 9 FLP-TTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQ 67
LP TS++IVFHNEAWS LLRTV SV++R+P LL EIILVDDAS + + DQ
Sbjct: 124 LLPFKTSVIIVFHNEAWSALLRTVHSVLDRTPVQLLHEIILVDDASTQ------SHLGDQ 177
Query: 68 TFEYITASDM---------TWGGFNWKLR-EKNRHKKTVV-----CPIIDVISDQTFEYI 112
Y+ + + G +L K KT+ C + + ++I
Sbjct: 178 LKNYVKSLNKPVRIERMSSRSGLIRARLHGAKISTGKTLTFLDAHCEVTIGWLETLLKHI 237
Query: 113 T--AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
+ K +VCPIIDVIS TFEY+ SD TWG F+W+ NF W V RE+ R + + PL
Sbjct: 238 SENQKRIVCPIIDVISHDTFEYLLGSDRTWGTFDWQFNFHWETVVDREIDRINDEHNVPL 297
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTPTMAGGLF I ++YFYE+G+YDE M+IWGGEN+E+SFRVWQCGG L I PCS VGHVF
Sbjct: 298 RTPTMAGGLFTITREYFYEIGAYDEDMEIWGGENIELSFRVWQCGGELLIDPCSRVGHVF 357
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R SPYT+PGGVS I+ N R A VW+D++ FY+ +NP
Sbjct: 358 RKSSPYTWPGGVSHILHKNFVRTALVWLDQYSRFYFMLNP 397
>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
Length = 515
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 143/283 (50%), Positives = 174/283 (61%), Gaps = 14/283 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK K YP LP TS+VIVFHNEAWSTLLRT+ SV++R+ LL EIILVDD S + +
Sbjct: 58 CKDKVYPGDLPPTSVVIVFHNEAWSTLLRTIHSVLDRTAPDLLIEIILVDDKSVVKELHA 117
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKN-RHKKTVVCPIIDVISDQTFE------- 110
P+ I+ I + G +L K+ K V +D +
Sbjct: 118 PLDAYIAKLAKVKIIRNKKREGLIRSRLNGKSFAASKAPVVTFLDAHCEANTGWLEPLLE 177
Query: 111 --YITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
Y TVVCP IDVISD+ F Y S + G FNW L+FRW V E RR
Sbjct: 178 RIYNDRSTVVCPEIDVISDENFAYQYGPSGLMRGIFNWDLHFRWRAVSTEEQKRRQSP-I 236
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTPTMAGGLFAI++DYF E+G+YDE MDIWGGENLE+SFR+WQCGG LEI+PCSHVG
Sbjct: 237 DPVRTPTMAGGLFAINRDYFKEIGTYDEEMDIWGGENLEISFRIWQCGGTLEIVPCSHVG 296
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR PY FP GV + N+ RVAEVWMD +++F+Y P
Sbjct: 297 HVFRKSQPYGFPKGVVDTLGKNSQRVAEVWMDGYKEFFYQRQP 339
>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
partial [Taeniopygia guttata]
Length = 402
Score = 259 bits (663), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 114/157 (72%), Positives = 134/157 (85%)
Query: 115 KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPT 174
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+RTPT
Sbjct: 70 KTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMERRKGDRTLPVRTPT 129
Query: 175 MAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKS 234
MAGGLF+ID+ YF E+G+YD GMDIWGGENLEMSFR+WQCGG LEI+ CSHVGHVFR +
Sbjct: 130 MAGGLFSIDRSYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKAT 189
Query: 235 PYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
PYTFPGG ++ N R+AEVWMD+++DF+Y ++PG
Sbjct: 190 PYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPG 226
>gi|350646654|emb|CCD58681.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 400
Score = 259 bits (663), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 141/283 (49%), Positives = 179/283 (63%), Gaps = 24/283 (8%)
Query: 9 FLP-TTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQ 67
LP TS++IVFHNEAWS LLRTV SV++R+P LL EIILVDDAS + + DQ
Sbjct: 124 LLPFKTSVIIVFHNEAWSALLRTVHSVLDRTPVQLLHEIILVDDASTQ------SHLGDQ 177
Query: 68 TFEYITASDM---------TWGGFNWKLR-EKNRHKKTVV-----CPIIDVISDQTFEYI 112
Y+ + + G +L K KT+ C + + ++I
Sbjct: 178 LKNYVKSLNKPVRIERMSSRSGLIRARLHGAKISTGKTLTFLDAHCEVTIGWLETLLKHI 237
Query: 113 T--AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
+ K +VCPIIDVIS TFEY+ SD TWG F+W+ NF W V RE+ R + + PL
Sbjct: 238 SENQKRIVCPIIDVISHDTFEYLLGSDRTWGTFDWQFNFHWETVVDREIDRINDEHNVPL 297
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTPTMAGGLF I ++YFYE+G+YDE M+IWGGEN+E+SFRVWQCGG L I PCS VGHVF
Sbjct: 298 RTPTMAGGLFTITREYFYEIGAYDEDMEIWGGENIELSFRVWQCGGELLIDPCSRVGHVF 357
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
R SPYT+PGGVS I+ N R A VW+D++ FY+ +NP S
Sbjct: 358 RKSSPYTWPGGVSHILHKNFVRTALVWLDQYSRFYFMLNPYSS 400
>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
Length = 481
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/288 (49%), Positives = 176/288 (61%), Gaps = 23/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-----R 55
C+K+ +PT LP S+VIVFHNEAWSTLLRTV SV++RS L++EIILVDD SE
Sbjct: 65 CRKQLFPTNLPPASLVIVFHNEAWSTLLRTVHSVLDRSDPRLMREIILVDDCSEIKGHEE 124
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVIS------DQTF 109
+ P+ I + + G +R + R K V P+I + D
Sbjct: 125 LQAPLEKYIQKLKIVKLVRNKKRQG----LIRARLRGYKEVTSPVIVFLDAHCEVVDGWL 180
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
E + A+ VVCP IDVIS + F Y AS + G FNW L+FRW +P E RR
Sbjct: 181 EPLLARIHENRSNVVCPEIDVISFENFGYSYASGIR-GVFNWNLHFRWRTLPAVEQQRRK 239
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
P+R+PTMAGGLFAI K YF ++G YD+ MDIWGGENLEMSFR+WQCGG LEIIPC
Sbjct: 240 S-VIDPIRSPTMAGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSFRIWQCGGNLEIIPC 298
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
SHVGHVFR PYTFP G + + N RVAEVWMD ++D +Y P
Sbjct: 299 SHVGHVFRKSQPYTFPKGAGETLNKNLQRVAEVWMDNYKDIFYNRFPN 346
>gi|321463472|gb|EFX74488.1| hypothetical protein DAPPUDRAFT_307282 [Daphnia pulex]
Length = 612
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 146/303 (48%), Positives = 177/303 (58%), Gaps = 54/303 (17%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC- 58
CK SY LPTTS++IVFHNEAWSTLLRTV SVINRSP LL EIILVDDAS R
Sbjct: 145 CKALSYNINELPTTSVIIVFHNEAWSTLLRTVHSVINRSPPKLLWEIILVDDASNRTFLK 204
Query: 59 -PIIDVIS--------------------------DQTFEYITASDM----TWGGFNWKLR 87
P+ D +S + T + +T D T G L
Sbjct: 205 KPLEDHVSVLPTTIIVLRSEKRIGLVRARLMGAREATGDVLTFLDAHCETTDGWLQPLLY 264
Query: 88 EKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKL 147
+ VCPIID+ISD TF + ++FE + GG +W L
Sbjct: 265 RIKTNPNVAVCPIIDIISDDTFALL---------------RSFE------LHHGGMSWNL 303
Query: 148 NFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEM 207
+FRW+ M R G+ S P RTP MAGGLFAI +DYF E+G+YD+ MDIWGGEN+EM
Sbjct: 304 HFRWFGASETLMAERRGNMSIPFRTPVMAGGLFAIGRDYFQEIGTYDDQMDIWGGENIEM 363
Query: 208 SFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
S R+WQCGG +E PCSHV HVFR SPYTFPGGV++I+ N AR A VWMDEW++F++
Sbjct: 364 SLRIWQCGGRVETSPCSHVAHVFRKSSPYTFPGGVNQILYSNLARAALVWMDEWKEFFFK 423
Query: 268 MNP 270
MNP
Sbjct: 424 MNP 426
>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Acyrthosiphon pisum]
Length = 581
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 135/283 (47%), Positives = 169/283 (59%), Gaps = 20/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C K Y LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S+
Sbjct: 134 CLTKKYRIDLPQTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSD------ 187
Query: 61 IDVISDQTFEYITASDMTWGGFNWKL-REKNRHKKTVVCPIIDVISDQT------FEYIT 113
D Q I + L R + R + P++ + E +
Sbjct: 188 -DSTDGQELSKIQKVKLIRNEKREGLMRSRVRGSEIATAPVLTFLDSHVECNVNWLEPLL 246
Query: 114 AKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
+ VVCPIIDVI+ F+YI AS GGF+W L F+W + +R D +
Sbjct: 247 DRVAEDPTRVVCPIIDVINMDNFQYIGASSELRGGFDWNLVFKWEYLSKEVRAQRQKDPT 306
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGGLF +DKDYF +LG+YD+ M+IWGGENLE+SFRVWQCGG LEIIPCS VG
Sbjct: 307 LPIRTPMIAGGLFVMDKDYFVKLGTYDKEMNIWGGENLEISFRVWQCGGSLEIIPCSRVG 366
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PYTFPGG + HN R AEVWMD+++ +YY P
Sbjct: 367 HVFRKRHPYTFPGGSGNVFAHNTRRAAEVWMDQYKRYYYNAVP 409
>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Saccoglossus kowalevskii]
Length = 1082
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 131/288 (45%), Positives = 178/288 (61%), Gaps = 26/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C Y LPT S++I FHNEAWSTLLRT++SVINRS LL+EIILVDD S+R +
Sbjct: 632 CDTVRYNKDLPTASVIISFHNEAWSTLLRTIYSVINRSKIKLLQEIILVDDYSDRDELKV 691
Query: 61 -IDVISDQTFE---YITASDMTWGGFNWKLREKNRHKKTVVC--------------PIID 102
+D F I + G +L ++ ++ P+I+
Sbjct: 692 ALDEYIQSNFNNKVKILHTTEREGLIRARLIGASKATGKILVFLDSHCEVNYNWLEPLIE 751
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I Y + T+ CP+ID+I +F Y +AS + GG NW L F+W VPP E++RR
Sbjct: 752 RI------YRDSSTIACPVIDIIDPDSFAY-SASPLVRGGVNWGLQFKWKNVPPVELLRR 804
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ P+++P MAGGLFA+D++YF +GSYD+ M IWGGE+LE+SFR+WQCGG LEI+P
Sbjct: 805 NSE-IEPIKSPIMAGGLFAVDRNYFEHIGSYDKDMQIWGGEHLELSFRIWQCGGTLEIVP 863
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR PYT PGG+ + HN+ RVAEVWMD+++ F+YA P
Sbjct: 864 CSRVGHIFRKSHPYTIPGGMENVFTHNSIRVAEVWMDDYKRFFYATRP 911
>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
Length = 518
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 138/291 (47%), Positives = 177/291 (60%), Gaps = 32/291 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------ 53
C YP LP TS++IVFHNEAWSTLLRTV SV++RSP LL+EIILVDD+S
Sbjct: 62 CSSLKYPIHKLPQTSVIIVFHNEAWSTLLRTVHSVLDRSPPELLREIILVDDSSDHEELH 121
Query: 54 ---ERVVCPI--IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E+ V + + ++ ++ E + S + GF H + +D +
Sbjct: 122 STLEKYVAKLSKVKIVRNKAREGLIRSRLN--GF--------AHATSPTVTFLDAHCEAN 171
Query: 109 --------FEYITAKT-VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
+ + +T VVCP IDVISD+TFEY +S G FNW LNFRW VP E
Sbjct: 172 VGWLEPLLYRIMQNRTIVVCPEIDVISDETFEYTYSSGNVRGSFNWNLNFRWKAVPEYEN 231
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
RR R+ +R+PTMAGGLF I YF ++G YD+ M+IWGGENLE+SFR+WQCGG LE
Sbjct: 232 KRRAA-RTDGIRSPTMAGGLFTIHSQYFKDIGLYDKQMEIWGGENLELSFRIWQCGGQLE 290
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCSHVGHVFR PY+FP G + + N RVAEVWMD ++ ++Y P
Sbjct: 291 IIPCSHVGHVFRKSQPYSFPKGTGETLSKNLQRVAEVWMDGYKRYFYKRQP 341
>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
Length = 353
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 132/281 (46%), Positives = 179/281 (63%), Gaps = 14/281 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C KSYP++LP+T++VI FHNEAWSTLLRTV SVI+RSP LL+EI+L+DD S + +
Sbjct: 26 CSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLKS 85
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------ 112
+ ++ + + G +L R K V +D + +++
Sbjct: 86 KLTAYVAKLRNVRVLRTSKREGLIRARL-IGARAAKGDVITFLDAHCEANVDWLQPLLSR 144
Query: 113 --TAKTVVC-PIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
+ +T+V P+ID+IS F Y GGF+W + F W+ +P R DR++P
Sbjct: 145 IHSDRTIVAVPVIDIISSTNFMYSGTPSAVIGGFSWDMQFTWHSLPNNRQSERK-DRTAP 203
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPTMAGGLF+ID+ YF+E GSYDEGMD+WGGENLEMSFR+WQCGG LEI+PCS VGHV
Sbjct: 204 IRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCSRVGHV 263
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FR + PY+FPGG S++ + N ARV VWMDE+ + Y P
Sbjct: 264 FRTRFPYSFPGGYSEVSV-NLARVVHVWMDEYNQYVYMKRP 303
>gi|307215388|gb|EFN90069.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Harpegnathos
saltator]
Length = 493
Score = 256 bits (654), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 137/293 (46%), Positives = 173/293 (59%), Gaps = 53/293 (18%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVI--- 64
LP TSI+IVFHNEAWSTLLRTV SVI+RSPR LL+EIILVDD SER + P+ + +
Sbjct: 42 LPKTSIIIVFHNEAWSTLLRTVHSVIDRSPRELLEEIILVDDNSEREFLKNPLDEYVKKL 101
Query: 65 -----------------------SDQTFEYITASDM----TWGGFNWKLREKNRHKKTVV 97
SD E +T D T G L ++ ++
Sbjct: 102 SVPTKVLRSTERVGLIKARLLGASDAKGEVLTFLDAHCECTVGWLEPLLEAVGKNATRII 161
Query: 98 CPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPR 157
P+ID+I+D TF Y ++FE + WG FNW L+FRW + R
Sbjct: 162 SPVIDIINDNTFSYT---------------RSFE------LHWGAFNWDLHFRWLTLNGR 200
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
+ R P RTP MAGGLF+++++YF++LGSYD+ M IWGGENLE+SFR WQCGG
Sbjct: 201 LLKERRESIVEPFRTPAMAGGLFSMNRNYFFQLGSYDDQMRIWGGENLELSFRAWQCGGS 260
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+EI PCSHVGH+FR SPYTFPGGV I+ N RVA VWMD+W +FY+ NP
Sbjct: 261 IEIAPCSHVGHLFRKSSPYTFPGGVGDILYGNLVRVASVWMDQWAEFYFKFNP 313
>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 621
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 178/288 (61%), Gaps = 24/288 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
C K Y LP TS+++V+HNEAWS L+RTV SVINRSPR LL+EI+L+DDAS R
Sbjct: 149 CFAKKYSRNLPKTSVILVYHNEAWSVLMRTVHSVINRSPRHLLEEILLIDDASTREYLGR 208
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ D I+ + + G +L+ K V+ P++D
Sbjct: 209 PLDDYITKLPVPVRVHHAKERRGLIGARLKGAELAKAPVLTFLDSHCECSKGWLEPLLDR 268
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTW-GGFNWKLNFRWYRVPPREMMRR 162
I+ TVVCP+I+ I D++F ++ A++++ GGF+W + F WY +P E R
Sbjct: 269 IA------ANRSTVVCPVINQIDDRSFAFVNATEVSHIGGFDWNIIFNWYNIPQSEKDRI 322
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GGD+S P+R+PTMAGGLF+IDK YF ELGSYD + WGGEN+E+S ++W CGGILE +P
Sbjct: 323 GGDKSEPVRSPTMAGGLFSIDKSYFEELGSYDPEFEFWGGENIELSLKIWMCGGILEFVP 382
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVFR +P+ + +V N R+AEVW+DE++ +YA P
Sbjct: 383 CSHVGHVFRKHNPHKYKNTTYNVVGRNNRRLAEVWLDEYKYLFYANQP 430
>gi|195114158|ref|XP_002001634.1| GI15842 [Drosophila mojavensis]
gi|193912209|gb|EDW11076.1| GI15842 [Drosophila mojavensis]
Length = 628
Score = 253 bits (646), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 133/278 (47%), Positives = 172/278 (61%), Gaps = 10/278 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S+ +
Sbjct: 187 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGM 246
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI--SDQTFEYITAKT-- 116
D+ I +D G ++R + +V+ + + ++Q E + +
Sbjct: 247 ELAKIDKV--RIIRNDKREGLVRSRVRGADAAVSSVLTFLDSHVECNEQWLEPLLERVRE 304
Query: 117 ----VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++ +RT
Sbjct: 305 DPTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERAARHNDPTTAIRT 364
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR
Sbjct: 365 PMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRK 424
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ PYTFPGG + N R AEVWMDE++ YY P
Sbjct: 425 RHPYTFPGGSGNVFAKNTRRAAEVWMDEYKQHYYNAVP 462
>gi|242024227|ref|XP_002432530.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212517982|gb|EEB19792.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 603
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 138/283 (48%), Positives = 177/283 (62%), Gaps = 20/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
CK+K + T LP TS++I FHNEA STLLRT+ SV+NRSP L+KEIILVDD S
Sbjct: 158 CKRKKWRTDLPPTSVIITFHNEARSTLLRTIVSVMNRSPEHLIKEIILVDDFSDNPSDGE 217
Query: 54 ERVVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC---PIID-VISDQ 107
E I ++ ++ E + S + L + H + V P+++ V+ D+
Sbjct: 218 ELAKIQKIKLVRNEKREGLMRSRVRGADLATAPILTFLDSHVECNVNWLEPLLERVVEDK 277
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VVCPIIDVIS TF+YI AS GGF+W L F+W + + +RR D +
Sbjct: 278 T-------RVVCPIIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLTLDQRLRRQQDPT 330
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
++TP +AGGLF ID+ YF LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VG
Sbjct: 331 RAIKTPMIAGGLFVIDRLYFDTLGKYDMQMDVWGGENLEISFRVWQCGGSLEIIPCSRVG 390
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PYTFPGG + N R AEVWMD+++ +YYA P
Sbjct: 391 HVFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKKYYYAAVP 433
>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 547
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 178/290 (61%), Gaps = 26/290 (8%)
Query: 1 CKKKSY--PTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
C+ K+Y + LPTTS++IV+HNEA+STL+RTV SVI RSP LKEIILVDD S R
Sbjct: 122 CRAKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFL 181
Query: 59 PI-IDVISDQTFEYITA--SDMTWGGFNWKLREKNRHKKTVVC--------------PII 101
+D Q +I ++ G +L K V+ P++
Sbjct: 182 KAELDNFVAQLGTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLL 241
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + K VVCP+IDVI+++TF Y ++ GGFNW L FRWY +PP +
Sbjct: 242 ARIKENR------KAVVCPVIDVINERTFAYQKGIELFRGGFNWNLQFRWYALPPEMIKS 295
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R D + P+ +PTMAGGLF+ID+ YF E+G+YD M+IWGGEN+E+S RVWQCGG +EI+
Sbjct: 296 RSNDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEIL 355
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVFR SP+ FP S +L+ N RVAEVWMDEW+ +Y P
Sbjct: 356 PCSHVGHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYRTAP 405
>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
2, partial [Ciona intestinalis]
Length = 582
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 132/273 (48%), Positives = 172/273 (63%), Gaps = 18/273 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-----RVVCPI--ID 62
LP TS+++ FHNEA STLLRTV SV+NRSP +L++EIILVDD S+ +++ I +
Sbjct: 145 LPATSVIVTFHNEARSTLLRTVVSVLNRSPPSLVREIILVDDFSDNAEDGQLLAQIEKVR 204
Query: 63 VISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTV 117
V+ + E I +D L K + P++ I+D V
Sbjct: 205 VLRNNQREGLMRSRIRGADAAAAPVLTFLDSHVECNKNWLEPLLQRIADDR------TAV 258
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
VCPIIDVI+ FEYI AS GGF+W L F+W + E R G+ ++P+ TP +AG
Sbjct: 259 VCPIIDVINMDNFEYIGASADLRGGFDWNLVFKWDYMSSEERRSRAGNPTAPISTPMIAG 318
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLF++DK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR + PYT
Sbjct: 319 GLFSMDKSYFNQLGKYDTAMDVWGGENLEISFRVWQCGGRLEIIPCSRVGHVFRKQHPYT 378
Query: 238 FPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FPGG + N R AEVWMD+++++YYA P
Sbjct: 379 FPGGSGNVFTRNTRRAAEVWMDDYKEYYYAAVP 411
>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 597
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/290 (46%), Positives = 178/290 (61%), Gaps = 26/290 (8%)
Query: 1 CKKKSY--PTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
C+ K+Y + LPTTS++IV+HNEA+STL+RTV SVI RSP LKEIILVDD S R
Sbjct: 111 CRAKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFL 170
Query: 59 PI-IDVISDQTFEYITA--SDMTWGGFNWKLREKNRHKKTVVC--------------PII 101
+D Q +I ++ G +L K V+ P++
Sbjct: 171 KAELDNFVAQLGTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLL 230
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + K VVCP+IDVI+++TF Y ++ GGFNW L FRWY +PP +
Sbjct: 231 ARIKE------NRKAVVCPVIDVINERTFAYQKGIELFRGGFNWNLQFRWYALPPEMIKS 284
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R D + P+ +PTMAGGLF+ID+ YF E+G+YD M+IWGGEN+E+S RVWQCGG +EI+
Sbjct: 285 RSNDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEIL 344
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVFR SP+ FP S +L+ N RVAEVWMDEW+ +Y P
Sbjct: 345 PCSHVGHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYRTAP 394
>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
Length = 553
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 132/283 (46%), Positives = 175/283 (61%), Gaps = 20/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C + LP+TS++I FHNEA STLLRT+ SV+NRSP L+KEIILVDD S
Sbjct: 105 CLDLEFSKDLPSTSVIITFHNEARSTLLRTIVSVLNRSPSHLIKEIILVDDFSNDASDGR 164
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPII-DVISDQ 107
E V + ++ + E + +++ G F L + + P++ V+ D+
Sbjct: 165 ELVQIEKVILVRNSKREGLVRSRVKGAEIATGEFLTFLDSHCECNEGWLEPLLARVVEDR 224
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T +VCP+IDVI+ +F+YI AS GGF+W L F+W +P E R D +
Sbjct: 225 T-------RIVCPVIDVIAMDSFQYIAASTELRGGFDWNLVFKWELLPAEEKANRKTDPT 277
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGGLF ID+ YF +LGSYD MDIWGGENLE+SFR WQCGG LEI+PCS VG
Sbjct: 278 IPIRTPMIAGGLFVIDRQYFQKLGSYDLQMDIWGGENLEISFRTWQCGGRLEIVPCSRVG 337
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY+FPGG I N R AEVWMD+++ +Y+A P
Sbjct: 338 HVFRKQHPYSFPGGSGTIFARNTRRAAEVWMDDYKKYYFAAVP 380
>gi|195386226|ref|XP_002051805.1| GJ10330 [Drosophila virilis]
gi|194148262|gb|EDW63960.1| GJ10330 [Drosophila virilis]
Length = 631
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 132/278 (47%), Positives = 172/278 (61%), Gaps = 10/278 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S+ +
Sbjct: 190 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 249
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI--SDQTFEYITAKT-- 116
D+ I +D G ++R + +V+ + + ++Q E + +
Sbjct: 250 ELAKIDKV--RIIRNDKREGLVRSRVRGADAAVSSVLTFLDSHVECNEQWLEPLLERVRE 307
Query: 117 ----VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++ +RT
Sbjct: 308 DPTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPTERAARHNDPTTAIRT 367
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR
Sbjct: 368 PMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRK 427
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 428 RHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 465
>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 576
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/306 (45%), Positives = 188/306 (61%), Gaps = 18/306 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y + LP TS++I FHNEAWS LLRTV SV+ R+P LL E+ILVDD S+
Sbjct: 80 CKNEKYTSDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAELILVDDFSDMAHLKA 139
Query: 61 IDVISDQTFE--YITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYI 112
I + F I + G ++R K +V+ C ++ + + I
Sbjct: 140 DLEIYMRQFSKVRILRLEKREGLIRARIRGAAISKGSVITYLDSHCECLEGWVEPLLDRI 199
Query: 113 --TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCP+IDVI D TFEY A GGF+W L F W+ +P ++ R+G
Sbjct: 200 KRNPKTVVCPVIDVIDDNTFEYHYSKAYFTNVGGFDWSLQFNWHAIPEKD--RKGRRDID 257
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++PTMAGGLF+ID+ +F ELGSYD G+DIWGGENLE+SF++W CGGILEI+PCSHVGH
Sbjct: 258 PVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSFKIWMCGGILEIVPCSHVGH 317
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM---NPGKSASVSTCAAHFRM 285
+FR +SPY + GV+ ++ N+ R+AEVWMDE++ +YY N G VS+ A +
Sbjct: 318 IFRKRSPYKWRSGVN-VLKRNSVRLAEVWMDEYKKYYYERINNNLGDFGDVSSRKALRKK 376
Query: 286 LSYSSW 291
L S+
Sbjct: 377 LQCKSF 382
>gi|194761420|ref|XP_001962927.1| GF15680 [Drosophila ananassae]
gi|190616624|gb|EDV32148.1| GF15680 [Drosophila ananassae]
Length = 630
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 168/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 189 CRNKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 248
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D G L + + P+++ + +
Sbjct: 249 ELAKIDKVRVIRNDKREGLVRSRVRGADAAVSGVLTFLDSHVECNERWLEPLLERVRED- 307
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 308 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 362
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 363 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 422
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 423 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 464
>gi|221130543|ref|XP_002162500.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Hydra magnipapillata]
Length = 578
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 129/288 (44%), Positives = 173/288 (60%), Gaps = 18/288 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+KKSY LPTT+I+I FHNE + LLRTV S +N+SP LLKEIILVDD S
Sbjct: 132 CRKKSYDKNLPTTTIIICFHNEGRAALLRTVVSALNKSPEHLLKEIILVDDFSDNPLDGE 191
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + P + +I + E + +DM G L + + P++ I D
Sbjct: 192 ELLALPRVKLIRNNQREGLIRSRVKGADMAVGEVLTFLDSHCECNEMWLEPLLQAIKD-- 249
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
K V PIIDVI + F+Y+++S GGF W LNF+W +PP +++ D ++
Sbjct: 250 ----NRKIVASPIIDVIGHEDFKYLSSSSDLRGGFGWNLNFKWDFLPPNHLIKHQQDGTA 305
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+ +P +AGGLF+I K +F ELG YD MD+WGGENLE+SFR WQCGG + IIPCS VGH
Sbjct: 306 FILSPVIAGGLFSIHKSWFEELGKYDPQMDVWGGENLEISFRTWQCGGEMYIIPCSRVGH 365
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
VFRD+ PY FPGG + N R AEVWMD+++ +Y+A P S+
Sbjct: 366 VFRDRHPYKFPGGSMNVFQKNTRRAAEVWMDDYKKYYFAAVPSARYSL 413
>gi|170046940|ref|XP_001851002.1| polypeptide N-acetylgalactosaminyltransferase 3 [Culex
quinquefasciatus]
gi|167869510|gb|EDS32893.1| polypeptide N-acetylgalactosaminyltransferase 3 [Culex
quinquefasciatus]
Length = 628
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 136/286 (47%), Positives = 171/286 (59%), Gaps = 56/286 (19%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K YPT LPTTSI+IVFHNEAWS LLRTVWSVI RSP+ L+KEI+LVDDAS+R
Sbjct: 133 CVSKEYPTKLPTTSIIIVFHNEAWSVLLRTVWSVIIRSPKHLIKEILLVDDASDRRFLKN 192
Query: 56 ----------VVCPIID----------------VISDQTFEYITAS-DMTWGGFNWKLRE 88
+V I+ V + T ++ A + + G L
Sbjct: 193 DLENYVQKLPLVVSILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPLLAR 252
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ K VVCP+ID+ISD F YI ++FE+ WG FNW+++
Sbjct: 253 IKENPKKVVCPVIDIISDDNFSYI---------------KSFEF------HWGAFNWQMH 291
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY + E+ R D + P TP MAGGLF ID+ YF+++GSYDE + IWGG+NLEMS
Sbjct: 292 FRWYTLSDEELAERRKDTTLPFHTPAMAGGLFTIDRKYFFDVGSYDERLKIWGGDNLEMS 351
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVA 254
FR+WQCGG +EI PCSHVGH+FR SPYTFPGGVS V ++RVA
Sbjct: 352 FRIWQCGGEIEIAPCSHVGHLFRKSSPYTFPGGVSGNV---SSRVA 394
>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
Length = 612
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 138/300 (46%), Positives = 183/300 (61%), Gaps = 36/300 (12%)
Query: 1 CKKKSY--PTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
C+ K+Y + LPTTS++IV+HNEA+STL+RTV SVI RSPR LKEIILVDD S R
Sbjct: 93 CRTKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVILRSPRENLKEIILVDDFSTRTFL 152
Query: 59 PI-IDVISDQ--TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PII 101
+ ++ + Q T I ++ G +L N + V+ P++
Sbjct: 153 KVELEKLVAQLGTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLL 212
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + K VVCP+ID+I+++TF Y ++ GGFNW L FRWY +PP +
Sbjct: 213 ARIKE------NRKAVVCPVIDIINERTFAYQKGIELFRGGFNWNLQFRWYALPPEMIKS 266
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFR----------V 211
R D + P+ +PTMAGGLF+ID+ YF E+G+YD MDIWGGEN+E+S R V
Sbjct: 267 RSDDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRLKLLKKNCFLV 326
Query: 212 WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNP 270
WQCGG +EI+PCSHVGHVFR SP+ FPG S +L+ N RVAEVWMDEW+ +Y P
Sbjct: 327 WQCGGRVEILPCSHVGHVFRRTSPHDFPGRKSGTILNSNLLRVAEVWMDEWKFHFYRTAP 386
>gi|195435185|ref|XP_002065582.1| GK14594 [Drosophila willistoni]
gi|194161667|gb|EDW76568.1| GK14594 [Drosophila willistoni]
Length = 635
Score = 250 bits (638), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 131/278 (47%), Positives = 172/278 (61%), Gaps = 10/278 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K + LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S+ +
Sbjct: 194 CRTKKFRNDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 253
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI--SDQTFEYITAKT-- 116
D+ I +D G ++R + +V+ + + ++Q E + +
Sbjct: 254 ELAKIDKV--RIIRNDKREGLVRSRVRGADAAVSSVLTFLDSHVECNEQWLEPLLERVRE 311
Query: 117 ----VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++ +RT
Sbjct: 312 DPTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERSVRHNDPTTAIRT 371
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR
Sbjct: 372 PMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRK 431
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 432 RHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 469
>gi|157128332|ref|XP_001661405.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108872614|gb|EAT36839.1| AAEL011095-PA [Aedes aegypti]
Length = 573
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 139/287 (48%), Positives = 177/287 (61%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+KK +P+ LP TS++I FHNEA STLLRT+ SV+NRSP L+ EIILVDD S
Sbjct: 131 CRKK-WPSNLPPTSVIITFHNEARSTLLRTIVSVLNRSPEHLIHEIILVDDYSDFPEDGQ 189
Query: 54 ERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT 113
E + VI ++ E + S ++R + +V+ +D + +++
Sbjct: 190 ELAKIHKVKVIRNEQREGLVRS---------RVRGADAATASVLT-FLDSHCECNVDWLE 239
Query: 114 A---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R
Sbjct: 240 PLLIRVKEDPTRVVCPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSTAERHERQK 299
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D ++P+RTP +AGGLF IDK YF +LG YD MDIWGGENLE+SFRVWQCGG LEIIPCS
Sbjct: 300 DPTTPIRTPMIAGGLFVIDKVYFEKLGKYDTQMDIWGGENLEISFRVWQCGGSLEIIPCS 359
Query: 225 HVGHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR + PYTFPGG S I N R AEVWMD+++ +YYA P
Sbjct: 360 RVGHVFRKRHPYTFPGGGSGNIFAKNTRRAAEVWMDDYKQYYYAAVP 406
>gi|443721252|gb|ELU10645.1| hypothetical protein CAPTEDRAFT_228331 [Capitella teleta]
Length = 512
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/290 (45%), Positives = 172/290 (59%), Gaps = 26/290 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE----RV 56
C + LP TS++I FHNEA STLLRTV SV+NRSP L++EIILVDD S+
Sbjct: 67 CSALRWRKNLPKTSVIITFHNEARSTLLRTVVSVLNRSPEELIQEIILVDDFSDFPEDGE 126
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FE 110
ID + + +D G +R + R V P++ + E
Sbjct: 127 ELAKIDKVK------VLRNDQRQG----LIRSRIRGADAAVAPVLTFLDSHCECNVHWLE 176
Query: 111 YITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ + VV PIIDVI+ F+Y+ AS GGFNW L F+W + P E+ +R G
Sbjct: 177 PLLERVAEDPTRVVSPIIDVINMDNFQYVGASSNLKGGFNWNLVFKWDSLTPEEVTQRRG 236
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
+ ++P++TP +AGGLF IDK+ F E+G YD MD+WGGENLE+SFRVWQC G LEIIPCS
Sbjct: 237 NPTAPIKTPMIAGGLFVIDKERFEEIGKYDMMMDVWGGENLEISFRVWQCHGSLEIIPCS 296
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
VGHVFR + PYTFPGG + N R AEVWMDE++ +YYA P +
Sbjct: 297 RVGHVFRKQHPYTFPGGSGNVFARNTRRAAEVWMDEYKSYYYAEVPSAKS 346
>gi|195032291|ref|XP_001988471.1| GH11183 [Drosophila grimshawi]
gi|193904471|gb|EDW03338.1| GH11183 [Drosophila grimshawi]
Length = 640
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/278 (47%), Positives = 170/278 (61%), Gaps = 10/278 (3%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S+ +
Sbjct: 199 CRTKKYREDLPATSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 258
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI--SDQTFEYITAKT-- 116
D+ I +D G ++R + V+ + + ++Q E + +
Sbjct: 259 ELAKIDKV--RIIRNDKREGLVRSRVRGADAAVSNVLTFLDSHVECNEQWLEPLLERVRE 316
Query: 117 ----VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VVCP+IDVIS F+YI AS GGF+W L F+W + E R D ++ +RT
Sbjct: 317 DPTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSASERTARHNDPTTAIRT 376
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
P +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR
Sbjct: 377 PMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRK 436
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 437 RHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 474
>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
Length = 556
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 174/280 (62%), Gaps = 12/280 (4%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK +YP LP+TS+VIVF+NEAWSTL+RTV SV++RSP LL E+ILVDD+S+ + P+
Sbjct: 103 CKSMTYPVDLPSTSVVIVFYNEAWSTLMRTVHSVLDRSPPDLLHEVILVDDSSDELHQPL 162
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA------ 114
+ + + + G +LR + +V +D + T ++
Sbjct: 163 EEYVRQLDKVRLHRNSQREGLIRARLRGLEQTSAPIVT-FLDAHCEVTIGWLEPLLNRIH 221
Query: 115 ---KTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
TVVCP ID I F Y S + G FNW L+F+W P E +RR + P+
Sbjct: 222 QDRTTVVCPEIDSIDLNNFAYKYGPSGVLRGTFNWDLSFKWSIAPTSERLRRTS-ATDPM 280
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFAID++YF ELG+YD G++IWG EN+E+SF+VWQCGG LEIIPCSHVGHVF
Sbjct: 281 RSPTMAGGLFAIDREYFLELGTYDRGLEIWGAENMELSFKVWQCGGKLEIIPCSHVGHVF 340
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R+ PY + I N RVAEVWMD+++ F+Y +P
Sbjct: 341 REVQPYDTSVSLHSIANKNYQRVAEVWMDDYKKFFYQRHP 380
>gi|194855488|ref|XP_001968556.1| GG24441 [Drosophila erecta]
gi|190660423|gb|EDV57615.1| GG24441 [Drosophila erecta]
Length = 631
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 190 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 249
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D L + + P+++ + +
Sbjct: 250 ELAKIDKVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED- 308
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 309 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 363
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 364 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 423
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 424 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 465
>gi|195471053|ref|XP_002087820.1| GE14879 [Drosophila yakuba]
gi|194173921|gb|EDW87532.1| GE14879 [Drosophila yakuba]
Length = 634
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 193 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDNPEDGL 252
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D L + + P+++ + +
Sbjct: 253 ELAKIDKVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED- 311
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 312 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 366
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 367 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 426
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 427 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 468
>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
Length = 564
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 174/284 (61%), Gaps = 22/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++K + T LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S+
Sbjct: 121 CRRKLWRTDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDNP--ED 178
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISD 106
+ ++ + +D G ++R + +V+ P+++ +++
Sbjct: 179 GEELAKIQKVRVLRNDKREGLMRSRVRGADAATASVLTFLDSHCECNVNWLEPLLERVAE 238
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D
Sbjct: 239 D------PTRVVCPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLGYAERESRQRDP 292
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ +RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS V
Sbjct: 293 TQAIRTPMIAGGLFVINKAYFEKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRV 352
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PYTFPGG + N R AEVWMD+++ FYYA P
Sbjct: 353 GHVFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKHFYYAAVP 396
>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 137/289 (47%), Positives = 177/289 (61%), Gaps = 15/289 (5%)
Query: 1 CKKK--SYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RV 56
CK K +YP LPTTS++I FH E S LLRTV SVINR+P LL E+I+VDD S+ ++
Sbjct: 56 CKAKHNTYPAKLPTTSVIICFHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKL 115
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
P+ D ++ T + G +L+ N K V+ C ++
Sbjct: 116 GKPLDDHVAQFTKVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLA 175
Query: 111 YITA--KTVVCPIIDVISDQTFEYITASDMTW-GGFNWKLNFRWYRVPPREMMRRGGDRS 167
I A + VVCP I+VI+ TF Y +++ GGF+W L F+W +PP E R D S
Sbjct: 176 RIAADRRNVVCPAIEVINADTFAYQGSTNADQRGGFSWDLFFKWKGIPPEEQKLRNDD-S 234
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTPTMAGGLF+I + YF+++GSYDE MDIWGGENLE+SFRVW CGG LEI+ CS VG
Sbjct: 235 DPIRTPTMAGGLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVG 294
Query: 228 HVFRD-KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSAS 275
HVFR SPY FP GV + + N R+AEVWMDE++D YY P S
Sbjct: 295 HVFRKYTSPYKFPDGVERTLTKNFNRLAEVWMDEYKDLYYNKKPQAKNS 343
>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
Length = 518
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 129/284 (45%), Positives = 179/284 (63%), Gaps = 26/284 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C+ ++Y T LP TSI+++FHNEAWS LLRTV+S ++RSP L+KEIILVDD S E +
Sbjct: 54 CRDRNYATELPDTSIIVIFHNEAWSVLLRTVFSCLDRSPGHLVKEIILVDDFSDFEHLQA 113
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + Q + + G +L + + V+ P++D I
Sbjct: 114 PLQEFADSQEKVRLVRAKKREGLIRARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRI 173
Query: 105 SDQTFEYITAKTVVCPIIDVISDQT--FEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
S VV P+IDVI+D T ++Y +A + GGF+W L F W+ +P E RR
Sbjct: 174 SQ------NKSNVVTPVIDVINDDTIQYQYSSAKSTSVGGFDWNLQFNWHGIPDHEKKRR 227
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D P+R+PTMAGGLF+I ++YF LG+YD GMDIWGGENLE+SFR+W CGG L+I P
Sbjct: 228 KSD-VDPVRSPTMAGGLFSISREYFEYLGTYDPGMDIWGGENLELSFRIWMCGGSLDIAP 286
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
CSHVGH+FR +SPY++ GV+ +V N+ R+AEVW+DE+ +YY
Sbjct: 287 CSHVGHIFRKRSPYSWKTGVN-VVKKNSIRLAEVWLDEFSKYYY 329
>gi|34042922|gb|AAQ56700.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 615
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 174 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 233
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D L + + P+++ + +
Sbjct: 234 ELAKIDKVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED- 292
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 293 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 347
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 348 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 407
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 408 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 449
>gi|33589464|gb|AAQ22499.1| RE02655p [Drosophila melanogaster]
Length = 633
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 192 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 251
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D L + + P+++ + +
Sbjct: 252 ELAKIDKVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED- 310
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 311 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 365
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 366 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 425
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 426 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 467
>gi|198474621|ref|XP_001356764.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
gi|198138471|gb|EAL33829.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
Length = 639
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 170/284 (59%), Gaps = 22/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S+ +
Sbjct: 198 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDFSDHPEDGL 257
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISD 106
D+ I +D G +++ + +V+ P+++ + +
Sbjct: 258 ELAKIDKV--RIIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEKWLEPLLERVRE 315
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D
Sbjct: 316 D------PSRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERSVRHNDP 369
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
++ +RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS V
Sbjct: 370 TTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRV 429
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 430 GHVFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 473
>gi|195148230|ref|XP_002015077.1| GL19517 [Drosophila persimilis]
gi|194107030|gb|EDW29073.1| GL19517 [Drosophila persimilis]
Length = 638
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 170/284 (59%), Gaps = 22/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S+ +
Sbjct: 197 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDFSDHPEDGL 256
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISD 106
D+ I +D G +++ + +V+ P+++ + +
Sbjct: 257 ELAKIDKV--RIIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEKWLEPLLERVRE 314
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D
Sbjct: 315 D------PSRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERSVRHNDP 368
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
++ +RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS V
Sbjct: 369 TTAIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRV 428
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 429 GHVFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 472
>gi|62484229|ref|NP_608773.2| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|320594323|ref|NP_995625.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
gi|195576320|ref|XP_002078024.1| GD22759 [Drosophila simulans]
gi|51315875|sp|Q6WV19.2|GALT2_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
Short=pp-GaNTase 2; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 2; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2
gi|61678274|gb|AAF51113.3| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|194190033|gb|EDX03609.1| GD22759 [Drosophila simulans]
gi|318068299|gb|AAS64620.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
Length = 633
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 192 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 251
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D L + + P+++ + +
Sbjct: 252 ELAKIDKVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED- 310
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 311 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 365
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 366 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 425
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 426 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 467
>gi|195342262|ref|XP_002037720.1| GM18147 [Drosophila sechellia]
gi|194132570|gb|EDW54138.1| GM18147 [Drosophila sechellia]
Length = 606
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K Y LP TS++I FHNEA STLLRT+ SV+NRSP L++EI+LVDD S
Sbjct: 165 CRTKKYREDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDHPEDGL 224
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI + E + +D L + + P+++ + +
Sbjct: 225 ELAKIDKVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED- 283
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS F+YI AS GGF+W L F+W + P E R D ++
Sbjct: 284 -----PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTT 338
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF IDK YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 339 AIRTPMIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 398
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD+++ YY P
Sbjct: 399 VFRKRHPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVP 440
>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Loxodonta africana]
Length = 560
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/297 (48%), Positives = 179/297 (60%), Gaps = 37/297 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP TS+VIVFHNEAWSTLLRTV SVINRSP LL E+ILVDDASER +
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVHSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 61 IDVISDQTFEY---ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
+ E I + G +LR K V+ +D + T ++
Sbjct: 165 TLENHVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVIT-FLDAHCECTLGWLEPLLA 223
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCPIIDVISD TFEY+ SDMT+GGFNWKLNFRWY VP REM RR GDR+
Sbjct: 224 RIKDDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTL 283
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI-----IPC 223
P+RTPTMAGGLF+ID++YF E+G+YD GMDIWGGENLEMSFR + ++E+ +P
Sbjct: 284 PVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRTY---SLMELESKNTVPY 340
Query: 224 S----HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEW-----RDFYYAMNPG 271
S H H Y ++ ++ + E W + W +DF+Y ++PG
Sbjct: 341 SVMSCHEAHAV----VYVNSRALTHVI---NKKQQEDWQEVWDGMNLKDFFYIISPG 390
>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
Length = 626
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 132/281 (46%), Positives = 172/281 (61%), Gaps = 18/281 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK +Y T LP+ S++I+F NEAWS L+RT+WSVINRSPR LKEI+L+DD S+RV
Sbjct: 166 CKGLTYDTILPSASVIIIFTNEAWSPLIRTIWSVINRSPRKFLKEILLIDDFSDRVELQG 225
Query: 59 PIIDVISDQTFEYITASDMT--WGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT--- 113
+ I Q + + G +L V+ +D + T ++
Sbjct: 226 KLERYIETQLPSIVRLVRLKERQGLIRARLAGAKEATGEVII-FLDSHCEATLGWLEPLL 284
Query: 114 ------AKTVVCPIIDVISDQTFEYITASDMTW--GGFNWKLNFRWYRVPPREMMRRGGD 165
+ V+ PIIDVI D+T EY S ++ G F W +F W +P RE+ RRG
Sbjct: 285 QRIKEDKRAVLVPIIDVIDDKTLEYYHGSPESFQIGSFTWSGHFTWMDIPKREIKRRGS- 343
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
R P +PTMAGGLFAID+ YF++LGSYDEGMD+WGGENLEMSFR+W CGG LE IPCS
Sbjct: 344 RVGPTNSPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSFRIWMCGGSLETIPCSR 403
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
VGH+FR PYTFPG + N ARV EVWMD++++ +Y
Sbjct: 404 VGHIFRSFHPYTFPGNKDTHGI-NTARVVEVWMDDYKELFY 443
>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
Length = 669
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 137/274 (50%), Positives = 170/274 (62%), Gaps = 19/274 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERVVCPIID 62
LP TS++I FHNEA STLLRTV SV+NRSP L+ EIILVDD S E +
Sbjct: 231 LPATSVIITFHNEARSTLLRTVVSVLNRSPERLIHEIILVDDFSDFPEDGQELAKIQKVR 290
Query: 63 VISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC---PIIDVISDQTFEYITAKTV 117
+I + E + S +T L + H + V P++ +++ V
Sbjct: 291 LIRNAKREGLVRSRVTGAAAATAKVLTFLDSHCECNVHWLEPLLARVAED------PTRV 344
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
VCP+IDVIS TF+YI AS GGF+W L F+W + E R D ++P+RTP +AG
Sbjct: 345 VCPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSGAERKERQRDPTAPIRTPMIAG 404
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLF ID+ YF +LG+YD MDIWGGENLE+SFRVWQCGG LEIIPCS VGHVFR + PYT
Sbjct: 405 GLFVIDRSYFEKLGTYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYT 464
Query: 238 FPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNP 270
FPGG S I N R AEVWMDE++ +YYA P
Sbjct: 465 FPGGGSGNIFAKNTRRAAEVWMDEYKRYYYAAVP 498
>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 632
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 133/284 (46%), Positives = 176/284 (61%), Gaps = 26/284 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP+TS++I FHNEAWS LLRTV SVI R+P LL E+ILVDD S+ +
Sbjct: 172 CKTEKYLDDLPSTSVIICFHNEAWSVLLRTVHSVIERTPEHLLTEVILVDDFSDMDHLKK 231
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + +S I D G +L+ K VV P++D I
Sbjct: 232 PLEEYMSALKKVRIVRMDKREGLIRARLKGAAVSKGAVVTFLDSHCECMEGWIEPLLDRI 291
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ TVVCP+IDVI D+TFEY A GGF+W L F W+ +P R+ R
Sbjct: 292 KR------NSSTVVCPVIDVIDDETFEYHYSKAYFTNVGGFDWSLQFNWHAIPERDRKNR 345
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
P+R+PTMAGGLF+ID+ YF +LG+YD G DIWGGENLE+SF++W CGG LEI+P
Sbjct: 346 K-RHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVP 404
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
CSHVGHVFR +SPY + GV+ ++ N+ R+AEVW+DE++ +YY
Sbjct: 405 CSHVGHVFRKRSPYKWRTGVN-VLKKNSVRLAEVWLDEYKVYYY 447
>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Strongylocentrotus purpuratus]
Length = 654
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 133/283 (46%), Positives = 183/283 (64%), Gaps = 17/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCP 59
CK + Y LPT SIVI F+NEAWSTLLRTV+SV++R+PR L+ E+ILVDD SE +
Sbjct: 179 CKYQVYSEELPTVSIVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVDDFSELTHLKK 238
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKN---RHKKTVVCPIIDV---ISDQTFEYIT 113
+D + F + + G +R + R+ V +D +++Q E +
Sbjct: 239 ELDQYMSKNFNGLVHV-IHNGQREGLIRARTIGARYATGDVLMFLDSHCEVNEQWLEPLL 297
Query: 114 AK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
+ TVVCPIID+I+ TF Y TAS + GGFNW ++F+W + R+++ + D
Sbjct: 298 ERIKADSHTVVCPIIDIINHDTFAY-TASPLVKGGFNWGMHFKWDTIRSRQLVGKE-DYV 355
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+ +PTMAGGLFA++++YF++LG YDEGMDIWGGENLE+SFR+WQCGG LEI+PCS VG
Sbjct: 356 KPIESPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGGKLEIVPCSRVG 415
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY P NA RVAEVWMDE+++ +Y + P
Sbjct: 416 HVFRKRRPYGSP-NRQDTTTKNAVRVAEVWMDEYKEHFYQVQP 457
>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
gallus]
Length = 566
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 121 CQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSPSHLIKEIILVDDYSNDPDDGA 180
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 181 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 234
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 235 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 294
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 295 PIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 354
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 355 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 396
>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Taeniopygia guttata]
Length = 669
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 224 CQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSPSHLIKEIILVDDYSNDPDDGA 283
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 284 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 337
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 338 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 397
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 398 PIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 457
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 458 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 499
>gi|328794283|ref|XP_001122865.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like,
partial [Apis mellifera]
Length = 372
Score = 246 bits (629), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 137/242 (56%), Positives = 150/242 (61%), Gaps = 53/242 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y +LP TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD SE+
Sbjct: 152 CKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 56 ------VVCPI--------------------IDVISDQTFEYITAS-DMTWGGFNWKLRE 88
P+ + Q ++ A + T G L
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 271
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ TVVCPIIDVISD TFEYI ASDMTWGGFNWKLN
Sbjct: 272 IAEDRTTVVCPIIDVISD---------------------DTFEYIPASDMTWGGFNWKLN 310
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWYRV REM RR GDR++PLRTPTMAGGLF+IDK+YFYELG+YDEGMDIWGGENLEMS
Sbjct: 311 FRWYRVAQREMDRRLGDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMS 370
Query: 209 FR 210
FR
Sbjct: 371 FR 372
>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
glaber]
Length = 567
Score = 246 bits (629), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 122 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 181
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 182 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 235
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 236 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 295
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 296 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 355
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 356 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 397
>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Meleagris gallopavo]
Length = 579
Score = 246 bits (629), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 136/288 (47%), Positives = 180/288 (62%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK KSY LPTTS+VI F+NEAWSTLLRTV SV+ SP LLKEIILVDD S++V
Sbjct: 125 CKTKSYNYRKLPTTSVVIAFYNEAWSTLLRTVHSVLETSPSVLLKEIILVDDLSDKVYLK 184
Query: 60 I-----------IDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ +I E + + + F L + H + V + P+++
Sbjct: 185 TDLEKYISSLKRVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLER 244
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I++ V+CP+ID I TFEY + +++ GGF+W+L F+W+ VP E +RR
Sbjct: 245 IAE------NETVVICPVIDTIDWNTFEYYMQSAEPMIGGFDWRLTFQWHSVPKHERLRR 298
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ + P+R+PTMAGGLFA+ K YF LG+YD GMD+WGGENLE+SFRVWQCGG+LEI P
Sbjct: 299 KSE-TDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHP 357
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 358 CSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 400
>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
Length = 622
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 185/306 (60%), Gaps = 18/306 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL EIILVDD S+
Sbjct: 166 CKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSDMAHLKA 225
Query: 61 IDVISDQTFE--YITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYI 112
I + F I + G +++ K +V+ C ++ + + I
Sbjct: 226 SLEIYMRQFPKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWMEPLLDRI 285
Query: 113 --TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCP+IDVI D TFEY A GGF+W L F W+ +P ++ R+G
Sbjct: 286 KKNPKTVVCPVIDVIDDNTFEYHYSKAYFTNVGGFDWSLQFNWHAIPEKD--RKGRRDID 343
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++PTMAGGLF+ID+ +F +LGSYD G+DIWGGENLE+SF+ W CGGILEI+PCSHVGH
Sbjct: 344 PVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEIVPCSHVGH 403
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM---NPGKSASVSTCAAHFRM 285
+FR +SPY + GV+ ++ N+ R+AEVWMDE++ +YY N G VS+ A
Sbjct: 404 IFRKRSPYKWLSGVN-VLKRNSVRLAEVWMDEYKKYYYERINNNLGDFGDVSSRKALREK 462
Query: 286 LSYSSW 291
L S+
Sbjct: 463 LQCKSF 468
>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 628
Score = 246 bits (628), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 183 CQRKQWRVDLPATSVVITFHNEARSALLRTVASVLKKSPPHLVKEIILVDDYSNDPEDGA 242
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 243 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQARVLTFLDSHCECNEHWLEPLLE 296
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 297 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 356
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEI+PCS VGH
Sbjct: 357 PIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGH 416
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 417 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 458
>gi|170046214|ref|XP_001850669.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
gi|167869055|gb|EDS32438.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
Length = 576
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 140/285 (49%), Positives = 175/285 (61%), Gaps = 23/285 (8%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------ 53
C++K +P++ LP TS++I FHNEA STLLRT+ SV+NRSP L+ EIILVDD S
Sbjct: 133 CRRK-WPSYSLPPTSVIITFHNEARSTLLRTIVSVLNRSPEHLIHEIILVDDFSDFPEDG 191
Query: 54 -ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPI-IDVISD 106
E + VI ++ E + +D+ L + P+ + V D
Sbjct: 192 QELAKIHKVKVIRNEKREGLVRSRVKGADVATAKLLTFLDSHCECNVDWLEPLLVRVQED 251
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
T VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D
Sbjct: 252 PT-------RVVCPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSNAERHERQKDP 304
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
++P+RTP +AGGLF IDK YF +LG YD MDIWGGENLE+SFRVWQCGG LEIIPCS V
Sbjct: 305 TTPIRTPMIAGGLFVIDKAYFEKLGKYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRV 364
Query: 227 GHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PYTFPGG S I N R AEVWMD+++ +YYA P
Sbjct: 365 GHVFRKRHPYTFPGGGSGNIFAKNTRRAAEVWMDDYKQYYYAAVP 409
>gi|344278311|ref|XP_003410938.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Loxodonta africana]
Length = 572
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 127 CQRKQWRVGLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 186
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 187 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 240
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 241 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 300
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 301 PIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 360
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 361 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 402
>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Cavia porcellus]
Length = 531
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 86 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 145
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 146 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 199
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 200 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 259
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 260 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 319
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 320 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 361
>gi|66507571|ref|XP_394527.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis mellifera]
gi|380015445|ref|XP_003691712.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis florea]
Length = 571
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 134/291 (46%), Positives = 172/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S
Sbjct: 128 CRMKQWRRDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGE 187
Query: 54 ERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------P 99
E + VI ++ E + S ++R + TV+ P
Sbjct: 188 ELSRIHKVRVIRNEKREGLMRS---------RVRGADAATATVLTFLDSHCECNADWLEP 238
Query: 100 IIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
+++ +++ VVCP+IDVIS TF+YI AS GGF+W L F+W + E
Sbjct: 239 LLERVAED------PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTER 292
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
R D + +RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LE
Sbjct: 293 QARQKDPTQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLE 352
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGHVFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 353 IIPCSRVGHVFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 403
>gi|354468855|ref|XP_003496866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Cricetulus griseus]
gi|344247257|gb|EGW03361.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Cricetulus
griseus]
Length = 535
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 90 CQRKQWRGDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 149
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 150 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 203
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 204 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 263
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 264 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 323
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 324 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 365
>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
Length = 504
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/288 (48%), Positives = 174/288 (60%), Gaps = 24/288 (8%)
Query: 1 CKKKSYPTF-----LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-- 53
C++ S+ LP TS++I FHNEA STLLRTV SV+NRSP L+ EIILVDD S
Sbjct: 54 CRRSSWSDLSTIAHLPATSVIITFHNEARSTLLRTVVSVLNRSPERLIHEIILVDDYSDF 113
Query: 54 -----ERVVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC---PIIDV 103
E + +I + E + S +T L + H + V P++
Sbjct: 114 PEDGQELAKIQKVRLIRNSKREGLVRSRVTGAAAATAKVLTFLDSHCECNVNWLEPLLAR 173
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+++ VVCP+IDVIS TF+YI AS GGF+W L F+W + E R
Sbjct: 174 VAED------PTRVVCPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSNAERKARQ 227
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D ++P+RTP +AGGLF IDK YF LG+YD MDIWGGENLE+SFRVWQCGG LEIIPC
Sbjct: 228 RDPTAPIRTPMIAGGLFVIDKAYFERLGTYDTQMDIWGGENLEISFRVWQCGGSLEIIPC 287
Query: 224 SHVGHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGHVFR + PYTFPGG S I N R AEVWMDE++ +YYA P
Sbjct: 288 SRVGHVFRKRHPYTFPGGGSGNIFAKNTRRAAEVWMDEYKKYYYAAVP 335
>gi|383847543|ref|XP_003699412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Megachile rotundata]
Length = 571
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 169/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S
Sbjct: 128 CRMKQWRRDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGE 187
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI ++ E + +D L + P+++ +++
Sbjct: 188 ELSRIHKVRVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAED- 246
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS TF+YI AS GGF+W L F+W + E + R D +
Sbjct: 247 -----PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQSERLARQKDPTQ 301
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 302 AIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 361
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 362 VFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 403
>gi|397508104|ref|XP_003824510.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
paniscus]
Length = 533
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
V I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 VLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 321
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 322 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 363
>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 571
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQARVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
Length = 526
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 81 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 140
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 141 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 194
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 195 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 254
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 255 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 314
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 315 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 356
>gi|327262105|ref|XP_003215866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Anolis carolinensis]
Length = 575
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 130 CQRKQWRIDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 189
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + I +D G ++R + + V+ C + + E
Sbjct: 190 LLGKIEKVR------ILRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 243
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 244 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 303
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 304 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 363
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 364 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 405
>gi|426256000|ref|XP_004021634.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Ovis
aries]
Length = 674
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/284 (45%), Positives = 174/284 (61%), Gaps = 20/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 227 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 286
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPI----------IDVISD 106
+ I+ + + +D G ++R + + V+ + ++ + +
Sbjct: 287 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 340
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
+ E VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+
Sbjct: 341 RVAEGSDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNP 400
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEI+PCS V
Sbjct: 401 VAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRV 460
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 461 GHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 504
>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Saccoglossus kowalevskii]
Length = 524
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 174/286 (60%), Gaps = 24/286 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C+ K Y LP+TSI+I F E+WSTL+R+V SVINRSP L+KEIILVDD S R +
Sbjct: 67 CQDKLYSDSLPSTSIIICFTEESWSTLVRSVHSVINRSPPQLIKEIILVDDFSSREYLKA 126
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + I + G +LR + V+ P++ I
Sbjct: 127 PLDKYMKRFPQVKILRLENREGLIRGRLRGTEIAQGEVLTFLDSHIECGVGWLEPMLQRI 186
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ + VV P+ID I F Y AS++ GGF+W++ F+W +P EM RR
Sbjct: 187 KEDR------RNVVAPMIDGIDATKFSY-AASNLIRGGFSWEMQFKWKPIPDYEMKRRK- 238
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P+R+PTMAGGLFAIDK YF E+G+YD G++IWG ENLE+SF++W CGG LE+IPCS
Sbjct: 239 DETWPIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCGGNLEMIPCS 298
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVGHVFR PY FP G K + N RVAEVWMDE++D +YA+ P
Sbjct: 299 HVGHVFRASQPYKFPEGNIKTFMRNNMRVAEVWMDEYKDIFYALKP 344
>gi|340712798|ref|XP_003394942.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus terrestris]
Length = 571
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 168/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S
Sbjct: 128 CRMKQWRRDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGE 187
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI ++ E + +D L + P+++ +++
Sbjct: 188 ELSRIHKVRVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAED- 246
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D +
Sbjct: 247 -----PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQ 301
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 302 AIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 361
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 362 VFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 403
>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
Length = 522
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 180/288 (62%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK KSY LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S++V
Sbjct: 68 CKAKSYNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPSVLLKEIILVDDLSDKVYLK 127
Query: 60 I-----------IDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ +I E + + + F L + H + V + P+++
Sbjct: 128 TDLEKYISSLKRVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLER 187
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I++ +VCP+ID I +TFEY + ++ GGF+W+L F+W+ VP E +RR
Sbjct: 188 IAEN------ETVIVCPVIDTIDWKTFEYYMQTAEPMIGGFDWRLTFQWHSVPKHERLRR 241
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ + P+R+PTMAGGLFA+ K YF LG+YD GMD+WGGENLE+SFRVWQCGG+LEI P
Sbjct: 242 KSE-TDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHP 300
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 301 CSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 343
>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
Length = 570
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 125 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 184
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 185 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 238
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 239 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 298
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 299 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 358
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 359 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 400
>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
Length = 604
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 180/292 (61%), Gaps = 25/292 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVV 57
CK ++YP LP T+++I FHNEA +TLLRTVWS+++RSP +L+ EI+L+DDAS E +
Sbjct: 143 CKDRTYPLDKLPDTTVIIPFHNEARTTLLRTVWSILDRSPPSLINEILLIDDASTMEHLK 202
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + ++ + G K+ + K VV P+++
Sbjct: 203 APLDEELATIPKTRVLRLSERSGLIRAKVFGAEQAKGKVVTFLDSHCECNVGWLEPLLER 262
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I Y+ TVV P+ID I +TF Y + + +T G F W L F W +P E +R
Sbjct: 263 I------YLDRTTVVTPVIDNIDKKTFAYTGSPTVITRGIFTWSLTFSWLDLPWFEQKKR 316
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D +PL +PTMAGGLF++D++YF+E+GSYD GMD+WGGENLE+SFR+WQCGG LE IP
Sbjct: 317 -KDPIAPLPSPTMAGGLFSMDREYFFEIGSYDMGMDVWGGENLEISFRIWQCGGTLEFIP 375
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
CS VGHV+RD PY FP G + + N RVAEVWMDE+++ YY + P A
Sbjct: 376 CSRVGHVYRDFHPYKFPSGAVQTINKNLNRVAEVWMDEYKELYYGVRPHHRA 427
>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
Length = 575
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK + +P LPTT+I+I FHNE S LLRTV S +NRSP LLKEIILVDD S
Sbjct: 130 CKSQVWPHDLPTTTIIICFHNEGRSALLRTVISALNRSPPHLLKEIILVDDFSSDPKDGR 189
Query: 56 --VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ P + +I + E + +++ G L K + P++ I +
Sbjct: 190 RLLKLPKVKLIRNTKREGLIRSRVKGANLARGEVLTFLDSHCECNKNWLEPLLLRIKE-- 247
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ KT+V PIIDVI+ TF+Y+ +S GGF W LNF+W +PP + R G +
Sbjct: 248 ----SPKTIVSPIIDVINLDTFDYLGSSADLRGGFGWNLNFKWDFLPPHILAERQGKPTL 303
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++P +AGGLF++ K +F LG YD MD+WGGENLE+SFR WQCGG +EIIPCS VGH
Sbjct: 304 PIKSPVIAGGLFSVAKKWFETLGKYDMQMDVWGGENLEISFRTWQCGGAMEIIPCSRVGH 363
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR++ PY FPGG + N R EVWMD+++ +YYA P
Sbjct: 364 VFRNRHPYQFPGGSMNVFQKNTRRAVEVWMDDYKRYYYAAVP 405
>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 548
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 103 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 162
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 163 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 216
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 217 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 276
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 277 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 336
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 337 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 378
>gi|350409232|ref|XP_003488663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus impatiens]
Length = 571
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 168/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S
Sbjct: 128 CRMKQWRRDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGE 187
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + VI ++ E + +D L + P+++ +++
Sbjct: 188 ELSRIHKVRVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAED- 246
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D +
Sbjct: 247 -----PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQ 301
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 302 AIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 361
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 362 VFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 403
>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
Length = 536
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 91 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 150
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 151 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 204
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 205 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 264
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 265 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 324
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 325 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 366
>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
musculus]
gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 570
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 125 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 184
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 185 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 238
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 239 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 298
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 299 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 358
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 359 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 400
>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
caballus]
Length = 539
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 94 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 153
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 154 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 207
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 208 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 267
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 268 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 327
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 328 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 369
>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
And Ea2
gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
Length = 501
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 56 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 115
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 116 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 169
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 170 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 229
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 230 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 289
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 290 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 331
>gi|268580247|ref|XP_002645106.1| Hypothetical protein CBG16794 [Caenorhabditis briggsae]
Length = 568
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 123/281 (43%), Positives = 180/281 (64%), Gaps = 25/281 (8%)
Query: 8 TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCPIIDVISD 66
T LP SI+I FHNEAW+T++RT+ S+ NRSPR L++EI+LVDD S++ + +D+
Sbjct: 127 TELPRASIIITFHNEAWTTIIRTLHSISNRSPRHLIEEIVLVDDYSDKYWLKGPLDIYVR 186
Query: 67 QTFE---YITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTF 109
Q FE ++T G +L K ++ P+I ++D
Sbjct: 187 Q-FEIPVHVTHLPERSGLIRARLTGAKIAKGPILLFLDSHIEVSEGWLEPLISRVADDR- 244
Query: 110 EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
++ PIID ISD+ F + T WGGF+W L+F+W+ + + R ++ P
Sbjct: 245 -----TRIIAPIIDNISDEDFGFSTGRTDLWGGFSWILSFKWFDMNGNDTQRLIAKKAEP 299
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+RTPT+AGGLFAI+++YFYE+G+YDEGM++WGGEN+E+SFR+W CGG +EI PCSHVGHV
Sbjct: 300 IRTPTIAGGLFAINREYFYEMGAYDEGMEVWGGENVEISFRIWMCGGSMEIHPCSHVGHV 359
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FR K+PY+F V+ ++ N AR AEVWMDE+++F++ M P
Sbjct: 360 FRTKTPYSFTKEVNFVIRRNQARTAEVWMDEYKEFFFKMVP 400
>gi|355767580|gb|EHH62635.1| hypothetical protein EGM_21033, partial [Macaca fascicularis]
Length = 453
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 8 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 67
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 68 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 121
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 122 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 181
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 182 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 241
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 242 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 283
>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
Length = 544
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 99 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 158
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 159 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 212
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 213 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 272
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 273 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 332
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 333 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 374
>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 321
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 322 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 363
>gi|395836156|ref|XP_003791031.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Otolemur garnettii]
Length = 571
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|440891991|gb|ELR45390.1| Polypeptide N-acetylgalactosaminyltransferase 2, partial [Bos
grunniens mutus]
Length = 530
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 85 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 144
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 145 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 198
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 199 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 258
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 259 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 318
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 319 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 360
>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
Length = 569
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 124 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 183
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 184 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 237
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 238 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 297
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 298 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 357
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 358 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEFKHFYYAAVP 399
>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
[Macaca mulatta]
Length = 554
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 109 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 168
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 169 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 222
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 223 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 282
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 283 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 342
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 343 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 384
>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
gorilla gorilla]
Length = 533
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 321
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 322 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 363
>gi|332812181|ref|XP_003308857.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
troglodytes]
gi|410227516|gb|JAA10977.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410264536|gb|JAA20234.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410296424|gb|JAA26812.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 131 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 190
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 191 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 244
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 245 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 304
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 305 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 364
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 365 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 406
>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 504
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 59 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 118
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 119 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 172
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 173 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 232
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 233 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 292
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 293 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEFKHFYYAAVP 334
>gi|26338209|dbj|BAC32790.1| unnamed protein product [Mus musculus]
Length = 570
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/289 (46%), Positives = 174/289 (60%), Gaps = 32/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 125 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 184
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIID 102
+ I+ + + +D G ++R + + V+ P+++
Sbjct: 185 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHYECNERWLEPLLE 238
Query: 103 -VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
V D+T VV PIIDVI+ F+Y+ AS GGF+W L F+W + P +
Sbjct: 239 RVAEDRT-------RVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRS 291
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R G+ +P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEII
Sbjct: 292 RQGNPVAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEII 351
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS VGHVFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 352 PCSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 400
>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
Length = 571
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
4 [Pan troglodytes]
Length = 533
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 321
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 322 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 363
>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
Length = 533
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 321
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 322 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 363
>gi|291402210|ref|XP_002717436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oryctolagus cuniculus]
Length = 571
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|300797173|ref|NP_001180032.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Bos
taurus]
gi|296472282|tpg|DAA14397.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Bos
taurus]
Length = 571
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|386780726|ref|NP_001248284.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Macaca
mulatta]
gi|384941838|gb|AFI34524.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
gi|387540526|gb|AFJ70890.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
Length = 571
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Papio anubis]
Length = 571
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|4758412|ref|NP_004472.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Homo
sapiens]
gi|51315838|sp|Q10471.1|GALT2_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|971461|emb|CAA59381.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|26996816|gb|AAH41120.1| GALNT2 protein [Homo sapiens]
gi|119590317|gb|EAW69911.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_c [Homo sapiens]
gi|239740418|gb|ACS13744.1| polypeptide N-acetylgalactosaminyltransferase 2 [Homo sapiens]
gi|307686451|dbj|BAJ21156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [synthetic
construct]
Length = 571
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
Length = 530
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 85 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 144
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 145 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 198
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 199 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 258
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 259 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 318
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 319 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 360
>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_a [Homo sapiens]
Length = 508
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 321
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 322 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 363
>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
lupus familiaris]
Length = 588
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 143 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 202
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 203 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 256
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 257 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 316
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEI+PCS VGH
Sbjct: 317 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGH 376
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 377 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 418
>gi|158261119|dbj|BAF82737.1| unnamed protein product [Homo sapiens]
Length = 571
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
catus]
Length = 653
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 208 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 267
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 268 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 321
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 322 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 381
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEI+PCS VGH
Sbjct: 382 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGH 441
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 442 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 483
>gi|410342331|gb|JAA40112.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410342333|gb|JAA40113.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 131 CQRKQWRGGLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 190
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 191 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 244
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 245 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 304
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 305 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 364
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 365 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 406
>gi|405973911|gb|EKC38600.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Crassostrea gigas]
Length = 581
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 134/283 (47%), Positives = 174/283 (61%), Gaps = 20/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+++ + + L TS++I FHNEA STLLRT+ SV +RSP+ L+ EIILVDD S
Sbjct: 135 CREEQHDSNLDPTSVIITFHNEARSTLLRTIVSVFSRSPKHLITEIILVDDFSDDPSDGQ 194
Query: 54 ERVVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC---PIIDVIS-DQ 107
E V + ++ + E + S + L + H + V P++D I D+
Sbjct: 195 ELAVIKRVKILRNDKREGLMRSRVKGADAARAPILTFLDSHCECNVGWLEPLLDRIKGDR 254
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVI+ FEYI AS GGF+W L F+W + P E +R G+
Sbjct: 255 T-------RVVSPIIDVINMDNFEYIGASADLKGGFDWNLVFKWDYMTPEERNKRAGNPI 307
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGGLF+I+K +F ELG YD MD+WGGENLE+SFRVWQC G LEIIPCS VG
Sbjct: 308 QPIRTPMIAGGLFSIEKKWFEELGKYDRNMDVWGGENLEISFRVWQCHGSLEIIPCSRVG 367
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PYTFPGG + N R AEVWMD +++FYYA P
Sbjct: 368 HVFRKQHPYTFPGGSGNVFARNTRRAAEVWMDNYKEFYYAAVP 410
>gi|350592744|ref|XP_001927809.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Sus
scrofa]
Length = 571
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
Length = 618
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 128 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 187
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 188 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 241
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 242 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 301
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 302 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 361
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++ FYYA P
Sbjct: 362 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKHFYYAAVP 403
>gi|441612314|ref|XP_004088076.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Nomascus leucogenys]
Length = 570
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 125 CQRKQWRVDLPPTSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 184
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 185 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 238
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 239 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 298
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 299 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 358
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 359 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 400
>gi|332265851|ref|XP_003281927.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 111 CQRKQWRVDLPPTSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 170
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 171 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 224
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 225 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 284
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 285 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 344
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 345 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 386
>gi|332265853|ref|XP_003281928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
2 [Nomascus leucogenys]
Length = 571
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 131/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPPTSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
Length = 415
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 134/287 (46%), Positives = 178/287 (62%), Gaps = 29/287 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K YP LP +SI+I FHNEAWSTLLRTV SVINR+P LL+EI+L+DDAS R
Sbjct: 48 CKAKKYPLHLPKSSIIICFHNEAWSTLLRTVHSVINRTPPRLLEEILLIDDASNR----- 102
Query: 61 IDVISDQTFEYITASDMT--------WGGFNWKLREKNRHKKTVVCPIIDVISDQT---F 109
D + ++ EY+ + G +L+ K +++ +D + +
Sbjct: 103 -DELKEKLEEYVAKLKVVRIIRLSKRQGLIRARLKGAAAAKGSILT-FLDAHCECSKGWL 160
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASD-MTWGGFNWKLNFRWYRVPPREMMRR 162
E + AK VV P+ID ISD TF Y + G F W+L F W VP EM RR
Sbjct: 161 EPLAAKIAENSSNVVMPVIDEISDTTFYYHAVPEPFHRGVFRWRLEFGWKPVPQYEMERR 220
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + +RTP MAGGLF+IDK+YF ++G+YD GMDIWGGENLE+SFR+W CGG +E++P
Sbjct: 221 K-DEADGIRTPVMAGGLFSIDKNYFEKIGTYDTGMDIWGGENLEISFRIWMCGGAIEMLP 279
Query: 223 CSHVGHVFRDKSPYTF---PGGVSKIVLHNAARVAEVWMDEWRDFYY 266
CS VGHVFR + PY+F PG + +V +N RVA+VWMDE++ +Y
Sbjct: 280 CSRVGHVFRPRFPYSFPARPGHNTDVVSNNLMRVADVWMDEYKKHFY 326
>gi|126307024|ref|XP_001369295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Monodelphis domestica]
Length = 571
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 126 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 185
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 186 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 239
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 240 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 299
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 300 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 359
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PY+FPGG + N R AEVWMDE+++FYYA P
Sbjct: 360 VFRKQHPYSFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 401
>gi|395531657|ref|XP_003767891.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Sarcophilus harrisii]
Length = 542
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 97 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 156
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 157 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 210
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 211 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVA 270
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 271 PIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 330
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PY+FPGG + N R AEVWMDE+++FYYA P
Sbjct: 331 VFRKQHPYSFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 372
>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Cricetulus griseus]
Length = 797
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 134/292 (45%), Positives = 175/292 (59%), Gaps = 19/292 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 97 CPSLSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 156
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 157 LLTRIPKVK---CLRNDKREGLIRSRVRGADVAGATVLT-FLDSHCEVNIEWLQPMLQRV 212
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 213 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 271
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 272 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 331
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVST 278
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T
Sbjct: 332 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVAT 383
>gi|348534088|ref|XP_003454535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oreochromis niloticus]
Length = 559
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K + + LP +S+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S+ P
Sbjct: 114 CRHKQWKSDLPASSVVITFHNEARSALLRTVVSVLKKSPPHLVKEIILVDDYSDN---PE 170
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI------SDQTFEYITA 114
+ + + + G +R + R P++ + +D E +
Sbjct: 171 DGALLGKIEKVRVLRNDRREGL---MRSRVRGADAATAPVLTFLDSHCECNDHWLEPLLE 227
Query: 115 KT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + + R G+ +
Sbjct: 228 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTQEQRRARQGNPIA 287
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK+YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 288 PIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 347
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 348 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 389
>gi|345488662|ref|XP_003425959.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Nasonia vitripennis]
Length = 572
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 169/284 (59%), Gaps = 22/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S+
Sbjct: 129 CRLKQWRQDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHP--ED 186
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISD 106
D +S + ++ G ++R + V+ P+++ +++
Sbjct: 187 GDELSRIHKVRVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWLEPLLERVAE 246
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VVCP+IDVIS F+YI AS GGF+W L F+W + E R D
Sbjct: 247 D------PSRVVCPVIDVISMDNFQYIGASADLRGGFDWSLVFKWEYLSQSERQARQKDP 300
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ +RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS V
Sbjct: 301 TQAIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRV 360
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 361 GHVFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 404
>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Saccoglossus kowalevskii]
Length = 633
Score = 243 bits (620), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 173/286 (60%), Gaps = 27/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
C + Y LP+TS+VI F NEAWSTLLRTV+SVI+RSP LL EIILVDD S
Sbjct: 167 CAYQVYSNNLPSTSVVICFFNEAWSTLLRTVYSVIDRSPANLLHEIILVDDYSSSTYLKD 226
Query: 57 ---------VCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIID 102
+ I+ +I ++ E + + M G L + P+++
Sbjct: 227 YLDDFIKTNLFQIVKIIHNKKREGLIRARMIGAAAATGDVVMFLDSHCEVSTQWLEPLLE 286
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I TVVCPIID+I+ TFEY S + GGFNW L+F+W +P + +
Sbjct: 287 RIK------FDPHTVVCPIIDIINADTFEY-QQSPLVRGGFNWGLHFKWDTIPSSQFKGK 339
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D P+R+PTMAGGLFA+D+ YF+ELG YD+GMDIWGGENLE+SFR+WQCGG LEIIP
Sbjct: 340 E-DYIKPVRSPTMAGGLFAMDRKYFHELGEYDDGMDIWGGENLEISFRIWQCGGTLEIIP 398
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
CS VGHVFR + PY P G + N+ RVA VWMDE+++ Y+ +
Sbjct: 399 CSRVGHVFRKRRPYGSPNG-EDTMSKNSLRVAHVWMDEYKEHYFEL 443
>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Anolis carolinensis]
Length = 608
Score = 243 bits (620), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 132/282 (46%), Positives = 179/282 (63%), Gaps = 15/282 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCP 59
CK K YP LP+ SI+I F+NEA+S LLRTV SV++R+P LL EIILVDD SE V +
Sbjct: 141 CKGKKYPLDLPSASIIICFYNEAFSALLRTVHSVLDRTPSHLLHEIILVDDNSELVDLKE 200
Query: 60 IIDVISDQTFE---YITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+DV + + + G ++ + V+ C + ++
Sbjct: 201 DLDVYLRKNLPNNVKLVRNGKREGLIRGRMIGASHATGKVLVFLDSHCEVNELWLQPLLT 260
Query: 111 YI--TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I + KTVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ G ++
Sbjct: 261 PIRESRKTVVCPVIDIISADTLTY-SSSPVVRGGFNWGLHFKWDLVPLSELEGPEG-ATA 318
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++PTMAGGLFA+D++YF ELG YD GMDIWGGENLE+SFR+W CGG L IIPCS VGH
Sbjct: 319 PIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLLIIPCSRVGH 378
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+FR + PY PGG + HN+ R+A VWMDE++D Y+A+ P
Sbjct: 379 IFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKDQYFALRP 419
>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
kowalevskii]
Length = 753
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 131/294 (44%), Positives = 177/294 (60%), Gaps = 31/294 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----V 56
C+ + YP LP TS++IVFHNEAW+TLLRTV SVI+RSP LL+EI+LVDDAS +
Sbjct: 289 CEHREYPHILPKTSVIIVFHNEAWTTLLRTVISVIDRSPWQLLEEILLVDDASTSEKYWL 348
Query: 57 VCPIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PII 101
+ + ++ + + G +LR + V+ P++
Sbjct: 349 QSELDEYVAKLPVITRVIRTGKRVGLIQGRLRGVEEARGEVLTFLDSHCECNIGWLEPLL 408
Query: 102 -DVISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPRE 158
++++D+T TVV P +DVISD+TF Y I GGF W ++F+WY +P RE
Sbjct: 409 SEIVNDRT-------TVVAPNLDVISDKTFGYTFIKPEQTMIGGFGWLVDFKWYSLPKRE 461
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+R D S PLRTPT+AGGLFAID DYF+ +G YD G D WG ENLE+SFRVWQCGG L
Sbjct: 462 RLRVNNDMSRPLRTPTIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELSFRVWQCGGTL 521
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSK--IVLHNAARVAEVWMDEWRDFYYAMNP 270
EI+PCSHVGHVFR PY + + + N R+ +VWMD+ + F+ A+ P
Sbjct: 522 EIVPCSHVGHVFRSSIPYKYKDNKNPGLTIAKNNMRLMDVWMDDLKYFFLAILP 575
>gi|410912128|ref|XP_003969542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Takifugu rubripes]
Length = 558
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 173/286 (60%), Gaps = 26/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----V 56
C+ K + + LP +S+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S+
Sbjct: 113 CRHKQWKSDLPASSVVITFHNEARSALLRTVVSVLKKSPPHLVKEIILVDDYSDNPEDGA 172
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI------SDQTFE 110
+ ID + + +D G +R + R P++ + +D E
Sbjct: 173 LLGKIDKLR------VLRNDRREG----LMRSRVRGADAATAPVLTFLDSHCECNDHWLE 222
Query: 111 YITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ + VV PIIDVI+ F+Y+ AS GGF+W L F+W + + R G
Sbjct: 223 PLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTLDQRRARQG 282
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
+ +P++TP +AGGLF +DK+YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS
Sbjct: 283 NPIAPIKTPMIAGGLFVMDKEYFEQLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCS 342
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 343 RVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 388
>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Taeniopygia guttata]
Length = 608
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 134/287 (46%), Positives = 182/287 (63%), Gaps = 25/287 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++KSYP LP+ S+VI F+NEA S LLRTV SV++R+P LL EIILVDD SE +
Sbjct: 141 CREKSYPADLPSASVVICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSE-----L 195
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLRE---KNR-----HKKTVVCPIIDV---ISDQTF 109
D+ D + T T + RE + R H V +D +++
Sbjct: 196 ADLKKDLSEYVKTQLPRTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCEVNEMWL 255
Query: 110 EYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + A +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 256 QPLLAPIREDPRTVVCPVIDIISADTLTY-SSSPVVRGGFNWGLHFKWDLVPLAELEGPE 314
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
G ++P+++PTMAGGLFA+D++YF ELG YD GMDIWGGENLE+SFR+W CGG L IIPC
Sbjct: 315 G-ATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIPC 373
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y+A+ P
Sbjct: 374 SRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYFALRP 419
>gi|291391583|ref|XP_002712189.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Oryctolagus cuniculus]
Length = 941
Score = 243 bits (619), Expect = 1e-61, Method: Composition-based stats.
Identities = 120/294 (40%), Positives = 172/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 487 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTK----- 541
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 542 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 595
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 596 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFLWPMNFGWKT 649
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 650 IPPDVVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 709
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 710 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 763
>gi|8918932|dbj|BAA97985.1| unnamed protein product [Mus musculus]
Length = 558
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 170/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 113 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDKREGLIRSRVRRADVAGATVLT-FLDSHCEVNVEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DLTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|50510795|dbj|BAD32383.1| mKIAA1130 protein [Mus musculus]
Length = 655
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 170/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 210 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 269
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 270 LLTRIPKVK---CLRNDKREGLIRSRVRGADVAGATVLT-FLDSHCEVNVEWLQPMLQRV 325
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 326 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 384
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 385 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 444
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 445 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 484
>gi|403300209|ref|XP_003940844.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Saimiri
boliviensis boliviensis]
Length = 724
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 171/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 279 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 338
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 339 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 392
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 393 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 452
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 453 PIKTPMIAGGLFVMDKFYFEALGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 512
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 513 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 554
>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oreochromis niloticus]
Length = 619
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 132/291 (45%), Positives = 175/291 (60%), Gaps = 33/291 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++KSYP LP+ S+VI F NEA S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 117 CREKSYPVALPSASVVICFFNEALSALLRTVHSVLDRTPAYLLHEIILVDDHSE------ 170
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------HKKTVVCPIIDVISDQT 108
++ + D+ Y+ A G +R + R H V +D +
Sbjct: 171 LEELKDELDRYVRAE---LQGKVQLVRNQRREGLIRGRMIGASHATGEVLVFLDSHCEVN 227
Query: 109 FEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
++ +TVVCP+ID+IS T Y + S + GGFNW L+F+W VPP E+
Sbjct: 228 QAWLQPLLAPIQKDHRTVVCPVIDIISADTLAY-SPSPIVRGGFNWGLHFKWDPVPPSEL 286
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
G S P+R+PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 287 SGPEGA-SGPIRSPTMAGGLFAMNRKYFNELGQYDAGMDIWGGENLEISFRIWMCGGQLF 345
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY PGG + HN+ R+A VWMD +++ Y ++ P
Sbjct: 346 IIPCSRVGHIFRKRRPYGSPGGHDTMA-HNSLRLAHVWMDGYKEQYLSLRP 395
>gi|148670721|gb|EDL02668.1| mCG7620, isoform CRA_b [Mus musculus]
Length = 667
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 170/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 222 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 281
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 282 LLTRIPKVK---CLRNDKREGLIRSRVRGADVAGATVLT-FLDSHCEVNVEWLQPMLQRV 337
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 338 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 396
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 397 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 456
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 457 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 496
>gi|357620060|gb|EHJ72385.1| hypothetical protein KGM_13871 [Danaus plexippus]
Length = 600
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 130/274 (47%), Positives = 164/274 (59%), Gaps = 9/274 (3%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LPT S++IVFHNEAWSTL+RTV SVI RSP LLKEIILVDDASER + + D +++
Sbjct: 160 LPTASVIIVFHNEAWSTLMRTVMSVILRSPDMLLKEIILVDDASERKYLGKELDDAVANL 219
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAKTV-VCP 120
I S G +L V+ C + + + + V +CP
Sbjct: 220 DKVVILRSLNRTGLVGARLMGAKTATGNVLVFLDAHCEVTKGWLEPLLDRAGSDDVFICP 279
Query: 121 IIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLF 180
ID++SD T Y + D WG F+W+L+FRW MM + S P TP MAGGLF
Sbjct: 280 HIDLLSDDTLAYTKSIDAHWGAFSWRLHFRWLMPSNEIMMNKSRYPSKPFPTPAMAGGLF 339
Query: 181 AIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPG 240
A+ K F+ LG YDE M IWGGENLE+S+R WQCG +EI CS VGH+FR SPY +PG
Sbjct: 340 AVRKSLFWRLGGYDEEMSIWGGENLELSWRAWQCGARVEITHCSRVGHIFRRHSPYKYPG 399
Query: 241 GVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
GV K++ N AR A VWMDEW DF++ NP +A
Sbjct: 400 GVFKVLNTNLARAATVWMDEWADFFFKFNPSVAA 433
>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Ailuropoda melanoleuca]
Length = 578
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 176/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSRKFDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ TVVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
Length = 491
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 176/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 63 CKSRKFDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 119
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 120 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 170
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ TVVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 171 CNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 230
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 231 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 289
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 290 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 338
>gi|291190646|ref|NP_001167159.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
gi|223648406|gb|ACN10961.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
Length = 560
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----V 56
C+ K + T LP +S++I FHNEA S LLRTV SV+ +SP L+KEIILVDD S+
Sbjct: 115 CQHKQWKTELPASSVIITFHNEARSALLRTVVSVLKKSPPHLVKEIILVDDYSDNPEDGA 174
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ I + +D G ++R + +V+ C + + E
Sbjct: 175 LLGKIEKIR------VLRNDRREGLMRSRVRGADTATASVLTFLDSHCECNEHWLEPLLE 228
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + + R G+ ++
Sbjct: 229 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTVEQRRVRQGNPTA 288
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DKDYF LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 289 PIKTPMIAGGLFVMDKDYFELLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 348
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 349 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEFKNFYYAAVP 390
>gi|190358441|ref|NP_001121823.1| polypeptide N-acetylgalactosaminyltransferase 2 [Danio rerio]
Length = 559
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----V 56
C+ K + T LP +S+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S+
Sbjct: 114 CRHKQWRTDLPASSVVITFHNEARSALLRTVISVLKKSPPHLVKEIILVDDYSDNPEDGA 173
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + +V+ C + + E
Sbjct: 174 LLGKIEKVR------VLRNDRREGLMRSRVRGADAATASVLTFLDSHCECNEHWLEPLLE 227
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + + R G+ +
Sbjct: 228 RVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTLEQRRARQGNPIA 287
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DKDYF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 288 PIKTPMIAGGLFVMDKDYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 347
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMD++++FYYA P
Sbjct: 348 VFRKQHPYTFPGGSGTVFARNTRRAAEVWMDDFKNFYYAAVP 389
>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
Length = 494
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 125/286 (43%), Positives = 170/286 (59%), Gaps = 26/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
C+ ++YP+ LP TSI+I FHNEA STLLRTV S++N++P L+ EIILVDD S+
Sbjct: 48 CRYEAYPSTLPATSIIITFHNEARSTLLRTVKSILNKTPPNLVNEIILVDDFSDDAEDGL 107
Query: 58 ----CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI- 112
P + V+ + + + S + + + K+ V +D + +++
Sbjct: 108 LLMGLPKVKVLRNNKRQGLIRSRV----------KGSDTAKSDVLTFLDSHCECNTDWLQ 157
Query: 113 --------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
K VV PIIDVI+ F YI AS GGF+W L+F+W + P + R
Sbjct: 158 PLLKRVVQNKKAVVSPIIDVINMDDFSYIGASADIKGGFDWSLHFKWDNLTPEQKQSRRS 217
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
+P++TP +AGGLF + K +F E+G YD MDIWGGEN E+SFR WQCGG +EIIPCS
Sbjct: 218 TPIAPIKTPMIAGGLFVVTKSWFEEMGKYDTMMDIWGGENFEISFRTWQCGGSMEIIPCS 277
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR + PYTFP G + + N R AEVWMDE++ FYYA P
Sbjct: 278 RVGHVFRKRHPYTFPDGNANTYMKNTRRTAEVWMDEYKRFYYAARP 323
>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
2 [Sus scrofa]
Length = 578
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS+VI F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSKKFDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
glaber]
Length = 553
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/282 (46%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 102 CMLLVYHTALPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPDDCK 161
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ V P + + + + + S D+ G L + + P++ + +
Sbjct: 162 QLVRLPKVKCLRNSERQGLVRSRMRGADIAQGATLTFLDSHCEVNRDWLEPLLHRVKE-- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+FRW ++ P + RR D +
Sbjct: 220 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFRWEQLSPEQKARRL-DPTE 274
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 275 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 334
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 335 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 376
>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Sus scrofa]
gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Sus scrofa]
Length = 582
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS+VI F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 128 CKSKKFDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 184
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 185 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 235
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 355 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 403
>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
Length = 688
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/289 (45%), Positives = 179/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK + Y T LPTT ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 200 CKDSTQYLTNLPTTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 259
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 260 KQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSAVLTYLDSHCECT 309
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD T EY +S + GGF+W L F W+ VP R
Sbjct: 310 EGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHYRDSSGVNVGGFDWNLQFSWHSVPER 369
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + + P+ +PTMAGGLFAID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 370 ER-KRHNNSAEPVYSPTMAGGLFAIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 428
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMD++ +YY
Sbjct: 429 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLRKNSVRLAEVWMDDYAQYYY 476
>gi|124487253|ref|NP_001074890.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Mus musculus]
gi|341940755|sp|Q9JJ61.2|GLTL1_MOUSE RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|52851357|dbj|BAD52071.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74218446|dbj|BAE23810.1| unnamed protein product [Mus musculus]
gi|115527273|gb|AAI10635.1| Galntl1 protein [Mus musculus]
gi|115528977|gb|AAI25016.1| Galntl1 protein [Mus musculus]
Length = 558
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 170/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 113 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDKREGLIRSRVRGADVAGATVLT-FLDSHCEVNVEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|432852860|ref|XP_004067421.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Oryzias latipes]
Length = 556
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 26/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----V 56
CK K + + LP +S+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S+
Sbjct: 111 CKHKQWNSDLPASSVVITFHNEARSALLRTVVSVLKKSPPQLVKEIILVDDYSDNSEDGA 170
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI------SDQTFE 110
+ I+ + + +D G +R + R P++ + +D E
Sbjct: 171 LLGKIEKVR------VLRNDRREG----LMRSRVRGADAATAPVLTFLDSHCECNDHWLE 220
Query: 111 YITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ + VV PIIDVI+ F+Y+ AS GGF+W L F+W + + R G
Sbjct: 221 PLLERVAEDKTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTLEQRRARQG 280
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
+ +P++TP +AGGLF +DK+YF LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS
Sbjct: 281 NPIAPIKTPMIAGGLFVMDKEYFELLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCS 340
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 341 RVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 386
>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 653
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/278 (46%), Positives = 176/278 (63%), Gaps = 15/278 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCP 59
C+ + + LPT S+V+ F+NEAWSTLLRTV +V+ R+PR LL E+ILVDD S +V + P
Sbjct: 179 CRSEEHGAELPTASVVVCFYNEAWSTLLRTVHTVLGRTPRHLLHEVILVDDNSTQVDLGP 238
Query: 60 -IIDVISDQTFEY---ITASDMTWGGFNWKLREKNRHKKTVV-----CPIIDVISDQTFE 110
+ + +S Q + I D +N + +V C + + E
Sbjct: 239 QLAEYVSSQLPSHVRLIRTRDREGLIRARMFGARNASGEVLVFLDSHCEVNVGWLEPLLE 298
Query: 111 YITAK--TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I A TV CPIID+I+ TFEY TAS + GGFNW L+F+W PP + R+G +
Sbjct: 299 RIRANRATVTCPIIDIINADTFEY-TASPIVRGGFNWGLHFKW-ESPPAGLARKGRGAIA 356
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+ +PTMAGGLFA+D+ +F+ LG YD+GMDIWGGENLE+SFR+W CGG LEIIPCS VGH
Sbjct: 357 PIPSPTMAGGLFAMDRKFFHRLGEYDDGMDIWGGENLEISFRIWMCGGQLEIIPCSRVGH 416
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
VFR + PY P G + N+ RVA VWMD+++ +Y+
Sbjct: 417 VFRRRRPYGSPNGEDTLT-KNSLRVAHVWMDDYKKYYF 453
>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11-like [Oryzias
latipes]
Length = 574
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 175/292 (59%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C+K+SYP LP+ S+VI F NEA S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 108 CRKRSYPQALPSASVVICFFNEALSALLRTVHSVLDRTPAYLLHEIILVDDQSELEELKE 167
Query: 55 -------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
+ + ++ ++ E + M H V +D +
Sbjct: 168 GLDRCVREELQGKVRLVRNRKREGLIRGRMIGAA----------HATGDVLVFLDSHCEV 217
Query: 108 TFEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
+++ +TVVCPIID+IS T Y ++S + GGFNW L+F+W VPP E
Sbjct: 218 NQDWLQPLLAPIQKDRRTVVCPIIDIISADTLTY-SSSPIVRGGFNWGLHFKWDPVPPSE 276
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ G + P+R+PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 277 ISGPEGA-AGPIRSPTMAGGLFAMNREYFNELGRYDPGMDIWGGENLEISFRIWMCGGQL 335
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y ++ P
Sbjct: 336 LIIPCSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYLSLRP 386
>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
Length = 558
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADMAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Oreochromis niloticus]
Length = 584
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 134/288 (46%), Positives = 172/288 (59%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C+ K + LPTTS++I F+NEAWSTLLRT+ SV+ +P LLKEIILVDD S+R +
Sbjct: 130 CRSKKFDYRHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILVDDFSDRGYLK 189
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D ISD + ++ G +L V+ P+++
Sbjct: 190 SKLADYISDLQRVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLER 249
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTW-GGFNWKLNFRWYRVPPREMMRR 162
IS+ A T+VCP+ID I TFE+ +D GGF+W+L F+W+ VP E RR
Sbjct: 250 ISE------NASTIVCPVIDTIDWNTFEFYMQTDEPMIGGFDWRLTFQWHSVPEMERKRR 303
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
R P+R+PTMAGGLFA+ K YF LG+YD GMD+WGGENLE+SFRVWQCGG LEI P
Sbjct: 304 KS-RIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSFRVWQCGGSLEIHP 362
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF K+PY P L N R AEVWMD ++ +Y NP
Sbjct: 363 CSHVGHVFPKKAPYARPN-----FLQNTVRAAEVWMDSYKKHFYNRNP 405
>gi|296215364|ref|XP_002754093.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Callithrix jacchus]
Length = 558
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
glaber]
Length = 582
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 177/294 (60%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K+Y LPTTS+VI F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 128 CKSKTYDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 185
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI++ + +L N+ + V +I DV++
Sbjct: 186 ----LKAQLETYISSLERV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 235
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 QERDRRT-SRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMD++++ +Y NP
Sbjct: 355 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDDYKEHFYNRNP 403
>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
Length = 608
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 132/287 (45%), Positives = 179/287 (62%), Gaps = 25/287 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++KSYP+ LP+ S++I F+NEA S LLRTV SV++R+P LL EIILVDD SE +
Sbjct: 141 CREKSYPSDLPSASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSE-----L 195
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLRE---KNR-----HKKTVVCPIIDVISDQTFEYI 112
D+ D T T + RE + R H V +D + ++
Sbjct: 196 ADLKKDLDEYVKTQLPKTTKLVRNEKREGLIRGRMIGASHATGQVLVFLDSHCEVNEMWL 255
Query: 113 TA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 256 QPLLTPIREDRRTVVCPVIDIISADTLTY-SSSPVVRGGFNWGLHFKWDLVPLSELEGPE 314
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
G ++P+++PTMAGGLFA+D++YF ELG YD GMDIWGGENLE+SFR+W CGG L IIPC
Sbjct: 315 G-ATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIPC 373
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y+A+ P
Sbjct: 374 SRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYFALRP 419
>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
gallus]
Length = 608
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 133/288 (46%), Positives = 180/288 (62%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++KSYP+ LP S++I F+NEA S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 141 CREKSYPSDLPFASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSE------ 194
Query: 61 IDVISDQTFEYI-TASDMTWGGFNWKLRE---KNR-----HKKTVVCPIIDVISDQTFEY 111
+D + EY+ T T + RE + R H V +D + +
Sbjct: 195 LDDLKKDLVEYVKTRLPKTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCEVNEMW 254
Query: 112 ITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 255 LQPLLTPIKEDRRTVVCPVIDIISADTLTY-SSSPVVRGGFNWGLHFKWDLVPLSELEGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+D++YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EG-ATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y+A+ P
Sbjct: 373 CSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYFALRP 419
>gi|403264517|ref|XP_003924524.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Saimiri boliviensis boliviensis]
Length = 558
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 113 CPSMSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|332020473|gb|EGI60888.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Acromyrmex
echinatior]
Length = 442
Score = 241 bits (614), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 131/279 (46%), Positives = 166/279 (59%), Gaps = 18/279 (6%)
Query: 4 KSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERV 56
K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S E
Sbjct: 2 KQWRQDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELS 61
Query: 57 VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
+ VI ++ E + +D L + P+++ +++
Sbjct: 62 RIHKVRVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAED---- 117
Query: 112 ITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D + +R
Sbjct: 118 --PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQAIR 175
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR
Sbjct: 176 TPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFR 235
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 236 KRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 274
>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
aries]
Length = 552
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 168/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRT+ S++NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCADLPPTSIIIAFHNEARSTLLRTIRSILNRTPMNLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIIHLDTFNYIESASELRGGFDWSLHFQWEQLTPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF +DK +FY LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVMDKSWFYYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYASRP 375
>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
Length = 558
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDLEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Loxodonta africana]
Length = 552
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CNLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSSDPDDCK 160
Query: 56 --VVCPIIDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVISDQT 108
+ P + + + + + S + G L + + P++ + +
Sbjct: 161 LLIKLPKVKCVRNNERQGLVRSRIQGAGIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN EMSFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDSEMDIWGGENFEMSFRVWMCGGSLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYIFPDGNTNTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Nomascus leucogenys]
Length = 558
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 181/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 407
Query: 287 SYSSW 291
+ S+
Sbjct: 408 NCKSF 412
>gi|354472196|ref|XP_003498326.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Cricetulus griseus]
Length = 513
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 168/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 68 CPSLSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 127
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 128 LLTRIPKVK---CLRNDKREGLIRSRVRGADVAGATVLT-FLDSHCEVNIEWLQPMLQRV 183
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 184 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 242
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 243 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 302
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 303 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 342
>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
[Homo sapiens]
gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
[synthetic construct]
Length = 542
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 181/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 407
Query: 287 SYSSW 291
+ S+
Sbjct: 408 NCKSF 412
>gi|410897032|ref|XP_003962003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Takifugu rubripes]
Length = 624
Score = 240 bits (613), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 177/286 (61%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDASE + + D+
Sbjct: 178 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASED------EALKDELD 231
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + +R++ R K + ++ V + T ++ A
Sbjct: 232 EYLKRLSIVQ-----VVRQRER-KGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLL 285
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE++ S + G F+W L F W +P E RR
Sbjct: 286 ARIAENHSAVVSPDITTIDLNTFEFVKPSPYGQNHNRGNFDWSLAFGWESLPDHEKRRRK 345
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYFY++GSYD+ M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 346 -DETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSFRVWQCGGQLEIIPC 404
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMD++++ +Y N
Sbjct: 405 SIVGHVFRTKSPHSFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRN 449
>gi|307214182|gb|EFN89299.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Harpegnathos
saltator]
Length = 442
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 131/279 (46%), Positives = 166/279 (59%), Gaps = 18/279 (6%)
Query: 4 KSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERV 56
K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S E
Sbjct: 2 KQWRRDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELS 61
Query: 57 VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
+ VI ++ E + +D L + P+++ +++
Sbjct: 62 RIHKVRVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWIEPLLERVAED---- 117
Query: 112 ITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D + +R
Sbjct: 118 --PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQIERQARQKDPTQAIR 175
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
TP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR
Sbjct: 176 TPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFR 235
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 236 KRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 274
>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Papio
anubis]
Length = 558
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 181/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 407
Query: 287 SYSSW 291
+ S+
Sbjct: 408 NCKSF 412
>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Macaca mulatta]
Length = 558
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 181/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 407
Query: 287 SYSSW 291
+ S+
Sbjct: 408 NCKSF 412
>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Pan
paniscus]
Length = 529
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 84 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 143
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 144 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 199
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 200 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 258
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 259 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 318
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 319 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 358
>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
Length = 499
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/305 (44%), Positives = 181/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 54 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 113
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 114 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 169
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 170 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 228
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 229 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 288
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 289 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 348
Query: 287 SYSSW 291
+ S+
Sbjct: 349 NCKSF 353
>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
grunniens mutus]
Length = 509
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 168/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRT+ S++NR+P L++EIILVDD S
Sbjct: 58 CTLLVYCADLPPTSIIIAFHNEARSTLLRTIRSILNRTPMNLIQEIILVDDFSNDPEDCK 117
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 118 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 176
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 177 -----YTRVVCPVIDIIHLDTFNYIESASELRGGFDWSLHFQWEQLTPEQKARRL-DPTE 230
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF +DK +FY LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 231 PIRTPIIAGGLFVMDKSWFYYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 290
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 291 VFRKKHPYIFPDGNANTYIKNTKRTAEVWMDEYKQYYYASRP 332
>gi|426233584|ref|XP_004010796.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Ovis
aries]
Length = 557
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 129/283 (45%), Positives = 165/283 (58%), Gaps = 21/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD S
Sbjct: 113 CPSMSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID-VISDQ 107
+V C D + +D+ F L + P++ V D
Sbjct: 173 LLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAFFTFLDSHCEVNTEWLQPMLQRVKEDH 232
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D +
Sbjct: 233 T-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPT 284
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 285 KPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 344
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 345 HVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 582
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/294 (45%), Positives = 176/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K++ LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 128 CKSKTFNYRQLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 184
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q Y++ D +L N+ + V +I DV++
Sbjct: 185 ---YLKTQLETYVSNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 235
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ ++CP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CNSGWLEPLLERISEDETVIICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 355 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 403
>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Pan paniscus]
Length = 532
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SVINR+P L++EIILVDD S
Sbjct: 81 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSNDPDDCK 140
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 141 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 198
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 199 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 253
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 254 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 313
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 314 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 355
>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Macaca
mulatta]
Length = 558
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
Length = 693
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 133/299 (44%), Positives = 178/299 (59%), Gaps = 30/299 (10%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK + Y + LP T ++I FHNEAWS LLRTV SV++RSP L+ EIILVDD S+ +
Sbjct: 207 CKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPSELIGEIILVDDYSDMTHLK 266
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D +D I G +L K V+ P++D
Sbjct: 267 KKLEDYFADYPMVKIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDR 326
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQT--FEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ + TVVCP+IDVI D T F Y +S + GGF+W L F W+ VP RE +
Sbjct: 327 IARNS------TTVVCPVIDVIDDATLEFHYRDSSGVNVGGFDWNLQFSWHSVPEREK-K 379
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R S P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 380 RHNSTSEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 439
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVS 277
PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMD++ +YY M+ G VS
Sbjct: 440 PCSHVGHIFRKRSPYKWRTGVN-VLKKNSVRLAEVWMDDYSKYYYQRIGMDKGDFGDVS 497
>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
Length = 455
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 125/283 (44%), Positives = 163/283 (57%), Gaps = 18/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
C+ Y LPTTS++I FHNEA S LLRT+ SV+NRSP LLKEIILVDD S+
Sbjct: 56 CRSLEYKHKLPTTSVIITFHNEARSALLRTIRSVLNRSPSELLKEIILVDDFSDNANDGR 115
Query: 57 ---VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ P + + + E + +D+ G L + + P++ ++
Sbjct: 116 LLKILPKVKTLRNNKREGLIRSRVRGADLAKGDVLTFLDSHCEVNERWLEPLLSRVAQ-- 173
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV PIIDVI TF YI +S GGF W LNF+W + E +R +
Sbjct: 174 ----NETIVVSPIIDVIHMDTFNYIGSSADLKGGFGWNLNFKWDSMTSEEQSQRAAHPTR 229
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF+I K++F + G YD GMD+WGGENLE+S R+W CGG LEI+PCS VGH
Sbjct: 230 PIKTPMIAGGLFSISKNWFIKSGKYDMGMDVWGGENLEISLRIWMCGGSLEIVPCSRVGH 289
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR + PYTFPGG + N R AE WMD + FYY PG
Sbjct: 290 VFRKRHPYTFPGGGGFVFAKNTRRAAEAWMDGYAKFYYKREPG 332
>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
grunniens mutus]
Length = 566
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 173/294 (58%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 112 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 168
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q Y++ D +L N+ + V +I DV++
Sbjct: 169 ---YLKTQLETYVSNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 219
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I V+CP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 220 CNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 279
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 280 HERDRRKS-RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 338
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 339 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 387
>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Nomascus leucogenys]
Length = 532
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 81 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 140
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ V P + + + + I +D+ G L + + P++ + +
Sbjct: 141 QLVKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 198
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 199 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 253
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 254 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 313
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 314 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 355
>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
Length = 575
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 146 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 205
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 206 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 261
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 262 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 320
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 321 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 380
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 381 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 420
>gi|194225134|ref|XP_001495036.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Equus caballus]
Length = 619
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 134/305 (43%), Positives = 181/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 175 CPSVSYSVDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 234
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 235 LLTRIPKVK---CLRNDRREGLIRSRVRGADVATAAVLT-FLDSHCEVNTEWLQPMLQRV 290
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS + GGF+W L+F+W ++P + + R D + P+
Sbjct: 291 KEDHTRVVSPIIDVISLDNFAYLAASAILRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 349
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 350 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 409
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 410 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 469
Query: 287 SYSSW 291
S S+
Sbjct: 470 SCKSF 474
>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
caballus]
Length = 539
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 136/303 (44%), Positives = 166/303 (54%), Gaps = 70/303 (23%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV+SVINRSP LL E+ILVDDASER
Sbjct: 105 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 164
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ + ++ I A + T G L
Sbjct: 165 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 225 IKEDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 263
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+ +G + A+ Y S+ +
Sbjct: 264 FRWYPVPQREMDRRKGDRTLPV--SCFSGNMTALPTGLLYNSCSFSQ------------- 308
Query: 209 FRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
+WQCGG LEI+ CSHVGHVFR +PYTFPGG ++ N R+AEVWMDE++DF+Y +
Sbjct: 309 --IWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYII 366
Query: 269 NPG 271
+PG
Sbjct: 367 SPG 369
>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
Length = 578
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 173/294 (58%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q Y++ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYVSNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I V+CP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRKS-RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
aries]
Length = 582
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 173/294 (58%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 128 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 184
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q Y++ D +L N+ + V +I DV++
Sbjct: 185 ---YLKTQLEAYVSNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 235
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I V+CP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CNTGWLEPLLERIHKDETVVICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 HERDRRKS-RIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 355 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 403
>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Ornithorhynchus anatinus]
Length = 608
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 132/288 (45%), Positives = 178/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
CK+KSYP LP S+VI F+NEA+S LLRT+ SV++R+P LL EIILVDD SE
Sbjct: 141 CKEKSYPPHLPAASVVICFYNEAFSALLRTIHSVLDRTPAHLLHEIILVDDNSELDDLKS 200
Query: 55 ------RVVCPI-IDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIID 102
R+ P I VI ++ E + M G L + P++
Sbjct: 201 GLDEYIRLHLPRNIQVIRNEKREGLIRGRMIGAAQATGEVLVFLDSHCEVNAMWLQPLLV 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+I T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 PIREDR------RTVVCPVIDIIGADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 G-RATAPIKSPTMAGGLFAMNREYFRELGQYDSGMDIWGGENLEISFRIWMCGGQLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y+A+ P
Sbjct: 373 CSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYFALRP 419
>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
malayi]
Length = 575
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 185/316 (58%), Gaps = 33/316 (10%)
Query: 1 CKKKSY--PTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
C+ K+Y + LPTTS++IV+HNEA+STL+RTV SVI RSPR LKEIILVDD S R
Sbjct: 77 CRTKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVIQRSPRENLKEIILVDDFSTRTFL 136
Query: 59 PI-IDVISDQ--TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PII 101
+ ++ Q T I ++ G +L N + V+ P++
Sbjct: 137 KVELEKFVAQLGTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLL 196
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + K VVCP+ID+I+D+TF Y + ++ GGFNW L FRWY +P +
Sbjct: 197 ARIKE------NRKAVVCPVIDIINDRTFAYQKSIELFRGGFNWNLQFRWYALPSEMIKS 250
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R D + P+ +PTMAGGLF+ID+ YF E+G+YD MDIWGGEN+E+S RV+ EI+
Sbjct: 251 RSDDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRVF------EIL 304
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLH-NAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
PCSHVGHVFR SP+ FPG S +L+ N RVAEVWMDEW+ +Y P + V
Sbjct: 305 PCSHVGHVFRRTSPHDFPGRKSGTILNSNLLRVAEVWMDEWKFHFYRTAPRRFGCVVNSR 364
Query: 281 AHFRMLSYSSWFSGSI 296
S+ WF ++
Sbjct: 365 KRLHCKSF-KWFLDNV 379
>gi|62630154|gb|AAX88899.1| unknown [Homo sapiens]
Length = 452
Score = 239 bits (611), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 1 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 60
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 61 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 119
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 120 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 173
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 174 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 233
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 234 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 275
>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Pan paniscus]
Length = 517
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SVINR+P L++EIILVDD S
Sbjct: 66 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSNDPDDCK 125
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 126 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 183
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 184 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 238
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 239 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 298
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 299 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 340
>gi|397513819|ref|XP_003827205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Pan paniscus]
Length = 557
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SVINR+P L++EIILVDD S
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSNDPDDCK 165
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 166 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 224
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 225 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 278
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 279 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 338
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 339 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 380
>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
Length = 578
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLESYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N ARVAEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARVAEVWMDEYKEHFYNRNP 399
>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
Length = 536
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 167/286 (58%), Gaps = 22/286 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
CK + +P LP TS+VIVFHNEA STLLRTV SV++RS L+ +IILVDD +S + P
Sbjct: 81 CKSQVFPKDLPQTSVVIVFHNEALSTLLRTVHSVLDRSAPDLIHQIILVDDFSSIKGHDP 140
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVIS 105
+ I+D + + G ++ +R +V P++D +
Sbjct: 141 LKKYIADLKKVILVRNPKREGLIRSRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDRV- 199
Query: 106 DQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG-FNWKLNFRWYRVPPREMMRRGG 164
+ VVCP IDVI D+TF+Y S G FNW + FRW P +E RR
Sbjct: 200 -----HQNRSVVVCPEIDVIDDKTFQYRAGSSGDIRGVFNWDMKFRWRLTPSQEQKRRNN 254
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
R+PTMAGGLFAID+ YF E+G YD MDIWGGENLE+SFR+WQCGG LEI+PCS
Sbjct: 255 YNVLFARSPTMAGGLFAIDRQYFQEIGLYDSQMDIWGGENLELSFRIWQCGGQLEIMPCS 314
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVGHVFR+ PY FP + N+ R AEVWMD +++F Y P
Sbjct: 315 HVGHVFRNVIPYKFPKDAGLTINKNSVRTAEVWMDGYKEFVYQRQP 360
>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Gorilla gorilla gorilla]
Length = 532
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 81 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 140
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 141 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 198
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 199 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 253
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 254 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 313
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 314 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 355
>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Pan paniscus]
Length = 552
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SVINR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Nomascus leucogenys]
Length = 535
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 84 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 143
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ V P + + + + I +D+ G L + + P++ + +
Sbjct: 144 QLVKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 201
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 202 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 256
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 257 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 316
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 317 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 358
>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
taurus]
Length = 552
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 168/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRT+ S++NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCADLPPTSIIIAFHNEARSTLLRTIRSILNRTPMNLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIIHLDTFNYIESASELRGGFDWSLHFQWEQLTPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF +DK +FY LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVMDKSWFYYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYIFPDGNANTYIKNTKRTAEVWMDEYKQYYYASRP 375
>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
davidii]
Length = 507
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 170/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRT+ SV+NR+P L+KEIILVDD S
Sbjct: 56 CTLLMYCRDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMNLIKEIILVDDFSNDPGDCE 115
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
E + P + + + E I +D+ G L + + P++ + +
Sbjct: 116 ELIKLPKVKCLRNDQREGLVRSRIRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 173
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + +R D S
Sbjct: 174 -DYTR---VVCPVIDIINLDTFSYIESATELRGGFDWSLHFQWEQLSPEQKAQRL-DPSE 228
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF +DK +F LG YD MDIWGGEN EMSFRVW CGG LEI+PCS VGH
Sbjct: 229 PIRTPIIAGGLFVMDKSWFNFLGKYDMDMDIWGGENFEMSFRVWMCGGSLEIVPCSRVGH 288
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ ++YA P
Sbjct: 289 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYFYAARP 330
>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 578
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/296 (46%), Positives = 177/296 (59%), Gaps = 46/296 (15%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 182 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLYCHCE 231
Query: 106 ----------DQTFEYITAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRV 154
++ Y TA VVCP+ID I TFE Y+ + GGF+W+L F+W+ V
Sbjct: 232 CNSGWLEPLLERIGRYETA--VVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSV 289
Query: 155 PPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQC 214
P +E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQC
Sbjct: 290 PKQERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQC 348
Query: 215 GGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GG LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 349 GGKLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Nomascus leucogenys]
Length = 552
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ V P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLVKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
sapiens]
gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_d [Homo sapiens]
gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
Length = 532
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 81 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 140
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 141 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 198
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 199 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 253
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 254 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 313
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 314 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 355
>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Gorilla gorilla gorilla]
Length = 598
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 153 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 212
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + TV+ +D + E++
Sbjct: 213 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAATVLT-FLDSHCEVNTEWLPPMLQRV 268
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 269 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 327
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 328 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 387
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 388 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 427
>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Gorilla gorilla gorilla]
Length = 517
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 66 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 125
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 126 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 183
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 184 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 238
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 239 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 298
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 299 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 340
>gi|291167742|ref|NP_001094333.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Rattus norvegicus]
Length = 558
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + +V+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDKREGLIRSRVRGADVAGASVLT-FLDSHCEVNVEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 MEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
norvegicus]
Length = 578
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 176/294 (59%), Gaps = 42/294 (14%)
Query: 1 CK-KKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK KK + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+R+
Sbjct: 124 CKAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRI--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKAQLEAYISNLDRV------RLTRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYIT--AKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRTS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMD++++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFSKRAPYARPN-----FLQNTAREAEVWMDDYKEHFYNRNP 399
>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 603
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/283 (47%), Positives = 172/283 (60%), Gaps = 17/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK K YP LP TS++I FHNEAWSTLLRTV SVINR+P LKEIILVDDAS + +
Sbjct: 155 CKVKKYPVDLPPTSVIICFHNEAWSTLLRTVHSVINRTPPQYLKEIILVDDASTSDDLKQ 214
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTF---EYITAK 115
+ D I + I G +L E + K + +D + T E + AK
Sbjct: 215 RLDDYIPNLKIVSIVRLRDRQGLIRARL-EGAKKAKGPILTFLDAHCECTLGWAEPLLAK 273
Query: 116 ------TVVCPIIDVISDQTFEYITASD-MTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID IS+ F Y + G F W+L F W +P E RR + S
Sbjct: 274 IKEDRQNVVMPVIDEISETNFNYNAVPEPFQRGVFKWRLEFTWRPIPSYEEQRRKHE-SD 332
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
++TP MAGGLF+I++DYFYE+GSYD GMDIWGGEN+E+SFR+W CGG +E++PCS VGH
Sbjct: 333 GIKTPVMAGGLFSINRDYFYEMGSYDTGMDIWGGENIEISFRIWMCGGSIEMLPCSRVGH 392
Query: 229 VFRDKSPYTFP---GGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
VFR + PY+FP GG +V N RVA+VWMDE+ +Y +
Sbjct: 393 VFRPRFPYSFPNRRGGDGDVVSRNLMRVADVWMDEYAKHFYNI 435
>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
Length = 600
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/294 (46%), Positives = 180/294 (61%), Gaps = 39/294 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C KK+YP LP SIVI F+NEA S LLRTV SV++R+P LL EIILVDD SE +
Sbjct: 133 CSKKTYPADLPLASIVICFYNEASSALLRTVHSVLDRTPAQLLHEIILVDDNSE-----L 187
Query: 61 IDVISDQTFEYITASDMTWGGFNWKL-REKNR------------HKKTVVCPIIDV---I 104
D+ D +Y +++ KL R K R H V +D +
Sbjct: 188 DDLKKD--LDYYMQENLSK---KVKLVRNKRREGLIRGRMVGASHATGDVLVFLDSHCEV 242
Query: 105 SDQTFEYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
++ + + A KTVVCP+ID+IS T Y + S + GGFNW L+F+W VP E
Sbjct: 243 NEMWLQPLLAPIRENPKTVVCPVIDIISADTLIY-SQSPVVRGGFNWGLHFKWDPVPLSE 301
Query: 159 MMRRGGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+ GG ++P R+PTMAGGLFA+D++YF LG YD GMDIWGGENLE+SFR+W CGG
Sbjct: 302 L---GGPEGFTAPFRSPTMAGGLFAMDREYFNTLGQYDSGMDIWGGENLEISFRIWMCGG 358
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
L I+PCS VGH+FR + PY PGG + HN+ R+A VWMDE++D Y+A+ P
Sbjct: 359 SLLIVPCSRVGHIFRKRRPYGSPGGHDTMA-HNSLRLAHVWMDEYKDQYFALRP 411
>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Otolemur garnettii]
Length = 582
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 128 CKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 185
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ + +L N+ + V +I DV++
Sbjct: 186 ----LKTQLETYISNLERV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 235
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 355 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 403
>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Callithrix jacchus]
Length = 578
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 182 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|426335183|ref|XP_004029112.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Gorilla gorilla gorilla]
Length = 557
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 165
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 166 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 224
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 225 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 278
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 279 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 338
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 339 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 380
>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Macaca mulatta]
Length = 532
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 81 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 140
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 141 QLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 198
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 199 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 253
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 254 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 313
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 314 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 355
>gi|359465583|ref|NP_001240755.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 2 [Homo
sapiens]
gi|10434341|dbj|BAB14227.1| unnamed protein product [Homo sapiens]
gi|119620892|gb|EAX00487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_b [Homo sapiens]
Length = 557
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 165
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 166 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 224
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 225 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 278
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 279 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 338
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 339 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 380
>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 66 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 125
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 126 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 183
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 184 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 238
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 239 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 298
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 299 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 340
>gi|355689604|gb|AER98888.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mustela putorius
furo]
Length = 306
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/280 (46%), Positives = 170/280 (60%), Gaps = 32/280 (11%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERVVCPIIDVIS 65
LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S + + I+ +
Sbjct: 2 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDSEDGALLGKIEKVR 61
Query: 66 DQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIID-VISDQTFE 110
+ +D G ++R + + V+ P+++ V D+T
Sbjct: 62 ------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRT-- 113
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +P+
Sbjct: 114 -----RVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPI 168
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
+TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEI+PCS VGHVF
Sbjct: 169 KTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGHVF 228
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 229 RKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 268
>gi|301763305|ref|XP_002917071.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ailuropoda melanoleuca]
Length = 555
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/283 (45%), Positives = 164/283 (57%), Gaps = 21/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD S
Sbjct: 111 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 170
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID-VISDQ 107
+V C D + +DM L + P++ V D
Sbjct: 171 LLTRIPKVKCLRNDRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQPMLQRVKEDH 230
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D +
Sbjct: 231 T-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPT 282
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 283 KPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 342
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 343 HVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 385
>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
Length = 552
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGHANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
Length = 499
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 138/282 (48%), Positives = 172/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKS--YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RV 56
CK+K YP LPTTS++I FHNEA S LLRTV SV+N SP L+ +IILVDD SE +
Sbjct: 50 CKRKHKLYPRALPTTSVIICFHNEALSVLLRTVHSVLNESPPRLIADIILVDDYSEYDDL 109
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA-- 114
P+ID IS + G +LR + V+ +D + T ++
Sbjct: 110 KQPLIDHISMLNKVKLIRMPSRQGLVPARLRGAEEARGEVLT-FLDSHCEATPGWLEPLL 168
Query: 115 -------KTVVCPIIDVISDQTFEYITASDMTW--GGFNWKLNFRWYRVPPREMMRRGGD 165
+ VVCP+I+VI+ F Y ASD+ GGF W L F W +P E RR D
Sbjct: 169 VRIAEDRRNVVCPVIEVINADDFRY-QASDVIHERGGFTWDLFFTWKAIPEAEKKRRK-D 226
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
+ +R+PTMAGGLFAI K YFY+LGSYD M+IWGGENLEMSFR+W CGG LEI+PCS
Sbjct: 227 ETDYIRSPTMAGGLFAIHKKYFYDLGSYDSKMEIWGGENLEMSFRIWMCGGQLEIVPCSR 286
Query: 226 VGHVFRD-KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
VGHVFR SPY FP G + + N R+AEVWMDE++D YY
Sbjct: 287 VGHVFRKYTSPYKFPKGTTTTLARNFNRLAEVWMDEYKDHYY 328
>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
musculus]
Length = 529
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/288 (45%), Positives = 180/288 (62%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C++KSYPT LPT SIVI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 62 CRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 121
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 122 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 181
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 182 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSELGGP 234
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I+P
Sbjct: 235 DG-ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 293
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 294 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 340
>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Callithrix jacchus]
Length = 552
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCQ 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|355751232|gb|EHH55487.1| hypothetical protein EGM_04701, partial [Macaca fascicularis]
Length = 516
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 65 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 124
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 125 QLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 183
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 184 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 237
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 238 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 297
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 298 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 339
>gi|281349386|gb|EFB24970.1| hypothetical protein PANDA_005243 [Ailuropoda melanoleuca]
Length = 553
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 129/283 (45%), Positives = 164/283 (57%), Gaps = 21/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD S
Sbjct: 109 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 168
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID-VISDQ 107
+V C D + +DM L + P++ V D
Sbjct: 169 LLTRIPKVKCLRNDRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQPMLQRVKEDH 228
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D +
Sbjct: 229 T-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPT 280
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 281 KPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 340
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 341 HVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 383
>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
Length = 578
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
mulatta]
Length = 517
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 66 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 125
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 126 QLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 183
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 184 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 238
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 239 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 298
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 299 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 340
>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
Length = 575
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 121 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 177
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 178 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 228
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 229 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPK 288
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 289 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 347
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 348 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 396
>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4 [Equus caballus]
Length = 703
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS+VI F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 249 CKSQKFNYRKLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 305
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 306 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 356
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 357 CNSGWLEPLLERISKDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 416
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+ +PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 417 HERDRR-KSRIDPISSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGG 475
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 476 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 524
>gi|444509912|gb|ELV09433.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Tupaia chinensis]
Length = 566
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 167/280 (59%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 121 CPSMSYSVDLPATSVIITFHNEARSTLLRTVRSVLNRTPANLIQEIILVDDFSSDPEDCL 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 181 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAAAVLT-FLDSHCEVNTEWLQPMLQRV 236
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 237 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLDQKMTRT-DPTRPI 295
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 296 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 355
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 356 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 395
>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
troglodytes]
gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
paniscus]
gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
Length = 578
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|355565588|gb|EHH22017.1| hypothetical protein EGK_05198 [Macaca mulatta]
Length = 557
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 165
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 166 QLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 224
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 225 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 278
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 279 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 338
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 339 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 380
>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
Length = 578
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
(Silurana) tropicalis]
gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus (Silurana) tropicalis]
Length = 563
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 132/290 (45%), Positives = 176/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C KK+YP LP SIVI F+NEA+S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 96 CAKKTYPPDLPMASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIILVDDNSELDDLKK 155
Query: 55 -------RVVCPIIDVISDQTFEYITASDMTW-----GGFNWKLREKNRHKKTVVCPIID 102
+ + ++ ++ E + M G L + + P++
Sbjct: 156 DLDGYMQENLSKKVKLVRNKQREGLIRGRMVGASHATGDVLVFLDSHCEVNEMWLQPLLA 215
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 216 PIKE------NPRTVVCPVIDIISADTLIY-SSSPVVRGGFNWGLHFKWDPVPLAEL--- 265
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG S+P R+PTMAGGLFA+D++YF LG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 266 GGPEGFSAPFRSPTMAGGLFAMDREYFNMLGQYDSGMDIWGGENLEISFRIWMCGGSLLI 325
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+PCS VGH+FR + PY PGG + HN+ R+A VWMDE++D Y+A+ P
Sbjct: 326 VPCSRVGHIFRKRRPYGSPGGHDTMA-HNSLRLAHVWMDEYKDQYFALRP 374
>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
Length = 578
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 176/294 (59%), Gaps = 42/294 (14%)
Query: 1 CK-KKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK KK + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+R+
Sbjct: 124 CKAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRI--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKAQLEAYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYIT--AKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRTS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMD++++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDDYKEHFYNRNP 399
>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Gorilla gorilla gorilla]
Length = 552
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
musculus]
gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
musculus]
gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
musculus]
Length = 608
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/288 (45%), Positives = 180/288 (62%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C++KSYPT LPT SIVI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 141 CRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 200
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 201 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSELGGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I+P
Sbjct: 314 DG-ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|109102570|ref|XP_001104659.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Macaca mulatta]
Length = 557
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 165
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 166 QLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 224
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 225 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 278
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 279 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 338
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 339 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 380
>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
sapiens]
gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
sapiens]
gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Homo sapiens]
gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_a [Homo sapiens]
gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[synthetic construct]
gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
sapiens]
Length = 552
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
Length = 631
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 174/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDASE + + D
Sbjct: 159 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDASED------EALKDGLD 212
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + ++ + K + ++ V + T ++ A
Sbjct: 213 EYLKRLSIV------RVVRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLL 266
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE++ S + G F+W L F W +P E RR
Sbjct: 267 ARIAKNRTAVVSPDITTIDLNTFEFMKPSPYGQNHNRGNFDWSLAFGWESLPDHEKKRRK 326
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYFY++GSYD+ M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 327 -DETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSFRVWQCGGQLEIIPC 385
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMD++++ +Y N
Sbjct: 386 SIVGHVFRTKSPHSFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRN 430
>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
gorilla gorilla]
Length = 578
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLRNTARAAEVWMDEYKEHFYNRNP 399
>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Nomascus leucogenys]
Length = 578
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 182 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
Length = 578
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 176/294 (59%), Gaps = 42/294 (14%)
Query: 1 CK-KKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK KK + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+R+
Sbjct: 124 CKAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ + +L N+ + V +I DV++
Sbjct: 182 ----LKAQLETYISNLERV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYIT--AKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I+ +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRTS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
Length = 578
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 175/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRV--- 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 181 ---YLKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 QERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
Length = 608
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/288 (45%), Positives = 180/288 (62%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C++KSYPT LPT SIVI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 141 CRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 200
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 201 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSELGGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I+P
Sbjct: 314 DG-ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
5 [Macaca mulatta]
Length = 552
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Saimiri boliviensis boliviensis]
Length = 552
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 168/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCTELPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCQ 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFHWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
Length = 644
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 172/284 (60%), Gaps = 26/284 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C + T PTT ++I FHNEAWSTLLRT+ SVINRSP L+ EIILVDDAS + +
Sbjct: 195 CPSINQSTLSPTT-VIICFHNEAWSTLLRTLHSVINRSPSHLIMEIILVDDASTFDYLGE 253
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + +S Y+ + + G +L + K V+ P++ I
Sbjct: 254 PLENHLSQLENVYLLRTKIREGLIRARLLGVSYAKGDVLVFLDSHCECAEGWLPPLLLAI 313
Query: 105 -SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+D+T +VCP++DVI QTFEY A + G F+W L F W +P EM RR
Sbjct: 314 EADRT-------KIVCPLVDVIEFQTFEYRAAKEELHGAFDWNLQFIWKDLPEHEMKRRT 366
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
+ +R PT+ GGLFA+D+ YF +GSYD GMDIWG ENLE+SFRVW CGG LEI PC
Sbjct: 367 SP-ADNIRAPTIIGGLFAVDRLYFKRIGSYDSGMDIWGSENLELSFRVWMCGGSLEISPC 425
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
S VGHVFR + PY FP G + + +NA R AEVW+D+++ F+YA
Sbjct: 426 SRVGHVFRTRIPYGFPNGGKRTIRNNAMRAAEVWLDDYKKFFYA 469
>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 648
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 132/284 (46%), Positives = 178/284 (62%), Gaps = 26/284 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
CK + Y LP+TS+++ FHNEAWS LLRTV S+I+RSP LL EIILVDD S
Sbjct: 190 CKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIILVDDYSDMPHLKQ 249
Query: 54 --ERVVC--PIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDVI 104
E V P + ++ Q E + + + L + H + + P++D I
Sbjct: 250 KLEDYVAHFPKVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCECTEGWLEPLLDRI 309
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ + TVVCP+IDVISD TFEY + + GGF+W L F W+ VP RE RR
Sbjct: 310 AR------NSTTVVCPVIDVISDSTFEYHYRDSGGVNVGGFDWNLQFSWHAVPERERQRR 363
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
P+ +PTMAGGLF+IDK +F +LG+YD G DIWGGENLE+SF+ W CGG LEI+P
Sbjct: 364 K-HSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVP 422
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
CSHVGH+FR +SPY + GV+ ++ N+ R+AEVW+DE++ +YY
Sbjct: 423 CSHVGHIFRKRSPYKWRSGVN-VLRRNSVRLAEVWLDEYKQYYY 465
>gi|348573294|ref|XP_003472426.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Cavia
porcellus]
Length = 556
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 168/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + ++ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAAAILT-FLDSHCEVNVEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
anubis]
Length = 578
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 182 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 HERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 641
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/291 (45%), Positives = 177/291 (60%), Gaps = 32/291 (10%)
Query: 2 KKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVCP 59
K K+YP LPT S++I FHNEA+S LLRTV SV+NR+P LL +IILVDD SE + P
Sbjct: 177 KHKTYPRKLPTASVIICFHNEAYSVLLRTVHSVLNRTPPDLLTDIILVDDKSEYENLKRP 236
Query: 60 IIDVISDQTFEY---------------ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ D ++ + + I +D++ G L P++ I
Sbjct: 237 LDDHVAQLSKKIKIIRNAKRSGLIRSRINGADLSRGDVLIFLDSHCETTPGWAEPLLARI 296
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTW-GGFNWKLNFRWYRVPPREMMRRG 163
++++ VV PII+VI+ T +Y A++ GGF+W L ++W +P E R
Sbjct: 297 AEKS------SNVVVPIIEVINADTLQYAAAANPDQRGGFSWDLFYKWKPIPLDEQHLR- 349
Query: 164 GDRSSPL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
SP+ RTPTMAGGLFAID+ YFY++G+YDE MDIWGGENLEMSFR+W CGG ++I
Sbjct: 350 ---KSPIDVIRTPTMAGGLFAIDRKYFYDMGTYDEEMDIWGGENLEMSFRIWMCGGRIDI 406
Query: 221 IPCSHVGHVFRD-KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR SPY FP GV K + N R+AEVW+DE+++ YY P
Sbjct: 407 IPCSRVGHIFRKFTSPYKFPDGVEKTLSKNLNRLAEVWLDEYKELYYQKRP 457
>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Otolemur garnettii]
Length = 678
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 171/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYYTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L + + P++ I +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRIKE-- 218
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 219 -DYTR---VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|432107114|gb|ELK32537.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Myotis davidii]
Length = 518
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 70 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 129
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 130 LLTRIPKVK---CLRNDRREGLIRSRVRGADVATAAVLT-FLDSHCEVNTEWLQPLLQRV 185
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 186 QEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 244
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 245 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 304
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 305 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 344
>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
adamanteus]
Length = 608
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 132/292 (45%), Positives = 181/292 (61%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK+K YP LP+ SI+I F+NEA+S LLRT+ SV++R+P LL EIILVDD SE +
Sbjct: 141 CKEKIYPHDLPSASIIICFYNEAFSALLRTIHSVLDRTPSHLLHEIILVDDRSE-----L 195
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------HKKTVVCPIIDVISDQT 108
D+ D Y+T +R +NR H V +D +
Sbjct: 196 ADLKEDLDI-YLTKDLPNKVKL---VRNENREGLIRGRMVGASHATGKVLVFLDSHCEVN 251
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
++ + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP EM
Sbjct: 252 EMWLQPLLTPIQESRRTVVCPVIDIISADTLTY-SSSPVVRGGFNWGLHFKWDLVPLLEM 310
Query: 160 MRRGGDRS-SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
G +++ +P+++PTMAGGLFA+D++YF LG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 311 --EGPEQATAPIKSPTMAGGLFAMDREYFNALGQYDSGMDIWGGENLEISFRIWMCGGKL 368
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y+A+ P
Sbjct: 369 VIIPCSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYFALRP 419
>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Cavia porcellus]
Length = 552
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 168/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLGYHTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPDDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ V P + + + + + S M G L + + P++ + +
Sbjct: 161 QLVRLPKVKCLRNGERQGLVRSRMRGAEIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+FRW ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFRWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
boliviensis boliviensis]
Length = 578
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 182 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 291
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 292 YERDRRIS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
Length = 721
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 178/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 197 CKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 256
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 257 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 306
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD+T EY + + GGF+W L F W+ VP R
Sbjct: 307 EGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 366
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 367 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 425
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 426 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYYY 473
>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/304 (43%), Positives = 177/304 (58%), Gaps = 46/304 (15%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ ++YP LP S+VI F NEA S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 116 CRDRTYPGDLPRASVVICFFNEALSALLRTVHSVLDRTPPFLLHEIILVDDYSE------ 169
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH-----------KKTVVCP---IIDVISD 106
++ + Y+ A G LR + R + + V P I+D+ S
Sbjct: 170 LEELKGDLDRYVQAE---LRGKVRVLRNQKREGLIRGRMIGAAQASGVSPDPQILDLCSG 226
Query: 107 QTFEYITA--------------------KTVVCPIIDVISDQTFEYITASDMTWGGFNWK 146
+ ++ + +TVVCP+ID+IS T Y + S + GGFNW
Sbjct: 227 EVLVFLDSHCEVNQMWLQPLLAPIRQDRRTVVCPVIDIISADTLSY-SPSPIVRGGFNWG 285
Query: 147 LNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLE 206
L+F+W VPP E+ G P+R+PTMAGGLFAI++ YF E+G YD GMDIWGGENLE
Sbjct: 286 LHFKWDPVPPAELKSPQGP-VGPIRSPTMAGGLFAINRKYFNEIGQYDAGMDIWGGENLE 344
Query: 207 MSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
+SFR+W CGG L IIPCS VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y
Sbjct: 345 ISFRIWMCGGQLFIIPCSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYL 403
Query: 267 AMNP 270
+M P
Sbjct: 404 SMRP 407
>gi|326437922|gb|EGD83492.1| hypothetical protein PTSG_04099 [Salpingoeca sp. ATCC 50818]
Length = 699
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 126/286 (44%), Positives = 174/286 (60%), Gaps = 23/286 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C+++ +P LP +++I F NEAWSTLLRTVWSV++R+P LLKEI+LVDDAS+
Sbjct: 260 CRQQEHPRDLPQATVIICFVNEAWSTLLRTVWSVLDRTPPHLLKEILLVDDASDQEHLLD 319
Query: 55 RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA 114
++ + D + D+ + S G ++ H +D + ++
Sbjct: 320 KLEVEVRDNLPDKV--KLVRSPKRLGLIRARVLGAE-HATADYMVFLDSHCEANLGWLEP 376
Query: 115 --------KT-VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGD 165
KT VVCP ID IS QT +Y+ + G F+W L+F W + + G
Sbjct: 377 LLAWMAKDKTRVVCPTIDRISAQTMDYVGGGASSRGTFHWTLDFTWEYA----VRQHGET 432
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
+ P+++PTMAGGLF I++DYFYELG+YD GMD WGGENLEMSFR+WQCGG L IIPCS
Sbjct: 433 PADPIKSPTMAGGLFGINRDYFYELGTYDMGMDGWGGENLEMSFRIWQCGGSLHIIPCSR 492
Query: 226 VGHVFRDKSPYTFPGG-VSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGH+FRD PY P V++ L N+ R+AEVWMDE++D +Y + P
Sbjct: 493 VGHIFRDWHPYAIPNSTVNETFLKNSIRLAEVWMDEYKDIFYDIKP 538
>gi|300797404|ref|NP_001179787.1| polypeptide N-acetylgalactosaminyltransferase 3 [Bos taurus]
Length = 633
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 173/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWETLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|296490594|tpg|DAA32707.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Bos
taurus]
gi|440907905|gb|ELR57989.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Bos grunniens
mutus]
Length = 633
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 173/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWETLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
catus]
Length = 582
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/294 (45%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 128 CKSQKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 185
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 186 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 235
Query: 106 ------DQTFEYITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CNSGWLEPLLERIGKDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMD++++ +Y NP
Sbjct: 355 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDQYKEHFYNRNP 403
>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
Length = 647
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 178/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 198 CKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 257
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 258 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 307
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD+T EY + + GGF+W L F W+ VP R
Sbjct: 308 EGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 367
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 368 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 427 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYYY 474
>gi|417402722|gb|JAA48197.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 557
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 167/280 (59%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSLSYSADLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVASAAVLT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
Length = 650
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 178/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 198 CKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 257
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 258 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 307
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD+T EY + + GGF+W L F W+ VP R
Sbjct: 308 EGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 367
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 368 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 427 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYYY 474
>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
Length = 590
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/289 (46%), Positives = 176/289 (60%), Gaps = 32/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C+ ++Y LPT SIVI F NEA+S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 127 CRDRAYSVSLPTASIVICFFNEAFSALLRTVHSVLDRTPNYLLHEIILVDDHSELDDLKE 186
Query: 55 -------RVVCPIIDVISDQTFE------YITASDMTWGGFNWKLREKNRHKKTVVCPII 101
+ + + V+ ++ E I AS T G L + + P++
Sbjct: 187 DLDSYVQQHLQKKVKVVRNEKREGLIRGRMIGASHAT-GEVLVFLDSHCEVNEAWLQPLL 245
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + KTVVCP+ID+IS T Y T S + GGFNW L+F+W VP E+
Sbjct: 246 TPIKE------NRKTVVCPVIDIISADTLVY-TPSPIVRGGFNWGLHFKWDPVPMSELNS 298
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
G +R+PTMAGGLFA+D++YFYELG YD GMDIWGGENLE+SFR+W CGG L I+
Sbjct: 299 PDG----AIRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISFRIWMCGGQLLIV 354
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS VGH+FR + PY PGG + HN+ R+A VWMD++++ Y+A+ P
Sbjct: 355 PCSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDDYKEQYFALRP 402
>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
Length = 590
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 133/289 (46%), Positives = 176/289 (60%), Gaps = 32/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C+ ++Y LPT SIVI F NEA+S LLRTV SV++R+P LL EIILVDD SE
Sbjct: 127 CRDRAYSVSLPTASIVICFFNEAFSALLRTVHSVLDRTPNYLLHEIILVDDHSELDDLKE 186
Query: 55 -------RVVCPIIDVISDQTFE------YITASDMTWGGFNWKLREKNRHKKTVVCPII 101
+ + + V+ ++ E I AS T G L + + P++
Sbjct: 187 DLDSYVQQHLQKKVKVVRNEKREGLIRGRMIGASHAT-GEVLVFLDSHCEVNEAWLQPLL 245
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + KTVVCP+ID+IS T Y T S + GGFNW L+F+W VP E+
Sbjct: 246 TPIKE------NRKTVVCPVIDIISADTLVY-TPSPIVRGGFNWGLHFKWDPVPMSELNS 298
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
G +R+PTMAGGLFA+D++YFYELG YD GMDIWGGENLE+SFR+W CGG L I+
Sbjct: 299 PDG----AIRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISFRIWMCGGQLLIV 354
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS VGH+FR + PY PGG + HN+ R+A VWMD++++ Y+A+ P
Sbjct: 355 PCSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDDYKEQYFALRP 402
>gi|440897357|gb|ELR49068.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos grunniens mutus]
Length = 557
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAAAVLT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEFKQYYYEARP 387
>gi|395849607|ref|XP_003797413.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Otolemur garnettii]
Length = 558
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 174/308 (56%), Gaps = 25/308 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C SY LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD S
Sbjct: 113 CPSVSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID-VISDQ 107
+V C D + +D+ L + P++ V D
Sbjct: 173 LLTRIPKVKCLRNDRREGLIRSRVRGADVATAAILTFLDSHCEVNTEWLQPMLQRVKEDH 232
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D +
Sbjct: 233 T-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPT 284
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 285 RPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 344
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHF 283
HVFR + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T
Sbjct: 345 HVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQR 404
Query: 284 RMLSYSSW 291
+ ++ S+
Sbjct: 405 KKMNCKSF 412
>gi|449275388|gb|EMC84260.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Columba livia]
Length = 632
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTSI+IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 183 LPTTSIIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDAS------VDEYLHDKLD 236
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 237 EYVKQFQIV------KVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLL 290
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S G F+W L+F W +P E RR
Sbjct: 291 ARIAENPVAVVSPDIASIDLNTFEFTKPSPYGHGHNRGNFDWSLSFGWESLPKHENKRRK 350
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I KDYF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 351 -DETYPIRTPTFAGGLFSISKDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 409
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 410 SVVGHVFRSKSPHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRN 454
>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
Length = 650
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 178/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 198 CKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 257
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 258 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 307
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD+T EY + + GGF+W L F W+ VP R
Sbjct: 308 EGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 367
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 368 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 427 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYYY 474
>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
Length = 650
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 178/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 198 CKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 257
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 258 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 307
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD+T EY + + GGF+W L F W+ VP R
Sbjct: 308 EGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 367
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 368 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 427 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYYY 474
>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
lupus familiaris]
Length = 552
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/286 (44%), Positives = 169/286 (59%), Gaps = 27/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTMLVYCADLPPTSIIITFHNEARSTLLRTIRSVLNRTPMNLIQEIILVDDFSNDPDDCL 160
Query: 54 ERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT 113
+ + P + I + + + S ++R N K T + +D + +++
Sbjct: 161 QLIKLPKVKCIRNSERQGLVRS---------RIRGANVAKGTTLT-FLDSHCEVNRDWLQ 210
Query: 114 A---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
VVCP+ID+IS F YI ++ GGF+W L+F+W ++ P + RR
Sbjct: 211 PLLHRVKEDYTRVVCPVIDIISLDNFNYIESAAELRGGFDWSLHFQWEQLSPEQKARRL- 269
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P+RTP +AGGLF +DK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS
Sbjct: 270 DPAEPIRTPIIAGGLFVMDKSWFNYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCS 329
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 330 RVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|155371981|ref|NP_001094597.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos taurus]
gi|151554939|gb|AAI47930.1| GALNTL1 protein [Bos taurus]
gi|296482974|tpg|DAA25089.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 1 [Bos
taurus]
Length = 557
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 169/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAAAVLT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEFKQYYYEARP 387
>gi|345803601|ref|XP_537492.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Canis lupus
familiaris]
Length = 557
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 168/280 (60%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSADLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVATAAVLT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Hydra magnipapillata]
Length = 548
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 133/298 (44%), Positives = 176/298 (59%), Gaps = 23/298 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C K YP LP TS++I FHNEA S LLRTV SVIN +P +L I+LVDDAS +
Sbjct: 128 CSSKQYPAELPNTSVIICFHNEATSALLRTVHSVINETPPNILSNIVLVDDASVGAALKK 187
Query: 59 PIIDVISD------QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI 112
P+ + I++ + + + G +L+ TV+ +D + T ++
Sbjct: 188 PLRNYINELNRKLGEEMVILYRNAKRQGLVRSRLKGAELASGTVLT-FLDSHCEATEGWV 246
Query: 113 T---------AKTVVCPIIDVIS--DQTFEYITASDMTW-GGFNWKLNFRWYRVPPREMM 160
+ VVCP+I+VI D +++ +T GGF W L F W + E
Sbjct: 247 EPLLFRIKEDKRNVVCPVIEVIDAVDLSYKKTELDRITQVGGFTWDLFFNWKEITEDEKR 306
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
R D + PL++PTMAGGLFAIDK YFYE+GSYD M+IWGGENLEMSFR+W CGG LEI
Sbjct: 307 LRA-DGTQPLKSPTMAGGLFAIDKSYFYEIGSYDNQMEIWGGENLEMSFRIWMCGGKLEI 365
Query: 221 IPCSHVGHVFR-DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVS 277
IPCS VGH+FR + SPY+FP GVSK + N R+AEVWMDE+++ YY P + V
Sbjct: 366 IPCSRVGHIFRKENSPYSFPNGVSKTLAKNFNRLAEVWMDEYKELYYRRKPPEDKLVK 423
>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_c [Rattus norvegicus]
Length = 560
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 131/290 (45%), Positives = 180/290 (62%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ KSYPT LPT S+VI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 93 CRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 152
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 153 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 212
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP ++
Sbjct: 213 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSDL--- 262
Query: 163 GGDRSS--PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG S+ P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 263 GGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFI 322
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 323 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 371
>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Monodelphis domestica]
Length = 563
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 132/286 (46%), Positives = 170/286 (59%), Gaps = 18/286 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y + LPTTSIVI FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 121 CTSVHYASDLPTTSIVITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R ++ +D + E++
Sbjct: 181 LLTRIPKVK---CLRNDRREGLIRSRVRGAEVATADILT-FLDSHCEVNSEWLQPMLQRV 236
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 237 KEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DPTQPI 295
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 296 RTPVIAGGIFVIDKAWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 355
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS 273
R + PY FP G + + N R AEVWMDE++ +YY P GKS
Sbjct: 356 RKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKS 401
>gi|345323153|ref|XP_001510349.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Ornithorhynchus anatinus]
Length = 479
Score = 237 bits (605), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/283 (44%), Positives = 167/283 (59%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
C Y + LP+TSI+I FHNEA STLLRT+ SV+NR+P L+ EIILVDD S+
Sbjct: 28 CTASRYRSDLPSTSIIITFHNEARSTLLRTIRSVLNRTPMHLVHEIILVDDFSDDPDDCQ 87
Query: 58 ----CPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + + E I +D+ G L K + P++ I +
Sbjct: 88 LLGPLPKVKCLRNGQREGLIRSRIRGADLAKAGVLTFLDSHCEVNKDWLLPLLQRIKED- 146
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P + +R D +
Sbjct: 147 -----PTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQLSPEQKAKRT-DPTQ 200
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 201 PIKTPIIAGGLFVIDKSWFNHLGKYDTAMDIWGGENFEISFRVWMCGGTLEIVPCSRVGH 260
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 261 VFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKQYYYAARPA 303
>gi|149639508|ref|XP_001513185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Ornithorhynchus anatinus]
Length = 634
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 173/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS+VIVFHNEAWSTLLRTV+SV+ SP LLKEIILVDDAS + D + D+
Sbjct: 185 LPTTSVVIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDAS------VDDYLHDKLD 238
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 239 EYVKQFQIV------KVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLL 292
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 293 ARIAENYTAVVSPDIASIDLNTFEFSKPSPYGNNHNRGNFDWSLSFGWESLPEHEKQRRK 352
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 353 -DETYPIRTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 411
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 412 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEFKEIFYRRN 456
>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oreochromis niloticus]
Length = 598
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 165/283 (58%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C Y + LP+TSI+I FHNEA STLLRT+ SV+NR+P L+ EIILVDD S+
Sbjct: 147 CTTLHYDSELPSTSIIITFHNEARSTLLRTIKSVLNRTPVHLIYEIILVDDFSDDESDCQ 206
Query: 56 --VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + E + +D L K + P++ I +
Sbjct: 207 LLTKLPKVKCFRNNKREGLIRSRVRGTDAARAKVLTFLDSHCEVNKDWLPPLLQRIKED- 265
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P + RR D +
Sbjct: 266 -----PSRVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQLSPEQRARRT-DPTQ 319
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF ID+ +F LG YD MDIWGGEN E+SFRVWQCGG LEI+PCS VGH
Sbjct: 320 PIKTPIIAGGLFVIDRAWFNHLGKYDTAMDIWGGENFEISFRVWQCGGSLEILPCSRVGH 379
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR K PY FP G + + N R AEVWMD++R FYY+ P
Sbjct: 380 VFRKKHPYVFPEGNANTYIKNTRRTAEVWMDDFRLFYYSARPA 422
>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Cricetulus griseus]
gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
griseus]
Length = 608
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 129/288 (44%), Positives = 179/288 (62%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ KSYP LPT S+VI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 141 CRGKSYPADLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 200
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI ++ E + M L + H + V P++
Sbjct: 201 ELDEYIQRYLPAKVKVIRNRKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSELGGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 DG-ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
Length = 684
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 128/289 (44%), Positives = 178/289 (61%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 198 CKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 257
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 258 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 307
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVI+D+T EY + + GGF+W L F W+ VP R
Sbjct: 308 EGWLEPLLDRIARNSSTVVCPVIDVINDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 367
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 368 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 427 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYYY 474
>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Anolis carolinensis]
Length = 634
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + D+
Sbjct: 185 LPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDAS------VDDYLQDKLD 238
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAKT----------- 116
+Y+ + K+ + K + ++ + + +T ++ A
Sbjct: 239 DYVKQFHIV------KVVRQKERKGLITARLLGASIATGETLTFLDAHCECFYGWLEPLL 292
Query: 117 ---------VVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S G F+W L+F W +P E +R
Sbjct: 293 ARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHNRGNFDWSLSFGWESLPEHESKKR- 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYFY +GSYDE M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 352 KDETYPIKTPTFAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIPC 411
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 412 SVVGHVFRSKSPHSFPKG-TQVITRNQVRLAEVWMDEYKNIFYRRN 456
>gi|410930313|ref|XP_003978543.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Takifugu rubripes]
Length = 500
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 130/294 (44%), Positives = 173/294 (58%), Gaps = 30/294 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP+TSI+I FHNEA STLLRTV SV+ RSP +L++EIIL+DD +S+R C
Sbjct: 131 CASMSYDAELPSTSIIITFHNEARSTLLRTVKSVLMRSPPSLVQEIILIDDFSSDRDCCH 190
Query: 60 IIDVISDQTFEYIT--ASDMTWGGFNWKLREKNRHKKTVVC--------------PIID- 102
++ + F + + G ++R N +++ P+I
Sbjct: 191 LLT----EPFPPVKFYSPSRREGLIRSRVRGANAASASILTFLDSHCEVNTDWLQPMIQR 246
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
V D T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R
Sbjct: 247 VKEDHT-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMAR 299
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P+RTP +AGG+F +D+ +F LG YD MDIWGGEN E+SFRVW CGG LEI+P
Sbjct: 300 S-DPTQPIRTPVIAGGIFVMDRSWFNRLGQYDTRMDIWGGENFELSFRVWMCGGSLEILP 358
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
CS VGHVFR + PY FP G + + N R AEVWMDE++ +YY+ P V
Sbjct: 359 CSRVGHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDEYKQYYYSARPSAQGKV 412
>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
Length = 658
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/289 (43%), Positives = 174/289 (60%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK + Y + LP T ++I FHNEAW+ LLRTV SV++RSP L+ IILVDD S+
Sbjct: 206 CKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGRIILVDDYSDMPHLK 265
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I + E + + + +H K V +D + T
Sbjct: 266 TQLEDYFAAYPKVQIIRGKKREGLIRARLLGA----------QHAKAPVLTYLDSHCECT 315
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD T EY +S + GGF+W L F W+ VP R
Sbjct: 316 EGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHYRDSSGVNVGGFDWNLQFSWHAVPER 375
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID++YF LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 376 EK-KRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSFKTWMCGGT 434
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 435 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLRKNSVRLAEVWMDEYSQYYY 482
>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
norvegicus]
gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
Length = 633
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + ++
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I +DYF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus laevis]
gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
Length = 603
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 177/284 (62%), Gaps = 19/284 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
C KK+YP LP SIVI F+NEA+S LLRTV SV++R+P LL EIILVDD SE +
Sbjct: 136 CSKKTYPADLPHASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIILVDDNSELDDLKK 195
Query: 59 PIIDVISDQTFEYI--TASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ + + + E + + G ++ +R V+ C + ++
Sbjct: 196 DLDNYMQENLSEKVKLVRNKQREGLIRGRMVGASRATGDVLVFLDSHCEVNEMWLQPLLA 255
Query: 111 YI--TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR-- 166
I KTVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ GG
Sbjct: 256 PIRENPKTVVCPVIDIISSDTLIY-SSSPVVRGGFNWGLHFKWDPVPLSEL---GGPEGY 311
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
++P R+PTMAGGLF +D++YF LG YD GMDIWGGENLE+SFR+W CGG L I+PCS V
Sbjct: 312 TAPFRSPTMAGGLFVMDREYFNTLGHYDSGMDIWGGENLEISFRIWMCGGSLLIVPCSRV 371
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GH+FR + PY PGG + +N+ R+A VWMDE++D Y+A+ P
Sbjct: 372 GHIFRKRRPYGSPGGHDTMA-YNSLRLAHVWMDEYKDQYFALRP 414
>gi|344273523|ref|XP_003408571.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Loxodonta africana]
Length = 555
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 174/308 (56%), Gaps = 25/308 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C SY LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD S
Sbjct: 113 CPSVSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 172
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID-VISDQ 107
+V C D + +D+ L + P++ V D
Sbjct: 173 LLTRIPKVKCLRNDQREGLIRSRVRGADVAVAAILTFLDSHCEVNTEWLQPMLQRVKEDH 232
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D +
Sbjct: 233 T-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKISRT-DPT 284
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 285 KPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 344
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHF 283
HVFR + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T
Sbjct: 345 HVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQR 404
Query: 284 RMLSYSSW 291
+ ++ S+
Sbjct: 405 KKMNCKSF 412
>gi|395846631|ref|XP_003796006.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Otolemur garnettii]
Length = 943
Score = 236 bits (603), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 172/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ + LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD C
Sbjct: 489 CAEQLVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD------CST 542
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D EY++ LR K RH V C
Sbjct: 543 KDYLKDNLDEYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 597
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ + V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 598 VGWLEPLLERV------YLSRQKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWKT 651
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YFYELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 652 IPPDVVAKNKIKETDIIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWM 711
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 712 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 765
>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
Length = 658
Score = 236 bits (603), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 127/289 (43%), Positives = 174/289 (60%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK + Y + LP T ++I FHNEAW+ LLRTV SV++RSP L+ IILVDD S+
Sbjct: 206 CKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGRIILVDDYSDMPHLK 265
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I + E + + + +H K V +D + T
Sbjct: 266 TQLEDYFAAYPKVQIIRGKKREGLIRARLLGA----------QHAKAPVLTYLDSHCECT 315
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD T EY +S + GGF+W L F W+ VP R
Sbjct: 316 EGWLEPLLDRIARNSTTVVCPVIDVISDDTLEYHYRDSSGVNVGGFDWNLQFSWHAVPER 375
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID++YF LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 376 EK-KRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSFKTWMCGGT 434
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 435 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLRKNSVRLAEVWMDEYSQYYY 482
>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Rattus norvegicus]
Length = 609
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 131/290 (45%), Positives = 180/290 (62%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ KSYPT LPT S+VI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 142 CRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 201
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 202 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 261
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP ++
Sbjct: 262 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSDL--- 311
Query: 163 GGDRSS--PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG S+ P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 312 GGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFI 371
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 372 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 420
>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
aries]
Length = 633
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWETLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|167523942|ref|XP_001746307.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775069|gb|EDQ88694.1| predicted protein [Monosiga brevicollis MX1]
Length = 2376
Score = 236 bits (603), Expect = 7e-60, Method: Composition-based stats.
Identities = 128/280 (45%), Positives = 170/280 (60%), Gaps = 27/280 (9%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ +Y LP S++ VF+NEA STLLR++ SVI R+P +LL EIILVDDAS+ +
Sbjct: 1022 CRALTYDLATLPDMSVIFVFYNEARSTLLRSIRSVIIRTPPSLLHEIILVDDASDDELPA 1081
Query: 60 IIDVISDQTFEYI-------------TASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD 106
D+ + +YI +D G L + P++ I++
Sbjct: 1082 --DIKAMDKIKYIRLPSRQGLIRARTAGADAATGEVLCFLDSHIEVNRDWAEPLLQRINE 1139
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VV PIIDVISD F Y +AS + GGF+W L F+W VP + + D
Sbjct: 1140 DPLH------VVTPIIDVISDSNFRY-SASPVVRGGFDWGLTFKWKSVPRSQ---QSSDP 1189
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
++P+ +PTMAGGLFA+ + FYELG+YD GMDIWG ENLEMSFR+WQCG LEI+PCS V
Sbjct: 1190 TAPIASPTMAGGLFAMKRTTFYELGTYDLGMDIWGAENLEMSFRIWQCGARLEIMPCSRV 1249
Query: 227 GHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFY 265
GHVFR PY+FPGG S + L N+ R+AEVWMDE+ +F+
Sbjct: 1250 GHVFRKHHPYSFPGGGSGHVFLRNSLRLAEVWMDEYAEFF 1289
>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
Length = 624
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 130/290 (44%), Positives = 177/290 (61%), Gaps = 37/290 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL+E++LVDD S+
Sbjct: 165 CKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD------ 218
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLR-EKNRHKKTVVCPIIDVISDQTFEYITAK---- 115
+D EY++ +GG LR EK V + + Y+ +
Sbjct: 219 MDHTKRPLEEYMS----QFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECM 274
Query: 116 ----------------TVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPR 157
TVVCP+IDVI D TFEY + + GGF+W L F W+ +P R
Sbjct: 275 EGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPER 334
Query: 158 EMMRRGGDRS-SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+ R+ R P+R+PTMAGGLF+IDK+YF +LG+YD G DIWGGENLE+SF++W CGG
Sbjct: 335 D--RKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGG 392
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 393 TLEIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 441
>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
Length = 623
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 178/280 (63%), Gaps = 17/280 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL+E++LVDD S+
Sbjct: 165 CKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKR 224
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
P+ + +S + I + G +LR V+ C ++ + +
Sbjct: 225 PLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDR 284
Query: 112 IT--AKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
I TVVCP+IDVI D TFEY + + GGF+W L F W+ +P R+ R+ R
Sbjct: 285 IKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--RKNRTRP 342
Query: 168 -SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+R+PTMAGGLF+IDK+YF +LG+YD G DIWGGENLE+SF++W CGG LEI+PCSHV
Sbjct: 343 IDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHV 402
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 403 GHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 441
>gi|348585731|ref|XP_003478624.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Cavia porcellus]
Length = 937
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 174/294 (59%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WSTLLR+V SV+NRSP+ L+KEI+LVDD S +
Sbjct: 483 CAEQLVHNQLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPQHLIKEILLVDDFSTK----- 537
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D+ +Y++ LR K RH V C
Sbjct: 538 -DYLKDKLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 591
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 592 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGVFVWPMNFGWRT 645
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 646 IPPEVVAKNRIKETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 705
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EI+PCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 706 CGGEIEIVPCSRVGHIFRNDNPYSFPKDRLKTVERNLVRVAEVWLDEYKELFYG 759
>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
Length = 630
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 180/292 (61%), Gaps = 41/292 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y LP TS+++ FHNEAWS LLRTV SV+ R+P LL+EI+LVDD S+
Sbjct: 169 CKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPEHLLEEIVLVDDFSD------ 222
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT------- 113
+D EY++ +GG LR + R + ++ + + T E +T
Sbjct: 223 MDHTKRPLEEYMS----QFGGKVKILRMEKR--EGLIRARLRGAAIATGEVLTYLDSHCE 276
Query: 114 ----------------AKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVP 155
TVVCP+IDVI D TFEY + + GGF+W L F W+ +P
Sbjct: 277 CMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIP 336
Query: 156 PREMMRRGGDRS-SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQC 214
R+ R+ R+ P+R+PTMAGGLF+IDK YF +LG+YD G DIWGGENLE+SF++W C
Sbjct: 337 ERD--RKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMC 394
Query: 215 GGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GG LEI+PCSHVGHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 395 GGTLEIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 445
>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Rattus norvegicus]
Length = 608
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 131/290 (45%), Positives = 180/290 (62%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+ KSYPT LPT S+VI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 141 CRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 200
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 201 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP ++
Sbjct: 261 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSDL--- 310
Query: 163 GGDRSS--PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG S+ P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|322785490|gb|EFZ12159.1| hypothetical protein SINV_06585 [Solenopsis invicta]
Length = 466
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 139/297 (46%), Positives = 176/297 (59%), Gaps = 30/297 (10%)
Query: 4 KSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERV 56
K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S E
Sbjct: 2 KQWRQDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELS 61
Query: 57 VCPIIDVISDQTFEYI-----------TASDMTW------GGFNW--KLREKNRHKKT-V 96
+ VI ++ E + TAS +T+ +W L E+ T V
Sbjct: 62 RIHKVRVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAEDPTRV 121
Query: 97 VCPIIDVISDQTFEYIT--AKTVVCPIIDVISDQT-FEYITASDMTWGGFNWKLNFRWYR 153
VCP+IDVIS TF+YI + + I + D+ F ++ AS GGF+W L F+W
Sbjct: 122 VCPVIDVISMDTFQYIEICLRCNLKRISETRRDKILFRFLGASADLRGGFDWSLVFKWEY 181
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+ E R D + +RTP +AGGLF I+K YF +LG YD MD+WGGENLE+SFRVWQ
Sbjct: 182 LSQGERQARQKDPTQSIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQ 241
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CGG LEIIPCS VGHVFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 242 CGGSLEIIPCSRVGHVFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 298
>gi|350593559|ref|XP_003133495.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Sus
scrofa]
Length = 633
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I KDYF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|291410883|ref|XP_002721722.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 1,
partial [Oryctolagus cuniculus]
Length = 499
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 132/305 (43%), Positives = 180/305 (59%), Gaps = 19/305 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 54 CPSMSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 113
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + ++ +D + E++
Sbjct: 114 LLTRIPKVK---CLRNDRREGLIRSRVRGADVAAAAILT-FLDSHCEVNTEWLQPMLQRV 169
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 170 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKITRT-DPTRPI 228
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 229 RTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 288
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFRML 286
R + PY FP G + + N R AEVWMDE++ +YY P GK+ SV+T + +
Sbjct: 289 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKM 348
Query: 287 SYSSW 291
+ S+
Sbjct: 349 NCKSF 353
>gi|196001851|ref|XP_002110793.1| hypothetical protein TRIADDRAFT_11844 [Trichoplax adhaerens]
gi|190586744|gb|EDV26797.1| hypothetical protein TRIADDRAFT_11844, partial [Trichoplax
adhaerens]
Length = 490
Score = 236 bits (602), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 129/294 (43%), Positives = 173/294 (58%), Gaps = 38/294 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-----R 55
CK + +P LPTTS+V+VFHNEAWSTLLRTV S+++RSP LL EIIL DD S+
Sbjct: 48 CKDQIFPLHLPTTSVVVVFHNEAWSTLLRTVHSILSRSPPDLLHEIILQDDYSDPIGHAE 107
Query: 56 VVCPI---------IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD 106
+ P+ + + ++ E + S + GF+ H V +D +
Sbjct: 108 LFMPLELYTSKLEKVKIFRNEKHEGLIRSRLN--GFS--------HATAPVVTFLDAHCE 157
Query: 107 QTFE---------YITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPP 156
T Y+ TVVCP IDVI D+TF+Y + G FNW+L FRW +PP
Sbjct: 158 VTTGWLEPLLERIYLNETTVVCPEIDVIDDRTFQYQFGPPALMRGVFNWQLYFRWALIPP 217
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR P+ +PTMAGGLFAI K +F LG+YD+ D+WGGEN+E+SF+ W CGG
Sbjct: 218 EEHKRRKSP-IDPVWSPTMAGGLFAISKKFFKRLGTYDDQFDVWGGENMEISFKAWLCGG 276
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI+PCS VGHVFR PY F G + N+ RVAEVW+D++++F+Y + P
Sbjct: 277 KLEIVPCSRVGHVFRHNQPYKFGGN---FLSRNSQRVAEVWLDDYKEFFYQVQP 327
>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
Length = 626
Score = 236 bits (602), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 178/280 (63%), Gaps = 17/280 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL+E++LVDD S+
Sbjct: 165 CKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKR 224
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
P+ + +S + I + G +LR V+ C ++ + +
Sbjct: 225 PLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDR 284
Query: 112 IT--AKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
I TVVCP+IDVI D TFEY + + GGF+W L F W+ +P R+ R+ R
Sbjct: 285 IKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--RKNRTRP 342
Query: 168 -SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+R+PTMAGGLF+IDK+YF +LG+YD G DIWGGENLE+SF++W CGG LEI+PCSHV
Sbjct: 343 IDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHV 402
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 403 GHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 441
>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
Length = 597
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/297 (44%), Positives = 182/297 (61%), Gaps = 51/297 (17%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y LP TS+++ FHNEAWS LLRTV SV+ R+P LL+EI+LVDD S+
Sbjct: 173 CKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPDHLLEEIVLVDDFSD------ 226
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------------------HKKTV 96
+D EY++ +GG LR + R H + +
Sbjct: 227 MDHTKRPLEEYMS----QFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECM 282
Query: 97 ---VCPIIDVIS-DQTFEYITAKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFR 150
+ P++D I D T TVVCP+IDVI D TFEY + + GGF+W L F
Sbjct: 283 EGWIEPLLDRIKRDPT-------TVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFN 335
Query: 151 WYRVPPREMMRRGGDRS-SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSF 209
W+ +P R+ R+ R+ P+R+PTMAGGLF+IDK YF +LG+YD G DIWGGENLE+SF
Sbjct: 336 WHSIPERD--RKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSF 393
Query: 210 RVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
++W CGG LEI+PCSHVGHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 394 KIWMCGGTLEIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 449
>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Takifugu rubripes]
Length = 583
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C+ K++ LPTTS++I F+NEAWSTLLRT+ SV+ +P LLKEIIL+DD S+R +
Sbjct: 130 CRSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILIDDFSDRAYLK 189
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ D IS+ + + G +L V+ C + + E
Sbjct: 190 SQLADYISNLERVRLIRTKKREGLVRARLIGATYATGEVLTFLDCHCECVPGWIEPLLER 249
Query: 112 I--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I + T+VCP+ID I TFE Y+ + GGF+W+L F+W+ VP RE RR
Sbjct: 250 IGENSSTIVCPVIDTIDWNTFEFYMQTEEPMIGGFDWRLTFQWHSVPERERKRRKSP-VD 308
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+R+PTMAGGLFA++K++F LG+YD GM++WGGENLE+SFRVWQCGG LEI PCSHVGH
Sbjct: 309 PIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGH 368
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VF K+PY P L N R AEVWMD ++ +Y NP
Sbjct: 369 VFPKKAPYARPN-----FLQNTVRAAEVWMDSYKQHFYNRNP 405
>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryctolagus cuniculus]
Length = 579
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/294 (45%), Positives = 173/294 (58%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K++ LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+R
Sbjct: 125 CKSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRAY-- 182
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L + + V +I DV++
Sbjct: 183 ----LKTQLETYISNLDRV------RLIRTKKREGLVRARLIGATFATGDVLTFLDCHCE 232
Query: 106 ------DQTFEYIT--AKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 233 CNSGWLEPLLERIERDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPK 292
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 293 HERDRRKS-RIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 351
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMD++++ +Y NP
Sbjct: 352 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDDYKEHFYNRNP 400
>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Cricetulus griseus]
Length = 541
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 170/282 (60%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 90 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIQEIILVDDFSNDPEDCK 149
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 150 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE-- 207
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + R D +
Sbjct: 208 -DYTR---VVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSPEQKALRL-DPTE 262
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 263 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGH 322
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 323 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 364
>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Sus scrofa]
Length = 552
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRTV S++NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCADLPPTSIIITFHNEARSTLLRTVRSILNRTPMNLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRIRGADAAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I TF+YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIIHLDTFDYIESATELRGGFDWSLHFQWEQLTPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF +DK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVMDKSWFDYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYASRP 375
>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Anolis carolinensis]
Length = 583
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 172/282 (60%), Gaps = 18/282 (6%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ K+Y LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S++V
Sbjct: 129 CRSKTYDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLESSPSVLLKEIILVDDLSDKVYLK 188
Query: 60 --IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ IS+ + ++ G +L V+ C + + +
Sbjct: 189 GELEKYISNLQRVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVPGWLEPLLQR 248
Query: 112 ITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ ++CP+ID I TFE Y+ + GGF+W+L F+W+ VP E RR +
Sbjct: 249 VAENESVIICPVIDTIDWNTFEFYMQPGEPMIGGFDWRLTFQWHSVPDYERQRRK-SKVD 307
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+R+PTMAGGLFA+ K YF LG+YD GMD+WGGENLE+SFRVWQCGGILEI PCSHVGH
Sbjct: 308 PIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSFRVWQCGGILEIHPCSHVGH 367
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VF ++PY P L N AR AEVWMD++++ +Y NP
Sbjct: 368 VFPKRAPYARPN-----FLQNTARAAEVWMDDYKEHFYNRNP 404
>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Taeniopygia guttata]
Length = 631
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/286 (46%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTSI+IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 182 LPTTSIIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDAS------VDEYLHDKLD 235
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 236 EYVKQFQIV------KVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLL 289
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S G F+W L+F W +P E RR
Sbjct: 290 ARIAENPVAVVSPDIASIDLNTFEFSKPSPYGHSHNRGNFDWSLSFGWESLPKHENKRRK 349
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I KDYF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 350 -DETYPIRTPTFAGGLFSISKDYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 408
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 409 SVVGHVFRSKSPHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRN 453
>gi|260836359|ref|XP_002613173.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
gi|229298558|gb|EEN69182.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
Length = 539
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 167/282 (59%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK ++ L TS++I FHNEA STLLRT+ SV+ RSP L++EIILVDD S++ P
Sbjct: 93 CKTHTWKEDLLPTSVIITFHNEARSTLLRTITSVLLRSPPHLIQEIILVDDYSDK---PD 149
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYITA 114
+ Q + + G +R + + PI+ + E +
Sbjct: 150 DGLELAQIQKVKILRNERREGL---MRSRVKGADAATAPILTFLDSHCECNQHWLEPMLE 206
Query: 115 KT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VVCPIIDVI+ F+Y+ AS GGF+W L F+W + + R D +
Sbjct: 207 RVMEDRTRVVCPIIDVINMDNFQYVGASADLRGGFDWNLVFKWDYMTANQRNARRSDPIA 266
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F ELG YD MD+WGGENLE+SFRVWQC G LEIIPCS VGH
Sbjct: 267 PIRTPMIAGGLFMIDKSWFDELGKYDMMMDVWGGENLEISFRVWQCQGSLEIIPCSRVGH 326
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR + PYTFPGG + N R AEVWMDE++++YYA P
Sbjct: 327 VFRKQHPYTFPGGSGNVFTRNTRRAAEVWMDEYKEYYYAAVP 368
>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
musculus]
Length = 633
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + ++
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
laevis]
gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
Length = 582
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K++ LPTTS++I F+NEA STLLRT+ SV+ SP LL+EIILVDD S++V
Sbjct: 128 CKSKTFSYRKLPTTSVIIAFYNEALSTLLRTIHSVLESSPAVLLREIILVDDFSDKVY-- 185
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS--DQTFE 110
+ Q +YI D +L + + V II DV++ D E
Sbjct: 186 ----LKSQLEDYIGGLDRV------RLIRTTKREGLVRARIIGATYAIGDVLTFLDCHCE 235
Query: 111 YITA-------------KTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+T VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CVTGWLEPLLERIGENETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHAVPE 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 KERQRRKS-RIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF K+PY P L N AR AEVWMD +++ +Y NP
Sbjct: 355 TLEIEPCSHVGHVFPKKAPYARPN-----FLQNTARAAEVWMDGYKELFYNRNP 403
>gi|194220840|ref|XP_001500424.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Equus
caballus]
Length = 539
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 123/282 (43%), Positives = 169/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L+KEIILVDD S
Sbjct: 88 CTTLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMNLIKEIILVDDFSNDPDDCN 147
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + ++ + I +D G + + + P++ + +
Sbjct: 148 QLIKLPKVKCLRNENRQGLVRSRIRGADFAEGAILTFMDSHCEVNRDWLQPLLHRVKE-- 205
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ F YI ++ GGF+W L+F+W ++ P + +R D +
Sbjct: 206 -DYTR---VVCPVIDIINLDNFNYIESATELRGGFDWSLHFQWEQLSPEQKAQRL-DPAE 260
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF ++K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 261 PIRTPVIAGGLFVMNKSWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 320
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R EVWMDE++ +YYA P
Sbjct: 321 VFRKKHPYVFPDGNANTYIKNTKRTVEVWMDEYKQYYYAARP 362
>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
abelii]
Length = 578
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/294 (45%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + + LPTTS++I F+NEAWSTLLRT+ SV+ SP LLKEIILVDD S+RV
Sbjct: 124 CKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVY-- 181
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q YI+ D +L N+ + V +I DV++
Sbjct: 182 ----LKTQLETYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCE 231
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I +VCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 232 CNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVP- 290
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
++ R R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 291 KQKRDRQISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGG 350
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF ++PY P L N AR AEVWMDE+++ +Y NP
Sbjct: 351 KLEIHPCSHVGHVFPKRAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNP 399
>gi|417403505|gb|JAA48553.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 633
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 174/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV+ SP LLKE+ILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAVLLKEVILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRKRKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDMNTFEFNKPSPYGINHNRGNFDWSLSFGWEALPDHERQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + ++
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|291386971|ref|XP_002709979.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oryctolagus cuniculus]
Length = 551
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 168/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y LP TSI+I FHNEA STLLRTV S++NR+P L++EIILVDD S
Sbjct: 100 CALLVYCKDLPPTSIIITFHNEARSTLLRTVRSILNRTPMHLIQEIILVDDFSSDPDDCN 159
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + I +D+ G L K + P++ + +
Sbjct: 160 QLIKLPKVKCLRNNERQGLVRSRIRGADIAQGATLTFLDSHCEVNKDWLQPLLHRVKE-- 217
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F W ++ P + RR D +
Sbjct: 218 -DYTR---VVCPVIDIINLDTFNYIESASELRGGFDWSLHFHWEQLSPEQKARRL-DPTE 272
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW C G LEIIPCS VGH
Sbjct: 273 PIRTPVIAGGLFVIDKAWFDYLGKYDTDMDIWGGENFEISFRVWMCRGSLEIIPCSRVGH 332
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMD+++ +YYA P
Sbjct: 333 VFRKKHPYAFPNGNTNTYIKNTKRTAEVWMDDYKQYYYAARP 374
>gi|349732170|ref|NP_001231847.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1-like [Sus
scrofa]
Length = 557
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/283 (45%), Positives = 164/283 (57%), Gaps = 21/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD S
Sbjct: 113 CPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 172
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID-VISDQ 107
+V C D + +D+ G L + P++ V D
Sbjct: 173 LLTRIPKVKCLRNDRREGLIRSRVRGADVAAAGVLTFLDSHCEVNTEWLQPMLQRVKEDH 232
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + D +
Sbjct: 233 T-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIA-WTDPT 284
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 285 KPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 344
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 345 HVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
Length = 623
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 177/280 (63%), Gaps = 17/280 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL+E++LVDD S+
Sbjct: 165 CKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKR 224
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
P+ + +S + I + G +LR V+ C ++ + +
Sbjct: 225 PLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDR 284
Query: 112 IT--AKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
I TVVCP+IDVI D TFEY + + GGF+W L F W+ +P R+ R+ R
Sbjct: 285 IKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--RKNRTRP 342
Query: 168 -SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+R+PTMAGGLF+IDK YF +LG+YD G DIWGGENLE+SF++W CGG LEI+PCSHV
Sbjct: 343 IDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHV 402
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 403 GHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 441
>gi|313246954|emb|CBY35800.1| unnamed protein product [Oikopleura dioica]
Length = 696
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/283 (46%), Positives = 172/283 (60%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C K+ Y LP T+++I +HNEA STLLRTV SV++RSP L+KEIILVDD S+ +
Sbjct: 248 CSKQKYQVEQLPDTTVIITYHNEAHSTLLRTVISVLHRSPPNLIKEIILVDDFSKNPNIG 307
Query: 58 CPI-----IDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
P+ + I + E + + + G L + + P++ I +
Sbjct: 308 PPLTKIKKVKAIRNPKREGLIRSRVRGAAIATGKVLTFLDSHVEANEGWLEPLLGRIHE- 366
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
+ VV PIIDVI F Y+ AS GGFNW L F+W + +E R +
Sbjct: 367 -----SRTAVVSPIIDVIGMDDFHYVGASADLKGGFNWDLVFKWDYMSEQERRERRRAPT 421
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
SP+RTP +AGGLF+IDK++F+ELG YD MD+WGGENLE+SFRVWQC G LEIIPCS VG
Sbjct: 422 SPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEISFRVWQCHGTLEIIPCSRVG 481
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR K PYTFPGG + N R AEVWMDE+++FY+A P
Sbjct: 482 HVFRKKHPYTFPGGSGNVFAKNTRRAAEVWMDEYKEFYFAAVP 524
>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
Length = 624
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 177/280 (63%), Gaps = 17/280 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL+E++LVDD S+
Sbjct: 165 CKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKR 224
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
P+ + +S + I + G +LR V+ C ++ + +
Sbjct: 225 PLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDR 284
Query: 112 IT--AKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
I TVVCP+IDVI D TFEY + + GGF+W L F W+ +P R+ R+ R
Sbjct: 285 IKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--RKNRTRP 342
Query: 168 -SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+R+PTMAGGLF+IDK YF +LG+YD G DIWGGENLE+SF++W CGG LEI+PCSHV
Sbjct: 343 IDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHV 402
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 403 GHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 441
>gi|410962531|ref|XP_003987822.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1,
partial [Felis catus]
Length = 553
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 126/280 (45%), Positives = 167/280 (59%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C +Y LP TS++I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 109 CPSVAYSADLPATSVIITFHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFSSDPEDCL 168
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + V+ +D + E++
Sbjct: 169 LLTRIPKVK---CLRNDRREGLIRSRVRGADVATAAVLT-FLDSHCEVNTEWLQPMLQRV 224
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 225 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKIART-DPTKPI 283
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 284 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 343
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 344 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 383
>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
catus]
Length = 552
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 166/282 (58%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-------AS 53
C Y LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CTLLVYCADLPPTSIIITFHNEARSTLLRTIRSVLNRTPMNLIQEIILVDDFSNDPDDCS 160
Query: 54 ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + I + + I + + G L + + P++ + +
Sbjct: 161 QLIKLPKVKCIRNTERQGLVRSRIRGASVAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+IS F YI ++ GGF+W L+F+W ++ P + RR D +
Sbjct: 220 -----YTRVVCPVIDIISLDNFNYIESAAELRGGFDWSLHFQWEQLSPEQKARRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF +DK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIRTPIIAGGLFVMDKSWFEYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|431904511|gb|ELK09894.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Pteropus alecto]
Length = 557
Score = 235 bits (600), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 166/280 (59%), Gaps = 15/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP TS VI FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 113 CPSVSYSVDLPATSFVITFHNEARSTLLRTVKSVLNRTPPNLIQEIILVDDFSSDPEDCL 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R + ++ +D + E++
Sbjct: 173 LLTRIPKVK---CLRNDRREGLIRSRVRGADVASAAILT-FLDSHCEVNTEWLQPMLQRV 228
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D + P+
Sbjct: 229 KEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKISRT-DPTRPI 287
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 288 RTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 347
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
R + PY FP G + + N R AEVWMDE++ +YY P
Sbjct: 348 RKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARP 387
>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
Length = 626
Score = 235 bits (600), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 177/280 (63%), Gaps = 17/280 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL+E++LVDD S+
Sbjct: 165 CKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKR 224
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
P+ + +S + I + G +LR V+ C ++ + +
Sbjct: 225 PLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDR 284
Query: 112 IT--AKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
I TVVCP+IDVI D TFEY + + GGF+W L F W+ +P R+ R+ R
Sbjct: 285 IKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--RKNRTRP 342
Query: 168 -SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+R+PTMAGGLF+IDK YF +LG+YD G DIWGGENLE+SF++W CGG LEI+PCSHV
Sbjct: 343 IDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHV 402
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GHVFR +SPY + GV+ ++ N+ R+AEVW+D+++ +YY
Sbjct: 403 GHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYY 441
>gi|395504161|ref|XP_003756425.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Sarcophilus harrisii]
Length = 563
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 169/286 (59%), Gaps = 18/286 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y + LP TSIVI FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 121 CTSVHYASDLPATSIVITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPEDCL 180
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA----- 114
++ I +D G ++R ++ +D + E++
Sbjct: 181 LLTRIPKIK---CLRNDRREGLIRSRVRGAEVATADILT-FLDSHCEVNSEWLQPMLQRV 236
Query: 115 ----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 237 KEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DPTQPI 295
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVF
Sbjct: 296 RTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVF 355
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS 273
R + PY FP G + + N R AEVWMDE++ +YY P GKS
Sbjct: 356 RKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKS 401
>gi|313233395|emb|CBY24510.1| unnamed protein product [Oikopleura dioica]
Length = 679
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/283 (46%), Positives = 172/283 (60%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C K+ Y LP T+++I +HNEA STLLRTV SV++RSP L+KEIILVDD S+ +
Sbjct: 231 CSKQKYQVEQLPDTTVIITYHNEAHSTLLRTVISVLHRSPPNLIKEIILVDDFSKNPNIG 290
Query: 58 CPI-----IDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
P+ + I + E + + + G L + + P++ I +
Sbjct: 291 PPLTKIKKVKAIRNPKREGLIRSRVRGAAIATGKVLTFLDSHVEANEGWLEPLLGRIHE- 349
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
+ VV PIIDVI F Y+ AS GGFNW L F+W + +E R +
Sbjct: 350 -----SRTAVVSPIIDVIGMDDFHYVGASADLKGGFNWDLVFKWDYMSEQERRERRRAPT 404
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
SP+RTP +AGGLF+IDK++F+ELG YD MD+WGGENLE+SFRVWQC G LEIIPCS VG
Sbjct: 405 SPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEISFRVWQCHGTLEIIPCSRVG 464
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR K PYTFPGG + N R AEVWMDE+++FY+A P
Sbjct: 465 HVFRKKHPYTFPGGSGNVFAKNTRRAAEVWMDEYKEFYFAAVP 507
>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/286 (45%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + ++
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGNNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|363731300|ref|XP_419370.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Gallus
gallus]
Length = 552
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/283 (45%), Positives = 169/283 (59%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-----R 55
C Y LP TSIVI FHNEA STLLRT+ SV+NR+P L+ EIILVDD S+ R
Sbjct: 101 CTTLHYRQDLPPTSIVITFHNEARSTLLRTIRSVMNRTPVHLIHEIILVDDFSDDPDDCR 160
Query: 56 VVC--PIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
++ P + + ++ E I +D+ G L K + P++ I +
Sbjct: 161 LLAKLPKVKCLRNRQREGLIRSRIQGADVAQAGVLTFLDSHCEVNKDWLLPLLQRIKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P + +R D +
Sbjct: 220 -----PTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQLSPEQKAKRL-DPTK 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIKTPIIAGGLFVIDKAWFNHLGKYDNAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKQYYYAARPA 376
>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Takifugu rubripes]
Length = 579
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/288 (44%), Positives = 167/288 (57%), Gaps = 22/288 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C Y + LP TSI+I FHNEA STLLRTV SV+NR+P L+ EIILVDD S+
Sbjct: 128 CATIRYDSDLPPTSIIITFHNEARSTLLRTVRSVLNRTPVHLIHEIILVDDFSDDESDCQ 187
Query: 56 --VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ P + + + E + +D L K + P++ I
Sbjct: 188 LLIKLPKVRCVRNPQREGLIRSRVRGADSAKAAVLTFLDSHCEVNKDWLPPLLQRIKQD- 246
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P + RR D +
Sbjct: 247 -----PTRVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQLSPEQRARRT-DPAQ 300
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF ID+ +F LG YD MDIWGGEN E+SFRVWQCGG LEI+PCS VGH
Sbjct: 301 PIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGGENFEISFRVWQCGGSLEILPCSRVGH 360
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS 273
VFR K PY FP G + + N R AEVWMD++ FYY+ P GKS
Sbjct: 361 VFRKKHPYVFPEGNANTYIKNTRRTAEVWMDDFSLFYYSARPAARGKS 408
>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 514
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 122/290 (42%), Positives = 180/290 (62%), Gaps = 38/290 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + Y + LP+TSI++ FHNEAWS L+RTV S++NRSP L+ +IILVDD S+
Sbjct: 62 CRDQVYSSKLPSTSIIVCFHNEAWSVLIRTVHSILNRSPAHLIHDIILVDDFSD------ 115
Query: 61 IDVISDQTFEYITA--------SDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ ++ D Y++A ++ G +L + V+
Sbjct: 116 LQLLKDPLERYLSAFPKVRIVRAEKREGLIRARLLGASHSTAPVLTFLDSHVECTQGWLE 175
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYIT--ASDMTWGGFNWKLNFRWYRVPP 156
P++D I+ + + VV P+ID+I+D T EY ++D+ GGF+W L F W+ +P
Sbjct: 176 PLLDRIA------VNSTNVVSPVIDIIADDTLEYNAKESADVNVGGFDWSLQFSWHSIPE 229
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
R +++ G R P+ TPTMAGGLF+ID+ +F LG YD G DIWGGENLE+SF+ W CGG
Sbjct: 230 R-ILKSGYKRWQPVETPTMAGGLFSIDRKFFERLGMYDPGFDIWGGENLELSFKTWMCGG 288
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEIIPCSHVGH+FR +SPY + GV+ ++ N+ R+A+VWMDE+ ++Y+
Sbjct: 289 RLEIIPCSHVGHIFRKRSPYKWRSGVN-VLRRNSIRLAKVWMDEYANYYF 337
>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 657
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/275 (46%), Positives = 169/275 (61%), Gaps = 26/275 (9%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP--IIDVISDQ 67
LP TS+VI FHNEAWS LLRTV SV++RSP LLKEIILVDD S+ + + D +S
Sbjct: 180 LPQTSVVICFHNEAWSVLLRTVHSVLDRSPPNLLKEIILVDDFSDMIHLKKQLEDYMSHY 239
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
I + G +L R V P++D I+
Sbjct: 240 PKVKIIRASKREGLIRARLLGATRATAPVTTFLDSHCECTVGWLEPLLDRIAKD------ 293
Query: 114 AKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +R + + P+
Sbjct: 294 PTTVVCPVIDVIDDTTLEYNFRDSGGVNVGGFDWNLQFNWHAVPEREK-KRHKNTAEPVW 352
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
+PTMAGGLFAIDK++F +G+YD G DIWGGENLE+SF+ W CGG LEI+PCSHVGH+FR
Sbjct: 353 SPTMAGGLFAIDKNFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFR 412
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
+SPY + GV+ ++ N+ R+AEVW+D++ +YY
Sbjct: 413 RRSPYKWRSGVN-VLKRNSVRLAEVWLDDYAKYYY 446
>gi|443704264|gb|ELU01402.1| hypothetical protein CAPTEDRAFT_127533 [Capitella teleta]
Length = 390
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 123/287 (42%), Positives = 173/287 (60%), Gaps = 23/287 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ KSY + LP S++I F E+WSTLLR+V SV+NR+P LL+EIILVDD S+R
Sbjct: 56 CRDKSYDYSSLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQR---- 111
Query: 60 IIDVISDQTFEYIT----ASDMTWGGFNWKLREKNRHKKTVVCPIIDVI----------S 105
+ + Y+T + + + +R + R + P++ + +
Sbjct: 112 --GHLHAKLDNYLTRLPKVTLIRFPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWA 169
Query: 106 DQTFEYIT--AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + I+ + +V P+ID IS + F YI S GGFNW + F+W VP E R G
Sbjct: 170 EPLLQRISHNRRVIVAPVIDAISSRDFSYIPISANQRGGFNWAMLFKWMPVPNYEKSRTG 229
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GD ++P+RTPT+AGGLFAI + +F LG YD G+DIWG ENLE+SF+ W CGG +E+IPC
Sbjct: 230 GDPTAPVRTPTIAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPC 289
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGHV+R PY+FPGG K+ + N RVA VWMD + + +Y M P
Sbjct: 290 SRVGHVYRSTQPYSFPGGNVKVFMRNNLRVANVWMDGYVNLFYLMKP 336
>gi|443726011|gb|ELU13353.1| hypothetical protein CAPTEDRAFT_91056 [Capitella teleta]
Length = 426
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 123/287 (42%), Positives = 169/287 (58%), Gaps = 23/287 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C+ KSY + LP S++I F E+WSTLLR+V SV+NR+P LL+EI+LVDD S+R +
Sbjct: 92 CRDKSYDYSSLPKMSVIICFTEESWSTLLRSVHSVLNRTPPDLLEEILLVDDFSQREHLH 151
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D ++ + G +LR + V+ P++
Sbjct: 152 AKLDDYLTRLPKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQR 211
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
IS + +V P+ID IS + F YI S GGFNW + F+W VP E R G
Sbjct: 212 ISHNR------RVIVAPVIDAISSRDFSYIPISANQRGGFNWAMLFKWMPVPDYEKSRTG 265
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GD ++P+RTPT+AGGLFAI + +F LG YD G+ IWG ENLE+SF+ W CGG +E+IPC
Sbjct: 266 GDPTAPVRTPTIAGGLFAIHQGFFRSLGFYDPGLHIWGSENLELSFKAWMCGGSMEMIPC 325
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ VGHV+R PY+FPGG K+ + N RVA VWMD++ D +Y M P
Sbjct: 326 ARVGHVYRSTQPYSFPGGNVKVFMRNNLRVANVWMDDYVDLFYLMKP 372
>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mustela putorius
furo]
Length = 452
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/284 (47%), Positives = 168/284 (59%), Gaps = 41/284 (14%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++I F+NEAWSTLLRTV SV+ SP LLKEIILVDD S+RV + Q
Sbjct: 8 LPTTSVIIAFYNEAWSTLLRTVHSVLETSPAVLLKEIILVDDLSDRV------YLKTQLE 61
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS-------------DQTF 109
YI+ D +L N+ + V +I DV++ +
Sbjct: 62 TYISNLDRV------RLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLL 115
Query: 110 EYIT--AKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
E I+ VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP E RR R
Sbjct: 116 ERISYDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERDRRKS-R 174
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSHV
Sbjct: 175 IDPIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHV 234
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVF ++PY+ L N R AEVWMDE+++ YY NP
Sbjct: 235 GHVFPKQAPYS-----RNKALANCVRAAEVWMDEFKELYYHRNP 273
>gi|403258871|ref|XP_003921965.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Saimiri
boliviensis boliviensis]
Length = 633
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/297 (44%), Positives = 174/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAVLLKEIILVDDAS------VDDYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGASVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSHHNRGNFDWSLSFGWETLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
norvegicus]
Length = 633
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP L KEIILVDDAS + D + ++
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILPKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I +DYF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR+KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRNKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|301783121|ref|XP_002926975.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Ailuropoda melanoleuca]
gi|281344477|gb|EFB20061.1| hypothetical protein PANDA_016676 [Ailuropoda melanoleuca]
Length = 632
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 236
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 237 EYIKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 290
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 291 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQRRK 350
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 351 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 409
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 410 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 454
>gi|344268422|ref|XP_003406059.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Loxodonta africana]
Length = 939
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 173/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ + LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 485 CAEQLVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 539
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 540 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 593
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 594 IGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGVFVWPMNFGWRT 647
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 648 IPPDVVAKNRIKETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 707
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PYTFP K V N RVAEVW+DE+++ +Y
Sbjct: 708 CGGEIEIIPCSRVGHIFRNDNPYTFPKDRMKTVERNLVRVAEVWLDEYKELFYG 761
>gi|74004468|ref|XP_535940.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Canis lupus familiaris]
Length = 632
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 236
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 237 EYIKQFSIV------KIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 290
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 291 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQRRK 350
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 351 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 409
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 410 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 454
>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
Length = 545
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 124/288 (43%), Positives = 175/288 (60%), Gaps = 20/288 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCP 59
CK K YP +LP TS+++ F +EA+S ++R+V S+INR+P LL E+ILVDD S R +
Sbjct: 88 CKTKKYPEYLPPTSVIMCFTDEAFSAVMRSVHSIINRTPPHLLAEVILVDDNSTRAELKG 147
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE--------- 110
+D + + + +R + R + V P++ + D E
Sbjct: 148 HLDDYVRRQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFL-DAHIECNVGWVEPL 206
Query: 111 ----YITAKTVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ VV PII+ I D+TFEY ++ GGF+W+L+F W +P E+ R
Sbjct: 207 LHRIWENRSNVVMPIIEAIDDKTFEYHGGVQSSRYAQRGGFSWELHFDWRVIPEYEIKRW 266
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GD ++P+R+PTMAGGLF+IDK YFYELG+YD+ MD WGGENLE+SF++W CGG LE P
Sbjct: 267 KGDETTPIRSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMCGGTLEQPP 326
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGHVFR +PY+ P G K + N RV EVW+D ++D +YA+NP
Sbjct: 327 CSKVGHVFRSSAPYSNPSG-PKTFIRNTLRVVEVWLDSYKDLFYALNP 373
>gi|410968769|ref|XP_003990872.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3 [Felis catus]
Length = 633
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|148706465|gb|EDL38412.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_a [Mus
musculus]
Length = 515
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/282 (45%), Positives = 169/282 (59%), Gaps = 16/282 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 63 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 122
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 123 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEVL 182
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 183 QDYTR---VVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 238
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 239 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 298
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 299 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 340
>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
Length = 587
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 125/285 (43%), Positives = 171/285 (60%), Gaps = 26/285 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + YP LP+ S+VI F+NEAWS LLRTV S+I+R+P LL EIILVDD S+
Sbjct: 111 CQSEEYPAELPSASVVICFYNEAWSVLLRTVHSIIDRTPSALLHEIILVDDFSD------ 164
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYI 112
+D +++Q Y++ RE + H V +D + E+I
Sbjct: 165 LDHLAEQLDAYVSEHLPQTKLVRNTRREGLIRARVIGSEHATGEVLVFLDSHCEVNVEWI 224
Query: 113 ---------TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
K V PIID+I TF Y +S + GGFNW L +RW ++P ++R+
Sbjct: 225 QPLLSHIHGNHKRVAVPIIDIIDQDTFRY-ESSPLVRGGFNWGLFYRWDQIP-ESLLRKQ 282
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D P++TPTMAGGLFA+++ YF +LG YD GMD+WGGENLE+SFRVWQCGG + I+PC
Sbjct: 283 EDYVKPIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGENLEISFRVWQCGGSMHILPC 342
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
S VGH+FR + PY P GV I N+ RVA VWMDE+ +++ +
Sbjct: 343 SRVGHIFRKRRPYGSPVGVDTIT-KNSLRVAHVWMDEYIKYFFQV 386
>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Takifugu rubripes]
Length = 580
Score = 234 bits (598), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/291 (45%), Positives = 171/291 (58%), Gaps = 33/291 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + YP LP S+VI F NEA S LLRTV SV++R+ LL EIILVDD SE
Sbjct: 114 CRDRIYPRDLPPASVVICFFNEALSALLRTVHSVLDRTAPFLLHEIILVDDYSE------ 167
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------HKKTVVCPIIDV---IS 105
++ + Y+ A G LR + R H V +D ++
Sbjct: 168 LEELKGDLDRYVQAE---LQGKVKVLRNQRREGLIRGRMIGAAHASGQVLVFLDSHCEVN 224
Query: 106 DQTFEYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
E + A +TVVCP+ID+IS T Y + S + GGFNW L+F+W VPP E+
Sbjct: 225 QMWLEPLLASIHEDRRTVVCPVIDIISADTLSY-SPSPIVRGGFNWGLHFKWDPVPPSEL 283
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
G P+R+PTMAGGLFAI++ YF E+G YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 284 KSPKGP-VDPIRSPTMAGGLFAINRKYFNEMGQYDAGMDIWGGENLEISFRIWMCGGQLL 342
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY PGG + HN+ R+A VWMDE+++ Y +M P
Sbjct: 343 IIPCSRVGHIFRKRRPYGSPGGQDTMA-HNSLRLAHVWMDEYKEQYLSMRP 392
>gi|47216191|emb|CAG01225.1| unnamed protein product [Tetraodon nigroviridis]
Length = 586
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 129/284 (45%), Positives = 169/284 (59%), Gaps = 23/284 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C SY LP+TSI+I FHNEA STLLRTV SV+ RSP +L++EIIL+DD +S+ C
Sbjct: 152 CASISYDPELPSTSIIITFHNEARSTLLRTVKSVLMRSPPSLIQEIILIDDFSSDPEDCQ 211
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT------ 113
++ I + ++ G +R + R PI+ + D E T
Sbjct: 212 LLVHIP----KVRCLRNVRREGL---IRSRVRGANAASAPILTFL-DSHCEVNTDWLQPM 263
Query: 114 -------AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D
Sbjct: 264 IQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMARS-DP 322
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ P+RTP +AGG+F +DK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS V
Sbjct: 323 TQPIRTPVIAGGIFVMDKSWFNRLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRV 382
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PY FP G + + N R AEVWMDE++ +YY+ P
Sbjct: 383 GHVFRKRHPYEFPEGNALTYIRNTRRAAEVWMDEYKQYYYSARP 426
>gi|432934600|ref|XP_004081948.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Oryzias
latipes]
Length = 600
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 131/294 (44%), Positives = 173/294 (58%), Gaps = 55/294 (18%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS
Sbjct: 153 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDAS---------------- 196
Query: 70 EYITASDMTWGGFNWKLRE-------KNRHKKTVVCPII---DVISDQTFEYITAK---- 115
+ G F L++ + R +K ++ + V + T ++ A
Sbjct: 197 ---VDGKNSMGPFRTYLKKLSIVRVVRQRERKGLITARLLGASVATGDTLTFLDAHCECF 253
Query: 116 ----------------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVP 155
VV P I I TFE++ S + G F+W L+F W +P
Sbjct: 254 NGWLEPLLARIAQNYTAVVSPDISTIDLNTFEFMKPSPYGQNHNRGNFDWGLSFGWESLP 313
Query: 156 PREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCG 215
E RR D + P++TPT AGGLF+I K+YFY++GSYDE M+IWGGEN+EMSFRVWQCG
Sbjct: 314 DHEKQRRK-DETYPIKTPTFAGGLFSISKEYFYQIGSYDEEMEIWGGENIEMSFRVWQCG 372
Query: 216 GILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
G LEIIPCS VGHVFR KSP+TFP G ++++ N R+AEVWMD++++ +Y N
Sbjct: 373 GQLEIIPCSVVGHVFRTKSPHTFPKG-TQVIARNQVRLAEVWMDDYKEIFYRRN 425
>gi|13929126|ref|NP_113984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Rattus norvegicus]
gi|51315691|sp|O88422.1|GALT5_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|3510639|gb|AAC69708.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T5 [Rattus
norvegicus]
gi|149047792|gb|EDM00408.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
gi|149047793|gb|EDM00409.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
Length = 930
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 124/297 (41%), Positives = 170/297 (57%), Gaps = 50/297 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WS LLR+V SV+NRSP L+KEI+LVDD S
Sbjct: 476 CAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFS------- 528
Query: 61 IDVISDQTFEYITAS-DMTWGGFNWK--LREKNRH---------------------KKTV 96
T +Y+ A+ D F LR K RH V
Sbjct: 529 -------TKDYLKANLDKYMSQFPKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHV 581
Query: 97 VC------PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFR 150
C P+++ + Y+ K V CP+I+VI+D+ Y+T + G F W +NF
Sbjct: 582 ECNVGWLEPLLERV------YLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFG 635
Query: 151 WYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFR 210
W +PP + + G + +R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+
Sbjct: 636 WRTIPPDVIAKNGIKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFK 695
Query: 211 VWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
VW CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 696 VWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 752
>gi|158749624|ref|NP_766443.2| polypeptide N-acetylgalactosaminyltransferase 5 [Mus musculus]
gi|341940730|sp|Q8C102.2|GALT5_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|148694985|gb|EDL26932.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 125/297 (42%), Positives = 169/297 (56%), Gaps = 50/297 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WS LLR+V SV+NRSP L+KEI+LVDD S
Sbjct: 476 CAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFS------- 528
Query: 61 IDVISDQTFEYITAS-DMTWGGFNWK--LREKNRH---------------------KKTV 96
T EY+ A D F LR K RH V
Sbjct: 529 -------TKEYLKADLDKYMSQFPKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHV 581
Query: 97 VC------PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFR 150
C P+++ + Y+ K V CP+I+VI+D+ Y+T + G F W +NF
Sbjct: 582 ECNVGWLEPLLERV------YLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFG 635
Query: 151 WYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFR 210
W +PP + + G + +R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+
Sbjct: 636 WKTIPPDVVAKNGIKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFK 695
Query: 211 VWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
VW CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+D++R+ +Y
Sbjct: 696 VWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDDYRELFYG 752
>gi|109732606|gb|AAI16333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 125/297 (42%), Positives = 169/297 (56%), Gaps = 50/297 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WS LLR+V SV+NRSP L+KEI+LVDD S
Sbjct: 476 CAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFS------- 528
Query: 61 IDVISDQTFEYITAS-DMTWGGFNWK--LREKNRH---------------------KKTV 96
T EY+ A D F LR K RH V
Sbjct: 529 -------TKEYLKADLDKYMSQFPKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHV 581
Query: 97 VC------PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFR 150
C P+++ + Y+ K V CP+I+VI+D+ Y+T + G F W +NF
Sbjct: 582 ECNVGWLEPLLERV------YLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFG 635
Query: 151 WYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFR 210
W +PP + + G + +R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+
Sbjct: 636 WKTIPPDVVAKNGIKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFK 695
Query: 211 VWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
VW CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+D++R+ +Y
Sbjct: 696 VWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDDYRELFYG 752
>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
Length = 654
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA+S LLRTV SVI+R+P LL E+ILV
Sbjct: 141 CKGKSYPADLPVASVVICFYNEAFSALLRTVHSVIDRTPARLLHEVILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI ++ E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNKKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S GGFNW L+F+W VP E+
Sbjct: 261 AIREDR------RTVVCPVIDIISADTLAY-SSSPAVRGGFNWGLHFKWDLVPLSELAGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GG ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 GGA-TAPIKSPTMAGGLFAMNRQYFSELGQYDSGMDIWGGENLEISFRIWMCGGQLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|118093614|ref|XP_422023.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Gallus
gallus]
Length = 632
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 173/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDAS------VDEYLHDKLD 236
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 237 EYVKQFQIV------KVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLL 290
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 291 ARIAENSVAVVSPDIASIDLNTFEFSKPSPYGHNHNRGNFDWSLSFGWESLPKYENKRRK 350
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I K+YF +GSYD+ M+IWGGEN+EMSFRVWQCGG+LEI+PC
Sbjct: 351 -DETYPIRTPTFAGGLFSISKEYFEHIGSYDDEMEIWGGENIEMSFRVWQCGGLLEIMPC 409
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 410 SVVGHVFRSKSPHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRN 454
>gi|449274705|gb|EMC83783.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Columba livia]
Length = 502
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 135/308 (43%), Positives = 177/308 (57%), Gaps = 25/308 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y T LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 60 CTSVRYDTDLPATSLIITFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFSSDPEDCQ 119
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITA-- 114
++ I T + +R + R + I+ D + E++
Sbjct: 120 LLTKIPKVKCLRNTRREGL-------IRSRVRGAEVATADILTFLDSHCEVNSEWLQPML 172
Query: 115 -------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D +
Sbjct: 173 QRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DPT 231
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 232 QSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 291
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHF 283
HVFR + PY FP G + + N R AEVWMDE++ +YY P GKS SV+
Sbjct: 292 HVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKSFGSVAERVEQR 351
Query: 284 RMLSYSSW 291
R L+ S+
Sbjct: 352 RKLNCKSF 359
>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryzias latipes]
Length = 584
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 133/294 (45%), Positives = 172/294 (58%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ K + LPTTS++I F+NEAWSTLLRT+ SV+ +P LLKEIIL+DD S+R
Sbjct: 130 CRNKKFDYRHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILIDDYSDR---- 185
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS------- 105
+ Q EYI+ +L N+ + V +I DV++
Sbjct: 186 --GYLKSQLAEYISNLQRV------RLIRTNKREGLVRARLIGATYATGDVLTFLDCHCE 237
Query: 106 ------DQTFEYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
+ E I A T+VCP+ID I +FE Y+ + GGF+W+L F+W+ VP
Sbjct: 238 CVPGWIEPLLERIAENASTIVCPVIDTIDWNSFEFYMQTGEPMIGGFDWRLTFQWHSVPE 297
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E RR R+ P R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 298 SERKRRKS-RTDPFRSPTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSFRVWQCGG 356
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF K+PY P L N R AEVWMD ++ +Y NP
Sbjct: 357 SLEIHPCSHVGHVFPKKAPYARPN-----FLQNTVRAAEVWMDSYKHHFYNRNP 405
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Acyrthosiphon pisum]
Length = 674
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 128/285 (44%), Positives = 177/285 (62%), Gaps = 27/285 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC- 58
CKK Y LP TS+++ FHNEAWS LLRTV S+++RSP L++EIILVDD S+
Sbjct: 182 CKKPGRYLDNLPQTSVIVCFHNEAWSVLLRTVHSILDRSPEHLIREIILVDDFSDMPHLK 241
Query: 59 ----------PIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
P I ++ + E + + + + L + H + + P++D
Sbjct: 242 TQLEEYSENYPKIKIVRAKKREGLIRARLMGARYASAPVLTYLDSHCECTEGWLEPLLDR 301
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQT--FEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ + A TVVCP+IDVI D T F Y A + GGF+W L F W+ VP +E +
Sbjct: 302 IARE------ASTVVCPVIDVIDDSTLEFHYRDAGGVNVGGFDWNLQFNWHVVPDKEK-K 354
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLFAIDK +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 355 RHKNAAEPVWSPTMAGGLFAIDKKFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 414
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMD++ +YY
Sbjct: 415 PCSHVGHIFRKRSPYKWRTGVN-VLKKNSIRLAEVWMDDYAKYYY 458
>gi|327262637|ref|XP_003216130.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Anolis carolinensis]
Length = 500
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 168/286 (58%), Gaps = 19/286 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L+ EIILVDD S+
Sbjct: 77 CTTLHYRTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPVHLVHEIILVDDFSDDPDDCR 136
Query: 56 --VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ P + + ++ E I ++M L K + P++ I +
Sbjct: 137 LLIKLPKVKCLRNRRREGLIRSRIRGAEMAEAEVLTFLDSHCEVNKDWLLPLLQRIKED- 195
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P++ +R D +
Sbjct: 196 -----PSHVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQLSPKQKAKRT-DPTE 249
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 250 PIKTPIIAGGLFVIDKAWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGH 309
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 310 VFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEYKQYYYAARPAAQG 355
>gi|291391661|ref|XP_002712292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oryctolagus cuniculus]
Length = 633
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 129/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRT+ SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTIHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 493
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 124/277 (44%), Positives = 162/277 (58%), Gaps = 24/277 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TS+VI FHNEA S LLRT+ SV+NRSP L++EIILVDD S+ D +
Sbjct: 120 LPATSVVITFHNEARSALLRTIVSVLNRSPAELIEEIILVDDFSD-------DPSDGEEL 172
Query: 70 EYITASDMTWGGFNWKL-REKNRHKKTVVCPIIDVISDQTFEYITA-------------K 115
I + L R + R + P++ + D E +
Sbjct: 173 AKIQKIRLLRNTQREGLVRSRVRGARAAKAPVLTFL-DSHCECNQGWLPPLLRRVKEDPR 231
Query: 116 TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
VVCP+IDVI+ ++F+Y AS GGFNW L F+W + +E R + + P+RTP +
Sbjct: 232 RVVCPVIDVINLESFKYFGASSDLRGGFNWNLVFKWEFLSNKEREERANNPTLPIRTPMI 291
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF +D+ F LG+YD MDIWGGENLE+SFR WQCGG LEI+PCS VGHVFR + P
Sbjct: 292 AGGLFVVDRAQFERLGAYDTAMDIWGGENLELSFRAWQCGGSLEILPCSRVGHVFRKQHP 351
Query: 236 YTFPGGVSKIVLH--NAARVAEVWMDEWRDFYYAMNP 270
Y+FPGG + N R AEVWMD+++ +YYA P
Sbjct: 352 YSFPGGSGNVFARQANTRRAAEVWMDDYKKYYYATVP 388
>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
(Silurana) tropicalis]
gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 582
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 174/294 (59%), Gaps = 42/294 (14%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK K++ LPTTS+VI F+NEA STLLRT+ SV+ SP LL+EIILVDD S++V
Sbjct: 128 CKSKTFNYRKLPTTSVVIAFYNEALSTLLRTIHSVLETSPAVLLREIILVDDFSDKVY-- 185
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS--DQTFE 110
+ Q +YI D +L + + V II DV++ D E
Sbjct: 186 ----LKSQLEDYIGGLDRV------RLIRTTKREGLVRARIIGATYAIGDVLTFLDCHCE 235
Query: 111 YITA-------------KTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPP 156
I+ VVCP+ID I TFE Y+ + GGF+W+L F+W+ VP
Sbjct: 236 CISGWLEPLLQRIGENETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHAVPE 295
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR R P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG
Sbjct: 296 KERQRRKS-RIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGG 354
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEI PCSHVGHVF K+PY P L N AR AEVWMD +++ +Y NP
Sbjct: 355 TLEIEPCSHVGHVFPKKAPYARPN-----FLQNTARAAEVWMDGYKELFYNRNP 403
>gi|432096894|gb|ELK27469.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Myotis davidii]
Length = 940
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 173/288 (60%), Gaps = 32/288 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C K+ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAKQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---KKTVVCPII---DVIS--DQTFE-- 110
D + D +Y++ L K RH + + I DV++ D E
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILHLKERHGLIRARLAGAQIATGDVLTFLDSHVECN 594
Query: 111 -----------YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
Y++ K V CP+I+VI+D+ Y+T + G F W +NF W +PP +
Sbjct: 595 IGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVI 654
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
+ + +R P MAGGLF+IDK+YFYELG+YD G+D+WGGEN+E+SF+VW CGG +E
Sbjct: 655 AKNRIKETDVIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIE 714
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
IIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 715 IIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 569
Score = 234 bits (597), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/285 (44%), Positives = 171/285 (60%), Gaps = 27/285 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C +Y LP SI++ FHNEAWS L+R+V+S++NR+P +LLKE+ILVDD S E +
Sbjct: 114 CHNLTYSENLPEVSIIVTFHNEAWSVLIRSVYSILNRTPDSLLKEVILVDDFSSLEHLKE 173
Query: 59 PIIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + Q + + A++ G +LR V+ P+ID
Sbjct: 174 PLDQFMEQFQKVKIVRATERQ-GLIRARLRGYREAVGDVLVFLDSHIECAEGWFEPLIDP 232
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TV+ P+IDVI +TF+Y AS GGF+W L F W+ VP E R
Sbjct: 233 IAR------NWSTVMTPVIDVIDKETFQYGFQAASATNVGGFDWSLMFTWHFVPETEQKR 286
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R P+R+PTMAGGLFAI + YF +G+YDEGMDIWGGENLE+SFR+W CGG L
Sbjct: 287 RQNKHYLPVRSPTMAGGLFAISRKYFEHIGTYDEGMDIWGGENLELSFRIWMCGGTLLTA 346
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
PCSHVGHVFR PY+F G +V +N R+AEVW+D+++ +YY
Sbjct: 347 PCSHVGHVFRHTPPYSF-GPKKNVVKNNLVRMAEVWLDDFKYYYY 390
>gi|410968689|ref|XP_003990834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Felis catus]
Length = 939
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 172/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 485 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 539
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 540 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 593
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 594 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 647
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 648 IPPDVVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 707
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PYTFP K V N RVAEVW+DE+++ +Y
Sbjct: 708 CGGEIEIIPCSRVGHIFRNDNPYTFPKDRMKTVERNLVRVAEVWLDEYKELFYG 761
>gi|432110716|gb|ELK34193.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Myotis davidii]
Length = 466
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 134/291 (46%), Positives = 176/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTSI+I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 10 CKEKKYDYDQLPTTSIIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 69
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + K V+ P++
Sbjct: 70 ERLENELSKLPKVRLIRANKREGLVRARLLGASAAKGQVLTFLDCHCECHEGWLEPLLQR 129
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE MR
Sbjct: 130 IQEE------ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPERERMRM 183
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 184 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 239
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ K L N+ R AEVWMDE+++ YY NP
Sbjct: 240 THPCSHVGHVFPKQAPYS-----RKKALANSVRAAEVWMDEFKELYYHRNP 285
>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 683
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 180/319 (56%), Gaps = 61/319 (19%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
CK + Y LP+TS+++ FHNEAWS LLRTV S+I+RSP LL EIILVDD S
Sbjct: 190 CKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIILVDDYSDMPHLKQ 249
Query: 54 --ERVVC--PIIDVISDQTFEYI--------------------TASDMTWGGFNWKLREK 89
E V P + ++ Q E + + + T G L
Sbjct: 250 KLEDYVAHFPKVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCECTEGWLEPLLDRI 309
Query: 90 NRHKKTVVC--------------------PIIDVISDQTFEYITAKTVVCPIIDVISDQT 129
R+ TV P++D I+ + TVVCP+IDVISD T
Sbjct: 310 ARNSTTVXATAPVLTYLDSHCECTEGWLEPLLDRIAR------NSTTVVCPVIDVISDST 363
Query: 130 FEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYF 187
FEY + + GGF+W L F W+ VP RE RR P+ +PTMAGGLF+IDK +F
Sbjct: 364 FEYHYRDSGGVNVGGFDWNLQFSWHAVPERERQRRK-HSWDPVWSPTMAGGLFSIDKAFF 422
Query: 188 YELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVL 247
+LG+YD G DIWGGENLE+SF+ W CGG LEI+PCSHVGH+FR +SPY + GV+ ++
Sbjct: 423 EKLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLR 481
Query: 248 HNAARVAEVWMDEWRDFYY 266
N+ R+AEVW+DE++ +YY
Sbjct: 482 RNSVRLAEVWLDEYKQYYY 500
>gi|397507787|ref|XP_003824367.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
paniscus]
Length = 633
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Cavia porcellus]
Length = 633
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + ++
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQKERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 292 ARIADNYTAVVSPDIASIDLNTFEFNKPSPYGTNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
Length = 650
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 129/289 (44%), Positives = 176/289 (60%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK ++ Y T LP T ++I FHNEAW+ LLRTV SV++RSP L+ +IILVDD S+
Sbjct: 198 CKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLK 257
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + +I Q E + + + G N H K+ V +D + T
Sbjct: 258 RQLEDYFAAYPKVQIIRGQKREGLIRARIL--GAN--------HAKSPVLTYLDSHCECT 307
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVISD+T EY + + GGF+W L F W+ VP R
Sbjct: 308 EGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPER 367
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E +R + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 368 ER-KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI+PCSHVGH+FR +SPY + GV+ + N+ R+AEVWMDE+ YY
Sbjct: 427 LEIVPCSHVGHIFRKRSPYKWRSGVN-VPKKNSVRLAEVWMDEYSQCYY 474
>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
Length = 647
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 129/283 (45%), Positives = 173/283 (61%), Gaps = 23/283 (8%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK+ Y T LP TS++I FHNEAWS LLRTV SV++RSP L+KE+ILVDD S+ P
Sbjct: 195 CKEPGRYGTDLPATSVIICFHNEAWSVLLRTVHSVLDRSPEHLVKEVILVDDFSD---MP 251
Query: 60 IIDVISDQTFE-YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT----- 113
+ FE Y + +R + + P++ + D E T
Sbjct: 252 HTQKQLEDYFEAYPRVKIIRAPKREGLIRARLLGARYATAPVLTYL-DSHCECTTGWLEP 310
Query: 114 --------AKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +R
Sbjct: 311 LLDRIARNSTTVVCPVIDVIDDNTMEYHYRDSGGVNVGGFDWNLQFNWHAVPDREK-KRH 369
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
+ P+ +PTMAGGLF+IDK++F LG+YD G DIWGGENLE+SF+ W CGG LEI+PC
Sbjct: 370 KSTAEPVFSPTMAGGLFSIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPC 429
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
SHVGH+FR +SPY + GV+ ++ N+ R+AEVW+DE+ +YY
Sbjct: 430 SHVGHIFRKRSPYKWRTGVN-VIKRNSVRLAEVWLDEYAKYYY 471
>gi|32698686|ref|NP_055383.1| polypeptide N-acetylgalactosaminyltransferase 5 [Homo sapiens]
gi|51315940|sp|Q7Z7M9.1|GALT5_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|30841528|gb|AAP34404.1| GalNAc-T5 [Homo sapiens]
gi|119631854|gb|EAX11449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|148745655|gb|AAI42677.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|158257740|dbj|BAF84843.1| unnamed protein product [Homo sapiens]
Length = 940
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SVINRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
Length = 685
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 185/308 (60%), Gaps = 52/308 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
CK + Y LP TS+++ FHNEAWS LLRTV SV+ R+P LL+EI+LVDD S+
Sbjct: 168 CKVEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPEHLLEEIVLVDDFSDMDHTKR 227
Query: 59 PIIDVISD--------------------------QTFEYITASD-----MTWGGFNWKLR 87
P+ + +S T E +T D M ++R
Sbjct: 228 PLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGKETENRVR 287
Query: 88 EKNRH-KKTVVCPIIDVIS-DQTFEYITAKTVVCPIIDVISDQTFEYITASD--MTWGGF 143
+N+ KK + P++D I D T TVVCP+IDVI D TFEY + + GGF
Sbjct: 288 TRNKKCKKRWIEPLLDRIKRDPT-------TVVCPVIDVIDDNTFEYHHSKAYFTSVGGF 340
Query: 144 NWKLNFRWYRVPPREMMRRGGDRS-SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGG 202
+W L F W+ +P R+ R+ R+ P+R+PTMAGGLF+IDK YF +LG+YD G DIWGG
Sbjct: 341 DWGLQFNWHSIPERD--RKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGG 398
Query: 203 ENLEMSFRV----WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWM 258
ENLE+SF+V W CGG LEI+PCSHVGHVFR +SPY + GV+ ++ N+ R+AEVW+
Sbjct: 399 ENLELSFKVRKCIWMCGGTLEIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWL 457
Query: 259 DEWRDFYY 266
D+++ +YY
Sbjct: 458 DDYKTYYY 465
>gi|431909863|gb|ELK12965.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Pteropus alecto]
Length = 543
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 135/291 (46%), Positives = 176/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+ Y LPTTS+VI F+NEAWSTLLRT++SV+ SP TLL+E+ILVDD S+R +
Sbjct: 87 CKETKYDYDHLPTTSVVIAFYNEAWSTLLRTIYSVLETSPDTLLEEVILVDDYSDREHLK 146
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + + G +L + K V+ P++
Sbjct: 147 ERLANELSGLPKVRLIRAHKREGLVRARLLGASAAKGDVLTFLDCHCECHEGWLEPLLQR 206
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE MR
Sbjct: 207 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPHIGGFDWRLVFTWHVVPTRERMRM 260
Query: 163 GGDRSSPL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 261 ----RSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 316
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
I PCSHVGHVF ++PY+ K L N+ R AEVWMDE+++ YY NP
Sbjct: 317 IHPCSHVGHVFPKQAPYS-----RKKALANSVRAAEVWMDEFKELYYHRNP 362
>gi|153266878|ref|NP_004473.2| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
gi|209572629|sp|Q14435.2|GALT3_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|62822129|gb|AAY14678.1| unknown [Homo sapiens]
gi|109731077|gb|AAI13568.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|109731742|gb|AAI13566.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|119631729|gb|EAX11324.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3), isoform
CRA_b [Homo sapiens]
gi|313883200|gb|ADR83086.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3)
[synthetic construct]
Length = 633
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|114581503|ref|XP_515871.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
troglodytes]
gi|410331347|gb|JAA34620.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Pan
troglodytes]
Length = 633
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|332234083|ref|XP_003266237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Nomascus leucogenys]
Length = 633
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Callithrix jacchus]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 129/286 (45%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAVLLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGASVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSHHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|432098371|gb|ELK28171.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Myotis davidii]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 129/286 (45%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ +P LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSTPAILLKEIILVDDAS------VAEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFPIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNRGNFDWSLSFGWEALPDHERQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|348519859|ref|XP_003447447.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oreochromis niloticus]
Length = 624
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 176/286 (61%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS + D + D+
Sbjct: 178 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDAS------VDDELKDKLD 231
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ ++ ++ + K + ++ V + T ++ A
Sbjct: 232 DYLKQLNIV------RVMRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLL 285
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE++ S + G F+W L+F W +P E RR
Sbjct: 286 ARIAENYTAVVSPDITTIDLNTFEFMKPSPYGQNHNRGNFDWSLSFGWESLPDHEKRRRK 345
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YFY +GSYDE M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 346 -DETYPIKTPTFAGGLFSISKEYFYRIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIPC 404
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMD++++ +Y N
Sbjct: 405 SIVGHVFRTKSPHTFPKG-TQVIARNQVRLAEVWMDDYKEIFYRRN 449
>gi|109099998|ref|XP_001096023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Macaca mulatta]
gi|297264195|ref|XP_002798936.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Macaca mulatta]
gi|355564937|gb|EHH21426.1| hypothetical protein EGK_04492 [Macaca mulatta]
gi|355750584|gb|EHH54911.1| hypothetical protein EGM_04018 [Macaca fascicularis]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|345304811|ref|XP_001505904.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ornithorhynchus anatinus]
Length = 555
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 166/283 (58%), Gaps = 21/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C YP+ LP TSIVI FHNEA STLLRTV SV+NR+P L++EIILVDD +++ C
Sbjct: 113 CTSAHYPSDLPVTSIVITFHNEARSTLLRTVKSVLNRTPANLVREIILVDDFSADPEDCQ 172
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITA-- 114
++ I + + +R + R + I+ D + E++
Sbjct: 173 LLTRIPKVKCLHNNQREGL-------IRSRVRGAEVATADILTFLDSHCEVNSEWLQPLL 225
Query: 115 -------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D +
Sbjct: 226 QRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DPT 284
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 285 QSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 344
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PY FP G + + N R AEVWMD+++ +YY P
Sbjct: 345 HVFRKRHPYDFPEGNALTYIKNTKRAAEVWMDDYKQYYYEARP 387
>gi|189066640|dbj|BAG36187.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEEQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|26325284|dbj|BAC26396.1| unnamed protein product [Mus musculus]
Length = 930
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 124/297 (41%), Positives = 169/297 (56%), Gaps = 50/297 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WS LLR+V SV+NRSP L+KEI+LVDD S
Sbjct: 476 CAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFS------- 528
Query: 61 IDVISDQTFEYITAS-DMTWGGFNWK--LREKNRH---------------------KKTV 96
T EY+ A D F LR K RH V
Sbjct: 529 -------TKEYLKADLDKYMSQFPKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHV 581
Query: 97 VC------PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFR 150
C P+++ + Y+ K V CP+I+VI+D+ Y+T + G F W +NF
Sbjct: 582 ECNVGWLEPLLERV------YLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFG 635
Query: 151 WYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFR 210
W +PP + + G + +R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+
Sbjct: 636 WKTIPPDVVAKNGIKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFK 695
Query: 211 VWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
VW CGG +E+IPCS VGH+FR+ +PY+FP K V N RVAEVW+D++R+ +Y
Sbjct: 696 VWMCGGEIELIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDDYRELFYG 752
>gi|402888519|ref|XP_003907606.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Papio
anubis]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|297668747|ref|XP_002812581.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Pongo abelii]
gi|297668749|ref|XP_002812582.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Pongo abelii]
gi|297668751|ref|XP_002812583.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
3 [Pongo abelii]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|1617312|emb|CAA63371.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
(GalNAc-T3) [Homo sapiens]
Length = 633
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 175/297 (58%), Gaps = 40/297 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLD 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYVKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCA 280
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N + V A
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKA 466
>gi|326434666|gb|EGD80236.1| polypeptide N-acetylgalactosaminyltransferase 13 [Salpingoeca sp.
ATCC 50818]
Length = 641
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 128/285 (44%), Positives = 171/285 (60%), Gaps = 31/285 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCPIIDVISDQT 68
+P TS++I + NEAWSTLLRTVWSV+NRSP L++EIIL+DDAS+ + +D +
Sbjct: 201 MPKTSVIICYVNEAWSTLLRTVWSVLNRSPPELIEEIILLDDASDAEWLGEKLDTYVREH 260
Query: 69 FE---YITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEY 111
F I S G +L K V+ PI+D+I+
Sbjct: 261 FPSHVRIVRSPDRLGLIRARLLGAKHAKGPVMTFLDSHCEANQGWLEPILDIIA------ 314
Query: 112 ITAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
TVV P+ID I +T EY T++ + G F+W L+F W ++R G + P
Sbjct: 315 TNRTTVVTPVIDTIDHRTMEYAKWTSNIPSVGTFDWTLDFNW----KSGVLRPGQKLTDP 370
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+ +PTMAGGLFAID+DYFYE+GSYDE MD WGGEN+EMSFR+WQCGG L PCSHVGH+
Sbjct: 371 IDSPTMAGGLFAIDRDYFYEIGSYDEDMDGWGGENVEMSFRIWQCGGRLVTAPCSHVGHI 430
Query: 230 FRDKSPYTFPG-GVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
FRD PY PG G+ + N+ R+AEVWMD+++ F+Y P +
Sbjct: 431 FRDTHPYKVPGKGIHHTFMKNSMRLAEVWMDDYKQFFYDTKPKRE 475
>gi|327281948|ref|XP_003225707.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Anolis carolinensis]
Length = 574
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 177/307 (57%), Gaps = 23/307 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVC- 58
C Y LP+TSI+I FHNEA STLLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 132 CASIHYGADLPSTSIIITFHNEARSTLLRTVTSVLNRTPANLIQEIILVDDFSSDPEDCQ 191
Query: 59 -----PIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + + E + +DM L + P++ + +
Sbjct: 192 LLTKIPKVKCLRNNRREGLIRSRVRGADMATADILTFLDSHCEVNSEWLQPMLQRVKE-- 249
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+Y VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + + R D +
Sbjct: 250 -DYTR---VVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKLSRT-DPTQ 304
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 305 SIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGH 364
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAHFR 284
VFR + PY FP G + + N R AEVWMDE++ +YY P GKS S++ R
Sbjct: 365 VFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKSFGSIADRVDQRR 424
Query: 285 MLSYSSW 291
L+ S+
Sbjct: 425 KLNCKSF 431
>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 533
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 130/283 (45%), Positives = 174/283 (61%), Gaps = 23/283 (8%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK+ Y LP TS+VI FHNEAWS LLRTV SVI+RSP L+KEIILVDD S+ ++
Sbjct: 40 CKQPGRYLEDLPQTSVVICFHNEAWSVLLRTVHSVIDRSPAHLIKEIILVDDFSDMPHLM 99
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEY 111
+ D +S I + G +R + K V P++ + E
Sbjct: 100 QQLDDYMSSLPKVRIVRATQREG----LIRARLLGAKYVTAPVLTYLDSHCECTEGWLEP 155
Query: 112 ITAK------TVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + VVCP+IDVI D T EY ++ + GGF+W L F W+ VP RE R
Sbjct: 156 LLDRIARNKTNVVCPVIDVIDDNTLEYHYRDSTSVNVGGFDWNLQFNWHPVPARERARHK 215
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
+ P+ +PTMAGGLFAIDK++F LG+YD G DIWGGENLE+SF+ W CGG LEI+PC
Sbjct: 216 -HTAEPVWSPTMAGGLFAIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPC 274
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
SHVGH+FR +SPY + GV+ ++ N+ R+AEVW+D++ +YY
Sbjct: 275 SHVGHIFRKRSPYKWRTGVN-VLKKNSVRLAEVWLDDYSKYYY 316
>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
Length = 670
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 131/299 (43%), Positives = 176/299 (58%), Gaps = 30/299 (10%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK + Y + LP T ++I FHNEAWS L+RTV SV++RSP L+ EIILVDD S+ +
Sbjct: 218 CKDSALYLSNLPKTDVIICFHNEAWSVLIRTVHSVLDRSPPELIGEIILVDDFSDMPHLK 277
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D + I G +L K V+ P++D
Sbjct: 278 KQLEDYFASYPKVKIVRGPQREGLIRARLLGAEYAKSPVITYLDSHCECAEGWLEPLLDR 337
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQT--FEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ + TVVCP+IDVI D T F Y +S + GGF+W L F W+ VP RE +
Sbjct: 338 IARNS------TTVVCPVIDVIDDTTLEFHYRDSSGVNVGGFDWNLQFSWHAVPEREK-K 390
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R S P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 391 RHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 450
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVS 277
PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMD++ +YY M+ G VS
Sbjct: 451 PCSHVGHIFRKRSPYKWRTGVN-VLKKNSVRLAEVWMDDYAKYYYQRIGMDKGDFGDVS 508
>gi|348519900|ref|XP_003447467.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oreochromis niloticus]
Length = 777
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 123/289 (42%), Positives = 167/289 (57%), Gaps = 34/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C++ LP+TS++ F +E WSTLLR+V SV+NRSP LLKEIILVDD S +
Sbjct: 325 CEQSLVHDDLPSTSVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTK----- 379
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
D + Q +Y I G +L K V+
Sbjct: 380 -DYLKKQLDDYMAQFPKVRIVRLKERQGLIRARLAGAAVAKGEVLTFLDSHIECNVGWLE 438
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
P+++ + Y+ K V CP+I+VISD+ Y+ + G F W L F W VPP +
Sbjct: 439 PLLERV------YLDRKKVPCPVIEVISDKDMSYMMVDNFQRGIFKWPLVFGWSAVPPED 492
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ + S P+R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF++W CGG +
Sbjct: 493 IKKFNLTISDPIRCPVMAGGLFSIDKQYFFELGTYDPGLDVWGGENMEISFKIWMCGGEI 552
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
EIIPCS VGH+FR ++PY FP K V N ARVAEVW+DE++D +Y
Sbjct: 553 EIIPCSRVGHIFRGQNPYKFPKDRQKTVERNLARVAEVWLDEYKDLFYG 601
>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
Length = 554
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 124/283 (43%), Positives = 164/283 (57%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C Y LP+T+IVI FHNEA STLLRTV SV+NR+P L+ EIILVDD SE
Sbjct: 103 CTTLHYDPDLPSTTIVITFHNEARSTLLRTVRSVLNRTPVHLIHEIILVDDFSEDPNDCL 162
Query: 56 --VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + ++ E + +D L K + P++ + +
Sbjct: 163 LLTKLPKVKCLRNKHREGLIRSRVRGADAAGAQILTFLDSHCEVNKDWLPPLLQRVKED- 221
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+V P+ID+I+ TF Y+ AS GGF+W L+F+W ++ + +R D +
Sbjct: 222 -----PTSVASPVIDIINMDTFAYVAASSDLRGGFDWSLHFKWEQLSAEKRAKRA-DPTE 275
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF ID+ +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 276 PIKTPIIAGGLFVIDRSWFNRLGKYDTAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGH 335
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR K PY FP G + + N R AEVWMDE++ FYY+ P
Sbjct: 336 VFRKKHPYIFPEGNANTYIKNTRRTAEVWMDEFKLFYYSARPA 378
>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
Length = 693
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 130/299 (43%), Positives = 177/299 (59%), Gaps = 30/299 (10%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK + Y + LP T ++I FHNEAWS LLRTV SV++RSP L+ +IILVDD S+ +
Sbjct: 208 CKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPPELIGQIILVDDYSDMPHLK 267
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D + I G +L K V+ P++D
Sbjct: 268 KQLEDYFASYPMVQIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDR 327
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQT--FEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ + TVVCP+IDVI D T F Y +S + GGF+W L F W+ VP RE R
Sbjct: 328 IARNS------TTVVCPVIDVIDDTTLEFHYRDSSGVNVGGFDWNLQFSWHAVPEREK-R 380
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+++F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 381 RHNNTAEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 440
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVS 277
PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMD++ +Y M+ G VS
Sbjct: 441 PCSHVGHIFRKRSPYKWRTGVN-VLKKNSVRLAEVWMDDYSKYYLQRIGMDKGDYGDVS 498
>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 608
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 125/288 (43%), Positives = 178/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK ++YP LP S+VI F+NEA S LLRTV SV++R+P LL+E+ILV
Sbjct: 141 CKDETYPEDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLREVILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWG----GFNWKLREKNRHKKTV-VCPIID 102
D+ + + I VI + E + M G + + T+ + P++
Sbjct: 201 QLDEFVQTQLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNTMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W +PP E+
Sbjct: 261 TIQEDR------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLIPPSELGGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
GG ++P+++PTMAGGLFA+++DYF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 GGA-TAPIKSPTMAGGLFAMNRDYFDELGRYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGRDTMA-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
Length = 644
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 127/285 (44%), Positives = 177/285 (62%), Gaps = 27/285 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + Y T LP T ++I FHNEAWS LLRTV SV++RSP L+ ++ILVDD S+
Sbjct: 192 CKDTARYLTDLPKTDVIICFHNEAWSVLLRTVHSVLDRSPEHLIGKVILVDDYSD----- 246
Query: 60 IIDVISDQTFEYITA---SDMTWGGFNWKLREKN----RHKKTVVCPIIDVISDQTFEYI 112
+ + Q +Y TA + G L ++ K+ V +D + T ++
Sbjct: 247 -MPHLKKQLEDYFTAYPKVQIVRGAKREGLIRARILGAQYAKSPVLTYLDSHCECTEGWL 305
Query: 113 ---------TAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMR 161
+ TVVCP+IDVI+D T EY ++ + GGF+W L F W+ VP RE +
Sbjct: 306 EPLLDRIARNSTTVVCPVIDVINDDTLEYHYRDSTGVNVGGFDWNLQFSWHAVPEREK-K 364
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + P+ +PTMAGGLF+ID+D+F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 365 RHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 424
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
PCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMD++ +YY
Sbjct: 425 PCSHVGHIFRKRSPYKWRSGVN-VLRKNSVRLAEVWMDDYAQYYY 468
>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Monodelphis domestica]
Length = 609
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 129/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
CK+KSYP+ LP SIVI F+NEA+S LLRTV SVI+R+P LL EIILVDD SE
Sbjct: 142 CKEKSYPSDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKG 201
Query: 55 -------RVVCPIIDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIID 102
+ + I V+ ++ E + M L K + P++
Sbjct: 202 ELDKYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQPLLV 261
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 262 PIQEDR------RTVVCPVIDIISADTLMY-SSSPIVRGGFNWGLHFKWDLVPFSELEGP 314
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G +P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 315 EG-AIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 373
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + +N+ R+A VW+DE+++ Y+++ P
Sbjct: 374 CSRVGHIFRKRRPYGSPEGQDTMT-YNSLRLAHVWLDEYKEQYFSLRP 420
>gi|443715013|gb|ELU07165.1| hypothetical protein CAPTEDRAFT_143879 [Capitella teleta]
Length = 390
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 172/287 (59%), Gaps = 23/287 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ KS+ + LP S++I F E+WSTLLR+V SV+NR+P LL+EIILVDD S+R
Sbjct: 56 CRDKSFDYSSLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQR---- 111
Query: 60 IIDVISDQTFEYIT----ASDMTWGGFNWKLREKNRHKKTVVCPIIDVI----------S 105
+ + Y+T + + +R + R + P++ + +
Sbjct: 112 --GHLHAKLDNYLTRLPKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWA 169
Query: 106 DQTFEYIT--AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + I+ + +V P+ID IS + F YI S GGFNW + F+W VP E R G
Sbjct: 170 EPLLQRISHNRRVIVAPVIDAISSRDFSYIPISANQRGGFNWAMLFKWMPVPNYEKSRTG 229
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GD ++P+RTPT+AGGLFAI + +F LG YD G+DIWG ENLE+SF+ W CGG +E+IPC
Sbjct: 230 GDPTAPVRTPTIAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPC 289
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGHV+R PY+FPGG K+ + N RVA VWMD + + +Y M P
Sbjct: 290 SRVGHVYRSTQPYSFPGGNVKVFMRNNLRVANVWMDGYVNLFYLMKP 336
>gi|426221067|ref|XP_004004733.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Ovis
aries]
Length = 938
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 122/294 (41%), Positives = 174/294 (59%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 484 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 538
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH-----------KKT----------VVC- 98
D + D +Y++ LR K RH K T V C
Sbjct: 539 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECN 592
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 593 IGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 646
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 647 IPPDVVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 706
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 707 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLGRVAEVWLDEYKELFYG 760
>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Sarcophilus harrisii]
Length = 590
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 133/281 (47%), Positives = 174/281 (61%), Gaps = 18/281 (6%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C++K Y LP TS+VI F+NEAWSTLLRTV+SV+ SP LLKEIILVDD S+R +
Sbjct: 135 CREKKYDYENLPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLKEIILVDDYSDREHLK 194
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
P+ + +S + ++ G +L + ++ C D + E
Sbjct: 195 EPLENHLSGLRKVRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPLLER 254
Query: 112 ITAK--TVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I + VVCP+IDVI TFEY+ S D GGF+W+L F W+ VP +E RR +
Sbjct: 255 IHEEESAVVCPVIDVIDWNTFEYLGNSGDPQIGGFDWRLVFTWHSVPEKEQKRRRS-KID 313
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+R+PTMAGGLFA++K YF LGSYD GM++WGGENLE SFR+WQCGG LEI PCSHVGH
Sbjct: 314 VIRSPTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGH 373
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
VF ++PY+ L N+ R AEVWMDE+++ YY N
Sbjct: 374 VFPKQAPYS-----RSKALANSVRAAEVWMDEFKEIYYHRN 409
>gi|114581297|ref|XP_525944.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Pan troglodytes]
gi|410296312|gb|JAA26756.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
gi|410333399|gb|JAA35646.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
Length = 940
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Anolis carolinensis]
Length = 941
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 167/280 (59%), Gaps = 34/280 (12%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTSI++ F +E WSTLLR+V SV+NRSP L+KEIILVDD S + + + D+
Sbjct: 496 LPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPQLIKEIILVDDFSTK------EYLKDKLD 549
Query: 70 EY--------ITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQ 107
+Y I +G +L K V+ P+++ I
Sbjct: 550 KYMAQFPKVRILHLKERYGLIRARLAGAEIAKGDVLTFLDSHVECNVGWLEPLLERI--- 606
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
++ K V CP+I+VISD+ Y+T + G FNW +NF W +PP + + +
Sbjct: 607 ---HLNRKKVPCPVIEVISDKDMSYMTVDNFQRGIFNWPMNFGWKPIPPDVIEKNKIKET 663
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
+R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+VW CGG +EIIPCS VG
Sbjct: 664 DVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIIPCSRVG 723
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
H+FR +PY+FP V N ARVAEVW+D+++D +Y
Sbjct: 724 HIFRSDNPYSFPKDRLTTVERNLARVAEVWLDDYKDLFYG 763
>gi|219804492|ref|NP_001137331.1| polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
gi|296490560|tpg|DAA32673.1| TPA: polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
Length = 940
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 122/294 (41%), Positives = 174/294 (59%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH-----------KKT----------VVC- 98
D + D +Y++ LR K RH K T V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 IGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EI+PCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIVPCSRVGHIFRNDNPYSFPKDRMKTVERNLGRVAEVWLDEYKELFYG 762
>gi|426337441|ref|XP_004032714.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gorilla
gorilla gorilla]
Length = 940
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|440896773|gb|ELR48609.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Bos grunniens
mutus]
Length = 940
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 122/294 (41%), Positives = 174/294 (59%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH-----------KKT----------VVC- 98
D + D +Y++ LR K RH K T V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 IGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVVAKNKIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EI+PCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIVPCSRVGHIFRNDNPYSFPKDRMKTVERNLGRVAEVWLDEYKELFYG 762
>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
Length = 527
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 60 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ R
Sbjct: 180 AIREDRH------TVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGRA 232
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 233 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 291
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 292 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 338
>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
Length = 415
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 124/291 (42%), Positives = 173/291 (59%), Gaps = 26/291 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C ++ LPTTSI++ F E+WSTLLR+V SVINRSP L++EI+L+DDAS R +
Sbjct: 79 CAEQEVADDLPTTSIIMCFCEESWSTLLRSVHSVINRSPPHLVEEILLIDDASRRSHLKQ 138
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ +S + G +L+ TV+ P++D I
Sbjct: 139 KLDQYMSKFPQVRVVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPLLDRI 198
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ VVCP ID +++ TF Y A++ GGF+W+L F+W +P E RR
Sbjct: 199 REDR------TRVVCPSIDRVNEATFAYEVANENVRGGFDWELFFQWVSLPAVEAKRRTH 252
Query: 165 D--RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ + +R+PTMAGGLF+ID+ +FYELG YD G IWGGENLE+SF++W CGG LEI+P
Sbjct: 253 NVFQHEVIRSPTMAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIWMCGGSLEILP 312
Query: 223 CSHVGHVFRDKSPYTFPGGVS--KIVLHNAARVAEVWMDEWRDFYYAMNPG 271
CS VGHVFR PY + S ++V HN R+AEVW+DE++ YYA++PG
Sbjct: 313 CSRVGHVFRKSQPYNYSNATSIMEVVHHNNVRLAEVWLDEYKKIYYALHPG 363
>gi|334333371|ref|XP_001365881.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Monodelphis domestica]
Length = 757
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 133/281 (47%), Positives = 174/281 (61%), Gaps = 18/281 (6%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS+VI F+NEAWSTLLRTV+SV+ SP LLKE+ILVDD S+R +
Sbjct: 148 CKEKKYDYENLPQTSVVIAFYNEAWSTLLRTVYSVLETSPDILLKEVILVDDYSDREHLK 207
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ + +SD + ++ G +L + ++ C D + E
Sbjct: 208 EQLENHLSDLPKVRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPLLER 267
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I + VVCP+IDVI TFEY+ A + GGF+W+L F W+ VP RE RR +
Sbjct: 268 IHEEESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPQREQKRRRS-QID 326
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+R+PTMAGGLFA++K YF LGSYD GM++WGGENLE SFR+WQCGG LEI PCSHVGH
Sbjct: 327 VIRSPTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGH 386
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
VF ++PY+ L N+ R AEVWMDE+++ YY N
Sbjct: 387 VFPKQAPYS-----RNKALANSVRAAEVWMDEYKEIYYHRN 422
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
Length = 621
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 170/276 (61%), Gaps = 28/276 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TS++I FHNEAWS LLRTV SV++RSP L+KE+ILVDD S+ +D + Q
Sbjct: 144 LPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSD------MDHLKQQLV 197
Query: 70 EYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYITA------- 114
+Y AS+ K RE H + V +D + T ++
Sbjct: 198 DYF-ASEPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPLLDRIAR 256
Query: 115 --KTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
TVVCP+IDVI D T EY + + GGF+W L F W+ VP E +R + + P+
Sbjct: 257 DPTTVVCPVIDVIDDTTLEYHFHDSGGVNVGGFDWNLQFNWHAVPEHEK-KRHKNPAEPV 315
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
+PTMAGGLF+IDK +F LG+YD G DIWGGENLE+SF+ W CGG LEI+PCSHVGH+F
Sbjct: 316 YSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIF 375
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
R +SPY + GV+ ++ N+ R+AEVW+DE+ +YY
Sbjct: 376 RKRSPYKWRSGVN-VLRRNSVRLAEVWLDEYAKYYY 410
>gi|395732382|ref|XP_002812541.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Pongo abelii]
Length = 967
Score = 233 bits (594), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 513 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 567
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 568 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 621
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 622 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 675
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 676 IPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 735
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 736 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 789
>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
Length = 586
Score = 233 bits (594), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 127/276 (46%), Positives = 170/276 (61%), Gaps = 28/276 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TS++I FHNEAWS LLRTV SV++RSP L+KE+ILVDD S+ +D + Q
Sbjct: 144 LPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSD------MDHLKQQLV 197
Query: 70 EYITASDMTWGGFNWKLREK--------NRHKKTVVCPIIDVISDQTFEYITA------- 114
+Y AS+ K RE H + V +D + T ++
Sbjct: 198 DYF-ASEPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPLLDRIAR 256
Query: 115 --KTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
TVVCP+IDVI D T EY + + GGF+W L F W+ VP E +R + + P+
Sbjct: 257 DPTTVVCPVIDVIDDTTLEYHFHDSGGVNVGGFDWNLQFNWHAVPEHEK-KRHKNPAEPV 315
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
+PTMAGGLF+IDK +F LG+YD G DIWGGENLE+SF+ W CGG LEI+PCSHVGH+F
Sbjct: 316 YSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIF 375
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
R +SPY + GV+ ++ N+ R+AEVW+DE+ +YY
Sbjct: 376 RKRSPYKWRSGVN-VLRRNSVRLAEVWLDEYAKYYY 410
>gi|6525067|gb|AAF15313.1|AF154107_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5 [Homo
sapiens]
Length = 610
Score = 233 bits (593), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SVINRSP L+KEI+LVDD S +
Sbjct: 156 CAEQLVXNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTK----- 210
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 211 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 264
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 265 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 318
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 319 IPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 378
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 379 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 432
>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
Length = 621
Score = 233 bits (593), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 170/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS+VIVFHNEAWSTLLRTV+SV++ SP LL+EIILVDDAS + + D+
Sbjct: 175 LPTTSVVIVFHNEAWSTLLRTVYSVLHASPAALLREIILVDDAS------TDEYLKDELD 228
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
Y+ + ++ + K + ++ V S + ++ A
Sbjct: 229 RYVKQLQIV------RVVRQAERKGLITARLLGASVASGEVLTFLDAHCECFHGWLEPLL 282
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + G F+W L F W VPPRE RR
Sbjct: 283 SRIAEEPTAVVSPDITTIDLNTFEFSKPVQYGKQHSRGNFDWSLTFGWEVVPPRERQRRK 342
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLFAI + YF +GSYD+ M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 343 -DETVPIKSPTFAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSFRVWQCGGQLEIIPC 401
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMD++++ +Y N
Sbjct: 402 SVVGHVFRSKSPHTFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRN 446
>gi|153792142|ref|NP_001093363.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Xenopus laevis]
gi|148744516|gb|AAI42582.1| LOC100101309 protein [Xenopus laevis]
Length = 563
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 178/314 (56%), Gaps = 37/314 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C Y LP+TS++I FHNEA STLLRT+ SV+ RSP L++EIILVDD S
Sbjct: 121 CTSVHYDNDLPSTSVIITFHNEARSTLLRTIKSVLIRSPGNLIQEIILVDDFSTDPDDCQ 180
Query: 56 --VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDV------ISDQ 107
P + + + E + +R + R + P++ ++++
Sbjct: 181 LLTKIPKVKCLRNNRREGL-------------IRSRVRGAELAAAPVLTFLDSHCEVNNE 227
Query: 108 TFEYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
+ + + VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M
Sbjct: 228 WLQPLLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMS 287
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R D +S +RTP +AGG+F IDK +F +LG YD MDIWGGEN E+SFRVW CGG LEI+
Sbjct: 288 RT-DPTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIV 346
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVS 277
PCS VGHVFR + PY FP G + + N R EVWMDE++ +YY P GKS SV+
Sbjct: 347 PCSRVGHVFRKRHPYEFPDGNALTYIKNTKRTVEVWMDEYKQYYYQARPSAIGKSYGSVA 406
Query: 278 TCAAHFRMLSYSSW 291
A + LS S+
Sbjct: 407 DRAELRKKLSCKSF 420
>gi|6688167|emb|CAB65104.1| GalNAc-T5 [Homo sapiens]
Length = 668
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SVINRSP L+KEI+LVDD S +
Sbjct: 214 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTK----- 268
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 269 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 322
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 323 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 376
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 377 IPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 436
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 437 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 490
>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 623
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 125/294 (42%), Positives = 181/294 (61%), Gaps = 32/294 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-----R 55
CKKK Y LPT S++I F+NE ++TLLR+++SV+ R+P LLKEIILV+D S+ R
Sbjct: 148 CKKKKYSKNLPTASVIICFYNEHFTTLLRSIYSVLERTPSYLLKEIILVNDFSDLAGLHR 207
Query: 56 VVCPIIDV-ISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PI 100
+ ++ +D+ + S G ++ + V+ P+
Sbjct: 208 NISNYVNTNFTDKV--KLFKSKKRLGLIRARIFGSRKASGDVLVFLDSHIEVNVNWLQPL 265
Query: 101 IDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
+ I D + K VV PIID+I+ TF+Y ++S + GGFNW L+F+W +P + +
Sbjct: 266 LSRIVD------SKKNVVVPIIDIINADTFKY-SSSPLVRGGFNWGLHFKWENLP-KSTL 317
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
+ D P+ +PTMAGGLFAI++ YF ELG YD GM+IWGGENLE+SFR+W CGG LE+
Sbjct: 318 KSNEDFVKPILSPTMAGGLFAINRAYFKELGEYDNGMNIWGGENLEISFRIWMCGGNLEL 377
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP-GKS 273
IPCS VGHVFR + PY P G ++ N+ RVA VWMD++++F+Y +P GK+
Sbjct: 378 IPCSRVGHVFRKRRPYGSPNG-EDTMMRNSLRVANVWMDDYKEFFYKQHPEGKT 430
>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11 [Sarcophilus
harrisii]
Length = 970
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/290 (45%), Positives = 178/290 (61%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
CK+KSYPT LP SIVI F+NEA+S LLRTV SVI+R+P LL EIILVDD SE
Sbjct: 141 CKEKSYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKG 200
Query: 55 -------RVVCPIIDVISDQTFEYITASDMTWG--GFNWKLREKNRH---KKTVVCPIID 102
+ + I V+ ++ E + M G L + H K + P++
Sbjct: 201 ELDDYVQKYLPGKIQVVRNEKGEGLIXGRMIGAAHGTGEVLVFLDSHCEVNKMWLQPLLV 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP +
Sbjct: 261 PIHED------HRTVVCPVIDIISADTLMY-SSSPIVCGGFNWDLHFKWDLVP---FSKL 310
Query: 163 GGDRSS--PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG + P+++P MAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGPEGAIAPIKSPAMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + +N+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-NNSLRMAHVWLDEYKEQYFSLRP 419
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 129/286 (45%), Positives = 173/286 (60%), Gaps = 31/286 (10%)
Query: 5 SYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE---------- 54
SYPT LP SIVI F+NEA+S LLRTV SVI+R+P LL EIILVDD SE
Sbjct: 507 SYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDD 566
Query: 55 ---RVVCPIIDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVISD 106
+ + I V+ ++ E + M L K + P++ I +
Sbjct: 567 YVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQPLLVPIHE 626
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
+TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ GG
Sbjct: 627 D------HRTVVCPVIDIISADTLMY-SSSPIVRGGFNWGLHFKWDLVPFSEL---GGPE 676
Query: 167 SS--PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
+ P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L IIPCS
Sbjct: 677 GAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCS 736
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 737 RVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 781
>gi|224051278|ref|XP_002200509.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Taeniopygia guttata]
Length = 570
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 130/289 (44%), Positives = 169/289 (58%), Gaps = 24/289 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y T LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 128 CTSVRYDTDLPATSLIITFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFSSDPEDCQ 187
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITA-- 114
++ I T + +R + R + I+ D + E++
Sbjct: 188 LLTKIPKVKCLRNTHREGL-------IRSRVRGAEVATADILTFLDSHCEVNSEWLQPML 240
Query: 115 -------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D +
Sbjct: 241 QRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DPT 299
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
+RTP +AGG+F IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VG
Sbjct: 300 QSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVG 359
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS 273
HVFR + PY FP G + + N R AEVWMDE++ +YY P GKS
Sbjct: 360 HVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKS 408
>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
Length = 671
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/289 (44%), Positives = 172/289 (59%), Gaps = 35/289 (12%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y LP TS++I FHNEAWS LLRTV SV++RSP L+KE+ILVDD S+
Sbjct: 219 CKEPGRYREDLPPTSVIICFHNEAWSVLLRTVHSVLDRSPEHLVKEVILVDDFSDMPHTQ 278
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ P + ++ E + + + RH V +D + T
Sbjct: 279 KQLEEYFLAYPRVKIVRAAKREGLIRARLLGA----------RHATAPVLTYLDSHCECT 328
Query: 109 FEYI---------TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + TVVCP+IDVI D T EY + + GGF+W L F W+ VP R
Sbjct: 329 TGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEYHYRDSGGVNVGGFDWNLQFNWHAVPER 388
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E R+ + P+ +PTMAGGLFAID+ +F LG+YD G DIWGGENLE+SF+ W CGG
Sbjct: 389 EK-RKHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGGENLELSFKTWMCGGS 447
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEIIPCSHVGH+FR +SPY + GV+ ++ N+ R+AEVWMDE+ +YY
Sbjct: 448 LEIIPCSHVGHIFRKRSPYKWRTGVN-VIKRNSVRLAEVWMDEYAQYYY 495
>gi|431894831|gb|ELK04624.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Pteropus alecto]
Length = 939
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 118/286 (41%), Positives = 172/286 (60%), Gaps = 28/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C K+ LPTTS+++ F +E WSTL+R+V SV+NRSP L+KEI+LVDD S +
Sbjct: 485 CAKQLVHNNLPTTSVIMCFVDEVWSTLVRSVHSVLNRSPPHLIKEILLVDDFSTK----- 539
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD-QTF---------- 109
D + D +Y++ +LRE++ + + + D TF
Sbjct: 540 -DYLKDNLDKYMSQFPKVRI---LRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNIG 595
Query: 110 --------EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
Y++ K V CP+I+VI+D+ Y+T + G F W +NF W +PP + +
Sbjct: 596 WLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVVAK 655
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
+ +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW CGG +EII
Sbjct: 656 NRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEII 715
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
PCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 716 PCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 761
>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ R
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGRA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ R
Sbjct: 261 AIREDRH------TVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGRA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 749
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 124/287 (43%), Positives = 163/287 (56%), Gaps = 24/287 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TS++I F E+WSTLLRTV SV+NRSP L+ E++LVDD S+R
Sbjct: 300 CKTKEYSDDLPRTSVIICFTEESWSTLLRTVHSVLNRSPPELIAEVLLVDDFSQRDYLKE 359
Query: 56 ------VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + V+ E + ++M G L + P++ I
Sbjct: 360 PLDEYMKKLPKVKVVRLPKREGLIRARLIGAEMAQGPVLTFLDSHVECNVGWLEPLLQRI 419
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
D VVCP ID I +FEY + G FNW++ F W +P E RR
Sbjct: 420 HDD------PTNVVCPAIDAIDATSFEYAGSGATIIGAFNWEMKFTWNGIPEYEARRRD- 472
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D S P+R+P MAGGLF+IDKD+FY +G+YD G DIWG ENLE+SF++W CGG LEIIPCS
Sbjct: 473 DESWPIRSPAMAGGLFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCGGSLEIIPCS 532
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDE-WRDFYYAMNP 270
V H+FR + PY FP G K + N R+ VW+DE +RD +Y++ P
Sbjct: 533 RVAHIFRKQQPYKFPDGNVKTFMRNTMRLVAVWVDEPYRDIFYSLKP 579
>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
[Homo sapiens]
gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
sapiens]
gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Homo sapiens]
gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[synthetic construct]
gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 608
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ R
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGRA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|149730635|ref|XP_001491185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Equus
caballus]
Length = 940
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 172/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGVFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDIVAKNRIKDTDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
Length = 398
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/285 (45%), Positives = 166/285 (58%), Gaps = 20/285 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-----R 55
C SYPT LPT S++I+FHNEAWSTLLRTV SV+ RSP LL+EI+LVDD S
Sbjct: 37 CLSLSYPTKLPTASVIIIFHNEAWSTLLRTVHSVLARSPPYLLREIVLVDDHSRLDTYGH 96
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI--- 112
+ + IS T + + G +L + K V+ +D + ++
Sbjct: 97 LGSKLESYISQFTKVQLIRAPKREGLIRARLIGAKQAKGEVLV-FLDSHCEANLGWLEPL 155
Query: 113 ------TAKTVVCPIIDVISDQTFEYITASDMTWGG-FNWKLNFRWYRVPPREMMRRGGD 165
VV P I+VI +TF Y G FNW+L F+W +P E RR D
Sbjct: 156 LARIGENRSIVVTPDIEVIDLRTFGYTHEHGANNRGIFNWELTFKWRGIPEYERRRRKSD 215
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
S P+R+PTMAGGLFAIDK YFYE+GSYD M WGGEN+E+SFR+W CGG LEIIPCS
Sbjct: 216 -SDPIRSPTMAGGLFAIDKSYFYEIGSYDTEMSFWGGENVEISFRIWMCGGSLEIIPCSK 274
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR+ PY G + N R+AEVWMD+++ +YAM P
Sbjct: 275 VGHVFRESQPYKIGEGA---IDRNNMRLAEVWMDDYKKIFYAMRP 316
>gi|332233960|ref|XP_003266176.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Nomascus
leucogenys]
Length = 940
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWKT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDIIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|301614636|ref|XP_002936794.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Xenopus (Silurana) tropicalis]
Length = 625
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 129/286 (45%), Positives = 173/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTSI+IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS + + + D+
Sbjct: 176 LPTTSIIIVFHNEAWSTLLRTVYSVMHTSPAILLKEIILVDDAS------VDEYLKDELD 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + T ++ A
Sbjct: 230 EYVKQLQIV------KVVRQKERKGLITARLLGASVATGDTLTFLDAHCECYYGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
+VV P I I TF++ S + G F+W L+F W +P E RR
Sbjct: 284 ASIAENYTSVVSPDITGIDLNTFQFSNPSPYGNNHNRGNFDWTLSFGWESLPSSEKTRR- 342
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 343 KDETYPIKTPTFAGGLFSISKAYFEHIGSYDEQMEIWGGENIEMSFRVWQCGGQLEILPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G +++++ N R+AEVWMD+ ++ +Y N
Sbjct: 403 SVVGHVFRSKSPHTFPKG-TQVIVRNQVRLAEVWMDDLKEIFYRRN 447
>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
Length = 558
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
CK K YPT LP S+VI F+NEA S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 126 CKDKIYPTDLPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDSSDFDDLKG 185
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
++ + I +I + E + M L + H + V P++
Sbjct: 186 ELDEFVQKHLPGKIKLIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 245
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 246 AIREDR------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELEGP 298
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 299 EG-ATAPIKSPTMAGGLFAMNRSYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 357
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 358 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 404
>gi|296204771|ref|XP_002749473.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Callithrix jacchus]
Length = 940
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDDLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|432936506|ref|XP_004082149.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oryzias latipes]
Length = 533
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/284 (44%), Positives = 164/284 (57%), Gaps = 23/284 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C SY LP+T+++I FHNEA STLLRTV SV+ RSP +L++E++L+DD S
Sbjct: 94 CAALSYDADLPSTTVIITFHNEARSTLLRTVKSVLMRSPPSLIQEVLLIDDFSS------ 147
Query: 61 IDVISDQTFEYITASDMTWGGFNWKL-REKNRHKKTVVCPIIDVISDQTFEYIT------ 113
D+ Q I L R + + + PI+ + D E T
Sbjct: 148 -DLEDCQLLAQIPKVRCLRNSRREGLIRSRVKGANSASAPILTFL-DSHCEVNTDWLQPM 205
Query: 114 -------AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D
Sbjct: 206 IQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMARS-DP 264
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ P+RTP +AGG+F +DK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS V
Sbjct: 265 TLPIRTPVIAGGIFVMDKSWFNHLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRV 324
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR + PY FP G + + N R AEVWMDE++ FYY+ P
Sbjct: 325 GHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDEYKQFYYSARP 368
>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Meleagris gallopavo]
Length = 528
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 176/288 (61%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y + LP TS+VI F+NEAWSTLLRTV SV+ SP LL+E+ILVDD S++ +
Sbjct: 70 CKEKKYDYYSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHLK 129
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + ++ + ++ G +L + K ++ P+++
Sbjct: 130 EPLENYVAGLRKVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLER 189
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+++ VVCP+IDVI TFEY+ A + GGF+W+L F W+ P RE RR
Sbjct: 190 IAEEE------SAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTTPEREQKRR 243
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ +R+PTMAGGLF++ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI P
Sbjct: 244 KS-KIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHP 302
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 303 CSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDEYKELYYHRNP 345
>gi|307183874|gb|EFN70488.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Camponotus
floridanus]
Length = 451
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/288 (45%), Positives = 166/288 (57%), Gaps = 27/288 (9%)
Query: 4 KSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERV 56
K + LP TS++I FHNEA STLLRTV SV+NRSP L+KEIILVDD S E
Sbjct: 2 KQWRQDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELS 61
Query: 57 VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
+ VI ++ E + +D L + P+++ +++
Sbjct: 62 RIHKVRVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWLEPLLERVAED---- 117
Query: 112 ITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
VVCP+IDVIS TF+YI AS GGF+W L F+W + E R D + +R
Sbjct: 118 --PTRVVCPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQAERQARQKDPTQAIR 175
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENL---------EMSFRVWQCGGILEIIP 222
TP +AGGLF I+K YF +LG YD MD+WGGENL ++SFRVWQCGG LEIIP
Sbjct: 176 TPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLGIVIQFHVQKISFRVWQCGGSLEIIP 235
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGHVFR + PY+FPGG + N R AEVWMD+++ FYY P
Sbjct: 236 CSRVGHVFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVP 283
>gi|326922813|ref|XP_003207639.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Meleagris
gallopavo]
Length = 632
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDAS------VDEYLHDKLD 236
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + K + ++ V + +T ++ A
Sbjct: 237 EYMKQFQIV------KVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLL 290
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 291 ARIAENSVAVVSPDIASIDLNTFEFSKPSPYGHNHNRGNFDWSLSFGWESLPKYENKRRK 350
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I K YF +GSYD+ M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 351 -DETYPIRTPTFAGGLFSISKKYFEHIGSYDDEMEIWGGENIEMSFRVWQCGGQLEIMPC 409
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 410 SVVGHVFRSKSPHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRN 454
>gi|403258971|ref|XP_003922013.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Saimiri boliviensis boliviensis]
Length = 967
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 118/286 (41%), Positives = 171/286 (59%), Gaps = 28/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 513 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 567
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD-QTF---------- 109
D + D +Y++ +LRE++ + + + D TF
Sbjct: 568 -DYLKDNLDKYMSQFPKVRI---LRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVG 623
Query: 110 --------EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
Y++ K V CP+I+VI+D+ Y+T + G F W +NF W +PP + +
Sbjct: 624 WLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAK 683
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
+ +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW CGG +EII
Sbjct: 684 NRIKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEII 743
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
PCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 744 PCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 789
>gi|21707970|gb|AAH34184.1| Galnt11 protein [Mus musculus]
Length = 411
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/280 (45%), Positives = 174/280 (62%), Gaps = 27/280 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C++KSYPT LPT SIVI F+NEA+S LLRTV SV++R+P LL EIILVDD+S
Sbjct: 141 CRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 200
Query: 54 ------ERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
+R + + VI + E + M L + H + V P++
Sbjct: 201 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 IILED------PHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPVSELGGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+R+PTMAGGLFA+++ YF +LG YD GMDIWGGENLE+SFR+W CGG L I+P
Sbjct: 314 DG-ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWR 262
CS VGH+FR + PY P G + HN+ R+A VW+DE++
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYK 411
>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
Length = 534
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/243 (53%), Positives = 149/243 (61%), Gaps = 53/243 (21%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K YP LP TS+VIVFHNEAWSTLLRTV SVI+RSP TLL+EIILVDDASER
Sbjct: 130 CKNKLYPDNLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKR 189
Query: 56 --------VVCPIIDVISDQTFEYITAS-------------------DMTWGGFNWKLRE 88
+ P+ V +Q I A + T G L
Sbjct: 190 PLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLAR 249
Query: 89 KNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLN 148
+ +KTVVCPIIDVIS D TFEY+ SDMT+GGFNWKLN
Sbjct: 250 IKKDRKTVVCPIIDVIS---------------------DDTFEYMAGSDMTYGGFNWKLN 288
Query: 149 FRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMS 208
FRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+YD GMDIWGGENLE+S
Sbjct: 289 FRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEIS 348
Query: 209 FRV 211
FR+
Sbjct: 349 FRL 351
>gi|301758254|ref|XP_002914993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Ailuropoda melanoleuca]
Length = 540
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/291 (45%), Positives = 174/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 85 CKEKKYDYNNLPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 144
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + K V+ P++
Sbjct: 145 ERLANELSGLPKVRLIRANRREGLVRARLLGASAAKGEVLTFLDCHCECHEGWLEPLLQR 204
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + GGF+W+L F W+ VP RE MR
Sbjct: 205 IHEEE------SAVVCPVIDVIDWNTFEYLGNPGEPQIGGFDWRLVFTWHVVPERERMRM 258
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 259 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 314
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 315 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 360
>gi|403258969|ref|XP_003922012.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Saimiri boliviensis boliviensis]
Length = 940
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 118/286 (41%), Positives = 171/286 (59%), Gaps = 28/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD-QTF---------- 109
D + D +Y++ +LRE++ + + + D TF
Sbjct: 541 -DYLKDNLDKYMSQFPKVRI---LRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVG 596
Query: 110 --------EYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
Y++ K V CP+I+VI+D+ Y+T + G F W +NF W +PP + +
Sbjct: 597 WLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAK 656
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
+ +R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW CGG +EII
Sbjct: 657 NRIKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEII 716
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
PCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 717 PCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|334330196|ref|XP_003341314.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Monodelphis
domestica]
Length = 631
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/286 (45%), Positives = 170/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++I+FHNEAWSTLLRTV SV+ SP LLKEIILVDDASE D + D+
Sbjct: 182 LPTTSVIIIFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASED------DYLHDKLD 235
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EYI + K+ + + + ++ V + +T ++ +
Sbjct: 236 EYIKQFQIV------KVVRQKERQGLINARLLGASVATAETLTFLDSHCECFYGWLEPLL 289
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 290 SRIAENYTAVVSPDIASIDLTTFEFSKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK 349
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I K YF +G+YDE M IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 350 -DETYPIRTPTFAGGLFSISKKYFEYIGTYDEEMKIWGGENIEMSFRVWQCGGQLEIMPC 408
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 409 SVVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEFKEIFYRRN 453
>gi|12832954|dbj|BAB22325.1| unnamed protein product [Mus musculus]
Length = 429
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 177/315 (56%), Gaps = 28/315 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQISLEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP-------GKSASVSTCAA 281
VFR K PY FP G + + N R AEVWMDE++ +YYA P G S C
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARPFALERPFGNSHVTQCCRR 393
Query: 282 HFRMLSYSSWFSGSI 296
R+L + F G +
Sbjct: 394 --RILIRGTSFRGVV 406
>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Otolemur garnettii]
Length = 608
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 180/290 (62%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK++SYPT LP S+VI F+NEA+S LLRTV SVI+R+P LL E+ILV
Sbjct: 141 CKEQSYPTDLPVASVVICFYNEAFSALLRTVHSVIDRTPVHLLHEVILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGG--FNWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAQATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIREDQ------QTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 310
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG+ ++P+++PTMAGGLFA+++ YF++LG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGEEGATAPIKSPTMAGGLFAMNRQYFHDLGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|26347119|dbj|BAC37208.1| unnamed protein product [Mus musculus]
Length = 550
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPHTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|254910954|ref|NP_082140.2| polypeptide N-acetylgalactosaminyltransferase 14 [Mus musculus]
gi|115527999|gb|AAI17801.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mus musculus]
Length = 550
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|391346326|ref|XP_003747427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Metaseiulus occidentalis]
Length = 622
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 128/278 (46%), Positives = 162/278 (58%), Gaps = 23/278 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TS+VI FHNEA S LLRTV SV+ R+P LLKEI+LVDDAS+ I +
Sbjct: 166 LPKTSVVITFHNEARSALLRTVVSVLQRTPSHLLKEIVLVDDASDDPTDGIELQMKYDKI 225
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYITAK------TV 117
E IT + +R + K P++ + E + A+ V
Sbjct: 226 ELITNRER-----QGLMRSRVFGAKKAKGPVLTFLDSHCECNEGWIEPLLARIRDEPSKV 280
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
VCP+IDV+S TF Y AS GGF+W L F+W + + + + P++TP MAG
Sbjct: 281 VCPVIDVLSMDTFGYFPASSDLRGGFDWNLVFKWEFITSKPELA-----TDPIKTPAMAG 335
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLFAI K F LGSYD MDIWG ENLEMSFRVWQCG +EI+PCS VGHVFR + PYT
Sbjct: 336 GLFAITKKEFERLGSYDTQMDIWGAENLEMSFRVWQCGSGIEILPCSRVGHVFRKQHPYT 395
Query: 238 FPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
FPGG S K+ N+ R AEVWMD+++ +YY P +
Sbjct: 396 FPGGGSGKVFARNSRRAAEVWMDDYKKYYYEQVPAAKS 433
>gi|148706466|gb|EDL38413.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_b [Mus
musculus]
Length = 551
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 102 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 161
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 162 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 220
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 221 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 274
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 275 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 334
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 335 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 376
>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
Length = 658
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 171/292 (58%), Gaps = 26/292 (8%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE----- 54
CK K +PT L SI+I F NEAWSTLLRTV SV+NRSP L+ EIIL+DD+S+
Sbjct: 205 CKAKQWPTANLLKASIIICFVNEAWSTLLRTVHSVLNRSPADLVHEIILLDDSSDAAWLG 264
Query: 55 -RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT 113
++ I D + D+ +Y+ + G H V +D + ++
Sbjct: 265 DKLTNYIRDNLPDKV-KYVRTQHRS--GLIRARLVGAEHATGDVLLFLDSHCEANLNWLE 321
Query: 114 A---------KTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRR 162
+TVV P+ID I T EY A+ G F+W ++F W + R
Sbjct: 322 PIMALITEDRRTVVTPVIDSIDHHTMEYSKATQDVPAVGTFDWTMDFNW----KAGVRRA 377
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G D + P+ +PTMAGGLFA++K+YFYELGSYDE MD WGGENLEMSFR+WQCGG L P
Sbjct: 378 GADATDPVDSPTMAGGLFAMEKNYFYELGSYDEKMDGWGGENLEMSFRIWQCGGRLVTAP 437
Query: 223 CSHVGHVFRDKSPYTFPGG-VSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
CSHVGH+FRD PYT PGG + L N+ RVAEVWMD ++ ++ PG++
Sbjct: 438 CSHVGHIFRDSHPYTVPGGSIHDTFLRNSMRVAEVWMDHYKQYFLDTRPGQN 489
>gi|149730677|ref|XP_001496099.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Equus
caballus]
Length = 633
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHGKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQRERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDMNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|108935842|sp|Q8BVG5.2|GLT14_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
Length = 550
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|392923087|ref|NP_001256888.1| Protein GLY-4, isoform c [Caenorhabditis elegans]
gi|255068800|emb|CBA11615.1| Protein GLY-4, isoform c [Caenorhabditis elegans]
Length = 480
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 122/270 (45%), Positives = 161/270 (59%), Gaps = 18/270 (6%)
Query: 13 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTFEYI 72
T+++I +HNEA S+LLRTV+SV N+SP LL EI+LVDD S+ V I + +
Sbjct: 153 TTVIITYHNEARSSLLRTVFSVFNQSPEELLLEIVLVDDNSQDVE------IGKELAQIQ 206
Query: 73 TASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYITA------KTVVCP 120
+ + +R + + + P++ + E + A K VV P
Sbjct: 207 RITVLRNNQREGLIRSRVKGAQVARAPVLTFLDSHIECNQKWLEPLLARIAENPKAVVAP 266
Query: 121 IIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLF 180
IIDVI+ F Y+ AS GGF+W L FRW + + R ++P+R+PTMAGGLF
Sbjct: 267 IIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRKERHAHPTAPIRSPTMAGGLF 326
Query: 181 AIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPG 240
AI K++F ELG+YD M++WGGENLEMSFRVWQCGG LEI+PCS VGHVFR K PYTFPG
Sbjct: 327 AISKEWFNELGTYDLDMEVWGGENLEMSFRVWQCGGSLEIMPCSRVGHVFRKKHPYTFPG 386
Query: 241 GVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
G + N R AEVWMDE++ Y P
Sbjct: 387 GSGNVFQKNTRRAAEVWMDEYKAIYLKNVP 416
>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
(Silurana) tropicalis]
gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
tropicalis]
Length = 555
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 122/284 (42%), Positives = 166/284 (58%), Gaps = 20/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
C + Y + LP TS++I FHNEA STLLRT+ SV+NR+P L+ EI+LVDD S+ +
Sbjct: 101 CTELHYQSDLPPTSVIITFHNEARSTLLRTIRSVLNRTPMHLIHEILLVDDFSDNLDDCR 160
Query: 57 ---VCPIIDVISDQTFE------YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
P + + ++ E + + + L K + P++ I +
Sbjct: 161 LLSKLPKVRCLRNEQREAGLIRSRVRGAGVAQAAVLTFLDSHCEVNKDWLPPLLHRIKED 220
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
VV P+ID+I+ TF YI AS GGF+W L+F+W ++ + +R D +
Sbjct: 221 ------PTRVVSPVIDIINLDTFAYIAASSDLRGGFDWSLHFKWEQLSAEQKAKRL-DPT 273
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P++TP +AGGLF I+K +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VG
Sbjct: 274 EPIKTPVIAGGLFVIEKSWFNHLGKYDTAMDIWGGENFEISFRVWMCGGSLEIIPCSRVG 333
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
HVFR K PY FP G + + N R AEVWMDE+++ YYA P
Sbjct: 334 HVFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKNHYYAARPA 377
>gi|58865788|ref|NP_001012109.1| polypeptide N-acetylgalactosaminyltransferase 14 [Rattus
norvegicus]
gi|50926091|gb|AAH79128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[Rattus norvegicus]
gi|149050682|gb|EDM02855.1| rCG61782, isoform CRA_b [Rattus norvegicus]
Length = 552
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNSERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSVEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|281341254|gb|EFB16838.1| hypothetical protein PANDA_002911 [Ailuropoda melanoleuca]
Length = 496
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 132/291 (45%), Positives = 174/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 41 CKEKKYDYNNLPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 100
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + K V+ P++
Sbjct: 101 ERLANELSGLPKVRLIRANRREGLVRARLLGASAAKGEVLTFLDCHCECHEGWLEPLLQR 160
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + GGF+W+L F W+ VP RE MR
Sbjct: 161 IHEEE------SAVVCPVIDVIDWNTFEYLGNPGEPQIGGFDWRLVFTWHVVPERERMRM 214
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 215 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 270
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 271 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 316
>gi|148706467|gb|EDL38414.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_c [Mus
musculus]
Length = 429
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 177/315 (56%), Gaps = 28/315 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP-------GKSASVSTCAA 281
VFR K PY FP G + + N R AEVWMDE++ +YYA P G S C
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARPFALERPFGNSHVTQCCRR 393
Query: 282 HFRMLSYSSWFSGSI 296
R+L + F G +
Sbjct: 394 --RILIRGTSFRGVV 406
>gi|341894191|gb|EGT50126.1| CBN-GLY-4 protein [Caenorhabditis brenneri]
Length = 584
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 128/284 (45%), Positives = 165/284 (58%), Gaps = 21/284 (7%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ Y F + T+++I +HNEA S+LLRTV+SV N SP LL EI+LVDD S
Sbjct: 137 CRDVDYSQFQMRPTTVIITYHNEARSSLLRTVFSVFNMSPEDLLHEIVLVDDNS------ 190
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKL-REKNRHKKTVVCPIIDVISDQT------FEYI 112
IDV + I + L R + + + PI+ + E +
Sbjct: 191 -IDVDIGKELAQIERVKVLRNNQREGLIRSRVKGAQVAGAPILTFLDSHIECNQKWLEPL 249
Query: 113 TA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
A K VV PIIDVI+ F Y+ AS GGF+W L FRW + + R +
Sbjct: 250 LARIAENPKAVVAPIIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRTERHKNP 309
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
++P+++PTMAGGLFAI K++F ELG+YD M++WGGENLEMSFRVWQCGG LEI+PCS V
Sbjct: 310 TAPIKSPTMAGGLFAISKEWFEELGTYDLDMEVWGGENLEMSFRVWQCGGSLEILPCSRV 369
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVFR K PYTFPGG + N R AEVWMDE++ Y P
Sbjct: 370 GHVFRKKHPYTFPGGSGNVFQKNTRRAAEVWMDEYKAIYLKNVP 413
>gi|154152027|ref|NP_001093795.1| polypeptide N-acetylgalactosaminyltransferase 12 [Bos taurus]
gi|151553796|gb|AAI48122.1| GALNT12 protein [Bos taurus]
Length = 579
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 134/285 (47%), Positives = 176/285 (61%), Gaps = 24/285 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTS+VI F+NEAWSTLLRTV+SV+ SP TLL+E+ILVDD S+R +
Sbjct: 123 CKEKKYNYDELPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILVDDYSDREHLK 182
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ ++ + ++ G +L + K V+ C + + +
Sbjct: 183 ERLATELAGLPKVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQR 242
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I K VVCP+IDVI TFEY+ A + GGF+W+L F W+ VP +E +R S
Sbjct: 243 IHEKESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTVPEKERIRM----RS 298
Query: 169 PL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
P+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI PCSH
Sbjct: 299 PIDVIRSPTMAGGLFAVSKKYFEYLGSYDIGMEVWGGENLEFSFRIWQCGGTLEIHPCSH 358
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 359 VGHVFPRQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 398
>gi|395844920|ref|XP_003795196.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Otolemur garnettii]
Length = 633
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
+YI + K+ + K + ++ V + +T ++ A
Sbjct: 238 KYIKQFSIV------KIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P +E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGGNHNRGNFDWSLSFGWESLPDQEKQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +GSYD+ M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKKYFEYIGSYDDEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
troglodytes]
Length = 527
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP SIVI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 60 CKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 180 AIREDRH------TVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 232
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 233 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 291
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 292 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 338
>gi|311246104|ref|XP_003122084.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Sus
scrofa]
Length = 541
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 136/286 (47%), Positives = 174/286 (60%), Gaps = 26/286 (9%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK+K Y LPTTS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R
Sbjct: 123 CKEKKYDYDHLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 182
Query: 60 ---IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
I++ I A+ G +L + K V+ C + + +
Sbjct: 183 ERLAIELAKLPKVRLIRANKRE-GLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQ 241
Query: 111 YITAK--TVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
I K VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 242 RIHEKESAVVCPVIDVIDWNTFEYMGNSREPQIGGFDWRLVFTWHVVPERERLRM----K 297
Query: 168 SPL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
SP+ R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE SFR+WQCGG LEI PCS
Sbjct: 298 SPIDVIRSPTMAGGLFAVSKKYFEYLGAYDTGMEVWGGENLEFSFRIWQCGGTLEIHPCS 357
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 358 HVGHVFPKQAPYS-----RSKALANSVRAAEVWMDEFKELYYHRNP 398
>gi|52851353|dbj|BAD52069.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
Length = 550
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 166/282 (58%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRHNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSLEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|431894865|gb|ELK04658.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Pteropus alecto]
Length = 633
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 127/286 (44%), Positives = 172/286 (60%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + D+
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHDKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIID--VISDQTFEYITAK------------ 115
EYI + K+ + K + ++ V + +T ++ A
Sbjct: 238 EYIKQFSIV------KIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLL 291
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 292 ARIAENYTAVVSPDIASIDMNTFEFNKPSPYGNNHNRGNFDWSLSFGWESLPDHERQRRK 351
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K+YF +G+YD+ M+IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 352 -DETYPIKTPTFAGGLFSISKEYFEYIGTYDDEMEIWGGENIEMSFRVWQCGGQLEIMPC 410
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMD++++ +Y N
Sbjct: 411 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDDYKEIFYRRN 455
>gi|72000999|ref|NP_507850.2| Protein GLY-4, isoform b [Caenorhabditis elegans]
gi|27151758|emb|CAB81985.3| Protein GLY-4, isoform b [Caenorhabditis elegans]
Length = 453
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 122/270 (45%), Positives = 161/270 (59%), Gaps = 18/270 (6%)
Query: 13 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTFEYI 72
T+++I +HNEA S+LLRTV+SV N+SP LL EI+LVDD S+ V I + +
Sbjct: 153 TTVIITYHNEARSSLLRTVFSVFNQSPEELLLEIVLVDDNSQDVE------IGKELAQIQ 206
Query: 73 TASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYITA------KTVVCP 120
+ + +R + + + P++ + E + A K VV P
Sbjct: 207 RITVLRNNQREGLIRSRVKGAQVARAPVLTFLDSHIECNQKWLEPLLARIAENPKAVVAP 266
Query: 121 IIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLF 180
IIDVI+ F Y+ AS GGF+W L FRW + + R ++P+R+PTMAGGLF
Sbjct: 267 IIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRKERHAHPTAPIRSPTMAGGLF 326
Query: 181 AIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPG 240
AI K++F ELG+YD M++WGGENLEMSFRVWQCGG LEI+PCS VGHVFR K PYTFPG
Sbjct: 327 AISKEWFNELGTYDLDMEVWGGENLEMSFRVWQCGGSLEIMPCSRVGHVFRKKHPYTFPG 386
Query: 241 GVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
G + N R AEVWMDE++ Y P
Sbjct: 387 GSGNVFQKNTRRAAEVWMDEYKAIYLKNVP 416
>gi|301608339|ref|XP_002933739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 622
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 168/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV+ SP LLKEIILVDDASE + + ++
Sbjct: 174 LPTTSVIIVFHNEAWSTLLRTVYSVLYTSPAILLKEIILVDDASED------EYLKEKLD 227
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ A + K+ + K ++ + + + ++ A
Sbjct: 228 DYVKALQIV------KIARQKERKGLTTARLLGASIATGEVLTFLDAHCECFHGWLEPLL 281
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I +I +FE+ G F+W L F W +P E RR
Sbjct: 282 SRIAEDHTAVVSPDIPIIDLNSFEFHKPVQYGKTHNRGNFDWSLTFGWEAIPAAEKERRK 341
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +GSYDE M+IWGGENLEMSFRVWQCGG LEIIPC
Sbjct: 342 -DETYPIKTPTFAGGLFSISKAYFEHIGSYDEEMEIWGGENLEMSFRVWQCGGQLEIIPC 400
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G ++++ N R+AEVWMD+++ YY N
Sbjct: 401 SVVGHVFRTKSPHTFPKG-TQVIFRNLVRLAEVWMDDYKLLYYQRN 445
>gi|72000997|ref|NP_001024216.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
gi|51316004|sp|Q8I136.2|GALT4_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
Short=pp-GaNTase 4; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 4; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|3047189|gb|AAC13670.1| GLY4 [Caenorhabditis elegans]
gi|11064525|emb|CAC14394.1| Protein GLY-4, isoform a [Caenorhabditis elegans]
Length = 589
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 122/270 (45%), Positives = 161/270 (59%), Gaps = 18/270 (6%)
Query: 13 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTFEYI 72
T+++I +HNEA S+LLRTV+SV N+SP LL EI+LVDD S+ V I + +
Sbjct: 153 TTVIITYHNEARSSLLRTVFSVFNQSPEELLLEIVLVDDNSQDVE------IGKELAQIQ 206
Query: 73 TASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYITA------KTVVCP 120
+ + +R + + + P++ + E + A K VV P
Sbjct: 207 RITVLRNNQREGLIRSRVKGAQVARAPVLTFLDSHIECNQKWLEPLLARIAENPKAVVAP 266
Query: 121 IIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLF 180
IIDVI+ F Y+ AS GGF+W L FRW + + R ++P+R+PTMAGGLF
Sbjct: 267 IIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRKERHAHPTAPIRSPTMAGGLF 326
Query: 181 AIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPG 240
AI K++F ELG+YD M++WGGENLEMSFRVWQCGG LEI+PCS VGHVFR K PYTFPG
Sbjct: 327 AISKEWFNELGTYDLDMEVWGGENLEMSFRVWQCGGSLEIMPCSRVGHVFRKKHPYTFPG 386
Query: 241 GVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
G + N R AEVWMDE++ Y P
Sbjct: 387 GSGNVFQKNTRRAAEVWMDEYKAIYLKNVP 416
>gi|363734725|ref|XP_001231965.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 1
[Gallus gallus]
Length = 563
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 132/309 (42%), Positives = 176/309 (56%), Gaps = 27/309 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y T LP TS++I FHNEA S LLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 121 CTSVRYDTDLPATSLIITFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPEDCQ 180
Query: 60 IIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITA- 114
++ I + I + +R + R + I+ D + E++
Sbjct: 181 LLTRIPKVKCLRNIRREGL--------IRSRVRGAEAATADILTFLDSHCEVNSEWLQPM 232
Query: 115 --------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D
Sbjct: 233 LQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DP 291
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ +RTP +AGG+F I+K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS V
Sbjct: 292 TQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRV 351
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAH 282
GHVFR + PY FP G + + N R AEVWMDE++ +YY P GKS S++
Sbjct: 352 GHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKSYGSIADRVEQ 411
Query: 283 FRMLSYSSW 291
R L+ S+
Sbjct: 412 RRKLNCKSF 420
>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
gallus]
Length = 590
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 127/288 (44%), Positives = 175/288 (60%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK K Y + LP TS+VI F+NEAWSTLLRTV SV+ SP LL+E+ILVDD S++ +
Sbjct: 132 CKGKKYDYYSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHLK 191
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + ++ + ++ G +L + + ++ P+++
Sbjct: 192 EPLENYVAGLRKVRLIRANKREGLVRARLLGASIARGDILTFLDCHCECHEGWLEPLLER 251
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+++ VVCP+IDVI TFEY+ A + GGF+W+L F W+ P RE RR
Sbjct: 252 IAEEE------SAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTTPEREQKRR 305
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ +R+PTMAGGLF++ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI P
Sbjct: 306 KS-KIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHP 364
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 365 CSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDEYKELYYHRNP 407
>gi|345797223|ref|XP_545481.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Canis
lupus familiaris]
Length = 602
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 172/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 148 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 202
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 203 -DYLKDDLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 256
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 257 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 310
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 311 IPPDVVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 370
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 371 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 424
>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Gorilla gorilla gorilla]
Length = 527
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 127/288 (44%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 60 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 180 AIREDRH------TVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 232
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 233 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 291
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 292 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 338
>gi|397525624|ref|XP_003832760.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Pan
paniscus]
Length = 940
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 170/294 (57%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+I K YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDTIRCPVMAGGLFSIHKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|149050681|gb|EDM02854.1| rCG61782, isoform CRA_a [Rattus norvegicus]
Length = 397
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 126/282 (44%), Positives = 167/282 (59%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C Y T LP TSI+I FHNEA STLLRT+ SV+NR+P L++EIILVDD S
Sbjct: 101 CSLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIQEIILVDDFSNDPEDCK 160
Query: 54 ERVVCPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
+ + P + + + + + S D+ G L + + P++ + +
Sbjct: 161 QLIKLPKVKCLRNSERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ + R D +
Sbjct: 220 -----YTRVVCPVIDIINLDTFNYIESASELRGGFDWSLHFQWEQLSVEQKALRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+RTP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIRTPIIAGGLFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 375
>gi|348510947|ref|XP_003443006.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oreochromis niloticus]
Length = 567
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 127/286 (44%), Positives = 169/286 (59%), Gaps = 27/286 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C +Y T LP+TSIVI FHNEA STLLRT+ SV+ RSP +L++EIIL+DD +S+ C
Sbjct: 128 CAALTYDTDLPSTSIVITFHNEARSTLLRTIKSVLMRSPPSLIQEIILIDDFSSDPEDCQ 187
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIID-VI 104
++ I + G ++R N +++ P+I V
Sbjct: 188 LLAQIPKVR---CLRNGRREGLIRSRVRGANMASASILTFLDSHCEVNTDWLQPMIQRVK 244
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
D T VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R
Sbjct: 245 EDHT-------RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMARS- 296
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + +RTP +AGG+F +D+ +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS
Sbjct: 297 DPTQAIRTPVIAGGIFVMDRSWFNHLGQYDTHMDIWGGENFELSFRVWLCGGSLEILPCS 356
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVFR + PY FP G + + N R AEVWMDE++ +YY+ P
Sbjct: 357 RVGHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDEYKQYYYSARP 402
>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
glaber]
Length = 608
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 178/290 (61%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+KSYPT LP S+VI F+NEA+S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKEKSYPTDLPVASVVICFYNEAFSALLRTVHSVLDRTPAYLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I +I + E + M L + H + V P++
Sbjct: 201 ELDEYIQKYLPAKIKLIRNPRREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
V+ + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 VV------HGDPHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 310
Query: 163 GGDRSS--PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG S+ P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGADSATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|281341921|gb|EFB17505.1| hypothetical protein PANDA_013078 [Ailuropoda melanoleuca]
Length = 936
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 483 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 537
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + +Y++ LR K RH V C
Sbjct: 538 -DYLKGNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 591
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 592 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 645
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 646 IPPDVVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 705
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 706 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 759
>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Pan troglodytes]
gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
4 [Pan troglodytes]
gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
Length = 608
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP SIVI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Callithrix jacchus]
Length = 607
Score = 230 bits (586), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 126/288 (43%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL E+ILV
Sbjct: 140 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEVILVDDDSDFDDLKG 199
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I +I + E + M L + H + V P++
Sbjct: 200 ELDEYVQKYLPGKIKIIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 259
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+FRW VP E+
Sbjct: 260 AIREDQ------HTVVCPVIDIISADTLAY-SSSPIVRGGFNWGLHFRWDLVPLSELGGA 312
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 313 EGA-TTPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 371
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 372 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 418
>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Pan paniscus]
gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Pan paniscus]
Length = 608
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP SIVI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 TIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|395519661|ref|XP_003763961.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Sarcophilus harrisii]
Length = 631
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 127/286 (44%), Positives = 171/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++I+FHNEAWSTLLRTV SV+ SP LLKEIILVDDASE + + D+
Sbjct: 182 LPTTSVIIIFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASED------EYLHDKLD 235
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
EY+ + K+ + + + ++ V + +T ++ +
Sbjct: 236 EYVKQFQIV------KIVRQKERQGLINARLLGASVATAETLTFLDSHCECFYGWLEPLL 289
Query: 116 --------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 290 ARIAENYTAVVSPDIASIDLNTFEFNKPSPYGYNHNRGNFDWSLSFGWESLPEHERQRRK 349
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+RTPT AGGLF+I K+YF +G+YDE M IWGGEN+EMSFRVWQCGG LEI+PC
Sbjct: 350 -DETYPIRTPTFAGGLFSISKEYFEYIGTYDEEMKIWGGENIEMSFRVWQCGGQLEIMPC 408
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 409 SVVGHVFRSKSPHSFPKG-TQVIARNQVRLAEVWMDEFKEIFYRRN 453
>gi|363734723|ref|XP_003641443.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 2
[Gallus gallus]
Length = 557
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 132/309 (42%), Positives = 176/309 (56%), Gaps = 27/309 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y T LP TS++I FHNEA S LLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 115 CTSVRYDTDLPATSLIITFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPEDCQ 174
Query: 60 IIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITA- 114
++ I + I + +R + R + I+ D + E++
Sbjct: 175 LLTRIPKVKCLRNIRREGL--------IRSRVRGAEAATADILTFLDSHCEVNSEWLQPM 226
Query: 115 --------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D
Sbjct: 227 LQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DP 285
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ +RTP +AGG+F I+K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS V
Sbjct: 286 TQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRV 345
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAH 282
GHVFR + PY FP G + + N R AEVWMDE++ +YY P GKS S++
Sbjct: 346 GHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKSYGSIADRVEQ 405
Query: 283 FRMLSYSSW 291
R L+ S+
Sbjct: 406 RRKLNCKSF 414
>gi|440897124|gb|ELR48890.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Bos
grunniens mutus]
Length = 499
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 134/285 (47%), Positives = 176/285 (61%), Gaps = 24/285 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTS+VI F+NEAWSTLLRTV+SV+ SP TLL+E+ILVDD S+R +
Sbjct: 43 CKEKKYNYDELPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILVDDYSDREHLK 102
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ ++ + ++ G +L + K V+ C + + +
Sbjct: 103 ERLATELAGLPKVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQR 162
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I K VVCP+IDVI TFEY+ A + GGF+W+L F W+ VP +E +R S
Sbjct: 163 IHEKESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTVPEKERIRM----RS 218
Query: 169 PL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
P+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI PCSH
Sbjct: 219 PIDVIRSPTMAGGLFAVSKKYFEYLGSYDIGMEVWGGENLEFSFRIWQCGGTLEIHPCSH 278
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 279 VGHVFPRQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 318
>gi|301776863|ref|XP_002923851.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Ailuropoda melanoleuca]
Length = 937
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 171/294 (58%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 483 CAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 537
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + +Y++ LR K RH V C
Sbjct: 538 -DYLKGNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 591
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 592 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 645
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + +R P MAGGLF+IDK+YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 646 IPPDVVAKNRIKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWM 705
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 706 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 759
>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Anolis carolinensis]
Length = 592
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 129/282 (45%), Positives = 175/282 (62%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVV 57
C +K Y + LP TS++I F+NEAWSTLLRTV SV+ SP LL+EIILVDD S E +
Sbjct: 134 CTEKRYDYYNLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLK 193
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ + +++ + ++ G +L + K V+ C + + E
Sbjct: 194 EKLENYVANLRKVRLIRANKREGLVRARLLGASIAKGDVLTFLDCHCECHEEWLEPLLER 253
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I + VVCP+IDVI TFEY+ A + GGF+W+L F W+ VP RE +R ++
Sbjct: 254 IKEEPSAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPEREQKQRRS-KTD 312
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+R+PTMAGGLFA++K+YF LGSYD GM++WGGENLE SFR+WQCGG LEI PCSHVGH
Sbjct: 313 VIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGH 372
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VF ++PY+ L N+ R AEVWMD +++ YY NP
Sbjct: 373 VFPKQAPYS-----RAKALANSVRAAEVWMDSYKELYYHRNP 409
>gi|354482531|ref|XP_003503451.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Cricetulus griseus]
Length = 929
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 119/294 (40%), Positives = 168/294 (57%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTSI++ F +E WS LLR+V S++NRSP L+KEI+LVDD S +
Sbjct: 475 CAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSILNRSPPHLIKEILLVDDFSTK----- 529
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ LR K RH V C
Sbjct: 530 -DYLKDNLDKYMSQFPKVR-----ILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 583
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y+ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 584 VGWLEPLLERV------YLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFGWRT 637
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + G + +R P M GLF+IDK YFYELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 638 IPPDVVAKSGIKETDIIRCPVMGCGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWM 697
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 698 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 751
>gi|426222263|ref|XP_004005316.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Ovis aries]
Length = 467
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 134/285 (47%), Positives = 176/285 (61%), Gaps = 24/285 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTS+VI F+NEAWSTLLRTV+SV+ SP TLL+E+ILVDD S+R +
Sbjct: 11 CKEKKYNYDQLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILVDDYSDREHLK 70
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ ++ + ++ G +L + K V+ C + + +
Sbjct: 71 ERLATELAGLPKVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQR 130
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I K VVCP+IDVI TFEY+ A + GGF+W+L F W+ VP +E +R S
Sbjct: 131 IHEKESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHMVPEKERIRM----RS 186
Query: 169 PL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
P+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI PCSH
Sbjct: 187 PIDVIRSPTMAGGLFAVSKKYFEYLGSYDIGMEVWGGENLEFSFRIWQCGGTLEIHPCSH 246
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 247 VGHVFPRQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 286
>gi|359320847|ref|XP_532008.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Canis
lupus familiaris]
Length = 578
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 132/291 (45%), Positives = 174/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LPTTS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 123 CKEKKYDYENLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 182
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + K V+ P++
Sbjct: 183 ERLANELSGLPKVRLIRANKREGLVRARLLGASAAKGEVLTFLDCHCECHEGWLEPLLQR 242
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + GGF+W+L F W+ VP RE MR
Sbjct: 243 IHEEE------SAVVCPVIDVIDWNTFEYLGNPREPQIGGFDWRLVFTWHVVPERERMRM 296
Query: 163 GGDRSSPL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 297 ----RSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 352
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMD++++ YY NP
Sbjct: 353 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDDFKELYYHRNP 398
>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Bos taurus]
gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
taurus]
Length = 606
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 176/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP SIVI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 139 CKDKSYPADLPVASIVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKG 198
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 199 ELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 258
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 259 AIREDR------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 308
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 309 GGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 368
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 417
>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
Length = 608
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 125/292 (42%), Positives = 172/292 (58%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
D+ ++ + I VI + E + M H V +D +
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAA----------HATGEVLVFLDSHCEV 250
Query: 108 TFEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
++ TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E
Sbjct: 251 NMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSE 309
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 310 LGEAEGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKL 368
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 FIIPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|355564907|gb|EHH21396.1| hypothetical protein EGK_04452 [Macaca mulatta]
Length = 940
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 118/294 (40%), Positives = 170/294 (57%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ L K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + ++ P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|449271781|gb|EMC82021.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Columba
livia]
Length = 314
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 174/288 (60%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVV 57
C++K Y + LP TS+VI F+NEAWSTLLRTV SV+ SP LL+EIILVDD S E +
Sbjct: 1 CREKKYDYYSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLK 60
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + ++ + ++ G +L + K ++ P++
Sbjct: 61 ETLENYVAGLRKVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLAR 120
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+++ VVCP+IDVI TFEY+ A + GGF+W+L F W+ P RE RR
Sbjct: 121 IAEEE------SAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHSTPEREQKRR 174
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
++ +R+PTMAGGLF++ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI P
Sbjct: 175 KS-KTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHP 233
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 234 CSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDEYKELYYHRNP 276
>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
6 [Macaca mulatta]
gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
Length = 608
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 125/292 (42%), Positives = 172/292 (58%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
D+ ++ + I VI + E + M H V +D +
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAA----------HATGEVLVFLDSHCEV 250
Query: 108 TFEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
++ TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E
Sbjct: 251 NMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSE 309
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 310 LGEAEGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKL 368
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 FIIPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|113931290|ref|NP_001039091.1| polypeptide N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|89268082|emb|CAJ83416.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|111305589|gb|AAI21348.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|134026192|gb|AAI35810.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
Length = 562
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 124/295 (42%), Positives = 169/295 (57%), Gaps = 36/295 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C + LP+TS++I FHNEA STLLRT+ SV+ RSP L++EIILVDD S
Sbjct: 120 CTSVHHDNDLPSTSVIITFHNEARSTLLRTIKSVLIRSPGNLIQEIILVDDFSTDPDDCQ 179
Query: 56 --VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDV------ISDQ 107
P + + + E + +R + R + P++ ++++
Sbjct: 180 LLTKIPKVKCLRNNRREGL-------------IRSRVRGAELAAAPVLTFLDSHCEVNNE 226
Query: 108 TFEYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
+ + + VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M
Sbjct: 227 WLQPLLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMS 286
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R D +S +RTP +AGG+F IDK +F +LG YD MDIWGGEN E+SFRVW CGG LEI+
Sbjct: 287 RT-DPTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIV 345
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS 273
PCS VGHVFR + PY FP G + + N R EVWMDE++ +YY P GKS
Sbjct: 346 PCSRVGHVFRKRHPYEFPDGNALTYIKNTKRTVEVWMDEYKQYYYQARPSAIGKS 400
>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Gorilla gorilla gorilla]
gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Gorilla gorilla gorilla]
Length = 608
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 127/288 (44%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
Length = 608
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 125/292 (42%), Positives = 172/292 (58%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
D+ ++ + I VI + E + M H V +D +
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAA----------HATGEVLVFLDSHCEV 250
Query: 108 TFEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
++ TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E
Sbjct: 251 NMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSE 309
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 310 LGEAEGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKL 368
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 FIIPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|355750550|gb|EHH54877.1| hypothetical protein EGM_03977 [Macaca fascicularis]
Length = 940
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/294 (40%), Positives = 170/294 (57%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ L K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + ++ P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|426362479|ref|XP_004048391.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Gorilla gorilla gorilla]
Length = 633
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 177 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 236
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 237 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 296
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 297 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 350
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 351 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 406
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 407 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 452
>gi|402888383|ref|XP_003907542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Papio
anubis]
Length = 940
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 118/294 (40%), Positives = 170/294 (57%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 486 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ L K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + ++ P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|393912281|gb|EFO21646.2| glycosyl transferase [Loa loa]
Length = 470
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 127/273 (46%), Positives = 165/273 (60%), Gaps = 18/273 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-----VCPI--ID 62
LP+TS++I +HNEA STLLRTV SV RSP LL EIILVDD S+ + + PI +
Sbjct: 144 LPSTSVIITYHNEARSTLLRTVVSVFLRSPPQLLHEIILVDDFSDDIAMGTDLLPIENVI 203
Query: 63 VISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC---PIIDVISDQTFEYITAKTV 117
VI + E + S + L + H + V P++ + + +TV
Sbjct: 204 VIRNTKREGLIRSRVKGSALARASVLTFLDSHCECNVNWLEPLLARVKENH------RTV 257
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
V P+IDVI TF+Y+ AS GGF W L F+W + + R ++P+RTP +AG
Sbjct: 258 VAPVIDVIDRDTFKYVAASADLRGGFEWNLVFKWEYLTGKLRDERHARPTAPIRTPVIAG 317
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLF I KD+F +LG+YDE MDIWGGENLE+SFRVWQCGG LEIIPCS VGHVFR + PYT
Sbjct: 318 GLFMIQKDWFEKLGTYDEEMDIWGGENLELSFRVWQCGGSLEIIPCSRVGHVFRKQHPYT 377
Query: 238 FPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FPGG + N R AEVW+ +++ Y P
Sbjct: 378 FPGGSGNVFQKNTRRAAEVWLGDYKHLYLRKVP 410
>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Cavia porcellus]
Length = 608
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 126/292 (43%), Positives = 175/292 (59%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK++SYP LP S+VI F+NEA+S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKEQSYPADLPVASVVICFYNEAFSALLRTVHSVLDRTPAYLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDV---I 104
D+ ++ + I VI + E + M H V +D +
Sbjct: 201 ELDEYVQKSLPTKIKVIRNAKREGLIRGRMIGAA----------HATGEVLVFLDSHCEV 250
Query: 105 SDQTFEYITA------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
++ + + A TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E
Sbjct: 251 NEMWLQPLLATIRGDPHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSE 309
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ G ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 310 LGGEDG-ATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISFRIWMCGGKL 368
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY P G + HN+ R+A VW+DE++D Y+++ P
Sbjct: 369 FIIPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKDQYFSLRP 419
>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 125/286 (43%), Positives = 170/286 (59%), Gaps = 29/286 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
C+ K+YP LP SIVI F+NEAW+ LLRT+ SV++R+P L EIILVDD S +
Sbjct: 45 CRGKTYPKNLPVASIVICFYNEAWTILLRTIHSVLDRTPHQFLHEIILVDDFSNMLELKS 104
Query: 57 -------VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
P I ++ + E I ++ G L + P++ I
Sbjct: 105 KLDRYLSTMPKIRIVRNNKREGLIRGRIIGAEAATGQVLVFLDSHCEVNINWLQPLLQHI 164
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
D K V CP+IDVIS TFEY ++S M GGFNW L+F W +PP ++ +
Sbjct: 165 HDDQ------KAVACPVIDVISSDTFEY-SSSPMVRGGFNWGLHFTWEPIPP-SLLVKPE 216
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D P+R+PTMAGGLFA+D++YF +LG YD GMDIWG ENLE+SFR+W CGG L+I+PCS
Sbjct: 217 DYVKPIRSPTMAGGLFAVDREYFTQLGKYDSGMDIWGAENLEISFRIWMCGGSLDILPCS 276
Query: 225 HVGHVFRDKSPYTFPGGVSK--IVLHNAARVAEVWMDEWRDFYYAM 268
VGH+FR PY G SK + N+ R+AEVW+D ++ ++Y +
Sbjct: 277 RVGHLFRRFRPY---GSDSKGDTMSRNSMRLAEVWLDGYKKYFYQI 319
>gi|312080021|ref|XP_003142423.1| glycosyl transferase [Loa loa]
Length = 447
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 127/273 (46%), Positives = 165/273 (60%), Gaps = 18/273 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-----VCPI--ID 62
LP+TS++I +HNEA STLLRTV SV RSP LL EIILVDD S+ + + PI +
Sbjct: 144 LPSTSVIITYHNEARSTLLRTVVSVFLRSPPQLLHEIILVDDFSDDIAMGTDLLPIENVI 203
Query: 63 VISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC---PIIDVISDQTFEYITAKTV 117
VI + E + S + L + H + V P++ + + +TV
Sbjct: 204 VIRNTKREGLIRSRVKGSALARASVLTFLDSHCECNVNWLEPLLARVKENH------RTV 257
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
V P+IDVI TF+Y+ AS GGF W L F+W + + R ++P+RTP +AG
Sbjct: 258 VAPVIDVIDRDTFKYVAASADLRGGFEWNLVFKWEYLTGKLRDERHARPTAPIRTPVIAG 317
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLF I KD+F +LG+YDE MDIWGGENLE+SFRVWQCGG LEIIPCS VGHVFR + PYT
Sbjct: 318 GLFMIQKDWFEKLGTYDEEMDIWGGENLELSFRVWQCGGSLEIIPCSRVGHVFRKQHPYT 377
Query: 238 FPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
FPGG + N R AEVW+ +++ Y P
Sbjct: 378 FPGGSGNVFQKNTRRAAEVWLGDYKHLYLRKVP 410
>gi|224047294|ref|XP_002195048.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Taeniopygia guttata]
Length = 552
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 164/283 (57%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
C Y LP TS++I FHNEA STLLRT+ SV+NR+P L+ EIILVDD S+
Sbjct: 101 CTTLHYSQDLPPTSVIITFHNEARSTLLRTIRSVLNRTPVHLVHEIILVDDFSDDPDDCR 160
Query: 58 ----CPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + + E I +D+ L K + P++ I +
Sbjct: 161 LLGQLPKVKCLRNGRREGLIRSRIRGADVAKASVLTFLDSHCEVNKDWLLPLLQRIKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P + +R D +
Sbjct: 220 -----PTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQLSPEQKAKRL-DPTE 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPCS VGH
Sbjct: 274 PIKTPIIAGGLFVIDKAWFNHLGKYDSAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
VFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 334 VFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKQYYYAARPA 376
>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
mutus]
Length = 606
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 128/290 (44%), Positives = 176/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 139 CKDKSYPADLPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKG 198
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 199 ELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 258
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 259 AIREDR------QTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 308
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 309 GGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 368
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 417
>gi|90078941|dbj|BAE89150.1| unnamed protein product [Macaca fascicularis]
Length = 311
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 98/139 (70%), Positives = 119/139 (85%)
Query: 133 ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGS 192
+ SDMT+GGFNWKLNFRWY VP REM RR GDR+ P+RTPTMAGGLF+ID+DYF E+G+
Sbjct: 1 MAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRDYFQEIGT 60
Query: 193 YDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAAR 252
YD GMDIWGGENLE+SFR+WQCGG LEI+ CSHVGHVFR +PYTFPGG +I+ N R
Sbjct: 61 YDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTFPGGTGQIINKNNRR 120
Query: 253 VAEVWMDEWRDFYYAMNPG 271
+AEVWMDE+++F+Y ++PG
Sbjct: 121 LAEVWMDEFKNFFYIISPG 139
>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Felis catus]
Length = 527
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 128/290 (44%), Positives = 175/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 60 CKDKSYPADLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKG 119
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
++ ++ + I VI + E + M L + H + V P++
Sbjct: 120 ELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 179
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 180 AIRED------PRTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 229
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+R+PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 230 GGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 289
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 290 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 338
>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 641
Score = 229 bits (583), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 127/283 (44%), Positives = 172/283 (60%), Gaps = 23/283 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + + LP TS++I FHNEAWS L+RTV SVI+RSP+ LLKEIILVDD S+ + +
Sbjct: 190 CKTQKFRRDLPQTSVIICFHNEAWSVLMRTVHSVIDRSPKNLLKEIILVDDFSD--MKHL 247
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVIS--------------D 106
+ + D T + + +R + K P++ + D
Sbjct: 248 KEQLEDYTRKLGIVKIVRASKREGLIRARLLGAKFATAPVLTYLDSHCECSTGWLEPLLD 307
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYI---TASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ E T VVCP+IDVISD TFEY + GGF+W L F W+ +P R+ R
Sbjct: 308 RIAEADT--NVVCPVIDVISDSTFEYPHRRAGYTVNVGGFDWNLQFSWHSLPQRDKDARK 365
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
S+ + +PTMAGGLF+I K YF +LG YD G DIWG ENLE+SF+VW CGG LEI+PC
Sbjct: 366 QSWSA-VPSPTMAGGLFSISKAYFEKLGLYDSGFDIWGAENLELSFKVWMCGGRLEIVPC 424
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
SHVGHVFR +SPY + GV+ ++ N+ R+A+VWMDE+ +Y+
Sbjct: 425 SHVGHVFRKRSPYKWLKGVN-VLKKNSVRLAKVWMDEYAQYYF 466
>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
anubis]
Length = 608
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 125/292 (42%), Positives = 172/292 (58%), Gaps = 35/292 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+VI F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
D+ ++ + I VI + E + M H V +D +
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAA----------HATGEVLVFLDSHCEV 250
Query: 108 TFEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
++ TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E
Sbjct: 251 NMMWLQPLLAAIREDRHTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSE 309
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L
Sbjct: 310 LGGAEGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKL 368
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IIPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 FIIPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
scrofa]
Length = 608
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 174/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSY T LP S++I F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKDKSYRTDLPVASVIICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYIQKYLTGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y +AS + GGFNW L+FRW VP E+
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SASPVVRGGFNWGLHFRWDLVPLSELEGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EG-ATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|326920610|ref|XP_003206562.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Meleagris gallopavo]
Length = 509
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 131/309 (42%), Positives = 175/309 (56%), Gaps = 27/309 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C Y LP TS++I FHNEA S LLRTV SV+NR+P L++EIILVDD +S+ C
Sbjct: 67 CTSVRYDADLPATSLIITFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFSSDPEDCQ 126
Query: 60 IIDVISD-QTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITA- 114
++ I + I + +R + R + I+ D + E++
Sbjct: 127 LLTKIPKVKCLRNIRREGL--------IRSRVRGAEVATADILTFLDSHCEVNSEWLQPM 178
Query: 115 --------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D
Sbjct: 179 LQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMSRT-DP 237
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ +RTP +AGG+F I+K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS V
Sbjct: 238 TQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRV 297
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAAH 282
GHVFR + PY FP G + + N R AEVWMDE++ +YY P GKS S++
Sbjct: 298 GHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSAIGKSYGSIADRVEQ 357
Query: 283 FRMLSYSSW 291
R L+ S+
Sbjct: 358 RRKLNCKSF 366
>gi|395740752|ref|XP_003780818.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12, partial [Pongo
abelii]
Length = 538
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 82 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 141
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 142 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 201
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 202 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 255
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 256 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 311
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 312 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 357
>gi|62148924|dbj|BAD93346.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 581
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 125 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 184
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 185 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 244
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 245 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 298
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 299 ----QSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 354
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 355 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 400
>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Metaseiulus occidentalis]
Length = 585
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 127/289 (43%), Positives = 174/289 (60%), Gaps = 32/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCP 59
CK++ Y LP S++I F+NEAWSTL+RTV SV++RSP LL+EIILVDD S+ + P
Sbjct: 119 CKQQKYSKDLPRASVIICFYNEAWSTLIRTVNSVLDRSPSALLQEIILVDDLSDIAELEP 178
Query: 60 I---------IDVISDQTFEYITASDMTWGG---------FNWKLREKNRHKKTVVCPII 101
+ + VI + E + + M + + R + ++ PI
Sbjct: 179 LAGFVQKHEKVRVIRTREREGLIRARMIGAHNSTGDVLVFLDSHVEVNERWLQPLLVPIQ 238
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
+QT TV CP+ID+I+ TFEY + S + GGFNW ++FRW +P + +
Sbjct: 239 ---QNQT-------TVTCPVIDIINADTFEY-SPSPLVKGGFNWGMHFRWDNLP-KGYFK 286
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
+R +PL +PTMAGGLFAI KD F LG YD GMD+WGGENLE+SFR+W CGG L+I+
Sbjct: 287 SEKERIAPLPSPTMAGGLFAIHKDEFRRLGEYDWGMDVWGGENLELSFRIWMCGGSLKIM 346
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS VGHVFR + PY G + N+ RVA VWMD+++ +YY M P
Sbjct: 347 PCSRVGHVFRKRRPYGASNGEDTLA-KNSLRVANVWMDDYKKYYYRMRP 394
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Nasonia vitripennis]
Length = 646
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 188/321 (58%), Gaps = 34/321 (10%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+ +
Sbjct: 159 CKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYSDMPHLK 218
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D + + I + G +L K V+ P++D
Sbjct: 219 RQLEDYMMNYPKVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGWLEPLLDR 278
Query: 104 IS-DQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMM 160
I+ +QT TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE
Sbjct: 279 IARNQT-------TVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK- 330
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
+R + + P+ +PTMAGGLFAID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI
Sbjct: 331 KRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEI 390
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVS 277
+PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + G VS
Sbjct: 391 VPCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVS 449
Query: 278 TCAAHFRMLSYSS--WFSGSI 296
A + L S W+ +I
Sbjct: 450 DRKALRKNLGCKSFKWYLDNI 470
>gi|194225536|ref|XP_001494993.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Equus
caballus]
Length = 460
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 131/288 (45%), Positives = 173/288 (60%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CKKK+Y LPTTS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 4 CKKKTYDYERLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 63
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ +S + ++ G +L + K V+ P++
Sbjct: 64 ERLASELSRLPKVRLIRANKREGLVRARLLGASAAKGDVLTFLDCHCECHEGWLEPLLQR 123
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 124 IHEE------ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPERERLRM 177
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE P
Sbjct: 178 RSP-TDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHP 236
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY+ L N+ R AEVWMD +++ YY NP
Sbjct: 237 CSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDGYKELYYHRNP 279
>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Felis catus]
Length = 608
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 128/290 (44%), Positives = 175/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKDKSYPADLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
++ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIRED------PRTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 310
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+R+PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGPEGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis mellifera]
Length = 635
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 186/313 (59%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y T LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 148 CKEPGRYLTDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDFSDMPHLQ 207
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 208 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 267
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 268 IAR------NPTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 320
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 321 RHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 380
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + GK VS
Sbjct: 381 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSE 439
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 440 RKALRKRLGCKSF 452
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Nasonia vitripennis]
Length = 610
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 188/321 (58%), Gaps = 34/321 (10%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+ +
Sbjct: 160 CKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYSDMPHLK 219
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D + + I + G +L K V+ P++D
Sbjct: 220 RQLEDYMMNYPKVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGWLEPLLDR 279
Query: 104 IS-DQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMM 160
I+ +QT TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE
Sbjct: 280 IARNQT-------TVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK- 331
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
+R + + P+ +PTMAGGLFAID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI
Sbjct: 332 KRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEI 391
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVS 277
+PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + G VS
Sbjct: 392 VPCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVS 450
Query: 278 TCAAHFRMLSYSS--WFSGSI 296
A + L S W+ +I
Sbjct: 451 DRKALRKNLGCKSFKWYLDNI 471
>gi|22122074|dbj|BAC07181.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Homo sapiens]
Length = 581
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 125 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 184
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 185 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 244
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 245 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 298
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 299 ----QSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 354
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 355 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 400
>gi|119579301|gb|EAW58897.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12) [Homo
sapiens]
Length = 517
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 61 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 120
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 121 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 180
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 181 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 234
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 235 ----QSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 290
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 291 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 336
>gi|391348383|ref|XP_003748427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 648
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/292 (44%), Positives = 171/292 (58%), Gaps = 30/292 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK +YP LPT S+VI+F +E +STLLRT+ S INRSP LL+EIILVDD S+
Sbjct: 181 CKAVTYPMAELPTASVVIIFTDEIFSTLLRTIVSTINRSPNHLLREIILVDDFSQS---- 236
Query: 60 IIDVISDQTFEYITA---SDMTW-------GGFNWKLREKNRHKKTVVCPIIDVISDQTF 109
+ + D+ YIT +D+ G R K V +D + T
Sbjct: 237 --EDLKDRLQRYITHHFRADVVRLIRLPERSGLIRARLAGARAAKGDVLIFLDSHCETTP 294
Query: 110 EYITA---------KTVVCPIIDVISDQTFEYITASDMTW--GGFNWKLNFRWYRVPPRE 158
++ + VVCP+ID+I D+T +Y+ A + GGFNWK F W+ +P
Sbjct: 295 GWLEPLLEPIRRDRRAVVCPVIDIIDDKTLQYVAAEGDRFQIGGFNWKGEFSWHNIPAAW 354
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
R + P+R+PTMAGGLFAI+++YF+E GSYDE MD WGGENLEMSFR+WQCGG +
Sbjct: 355 RKNRTSI-AEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHI 413
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
I PCSHVGH+FRD PY FP G + N R EVWMDE++ ++Y P
Sbjct: 414 VIAPCSHVGHIFRDYHPYKFPKGKDTNAI-NTKRAVEVWMDEFKKYFYQTRP 464
>gi|112807221|ref|NP_078918.3| polypeptide N-acetylgalactosaminyltransferase 12 [Homo sapiens]
gi|84028209|sp|Q8IXK2.3|GLT12_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|25815116|emb|CAC80100.2| UDP-GalNAc-transferase 12 [Homo sapiens]
gi|151555601|gb|AAI48740.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
[synthetic construct]
gi|151556464|gb|AAI48410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
[synthetic construct]
gi|261858036|dbj|BAI45540.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [synthetic
construct]
Length = 581
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 125 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 184
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 185 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 244
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 245 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 298
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 299 ----QSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 354
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 355 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 400
>gi|397499979|ref|XP_003820707.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
paniscus]
Length = 568
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 129/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 112 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 171
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 172 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 231
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE +R
Sbjct: 232 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 285
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 286 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 341
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 342 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 387
>gi|395519600|ref|XP_003763931.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Sarcophilus harrisii]
Length = 945
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 119/288 (41%), Positives = 170/288 (59%), Gaps = 32/288 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C + LPTTSI++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S +
Sbjct: 491 CADQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK----- 545
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---KKTVVCPII---DVIS--DQTFE-- 110
+ DQ +Y++ L K RH + + I DV++ D E
Sbjct: 546 -GYLKDQLDKYMSQFPKVR-----VLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECN 599
Query: 111 -----------YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
Y+ K V CP+I++I+D+ Y+T + G F W +NF W ++PP +
Sbjct: 600 VGWLEPLLERVYLNKKKVACPVIEIINDKDLSYMTVDNFQRGIFVWPMNFSWKKIPPEII 659
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
+ + +R P MAGGLF+IDK YF+ELG+YD G+++WGGEN+E+SF+VW CGG +E
Sbjct: 660 KQNKIKETDVIRCPVMAGGLFSIDKKYFFELGTYDPGLEVWGGENMELSFKVWMCGGEIE 719
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
IIPCS VGH+FR +PY+FP K + N RVAEVW+DE+++ +Y
Sbjct: 720 IIPCSRVGHIFRKDNPYSFPENRIKTIERNLIRVAEVWLDEYKELFYG 767
>gi|380021258|ref|XP_003694487.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis florea]
Length = 537
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/313 (42%), Positives = 186/313 (59%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y T LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 50 CKEPGRYLTDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDFSDMPHLQ 109
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 110 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 169
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 170 IAR------NPTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 222
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 223 RHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 282
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + GK VS
Sbjct: 283 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSE 341
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 342 RKALRKRLGCKSF 354
>gi|109099754|ref|XP_001087663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Macaca mulatta]
Length = 940
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 117/294 (39%), Positives = 170/294 (57%), Gaps = 44/294 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTTS+++ F +E WSTLLR+V SV+NRSP L++EI+LVDD S +
Sbjct: 486 CTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIEEILLVDDFSTK----- 540
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------------------KKTVVC- 98
D + D +Y++ L K RH V C
Sbjct: 541 -DYLKDNLDKYMSQFPKVR-----ILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECN 594
Query: 99 -----PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
P+++ + Y++ K V CP+I+VI+D+ Y+T + G F W +NF W
Sbjct: 595 VGWLEPLLERV------YLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRT 648
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+PP + + + ++ P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW
Sbjct: 649 IPPDVIAKNRIKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWM 708
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CGG +EIIPCS VGH+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 709 CGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
>gi|358332241|dbj|GAA27774.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 584
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 122/286 (42%), Positives = 169/286 (59%), Gaps = 29/286 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C+ + +P P T++VI FHNE WSTLLR+V SV++ P LLKEI+LVDD S E +
Sbjct: 125 CRTQKFPANQPATAVVICFHNECWSTLLRSVHSVLDTVPENLLKEIVLVDDFSTYEYLKS 184
Query: 59 PI---------IDVISDQTFEYITASDMTWGGFNWKLRE----KNRH---KKTVVCPIID 102
P+ + VI E + + M G N E + H K + P++D
Sbjct: 185 PLDLYMKQLKKVKVIHTDKREGLIRARMI--GMNASTAEILTFLDSHIECNKGWLEPLLD 242
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMM 160
I TVV P+ID I+D TF Y + S + GGF+W + + W+ VPP+ +
Sbjct: 243 CIQK------NQSTVVSPVIDRINDDTFAYEPLLLSQIQVGGFDWDMTYNWH-VPPKRDL 295
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
R G +P+R PT+AGGLF++ +D+F LG YD MD+WGGENLE+SF+ W CGG L++
Sbjct: 296 ERPGAPFTPIRAPTIAGGLFSVHRDFFAYLGYYDPQMDVWGGENLELSFKTWMCGGTLQV 355
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
PCSHVGHVFR KSPY+ + HN R+AEVWMDE++ ++Y
Sbjct: 356 HPCSHVGHVFRTKSPYSAKNNTGDTLRHNLVRLAEVWMDEYKGYFY 401
>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Nomascus leucogenys]
Length = 527
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/288 (43%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
C++K YP LP+ S+VI F+NEA+S LLRT SVI+R+P LL EIILV
Sbjct: 60 CQEKFYPPDLPSASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 180 AIREDQH------TVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 232
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 233 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 291
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 292 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 338
>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
caballus]
Length = 609
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 128/290 (44%), Positives = 175/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYPT LP S+VI F+NEA S LLRTV SV++R+P LL E+ILV
Sbjct: 142 CKDKSYPTDLPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEVILVDDDSDFDDLKG 201
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 202 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 261
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
VI + + VVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 262 VIQEDR------RMVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 311
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA+ + YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 312 GGPEGATAPIKSPTMAGGLFAMSRRYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 371
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 372 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAYVWLDEYKEQYFSLRP 420
>gi|348569970|ref|XP_003470770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Cavia porcellus]
Length = 579
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 174/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVV 57
CK+K Y LPTTS++I F+NEAWSTLLRTV+SV+ SP L++E+ILVDD S E +
Sbjct: 123 CKEKKYDYENLPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILVEEVILVDDYSDKEHLK 182
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + + G +L + + V+ P++
Sbjct: 183 ERLANELSGLPKVRLIRASKREGLVRARLLGASVARGNVLTFLDCHCECHEGWLEPLLQR 242
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ A + GGF+W+L F W+ VP R+ +R
Sbjct: 243 IHEEE------SAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPERDRLRM 296
Query: 163 GGDRSSPL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 297 ----KSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 352
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
I PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 353 IHPCSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDEFKELYYHRNP 398
>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 624
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 126/274 (45%), Positives = 164/274 (59%), Gaps = 16/274 (5%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LPTTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + P+ +
Sbjct: 178 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEPLEQYVQQL 237
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK--TVVC 119
+ + G +L + + V+ C + IT VV
Sbjct: 238 RIVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARITEDETAVVS 297
Query: 120 PIIDVISDQTFEYITASDM----TWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
P I I TFE+ + G F+W L F W +P E RR D + P+++PT
Sbjct: 298 PDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQRRK-DETDPIKSPTF 356
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHVFR KSP
Sbjct: 357 AGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKSP 416
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
+TFP G + ++ N R+AEVWMDE+++ +Y N
Sbjct: 417 HTFPKG-TNVIARNQVRLAEVWMDEYKEIFYRRN 449
>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 599
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 126/274 (45%), Positives = 164/274 (59%), Gaps = 16/274 (5%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LPTTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + P+ +
Sbjct: 168 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEPLEQYVQQL 227
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK--TVVC 119
+ + G +L + + V+ C + IT VV
Sbjct: 228 RIVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARITEDETAVVS 287
Query: 120 PIIDVISDQTFEYITASDM----TWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
P I I TFE+ + G F+W L F W +P E RR D + P+++PT
Sbjct: 288 PDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQRRK-DETDPIKSPTF 346
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHVFR KSP
Sbjct: 347 AGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKSP 406
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
+TFP G + ++ N R+AEVWMDE+++ +Y N
Sbjct: 407 HTFPKG-TNVIARNQVRLAEVWMDEYKEIFYRRN 439
>gi|402890489|ref|XP_003908519.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Papio anubis]
Length = 551
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 122/275 (44%), Positives = 166/275 (60%), Gaps = 19/275 (6%)
Query: 8 TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERVVCPI 60
T L T+ + FHNEA STLLRT+ SV+NR+P L++EIILVDD S + + P
Sbjct: 107 TDLSRTARLXXFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIKLPK 166
Query: 61 IDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAK 115
+ + + + I +D+ G L + + P++ + + +Y
Sbjct: 167 VKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKE---DYTR-- 221
Query: 116 TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
VVCP+ID+I+ TF YI ++ GGF+W L+F+W ++ P + RR D + P+RTP +
Sbjct: 222 -VVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRL-DPTEPIRTPII 279
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGHVFR K P
Sbjct: 280 AGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHP 339
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
Y FP G + + N R AEVWMDE++ +YYA P
Sbjct: 340 YVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARP 374
>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
Length = 682
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 123/284 (43%), Positives = 173/284 (60%), Gaps = 26/284 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVVC 58
C+K SY LP TS++I FHNEAWS LLRTV SV+ R+P LLKEIILVDD S+ +
Sbjct: 221 CQKASYRNDLPDTSVIICFHNEAWSVLLRTVHSVLERTPDHLLKEIILVDDFSDFDHLKK 280
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ D +S I + G +L+ + V+ P++D I
Sbjct: 281 PLEDYMSQFGKVRIIRLENRMGLIRARLKGASVATGKVLTYLDSHCECMNRWLEPLLDRI 340
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPREMMRR 162
+ + VV P+ID I+ +T +Y +S ++ GGFNW L F W+ +P R+
Sbjct: 341 AQ------NSTNVVTPVIDTINLETLQYHLSSHRRLSVGGFNWGLVFNWHILPDRDYQAM 394
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
R P+ +PTMAGGLF+ID+ YF +LG YD G DIWG ENLE+SF++W CGG LE++P
Sbjct: 395 KS-RIDPIPSPTMAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISFKIWMCGGRLEVVP 453
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
CSHVGH+FR KSPY + G++ ++ N R+AEVW+D++++ YY
Sbjct: 454 CSHVGHIFRKKSPYKWRKGIN-VLQRNNIRLAEVWLDDYKEIYY 496
>gi|355567593|gb|EHH23934.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
mulatta]
Length = 457
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 1 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 60
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 61 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 120
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 121 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 174
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 175 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 230
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 231 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 276
>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 512
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 128/297 (43%), Positives = 166/297 (55%), Gaps = 49/297 (16%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
C + Y LP+TS++I FHNEA STLLRT+ SV+NRSP L++EIILVDD SE V
Sbjct: 77 CYNQVYHPTLPSTSVIITFHNEARSTLLRTIVSVLNRSPPHLIEEIILVDDFSEDVNTGL 136
Query: 57 ---VCPIIDVISDQTFEYITAS--------------------DMTWGGFNWKLREKNRHK 93
P I +I ++ E + S + G L ++ +
Sbjct: 137 LLTQMPKIKLIRNERREGLVRSRIFGADAAKGEILTFLDSHCECNIGWLEPLLHRVSQDR 196
Query: 94 KTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
VV PIIDVIS +D TF+YI AS GGF+W L+F+W
Sbjct: 197 TIVVSPIIDVIS----------------MD-----TFDYIGASSELRGGFDWSLHFKWDG 235
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
P + +R P++TP +AGGLF+I++ F E G YD+ MDIWGGEN E+SFR W
Sbjct: 236 FTPAQRAKRKSP-IEPIKTPMIAGGLFSINRQRFIETGKYDDQMDIWGGENFEISFRTWM 294
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CGG LEIIPCS VGHVFR + PY FPGG + + N R AEVWMD ++D+YY+ P
Sbjct: 295 CGGSLEIIPCSRVGHVFRKRHPYVFPGGNAMTYMKNTKRAAEVWMDNYKDYYYSARP 351
>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oreochromis niloticus]
Length = 575
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 176/307 (57%), Gaps = 36/307 (11%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC- 58
CK+ Y LPTTS+VI F+NEAWSTLLRTV SV+ SP LLKE++LVDD S++
Sbjct: 117 CKELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDKAHLK 176
Query: 59 -PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ IS + + G +L + V+ P++
Sbjct: 177 EPLDKYISGLNKVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPVLHR 236
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ K VVCP+IDVI TF+Y+ A + GGF+W+L F W+ +P E RR
Sbjct: 237 IKEE------PKAVVCPVIDVIDWNTFQYLGHAGEPQIGGFDWRLVFTWHSIPDYEQKRR 290
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ KD+F+ LG+YD GM++WGGENLE SFR+WQCGG LE
Sbjct: 291 ----RSPVDVIRSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLE 346
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTC 279
+ PCSHVGHVF K+PY+ L N+ R AEVW+DE+++ YY NP
Sbjct: 347 VHPCSHVGHVFPKKAPYS-----RSKALANSVRAAEVWLDEFKEIYYHRNPHARLEAFGD 401
Query: 280 AAHFRML 286
RML
Sbjct: 402 VTERRML 408
>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Nomascus leucogenys]
Length = 608
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 125/288 (43%), Positives = 175/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
C++K YP LP+ S+VI F+NEA+S LLRT SVI+R+P LL EIILV
Sbjct: 141 CQEKFYPPDLPSASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIREDQ------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
Length = 507
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 124/284 (43%), Positives = 164/284 (57%), Gaps = 25/284 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP- 59
C + SYP +P S+VI+FHNEAWS LLRTV SV+NRSP L E+IL+DD S+R
Sbjct: 46 CSRVSYPKVMPNASVVIIFHNEAWSPLLRTVHSVVNRSPPEYLHEVILLDDFSDRAGLGE 105
Query: 60 -----IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD------QT 108
I D D + + A + G +R + K ++ + Q
Sbjct: 106 KLDGYIKDTWPDGIVKVVRAPERQ--GL---IRARVLGAKAATGEVLVFLDSHCECNVQW 160
Query: 109 FEYITAK------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
E + A+ ++CP+IDVI + Y + GGF W L+F W +P RE RR
Sbjct: 161 LEPLVARIKESRSALLCPMIDVIDAKAMSYNGIGAGSVGGFWWSLHFSWRPLPQRERKRR 220
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ +R+PTMAGGLFA D+ YF+E+G YD GMD+WGGENLE+SFRVW CGG LE +P
Sbjct: 221 KSSVET-IRSPTMAGGLFAADRKYFFEIGGYDPGMDVWGGENLEISFRVWMCGGTLEFVP 279
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
CS VGH+FR PYTFPG L N+ R+AEVWMD ++ +Y
Sbjct: 280 CSRVGHIFRSSHPYTFPGNKDTHGL-NSKRLAEVWMDGYKRLFY 322
>gi|292623437|ref|XP_001339749.3| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Danio rerio]
Length = 567
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 130/310 (41%), Positives = 179/310 (57%), Gaps = 29/310 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
C ++ LP TSI+I FHNEA STLLRT+ SV+ RSP L+ EIILVDD +S+ C
Sbjct: 129 CASMTFDPDLPPTSIIITFHNEARSTLLRTIKSVLMRSPPHLILEIILVDDFSSDPEDCR 188
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVIS 105
++ I ++ G ++R + +++ P+I +
Sbjct: 189 LLSQIPKVR---CLRNERREGLIRSRVRGASAASASILTFLDSHCEVNTDWLQPMIQRVK 245
Query: 106 DQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGD 165
+ VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D
Sbjct: 246 EDH------SRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPIEQKMARN-D 298
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
+ P+RTP +AGG+F I+K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS
Sbjct: 299 PTQPIRTPVIAGGIFVIEKGWFNHLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSR 358
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP---GKS-ASVSTCAA 281
VGHVFR + PY FP G + + N R AEVWMD+++ +YYA P GK+ S++ A
Sbjct: 359 VGHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDDYKQYYYAARPSAQGKAFGSIADRLA 418
Query: 282 HFRMLSYSSW 291
R L+ +S+
Sbjct: 419 LKRKLNCNSF 428
>gi|302563901|ref|NP_001181506.1| polypeptide N-acetylgalactosaminyltransferase 12 [Macaca mulatta]
Length = 581
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 125 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 184
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 185 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 244
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 245 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 298
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 299 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 354
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 355 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 400
>gi|402896867|ref|XP_003911504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Papio
anubis]
Length = 581
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 125 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 184
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 185 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 244
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 245 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 298
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 299 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 354
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 355 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 400
>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
Length = 608
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 126/290 (43%), Positives = 175/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK K+YP LP S+VI F+NEA S LLRTV SV++R+P LL E+ILV
Sbjct: 141 CKDKTYPADLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEVILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D ++ + I VI ++ E + M L + H + V P++
Sbjct: 201 ELDAFVQKYLPGKIKVIRNRKREGLIRGRMIGASHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP +
Sbjct: 261 AIQEDR------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVP---LPEP 310
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA+++DYF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGPEGATAPIKSPTMAGGLFAMNRDYFSELGQYDRGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
partial [Pongo abelii]
Length = 587
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 126/287 (43%), Positives = 176/287 (61%), Gaps = 29/287 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK+K YP LP S+V+ F+NEA+S LLRTV SVI+R+P LL EIILV
Sbjct: 141 CKEKFYPPDLPAASVVVCFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+ R
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--R 311
Query: 163 GGD-RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
G + ++P+++PTMAGGLFA+++ YF+ELG YD GMDIWGGENLE+SFR+W CGG L II
Sbjct: 312 GAEGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFII 371
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
PCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++
Sbjct: 372 PCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSL 417
>gi|441593636|ref|XP_003260599.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 isoform
2 [Nomascus leucogenys]
Length = 495
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 129/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 39 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 98
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 99 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 158
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE +R
Sbjct: 159 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPERERIRM 212
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 213 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 268
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 269 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 314
>gi|114625882|ref|XP_001157326.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
troglodytes]
Length = 483
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 27 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 86
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 87 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 146
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 147 IHEE------ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 200
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 201 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 256
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 257 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 302
>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Ailuropoda melanoleuca]
gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
Length = 608
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 174/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
++ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIQQDQ------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 310
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
aries]
Length = 606
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 127/290 (43%), Positives = 175/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 139 CKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKG 198
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 199 ELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 258
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + + VVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 259 AIREDR------RAVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 308
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 309 GGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 368
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 369 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 417
>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
laevis]
gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
Length = 622
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 125/286 (43%), Positives = 170/286 (59%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV+ SP LLKEIILVDDASE + + ++
Sbjct: 174 LPTTSVIIVFHNEAWSTLLRTVYSVLYTSPAILLKEIILVDDASED------EYLKEKLD 227
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ A + K+ + K + ++ + + + ++ A
Sbjct: 228 DYVKALQIV------KIARQKERKGLITARLLGASIATGEVLTFLDAHCECFHGWLEPLL 281
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I +FE+ + G F+W L F W +P E +RR
Sbjct: 282 SRIAEDYTAVVSPDITTIDLNSFEFAKPVQYGKTHSRGNFDWSLTFGWEAIPEAEKLRRK 341
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
+ + P++TPT AGGLF+I K YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 342 NE-TYPIKTPTFAGGLFSISKAYFEHIGSYDEDMEIWGGENVEMSFRVWQCGGQLEIIPC 400
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP++FP G ++++ N R+AEVWMD+++ YY N
Sbjct: 401 SVVGHVFRTKSPHSFPKG-TQVISRNQVRLAEVWMDDYKIIYYRRN 445
>gi|355753170|gb|EHH57216.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
fascicularis]
Length = 542
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 130/291 (44%), Positives = 175/291 (60%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 94 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 153
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 154 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQR 213
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ S + GGF+W+L F W+ VP RE +R
Sbjct: 214 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRM 267
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG+LE
Sbjct: 268 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLE 323
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 324 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 369
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus impatiens]
Length = 637
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 185/313 (59%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 150 CKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQ 209
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 210 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 269
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 270 IARD------PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 322
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 323 RHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 382
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + GK VS
Sbjct: 383 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSE 441
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 442 RKALRKKLGCKSF 454
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 3 [Bombus terrestris]
Length = 637
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 185/313 (59%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 150 CKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQ 209
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 210 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 269
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 270 IARD------PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 322
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 323 RHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 382
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + GK VS
Sbjct: 383 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSE 441
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 442 RKALRKKLGCKSF 454
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus impatiens]
Length = 602
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 185/313 (59%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 150 CKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQ 209
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 210 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 269
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 270 IARD------PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 322
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 323 RHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 382
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + GK VS
Sbjct: 383 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSE 441
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 442 RKALRKKLGCKSF 454
>gi|354487360|ref|XP_003505841.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Cricetulus griseus]
Length = 633
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 131/285 (45%), Positives = 171/285 (60%), Gaps = 38/285 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS+VIVF+NEAWSTLLRTV SV+ SP LLKEIILVDDAS + D + ++
Sbjct: 184 LPTTSVVIVFYNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDDYLHEKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRH------------KKTVVCPIIDVISDQTF---EYITA 114
EYI + +R+K R +D + + E + A
Sbjct: 238 EYIKQFSIVK-----IVRQKERKGLITARLLGAAAATAETLTFLDAHCECFYGWLEPLLA 292
Query: 115 K------TVVCPIIDVISDQTFEYITAS----DMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ VV P I I TFE+ S + G F+W L+F W +P E RR
Sbjct: 293 RIAENYTAVVSPDIASIDLNTFEFNKPSPYGNNHNRGNFDWSLSFGWESLPDHEKQRRK- 351
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P++TPT AGGLF+I ++YF +GSYDE M+IWGGEN+EMSFRVWQCGG LEI+PCS
Sbjct: 352 DETYPIKTPTFAGGLFSISREYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCS 411
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 412 VVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Canis lupus familiaris]
Length = 622
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 126/286 (44%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS+VIVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS + + +Q
Sbjct: 176 LPTTSVVIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDAS------TDEYLKEQLE 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ + ++ + K + ++ V Q ++ A
Sbjct: 230 QYVKKLQVV------RVVRQEERKGLITARLLGASVAQAQVLTFLDAHCECFHGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + G F+W L F W +P E RR
Sbjct: 284 ARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAIPAHEKQRRK 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 344 -DETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP GVS ++ N R+AEVWMD +++ +Y N
Sbjct: 403 SVVGHVFRTKSPHTFPKGVS-VIARNQVRLAEVWMDNYKEIFYRRN 447
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus terrestris]
gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus terrestris]
Length = 602
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 185/313 (59%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 150 CKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQ 209
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 210 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 269
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 270 IARD------PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 322
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 323 RHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 382
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + GK VS
Sbjct: 383 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGDVSE 441
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 442 RKALRKKLGCKSF 454
>gi|268569766|ref|XP_002648333.1| C. briggsae CBR-GLY-4 protein [Caenorhabditis briggsae]
Length = 523
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 123/283 (43%), Positives = 162/283 (57%), Gaps = 19/283 (6%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ Y + + T+++I +HNEA S+LLRTV+SV N SP LL EI+LVDD SE V
Sbjct: 68 CRDVDYSKYEMRPTTVIITYHNEARSSLLRTVFSVFNMSPEALLMEIVLVDDNSEDVE-- 125
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYIT 113
I + + + +R + + + PI+ + E +
Sbjct: 126 ----IGKELAQIEKIKVLRNNQREGLIRSRVKGAQVAQAPILTFLDSHIECNQKWLEPLL 181
Query: 114 A------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
+ K VV PIIDVI+ F Y+ AS GGF+W L FRW + R +
Sbjct: 182 SRIAENPKAVVAPIIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEELRKDRHAHPT 241
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
+P+++PTMAGGLFAI K++F ELG+YD M++WGGENLEMSFRVWQCGG LEI+PCS VG
Sbjct: 242 APIKSPTMAGGLFAISKEWFEELGTYDLDMEVWGGENLEMSFRVWQCGGSLEILPCSRVG 301
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR K YTFPGG + N R AEVWMDE++ Y P
Sbjct: 302 HVFRKKHQYTFPGGSGNVFQKNTRRAAEVWMDEYKAIYLKNVP 344
>gi|345328051|ref|XP_003431229.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Ornithorhynchus anatinus]
Length = 863
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 120/288 (41%), Positives = 168/288 (58%), Gaps = 32/288 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTT+I++ F +E WSTLLR++ SV+NRSP L++EIILVDD S +
Sbjct: 495 CAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTK----- 549
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---KKTVVCPII---DVIS--DQTFE-- 110
+ + D +Y+ L K RH + + I DV++ D E
Sbjct: 550 -EHLKDNLDKYMAQFPKVR-----VLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECN 603
Query: 111 -----------YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
+ K V CP+I+VISD+ Y T + G F W +NF W +PP +
Sbjct: 604 VGWLEPLLERVRLHRKKVACPVIEVISDKDLSYQTVDNFQRGIFTWPMNFGWKSIPPEVI 663
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
+ + +R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+VW CGG +E
Sbjct: 664 EKNKMKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIE 723
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
I+PCS VGH+FR+ +PY+FP K V N RVAEVW+DE++D +Y
Sbjct: 724 IVPCSRVGHIFRNDNPYSFPKDRVKTVERNLVRVAEVWLDEYKDLFYG 771
>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
Length = 580
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 123/267 (46%), Positives = 166/267 (62%), Gaps = 15/267 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y LP TS++I FHNEAWS LLRTV SV+ R+P LL EIILVDD S+
Sbjct: 147 CKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSDMAHLKA 206
Query: 61 IDVISDQTFEY--ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYI 112
I + F I + G +++ K +V+ C ++ + + I
Sbjct: 207 SLEIYMRQFPKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWMEPLLDRI 266
Query: 113 --TAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KTVVCP+IDVI D TFEY A GGF+W L F W+ +P ++ R+G
Sbjct: 267 KKNPKTVVCPVIDVIDDNTFEYHYSKAYFTNVGGFDWSLQFNWHAIPEKD--RKGRRDID 324
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++PTMAGGLF+ID+ +F +LGSYD G+DIWGGENLE+SF+ W CGGILEI+PCSHVGH
Sbjct: 325 PVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEIVPCSHVGH 384
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAE 255
+FR +SPY + GV+ ++ N+ R+AE
Sbjct: 385 IFRKRSPYKWLSGVN-VLKRNSVRLAE 410
>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
Length = 374
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 123/292 (42%), Positives = 171/292 (58%), Gaps = 27/292 (9%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK K Y + LP S++I FHNEAWSTL+RTV SV+ +P LL E+I+VDD S+ +
Sbjct: 52 CKSKEYDVSRLPAVSVIICFHNEAWSTLMRTVHSVLRTAPSELLTEVIMVDDDSQYDHLK 111
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D ++ + + G +L + + V+ P++D
Sbjct: 112 AQLTDYVAGLPKVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPLLDR 171
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I VV P+IDVI +TFEY + + GF+W+L FRW ++P RRG
Sbjct: 172 IVQ------NRSHVVTPVIDVIDFKTFEYRHLAIIQVRGFDWRLIFRWEKIPASYEKRRG 225
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
P+ +PTMAGGLFAIDK+YF+ LG YD GM+IWGGENLE+SFR+WQCGG LEI+PC
Sbjct: 226 LS-VDPILSPTMAGGLFAIDKEYFHHLGLYDTGMEIWGGENLELSFRIWQCGGTLEIMPC 284
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSAS 275
S VGHVFR + PY +++ N RVAEVWMD++++++Y + K S
Sbjct: 285 SRVGHVFRQRFPYQ---TSTEVTTRNLMRVAEVWMDQYKEYFYQIRHIKKKS 333
>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Oryctolagus cuniculus]
Length = 608
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 127/288 (44%), Positives = 173/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA+S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKDKSYPADLPVASVVICFYNEAFSALLRTVHSVLDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E
Sbjct: 261 AIREDR------HTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEQGGA 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EG-ATAPIKSPTMAGGLFAMNRLYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|149639580|ref|XP_001512277.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Ornithorhynchus anatinus]
Length = 949
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 112/279 (40%), Positives = 164/279 (58%), Gaps = 14/279 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTT+I++ F +E WSTLLR++ SV+NRSP L++EIILVDD S + +
Sbjct: 495 CAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKEH--L 552
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT----------FE 110
D + ++ + + +R + + ++ + E
Sbjct: 553 KDNLDKYMAQFPKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLLE 612
Query: 111 YIT--AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ K V CP+I+VISD+ Y T + G F W +NF W +PP + + +
Sbjct: 613 RVRLHRKKVACPVIEVISDKDLSYQTVDNFQRGIFTWPMNFGWKSIPPEVIEKNKMKETD 672
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
+R P MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF+VW CGG +EI+PCS VGH
Sbjct: 673 IIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVPCSRVGH 732
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
+FR+ +PY+FP K V N RVAEVW+DE++D +Y
Sbjct: 733 IFRNDNPYSFPKDRVKTVERNLVRVAEVWLDEYKDLFYG 771
>gi|432882425|ref|XP_004074024.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Oryzias latipes]
Length = 549
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 126/278 (45%), Positives = 167/278 (60%), Gaps = 29/278 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC--PIIDVISDQ 67
LPTTS+VI F+NEAWSTLLRTV SV+ SP LLKE++LVDD S+R P+ +S
Sbjct: 101 LPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPLEKYLSGF 160
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + V+ P++ I ++
Sbjct: 161 RKVRLIRATKREGLVRARLLGASIATGDVLTFLDCHCECHEGWLEPLLHRIKEE------ 214
Query: 114 AKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VVCP+IDVI TF+Y+ + GGF+W+L F W+ VP E RR + +R+
Sbjct: 215 PTAVVCPVIDVIDWDTFQYLGNPGEPQIGGFDWRLVFTWHSVPDNEQKRRHSP-TDVIRS 273
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
PTMAGGLF+++K+YFY LGSYD GM++WGGENLE SFR+WQCGG LEI PCSHVGHVF
Sbjct: 274 PTMAGGLFSVNKNYFYYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPK 333
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
K+PY+ L N+ R AEVWMD++++ YY NP
Sbjct: 334 KAPYS-----RSKALANSVRAAEVWMDDYKEIYYHRNP 366
>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
Length = 675
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 125/287 (43%), Positives = 168/287 (58%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK+K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ EIILVDD S+R +
Sbjct: 207 CKQKLYAEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDREHLKQ 266
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + + I + G +L K V+ P++D I
Sbjct: 267 PLEEYMVRLPKVRILRTKKREGLIRTRLLGATAAKGEVITFLDSHCEANVNWLPPLLDRI 326
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 327 AQ------NRKTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYKRIPIPP-ELQKE- 378
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 379 -DPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGCMEDIPC 437
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY PGGVS + N RVAEVWMDE+ ++ Y P
Sbjct: 438 SRVGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYIYQRRP 482
>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12 [Taeniopygia
guttata]
Length = 594
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/288 (44%), Positives = 172/288 (59%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVV 57
C++K Y + LP TS+VI F+NEAWSTLLRTV SV+ SP LL+EIILVDD S E +
Sbjct: 136 CREKKYDYYNLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLK 195
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + ++ + ++ G +L + K ++ P++
Sbjct: 196 ETLENYVAGLRKVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLAR 255
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+++ VVCP+IDVI TFEY+ A + GGF+ +L F W+ P RE RR
Sbjct: 256 IAEEE------TAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDXRLVFTWHSTPEREQKRR 309
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
++ +R+PTMAGGLF++ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI P
Sbjct: 310 KS-KTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHP 368
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY+ L N+ R AEVWMDE++ YY NP
Sbjct: 369 CSHVGHVFPKQAPYS-----RAKALANSVRAAEVWMDEYKQLYYHRNP 411
>gi|344268030|ref|XP_003405867.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Loxodonta africana]
Length = 633
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 130/285 (45%), Positives = 170/285 (59%), Gaps = 38/285 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV SV+ SP LLKEIILVDDAS + + + +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDAS------VDEYLHGKLE 237
Query: 70 EYITASDMTWGGFNWKLREKNRH------------KKTVVCPIIDVISDQTF---EYITA 114
EYI + +R+K R +D + + E + A
Sbjct: 238 EYIKQFSIVK-----IVRQKERKGLITARLLGAAAATAETLTFLDAHCECFYGWLEPLLA 292
Query: 115 K------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ VV P I I TFE+ S+ G F+W L+F W +P E RR
Sbjct: 293 RIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRK- 351
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P++TPT AGGLF+I K+YF +G+YDE M+IWGGEN+EMSFRVWQCGG LEI+PCS
Sbjct: 352 DETYPIKTPTFAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCS 411
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
VGHVFR KSP+TFP G ++++ N R+AEVWMDE+++ +Y N
Sbjct: 412 VVGHVFRSKSPHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRN 455
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Megachile rotundata]
Length = 638
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 184/313 (58%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER---- 55
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+
Sbjct: 151 CKEPGRYLKELPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDYSDMPHLQ 210
Query: 56 -------VVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDV 103
+ P + +I Q E + + + L + H + + P++D
Sbjct: 211 RQLEDYMMNYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 270
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 271 IARD------PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-K 323
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F LG+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 324 RHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 383
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + G VS
Sbjct: 384 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSD 442
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 443 RKALRKKLGCKSF 455
>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
taurus]
Length = 605
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 175/290 (60%), Gaps = 32/290 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP SIVI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 139 CKDKSYPADLPVASIVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKG 198
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
D+ ++ + I VI + E + M L + H + V P++
Sbjct: 199 ELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLA 258
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 259 AIREDR------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL--- 308
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA++++YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 309 GGPEGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 368
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE++ Y+++ P
Sbjct: 369 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKQ-YFSLRP 416
>gi|189237799|ref|XP_001814012.1| PREDICTED: similar to N-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270008127|gb|EFA04575.1| PNR-like protein [Tribolium castaneum]
Length = 614
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 123/294 (41%), Positives = 172/294 (58%), Gaps = 39/294 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK SY LPT +I+I F+NE + TLLRTV S+I+R+P ++LKEI+LVDD S+
Sbjct: 143 CKNISYSADLPTAAIIICFYNEHYYTLLRTVHSIIDRTPASVLKEILLVDDFSD------ 196
Query: 61 IDVISDQTFEYIT----------ASDMTWGGFNWKLREKNRHKKTVVC------------ 98
++ + + YIT ++ G +L R K+ V+
Sbjct: 197 LENLHENLSTYITKNFDDRVKLIKTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGW 256
Query: 99 --PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPP 156
P++ I D V P+ID+I+ TF Y TAS + GGFNW L+F+W + P
Sbjct: 257 IEPLLQRIKD------NYTNVAMPVIDIINADTFAY-TASPLVRGGFNWGLHFKWENL-P 308
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+ + D P+++PTMAGGLFA+ + YF +LG YD GM+IWGGENLE+SFR+W CGG
Sbjct: 309 KGTLSTKMDFIKPIKSPTMAGGLFAMSRKYFTDLGEYDAGMNIWGGENLEISFRIWMCGG 368
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LE+IPCS VGHVFR + PY P G +LHN+ RVA VWMD +++++ P
Sbjct: 369 RLELIPCSRVGHVFRQRRPYGAPDG-QDTMLHNSLRVANVWMDSYKEYFLNHRP 421
>gi|395728898|ref|XP_002809364.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Pongo abelii]
Length = 532
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/283 (44%), Positives = 169/283 (59%), Gaps = 21/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ +SP L+KEIILVDD S +
Sbjct: 88 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSNDPEDGA 147
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 148 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLE 201
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 202 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 261
Query: 169 PLRTPTMAGGLFAIDKDYFYELG-SYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P++TP +AGGLF +DK YF + G +DE D+WGGE E+SFRVWQCGG LEIIPCS VG
Sbjct: 262 PIKTPMIAGGLFVMDKFYFEDWGVRHDE--DVWGGETXEISFRVWQCGGSLEIIPCSRVG 319
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR + PYTFPGG + N R AEVWMDE+++FYYA P
Sbjct: 320 HVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVP 362
>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 697
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 123/285 (43%), Positives = 174/285 (61%), Gaps = 25/285 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
CK Y + LP+ SI+I FHNEAWS LLR+V SVI+RSP LL+EIILVDD S+R
Sbjct: 240 CKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNLLQEIILVDDFSDRPHLKE 299
Query: 57 -------VCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDVI 104
+ I+ ++ + E + + M + L + H + + P++D I
Sbjct: 300 ALEEYMGMLNIVKIVRTKQREGLIRARMIGAELSTGKVLVFLDSHIECTTGWLEPLLDRI 359
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFE--YITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ + VV P+I I+D+T + ++ A ++ GGF+W L FRW+ R+ R
Sbjct: 360 A------YNSSIVVVPVISTINDKTLKMNFLKADNVQVGGFDWSLTFRWHEQTERDRNRS 413
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G SP+R+PTMAGGLFAI ++YF LG YD GM+IWGGENLE+SF+VW CGGILE +
Sbjct: 414 GAP-YSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKVWMCGGILETVV 472
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CS VGH+FR +SPY + V + N R+A+VW+D+++ FYYA
Sbjct: 473 CSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRFYYA 517
>gi|410910894|ref|XP_003968925.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Takifugu rubripes]
Length = 577
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/278 (44%), Positives = 166/278 (59%), Gaps = 29/278 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC--PIIDVISDQ 67
LPTTS++I F+NE WSTLLRTV SV+ SP LLKE++LVDD S+R P+ + IS
Sbjct: 129 LPTTSVIIAFYNEGWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPLENYISGL 188
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + V+ P++ I ++
Sbjct: 189 KKVRLIRATKREGLVRARLLGASITTGDVLTFLDCHCECHEGWLEPLLHRIKEE------ 242
Query: 114 AKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRT 172
VVCP+IDVI F+Y+ A + GGF+W+L F W+ +P E RR + +R+
Sbjct: 243 PSAVVCPVIDVIDWNNFQYLGNAGEPQIGGFDWRLVFTWHSIPEYEQKRRKSP-TDVIRS 301
Query: 173 PTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRD 232
PTMAGGLFA+ K+YF+ LG+YD GM++WGGENLE SFR+WQCGG LE+ PCSHVGHVF
Sbjct: 302 PTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSHVGHVFPK 361
Query: 233 KSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
K+PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 362 KAPYS-----RNKALANSVRAAEVWMDEYKEIYYHRNP 394
>gi|402586218|gb|EJW80156.1| glycosyltransferase, partial [Wuchereria bancrofti]
Length = 448
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 122/275 (44%), Positives = 165/275 (60%), Gaps = 22/275 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-----VCPIIDVI 64
LP+TS+VI +HNEA STLLRT+ SV RSP LL EIILVDD S+ + + PI +V+
Sbjct: 144 LPSTSVVITYHNEARSTLLRTIVSVFLRSPPQLLHEIILVDDFSDDITIGTDLLPIENVV 203
Query: 65 SDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI---------TAK 115
+ + G +++ + +V+ +D + ++ +
Sbjct: 204 -------VIRNTKREGLIRSRVKGSTLARASVLT-FLDSHCECNVNWLEPLLARVKENHR 255
Query: 116 TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
VV P+ID+I TF+YI AS GGF W L F+W + + R ++P+RTP +
Sbjct: 256 AVVAPVIDIIDKDTFKYIAASADLRGGFEWNLIFKWEYLLGKLRDDRHAQPTAPIRTPVI 315
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF I KD+F +LG+YDE MD+WGGENLE+SFRVW CGG LEIIPCS VGHVFR + P
Sbjct: 316 AGGLFMIQKDWFEKLGTYDEEMDVWGGENLELSFRVWLCGGSLEIIPCSRVGHVFRKQHP 375
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
YTFPGG S + N RVAEVW+ +++ Y P
Sbjct: 376 YTFPGGSSNVFQKNTRRVAEVWLGDYKHLYLRKVP 410
>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 654
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/285 (43%), Positives = 174/285 (61%), Gaps = 25/285 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
CK Y + LP+ SI+I FHNEAWS LLR+V SVI+RSP LL+EIILVDD S+R
Sbjct: 240 CKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNLLQEIILVDDFSDRPHLKE 299
Query: 57 -------VCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDVI 104
+ I+ ++ + E + + M + L + H + + P++D I
Sbjct: 300 ALEEYMGMLNIVKIVRTKQREGLIRARMIGAELSTGKVLVFLDSHIECTTGWLEPLLDRI 359
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFE--YITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ + VV P+I I+D+T + ++ A ++ GGF+W L FRW+ R+ R
Sbjct: 360 A------YNSSIVVVPVISTINDKTLKMNFLKADNVQVGGFDWSLTFRWHEQTERDRNRS 413
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G SP+R+PTMAGGLFAI ++YF LG YD GM+IWGGENLE+SF+VW CGGILE +
Sbjct: 414 GAP-YSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKVWMCGGILETVV 472
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
CS VGH+FR +SPY + V + N R+A+VW+D+++ FYYA
Sbjct: 473 CSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRFYYA 517
>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Cavia porcellus]
Length = 622
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/286 (44%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP TLLKEIILVDDAS + + D+
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPATLLKEIILVDDAS------TDEYLKDELE 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
Y+ + K+ + K + ++ V + ++ A
Sbjct: 230 RYVQQLQIV------KVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I+ TFE+ + G F+W L F W +P E RR
Sbjct: 284 ARIAENKMAVVSPDIVTINLNTFEFSKPIPEGRIHSRGNFDWILTFGWEALPAHEKQRRK 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 344 -DETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G S ++ N R+AEVWMD+++ +Y N
Sbjct: 403 SVVGHVFRTKSPHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRN 447
>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
Length = 520
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/290 (45%), Positives = 175/290 (60%), Gaps = 37/290 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+K +YP LP S+VI+F +EAWS L+RTV SVINR+P LL+EIILVDD S+R
Sbjct: 63 CRKINYPDNLPVASVVIIFTDEAWSPLMRTVHSVINRTPFKLLQEIILVDDFSQR----- 117
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH----------KKTV--VCPIIDV---IS 105
D + + EYI +G +R + R K+ + V +D +S
Sbjct: 118 -DDLKGRLEEYIK----RFGNKVRLIRARERQGLIRAKLLGAKEAIGDVLIFLDSHCEVS 172
Query: 106 DQTFEYITAK------TVVCPIIDVISDQTFEYITASDM-TWGGFNWKLNFRWYRVPPRE 158
+ E + A+ V+CPIID IS +T Y + + GGF W L+FRW +P
Sbjct: 173 EGWLEPLLARIKENRSVVLCPIIDHISAETLAYSGSDRLANVGGFWWSLHFRWDPLPEEY 232
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
G D + P+R+PTMAGGLFA+D+ YF+E+G YD MDIWGGENLE+SFRVW CGG +
Sbjct: 233 Y---GIDPTKPIRSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISFRVWMCGGGI 289
Query: 219 EIIPCSHVGHVFRDKSPY--TFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
E IPCSHVGH+FR PY T PG + N+ R+AEVWMD+++ FYY
Sbjct: 290 EFIPCSHVGHIFRAGHPYNMTGPGNNEDVHGTNSKRLAEVWMDDYKRFYY 339
>gi|432865221|ref|XP_004070476.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oryzias latipes]
Length = 621
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 130/294 (44%), Positives = 175/294 (59%), Gaps = 24/294 (8%)
Query: 1 CKKKSYPT--FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERV 56
C ++ +P LPTTS++IVFHNEAWSTLLRTV+SV++ SP +LLKEIILVDDAS E +
Sbjct: 162 CVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVLHTSPASLLKEIILVDDASVAEHL 221
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPI----------IDVISD 106
+ D + + G +L + V+ + ++ +
Sbjct: 222 KSQLEDFVKHLKIVRVVRQPERKGLITARLLGASIATAEVLTFLDAHCECFHGWLEPLLA 281
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ E TA VV P I I F + T G F+W L F W +P E R
Sbjct: 282 RIVEEPTA--VVSPEITTIDLNNFNFNKPIATNRAYNRGNFDWSLTFGWEAIP-EEARRL 338
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P++TPT AGGLF+I K YF +G+YD+ M+IWGGEN+EMSFRVWQCGG LEIIP
Sbjct: 339 RKDETYPVKTPTFAGGLFSISKKYFEHIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIP 398
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
CS VGHVFR KSP+TFP G ++++ N R+AEVWMD+++ YY N K+A++
Sbjct: 399 CSVVGHVFRTKSPHTFPKG-TEVITRNQVRLAEVWMDDYKKIYYRRN--KNAAI 449
>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
Length = 621
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/286 (43%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TS++IVFHNEAWSTLLRTV+SV++ SP LL+EIILVDDAS D + DQ
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDAS------TDDYLKDQLE 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ + ++ + K + ++ V + ++ A
Sbjct: 230 QYVKKLQVV------RVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYI----TASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + + G F+W L F W +P E RR
Sbjct: 284 ARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQRRK 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 344 -DETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G+S ++ N R+AEVWMD +++ +Y N
Sbjct: 403 SVVGHVFRTKSPHTFPKGIS-VIARNQVRLAEVWMDSYKEIFYRRN 447
>gi|410897066|ref|XP_003962020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Takifugu rubripes]
Length = 600
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 119/274 (43%), Positives = 162/274 (59%), Gaps = 22/274 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LP+TS++ F +E WSTLLR+V SV+NRSP LL+EIILVDD S E + P+ +S
Sbjct: 157 LPSTSVIFCFVDEVWSTLLRSVHSVLNRSPPHLLEEIILVDDFSTKEYLKAPLDKYMSQF 216
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
I G +L K V+ P+++ I Y+
Sbjct: 217 PKVRIIRLRERQGLIRARLAGAAAAKGEVLTFLDSHVECNVGWLEPLLERI------YMD 270
Query: 114 AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTP 173
+ V CP+I+VI+D+ Y+ + G F W L F W +P + + S P+R P
Sbjct: 271 RRKVPCPVIEVINDKDMSYMLVDNFQRGIFRWPLVFGWSPLPEAYIKKHNLTISDPIRCP 330
Query: 174 TMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDK 233
MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF++W CGG +EIIPCS VGH+FR +
Sbjct: 331 VMAGGLFSIDKKYFYELGAYDSGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRGQ 390
Query: 234 SPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
+PY FP K V N ARVAEVW+DE++D +Y
Sbjct: 391 NPYKFPKDRQKTVERNLARVAEVWLDEYKDLFYG 424
>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Ailuropoda melanoleuca]
Length = 622
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/286 (43%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP TS++IVFHNEAWSTLLRTV+SV++ SP LL+EIILVDDAS D + DQ
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDAS------TDDYLKDQLE 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ + ++ + K + ++ V + ++ A
Sbjct: 230 QYVKKLQVV------RVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYI----TASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + + G F+W L F W +P E RR
Sbjct: 284 ARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQRRK 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 344 -DETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G+S ++ N R+AEVWMD +++ +Y N
Sbjct: 403 SVVGHVFRTKSPHTFPKGIS-VIARNQVRLAEVWMDSYKEIFYRRN 447
>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mustela putorius
furo]
Length = 609
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/293 (43%), Positives = 175/293 (59%), Gaps = 34/293 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
CK KSYP LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 141 CKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTVVC------P 99
++ ++ + I VI + E + M + L + H + V P
Sbjct: 201 ELEEYVQKYLPGKIKVIRNAKREGLIRGRMIGAAHSTGEVLVFLDSHCEVNVMWLMWLQP 260
Query: 100 IIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
++ I +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 LLAAIQQDR------RTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSEL 313
Query: 160 MRRGGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
GG ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG
Sbjct: 314 ---GGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGK 370
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
L IIPCS VGH+FR + PY P G + HN+ R+A VW+D++++ Y+++ P
Sbjct: 371 LFIIPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDDYKEQYFSLRP 422
>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
Length = 614
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/286 (43%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS D + D+
Sbjct: 168 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTE------DYLKDKLE 221
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ + K+ + K + ++ V + ++ A
Sbjct: 222 QYVKELQVV------KVVRQVERKGLITARLLGAKVAQAEVLTFLDAHCECFHGWLEPLL 275
Query: 116 --------TVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + + G F+W L F W +PP E R
Sbjct: 276 ARIAEDKTVVVSPDIVTIDLNTFEFSKPVQSGRVHSRGNFDWSLTFGWETLPPHEKQRHK 335
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 336 -DETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 394
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G++ ++ N R+AEVWMD ++ +Y N
Sbjct: 395 SVVGHVFRTKSPHTFPKGIN-VIARNQVRLAEVWMDSYKQIFYRRN 439
>gi|109476381|ref|XP_001066416.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 576
Score = 224 bits (572), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 128/274 (46%), Positives = 169/274 (61%), Gaps = 21/274 (7%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP TS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R + P+ + +S
Sbjct: 130 LPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKEPLANELSQL 189
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK--TVVC 119
+ + G +L + + V+ C + + + I K VVC
Sbjct: 190 PKVRLIRASKREGLVRARLLGASAARGEVLTFLDCHCECHEGWLEPLLQRIHEKESAVVC 249
Query: 120 PIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPRE--MMRRGGDRSSPLRTPTMA 176
P+IDVI TFEY+ S + GGF+W+L F W+ VP RE +MR D +R+PTMA
Sbjct: 250 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERKLMRSPID---VIRSPTMA 306
Query: 177 GGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPY 236
GGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSHVGHVF ++PY
Sbjct: 307 GGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPKQAPY 366
Query: 237 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ L N+ R AEVWMD++++ YY NP
Sbjct: 367 S-----RSKALANSVRAAEVWMDDFKELYYHRNP 395
>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
lupus familiaris]
Length = 608
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 124/288 (43%), Positives = 174/288 (60%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILV----------- 49
C+ KS+P LP S+VI F+NEA S LLRTV SV++R+P LL EIILV
Sbjct: 141 CRDKSFPADLPAASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKG 200
Query: 50 --DDASERVVCPIIDVISDQTFEYITASDMTWGGF--NWKLREKNRHKKTVVC---PIID 102
++ ++ + I VI + E + M L + H + V P++
Sbjct: 201 ELEEYVQKYLPGKIKVIRNIKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I + +TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AIQEDQ------QTVVCPVIDIISADTLAY-SSSPVVRGGFNWGLHFKWDLVPLSELGGP 313
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
G ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L IIP
Sbjct: 314 EGA-TAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 373 CSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|196006600|ref|XP_002113166.1| hypothetical protein TRIADDRAFT_27135 [Trichoplax adhaerens]
gi|190583570|gb|EDV23640.1| hypothetical protein TRIADDRAFT_27135, partial [Trichoplax
adhaerens]
Length = 491
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 118/282 (41%), Positives = 168/282 (59%), Gaps = 23/282 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
CK K Y +P+ S+VI+FHNEA STLLRTV SV++R+P LL EI+LVDD S
Sbjct: 45 CKNKIYRLNMPSVSVVIIFHNEARSTLLRTVQSVLDRTPPHLLSEIVLVDDNSDDATLGQ 104
Query: 54 ERVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV-----CPIIDVISDQT 108
E + P + +I ++ E + S + K+ K ++ C + ++
Sbjct: 105 ELLTLPKVKLIRNKKREGLIRSRV--------FGVKSSQGKAIIFLDSHCEVNQQWAEPL 156
Query: 109 FEYI--TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
E I K +V P++D I TFEY ++ GGF+W L FRW + M+ + D
Sbjct: 157 LEQIVLNPKAIVSPVLDNIDMNTFEYQEGTEDVRGGFDWSLTFRWDYMT-EAMINQRIDP 215
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+SP++TPT+AGG++A+ K +F +LG YD G IWGGENLE+SFR W CGG ++IIPCS V
Sbjct: 216 TSPIKTPTIAGGIYAVSKQWFNDLGEYDMGQKIWGGENLELSFRAWMCGGFMKIIPCSRV 275
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
GHVFR + PY FP G + N RV EVW+DE++ ++Y +
Sbjct: 276 GHVFRLQHPYIFPEGAGRTYYRNLRRVVEVWLDEYKVYFYQI 317
>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
Length = 550
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 122/285 (42%), Positives = 168/285 (58%), Gaps = 27/285 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C+ + + +P S++++FHNEAWS LLRTV+S++ RSP L+E+ILVDD S E +
Sbjct: 89 CRAILHSSKMPKASVIVIFHNEAWSVLLRTVYSILERSPPRFLEEVILVDDYSDQEHLHD 148
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ + ++ Q + S+ G +L K V+ P++D I
Sbjct: 149 QLDEFVATQQKVRLVRSEKREGLIRARLIGAEAAKGQVLVFLDSHCECTPGWLEPMLDRI 208
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYI---TASDMTWGGFNWKLNFRWYRVPPREMMR 161
Q + + VV PIIDVI D+T Y + + GGF+W + F W+ +P E R
Sbjct: 209 G-QDWSH-----VVTPIIDVIDDKTLMYNFNPLSRGFSVGGFDWAMGFTWHALPNHEKER 262
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R S P R+PTMAGGLFAID++YFY +GSYD GM+IWGGENLEMSFR+W CGG LE +
Sbjct: 263 RK-KISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSFRIWMCGGTLETL 321
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
PCSHVGH+FR ++P V N+ R AEVWMDE++ YY
Sbjct: 322 PCSHVGHIFRKRNP-NHSAKHGNFVQRNSVRTAEVWMDEYKYLYY 365
>gi|392347955|ref|XP_232988.5| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 579
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 128/274 (46%), Positives = 169/274 (61%), Gaps = 21/274 (7%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP TS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R + P+ + +S
Sbjct: 130 LPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKEPLANELSQL 189
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK--TVVC 119
+ + G +L + + V+ C + + + I K VVC
Sbjct: 190 PKVRLIRASKREGLVRARLLGASAARGEVLTFLDCHCECHEGWLEPLLQRIHEKESAVVC 249
Query: 120 PIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPRE--MMRRGGDRSSPLRTPTMA 176
P+IDVI TFEY+ S + GGF+W+L F W+ VP RE +MR D +R+PTMA
Sbjct: 250 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERKLMRSPID---VIRSPTMA 306
Query: 177 GGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPY 236
GGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSHVGHVF ++PY
Sbjct: 307 GGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPKQAPY 366
Query: 237 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ L N+ R AEVWMD++++ YY NP
Sbjct: 367 S-----RSKALANSVRAAEVWMDDFKELYYHRNP 395
>gi|327263882|ref|XP_003216746.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Anolis carolinensis]
Length = 536
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 168/290 (57%), Gaps = 40/290 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV+ SP LLKEIILVDDAS + + D+
Sbjct: 175 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDASTD------EYLKDELD 228
Query: 70 EYITASDMTW--------GGFNWKLREKNRHKKTVVC--------------PIIDVISDQ 107
Y+ + G +L + V+ P++ I+++
Sbjct: 229 SYVKQLQIVRVIRQIERKGLITARLLGASVATGDVLTFLDAHCECFHGWLEPLLSRIAEE 288
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + G F+W L F W +P E RR
Sbjct: 289 ------PTAVVSPDITTIDLNTFEFSKPIQYGKQHSRGNFDWSLTFGWEAIPQHEKERRK 342
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLFAI K YF +GSYD+ M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 343 -DETYPIKTPTFAGGLFAISKAYFEHVGSYDDQMEIWGGENVEMSFRVWQCGGQLEIIPC 401
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
S VGHVFR KSP+TFP G ++++ N R+AEVWMD++++ +Y N S
Sbjct: 402 SVVGHVFRSKSPHTFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRNQQAS 450
>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Takifugu rubripes]
Length = 600
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 125/288 (43%), Positives = 169/288 (58%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK+K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ E+ILVDD S E +
Sbjct: 130 CKQKLYAEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAELILVDDFSDKEHLKV 189
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + + I + G +L + K V+ P++D I
Sbjct: 190 PLEEYMKRMPKVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPLLDRI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
+ K++VCP+IDVI F Y T A D G F+W++ ++ R+P P EM R
Sbjct: 250 AQ------NRKSIVCPMIDVIDHDNFGYDTQAGDAMRGAFDWEMYYK--RIPIPAEMQR- 300
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IP
Sbjct: 301 -DDPSQPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIP 359
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH++R PY PGG+S + N RVAEVWMDE+ ++ Y P
Sbjct: 360 CSRVGHIYRKYVPYKVPGGIS--LAKNLKRVAEVWMDEYAEYVYQRRP 405
>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 520
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 169/290 (58%), Gaps = 37/290 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C K SY LP S+VI+F +EAWS L+RTV SVINR+P LL+EIILVDD S+R
Sbjct: 63 CHKISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQR----- 117
Query: 61 IDVISDQTFEYIT--------ASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---- 108
D + ++ EYI + G +R K K V ++ +
Sbjct: 118 -DELKEKLEEYIKRFGNKVRLVRALERQGL---IRAKLLGAKEAVGDVLVFLDSHCEVGE 173
Query: 109 --FEYITAK------TVVCPIIDVISDQTFEYITASD--MTWGGFNWKLNFRWYRVPPRE 158
E + A+ V+CPII+ IS +T Y +A+D GGF+W L+F W +P
Sbjct: 174 GWLEPLLARIKDKRSAVLCPIINHISAETLTY-SANDRPTNVGGFSWSLHFLWDPMPKEY 232
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
D + P+R+PTMAGGL A+D+ YF+E+G YD MDIWGGENLEMSFRVW CGG +
Sbjct: 233 F---DADPTEPIRSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSFRVWMCGGSI 289
Query: 219 EIIPCSHVGHVFRDKSPYTF--PGGVSKIVLHNAARVAEVWMDEWRDFYY 266
E IPCSHVGH+FRD PY PG + N+ R+AEVWMD+++ FYY
Sbjct: 290 EFIPCSHVGHIFRDGHPYNMIGPGDNKDVHGTNSKRLAEVWMDDYKKFYY 339
>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
rerio]
Length = 578
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 126/288 (43%), Positives = 170/288 (59%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C+ Y LPTTS+VI F+NEAWSTLLRTV SV+ SP LL E+ILVDD S+R +
Sbjct: 120 CRNLKYDYLSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDLLLNEVILVDDYSDREHLK 179
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + I+D + + G +L + V+ P++
Sbjct: 180 EPLENYIADLKKVRLIRARKREGLVRARLLGASIATGEVLTFLDCHCECHEGWLEPLLQR 239
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TF+Y+ + GGF+W+L F W+ +P E RR
Sbjct: 240 IKEE------PSAVVCPVIDVIDWNTFQYLGNPGEPQIGGFDWRLVFTWHSIPEHEQKRR 293
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ +R+PTMAGGLFA++K YF LG+YD GM++WGGENLE SFR+WQCGG LEI P
Sbjct: 294 SA-ATDVVRSPTMAGGLFAVNKKYFLYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEIHP 352
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF K+PY+ L N+ R AEVWMD++++ YY +P
Sbjct: 353 CSHVGHVFPKKAPYS-----RNKALANSVRAAEVWMDDFKEVYYHRSP 395
>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
caballus]
Length = 622
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 127/285 (44%), Positives = 170/285 (59%), Gaps = 38/285 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ +P LL+EIILVDDAS + + +Q
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLREIILVDDAS------TDEYLKEQLE 229
Query: 70 EYITASDMTWGGFNWKLREKNRH------------KKTVVCPIIDVISDQT---FEYITA 114
+Y+ + +R+K R + V +D + E + A
Sbjct: 230 QYVKQLQVVR-----VVRQKERTGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLA 284
Query: 115 K------TVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ VV P I I TFE+ + G F+W L+F W +PP E RR
Sbjct: 285 RIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLSFGWEALPPHEKQRRK- 343
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS
Sbjct: 344 DETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCS 403
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
VGHVFR KSP+TFP G+S ++ N R+AEVWMD +++ +Y N
Sbjct: 404 VVGHVFRTKSPHTFPKGIS-VIARNQVRLAEVWMDGYKEIFYRRN 447
>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
rerio]
gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
Length = 582
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 128/282 (45%), Positives = 166/282 (58%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK K Y LPTTS+VI F+NEAWSTLLRT+ SV+ +P LLK+IILVDD S+R +
Sbjct: 129 CKAKKYNIRRLPTTSVVIAFYNEAWSTLLRTIHSVLETTPAVLLKDIILVDDFSDRGYLK 188
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ IS+ + + G +L +V+ C + + E
Sbjct: 189 SQLAQYISNLERVRLIRTKKREGLVRARLIGATYATGSVLTFLDCHCECVPGWIEPLLER 248
Query: 112 ITAK--TVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I T++CP+ID I TFE Y+ + GGF+W+L F+W+ VP + R R
Sbjct: 249 IAENETTIICPVIDTIDWNTFEFYMQTEEPMVGGFDWRLTFQWHAVPEIDRKIRKS-RID 307
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+R+PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG LEI PCSHVGH
Sbjct: 308 PIRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGH 367
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VF K+PY L N R AEVWMD ++ +Y NP
Sbjct: 368 VFPKKAPY-----ARSNFLQNTVRAAEVWMDTYKQHFYNRNP 404
>gi|348521382|ref|XP_003448205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oreochromis niloticus]
Length = 620
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 173/290 (59%), Gaps = 22/290 (7%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS E + + + I
Sbjct: 172 LPTTSVIIVFHNEAWSTLLRTVFSVLHTSPAILLKEIILVDDASTAEHLKSRLEEYIRQL 231
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVCPI----------IDVISDQTFEYITAKTV 117
+ G +L + + V+ + ++ + + E TA V
Sbjct: 232 KIVRVVRQPERKGLITARLLGASIAQAEVLTFLDAHCECFHGWLEPLLARIVEEPTA--V 289
Query: 118 VCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTP 173
V P I I +F++ T G F+W L F W +P + R D + P++TP
Sbjct: 290 VSPEISSIDLNSFQFHKPVATNRAYNRGNFDWSLTFGWEAIP-EDAKRLRKDETYPVKTP 348
Query: 174 TMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDK 233
T AGGLFAI K YF +G+YD+ M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHVFR K
Sbjct: 349 TFAGGLFAISKKYFEHIGTYDDQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTK 408
Query: 234 SPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCAAHF 283
SP+TFP G ++++ N R+AEVWMD+++ YY N K+A++ F
Sbjct: 409 SPHTFPKG-TEVITRNQVRLAEVWMDDYKKIYYRRN--KNAAIMASEHRF 455
>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 677
Score = 224 bits (570), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 177/307 (57%), Gaps = 33/307 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK+K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EIILVDD S++
Sbjct: 207 CKQKLYAERLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDKDHLKG 266
Query: 56 ------VVCPIIDVISDQTFEYITASDMTWGGFNWK------LREKNRHKKTVVCPIIDV 103
V P + ++ + E + + + G K L + P++D
Sbjct: 267 ALEEYMVRLPKVRILRTKKREGLIRTRLL-GAAAAKGEVITFLDSHCEANINWLPPLLDR 325
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMR 161
I+ + KT+VCP+IDVI F Y T A D G F+W++ ++ R+P P E+ +
Sbjct: 326 IA------LNRKTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYK--RIPIPAELQK 377
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E I
Sbjct: 378 --NDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDI 435
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCA 280
PCS VGH++R PY PGGVS + N RVAEVWMDE+ ++ Y P + S A
Sbjct: 436 PCSRVGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYVYQRRPEYRHLSAGDVA 493
Query: 281 AHFRMLS 287
A + S
Sbjct: 494 AQKELRS 500
>gi|395824312|ref|XP_003785413.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Otolemur garnettii]
Length = 508
Score = 224 bits (570), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 126/291 (43%), Positives = 173/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C++K Y +P TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 52 CREKKYDYANMPKTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 111
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + ++ G +L + + V+ P++
Sbjct: 112 ERLANELSGLPKVRLIRANKREGLVRARLLGASAARGNVLTFLDCHCECHEGWLEPLLQR 171
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE R
Sbjct: 172 IHEEE------SAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERQRM 225
Query: 163 GGDRSSPL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 226 ----KSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLE 281
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMD++++ YY NP
Sbjct: 282 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDDYKELYYHRNP 327
>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 126/289 (43%), Positives = 174/289 (60%), Gaps = 33/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
CK K Y + LP S++I FHNE STLLRT+ S+ NRSP +LLKEI+LVDDAS R +
Sbjct: 144 CKSKKYYSDLPDVSVIIPFHNEGLSTLLRTIHSLHNRSPESLLKEIVLVDDASSRPLYKE 203
Query: 58 -------CPIIDVISDQTFEYITAS-----DMTWGGFNWKLREKNRHKKTVVCPIIDVIS 105
P + +I + T + + S + GG L + P++ IS
Sbjct: 204 LESSLAKFPKVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVSTNWLPPLLHPIS 263
Query: 106 DQTFEYITAKTVVCPIIDVISDQTFEYITA-SDMTWGGFNWKLNFRWYRVP-PREMMRRG 163
+ KTVVCP+ID+I ++ F+Y+T D G F+W+L ++ R+P P E +R
Sbjct: 264 ------LDRKTVVCPMIDIIDNENFQYVTQPGDAMRGAFDWELYYK--RIPIPNE--KRP 313
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFAI+++YFYE+G YDEG++IWGGE E+SF+VW CGG + PC
Sbjct: 314 KDPSEPFESPVMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSFKVWMCGGRILDSPC 373
Query: 224 SHVGHVFRDKSPYTFP--GGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S +GH++R PYT P GG + +N RVAEVWMDE+ +F+Y P
Sbjct: 374 SRIGHIYRKFVPYTIPNNGGPN----YNYKRVAEVWMDEYAEFFYRRRP 418
>gi|296190391|ref|XP_002743190.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Callithrix jacchus]
Length = 571
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 128/291 (43%), Positives = 173/291 (59%), Gaps = 36/291 (12%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+K Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 115 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 174
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ + +S + + G +L + + V+ P++
Sbjct: 175 ERLANELSGLPKVRLIRASKREGLVRARLLGASVARGDVLTFLDCHCECHEGWLEPLLQR 234
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE ++
Sbjct: 235 IHEE------ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIQM 288
Query: 163 GGDRSSP---LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
SP +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 289 ----RSPVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLE 344
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 345 THPCSHVGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 390
>gi|354475881|ref|XP_003500155.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Cricetulus griseus]
Length = 559
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 173/284 (60%), Gaps = 22/284 (7%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C+++ Y LP TS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 103 CREEKYDYENLPRTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 162
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ + +S + + G +L + K V+ C + + +
Sbjct: 163 ERLANELSQLPRVRLIRASKREGLVRARLLGASVAKGEVLTFLDCHCECHEGWLEPLLQR 222
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVP--PREMMRRGGDR 166
I K VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE+MR D
Sbjct: 223 IHEKESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERELMRSPID- 281
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSHV
Sbjct: 282 --VIRSPTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHV 339
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 340 GHVFPKQAPYS-----RSKALANSVRAAEVWMDEFKELYYHRNP 378
>gi|291382916|ref|XP_002708201.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oryctolagus cuniculus]
Length = 476
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 130/285 (45%), Positives = 170/285 (59%), Gaps = 24/285 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
CK+ Y LP TS++I F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 21 CKEVKYDYDHLPKTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 80
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ + +S + + G +L + K V+ C + +
Sbjct: 81 ERLANELSGLPKVRLIRATKREGLVRARLLGASAAKGDVLTFLDCHCECHEGWLEPLLHR 140
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I K VVCP+IDVI TFEY+ + GGF+W+L F W+ VP RE +R S
Sbjct: 141 IHEKESAVVCPVIDVIDWNTFEYLGNPGEPQIGGFDWRLVFTWHVVPERERLRM----RS 196
Query: 169 PL---RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
P+ R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSH
Sbjct: 197 PIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSH 256
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 257 VGHVFPKQAPYS-----RNKALANSVRAAEVWMDEFKELYYHRNP 296
>gi|344251833|gb|EGW07937.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Cricetulus
griseus]
Length = 457
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 173/284 (60%), Gaps = 22/284 (7%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C+++ Y LP TS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R +
Sbjct: 1 CREEKYDYENLPRTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 60
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
+ + +S + + G +L + K V+ C + + +
Sbjct: 61 ERLANELSQLPRVRLIRASKREGLVRARLLGASVAKGEVLTFLDCHCECHEGWLEPLLQR 120
Query: 112 ITAK--TVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVP--PREMMRRGGDR 166
I K VVCP+IDVI TFEY+ + + GGF+W+L F W+ VP RE+MR D
Sbjct: 121 IHEKESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERELMRSPID- 179
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSHV
Sbjct: 180 --VIRSPTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHV 237
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVF ++PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 238 GHVFPKQAPYS-----RSKALANSVRAAEVWMDEFKELYYHRNP 276
>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
Length = 577
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 170/294 (57%), Gaps = 35/294 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ Y LP+ S+VI+F +EAW+ LLRTV SV+NRSP LL E+IL+DD S+R
Sbjct: 122 CRSVHYDDDLPSASVVIIFTDEAWTPLLRTVHSVVNRSPLHLLHEVILLDDFSQR----- 176
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------KKTVVCPIIDVISDQT--- 108
+ + + EYI +GG +R+K RH +I +
Sbjct: 177 -EELKGKLDEYIK----RFGGIVKLIRKKERHGLIRAKLAGAHEATGEVIVFLDSHCEAN 231
Query: 109 ---FEYITAK------TVVCPIIDVISDQTFEYITASDMTW-GGFNWKLNFRWYRVPPRE 158
E + A+ V+CPIID IS +T +Y +++ GGF W L+FRW + E
Sbjct: 232 EGWLEPLLARIKEKRTAVLCPIIDYISAETMQYSGDANVNAVGGFWWSLHFRWDSIGKAE 291
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
RR P+R+PTMAGGL A +++YF E+G YD GMDIWGGENLE+SFRVW CGG +
Sbjct: 292 RDRR-KSAIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISFRVWMCGGSI 350
Query: 219 EIIPCSHVGHVFRDKSPY--TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
E IPCSHVGH+FR PY T PGG + N+ R+AEVWMD+++ YY P
Sbjct: 351 EFIPCSHVGHIFRAGHPYNMTGPGGNLDVHGTNSKRLAEVWMDDYKRLYYLHRP 404
>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
catus]
Length = 622
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 123/286 (43%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS + + +Q
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDAS------TDEYLKEQLD 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ + ++ + K + ++ V + ++ A
Sbjct: 230 QYVKKLQIV------RVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + G F+W L F W +P E RR
Sbjct: 284 ARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSRGNFDWSLTFGWEALPAHEKQRRK 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG +EIIPC
Sbjct: 344 -DETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G+S ++ N R+AEVWMD +++ +Y N
Sbjct: 403 SVVGHVFRTKSPHTFPKGIS-VIARNQVRLAEVWMDSYKEIFYRRN 447
>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
Length = 603
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 125/275 (45%), Positives = 164/275 (59%), Gaps = 18/275 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 157 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 216
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD---QTFEYITAK------TVV 118
+ + G +L + + V+ +D + E + A+ VV
Sbjct: 217 QVVRVVRQEERKGLITARLLGASVAQAEVLT-FLDAHCECFHGRLEPLLARIAEDKTVVV 275
Query: 119 CPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPT 174
P I I TFE+ + G F+W L F W +PP E RR D + P+++PT
Sbjct: 276 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYPIKSPT 334
Query: 175 MAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKS 234
AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHVFR KS
Sbjct: 335 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 394
Query: 235 PYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
P+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 395 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 428
>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Acromyrmex echinatior]
Length = 566
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 126/309 (40%), Positives = 180/309 (58%), Gaps = 22/309 (7%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+ +
Sbjct: 79 CKESGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSD--MPH 136
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYIT 113
+ + D Y + +R + P++ + E +
Sbjct: 137 LKRQLEDYMMNYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLL 196
Query: 114 AK------TVVCPIIDVISDQTFEY--ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGD 165
+ TVVCP+IDVI D T EY +S + GGF+W L F W+ VP RE +R +
Sbjct: 197 DRIARDPTTVVCPVIDVIDDTTLEYHWRDSSGVNVGGFDWNLQFNWHAVPERER-KRHKN 255
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
+ P+ +PTMAGGLF+ID+ +F +G+YD G DIWGGENLE+SF+ W CGG LEI+PCSH
Sbjct: 256 PAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSH 315
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVSTCAAH 282
VGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + G +S A
Sbjct: 316 VGHIFRKRSPYKWRNGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDISERKAL 374
Query: 283 FRMLSYSSW 291
+ L S+
Sbjct: 375 RKKLGCKSF 383
>gi|395507115|ref|XP_003757873.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Sarcophilus harrisii]
Length = 633
Score = 223 bits (568), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 124/282 (43%), Positives = 163/282 (57%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
C Y LP TSI+I FHNEA STLLRT+ SV NR+P L+ EIILVDD S+
Sbjct: 210 CATLHYGPNLPPTSIIITFHNEARSTLLRTIRSVSNRTPVHLVHEIILVDDFSDDPDDCQ 269
Query: 58 ----CPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + ++ E I +D+ L K + P++ I +
Sbjct: 270 LLSKLPKVKCLRNEQREGLIRSRIRGADLAQASILTFLDSHCEVNKDWLLPLLHRIKEDP 329
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF Y+++S GGF+W L+F+W + RE R D
Sbjct: 330 TR------VVCPVIDIINRDTFAYVSSSPDMRGGFDWTLHFKWEELSLREKALRV-DPIQ 382
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP ++GGLF ++K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 383 PIKTPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEILPCSRVGH 442
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PYTFP G + N R AEVWMDE++ ++YA P
Sbjct: 443 VFRKKHPYTFPEGNLNTYIKNTKRTAEVWMDEFKHYFYAARP 484
>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
Length = 622
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 168/280 (60%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS---------ERVV--C 58
LPTTS+VIVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS ER V
Sbjct: 176 LPTTSVVIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKEKLERYVQQL 235
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDVISDQTFEYIT 113
I+ V+ Q + + + + L + H + + P++ I++
Sbjct: 236 QIVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TF++ + G F+W L F W +P E RR D + P
Sbjct: 290 KTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD+++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRN 447
>gi|301611308|ref|XP_002935181.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Xenopus (Silurana) tropicalis]
Length = 532
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 126/275 (45%), Positives = 168/275 (61%), Gaps = 23/275 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP TS++I F+NEAWSTLLRTV SV+ SP LL+EIILVDD S+R + P+ IS
Sbjct: 118 LPKTSVIIAFYNEAWSTLLRTVHSVLETSPDLLLEEIILVDDYSDREHLKEPLEKYISSW 177
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK--TVVC 119
+ ++ G +L + K V+ C + + E I K V+C
Sbjct: 178 RKVRLIRANKREGLVRARLLGASIAKAEVLTFLDCHCECHEGWLEPLLERIREKESAVIC 237
Query: 120 PIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL---RTPTM 175
P+IDVI TFEY+ A + GGF+W++ F W+ VP E +R SP+ +PTM
Sbjct: 238 PVIDVIDWNTFEYLGNAGEPQIGGFDWRMVFTWHTVPETEQKKR----RSPIDVISSPTM 293
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF++ K YF LGSYD GM++WGGENLE SFR+WQCGG LE+ PCSHVGHVF ++P
Sbjct: 294 AGGLFSVSKKYFEHLGSYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSHVGHVFPRQAP 353
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
Y+ +S N+ R AEVW+DE+++ YY NP
Sbjct: 354 YSRSKALS-----NSVRAAEVWLDEYKEIYYHRNP 383
>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
Length = 622
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oryzias latipes]
Length = 758
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 117/274 (42%), Positives = 161/274 (58%), Gaps = 22/274 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP+TS++ F +E WSTLLR+V SV+NRSP LLKEIILVDD S + + P+ +S
Sbjct: 313 LPSTSVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTKDYLKEPLDKYMSQF 372
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
I G +L V+ P+++ I Y+
Sbjct: 373 PKVRIVRLKERQGLIRARLAGAAVATGEVLTFLDSHVECNVGWLEPLLERI------YLD 426
Query: 114 AKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTP 173
+ V CP+I+VI+D+ Y+ + G F W L F W + + + S P+R P
Sbjct: 427 RRKVPCPVIEVINDKDMSYMLIDNFQRGIFKWPLVFGWNALSEDYIRKHNITVSDPIRCP 486
Query: 174 TMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDK 233
MAGGLF+IDK YFYELG+YD G+D+WGGEN+E+SF++W CGG +EIIPCS VGH+FR +
Sbjct: 487 VMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRGQ 546
Query: 234 SPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
+PY+FP K V N ARVAEVW+DE++D +Y
Sbjct: 547 NPYSFPKDRQKTVERNLARVAEVWLDEYKDLFYG 580
>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
gorilla gorilla]
Length = 622
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Nomascus leucogenys]
Length = 622
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Pan paniscus]
gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pan paniscus]
Length = 622
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
Length = 622
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|260814835|ref|XP_002602119.1| hypothetical protein BRAFLDRAFT_125760 [Branchiostoma floridae]
gi|229287425|gb|EEN58131.1| hypothetical protein BRAFLDRAFT_125760 [Branchiostoma floridae]
Length = 1164
Score = 223 bits (567), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 118/262 (45%), Positives = 160/262 (61%), Gaps = 23/262 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LP+TS++I F E+WSTLLRTV SVINRSP L+KEIILVDDAS R + + +
Sbjct: 783 LPSTSVIICFCEESWSTLLRTVHSVINRSPPRLVKEIILVDDASSR------EHLKKKLD 836
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQT 129
+Y+ E+ K + P + + ++ +
Sbjct: 837 DYM---------------ERFPKVKIIHLPERAGLIRARLRGAAVRRLLESKGGISLHLD 881
Query: 130 FEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYE 189
Y ++ MT GGF+W+++FRW VP EM RR +++ P+R+PTMAGGLF+I K +F E
Sbjct: 882 HLYSSSGHMTRGGFDWRMHFRWNTVPDYEMARRKMEKA-PIRSPTMAGGLFSIHKMFFEE 940
Query: 190 LGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHN 249
LG+YD G++IWGGENLE+SF+ W CGG LEI+PCS VGH+FR PY FPGG + V N
Sbjct: 941 LGTYDPGLEIWGGENLELSFKTWMCGGTLEILPCSRVGHIFRQSQPYRFPGGGMQTVQRN 1000
Query: 250 AARVAEVWMDE-WRDFYYAMNP 270
+ RV +VWMDE R +YA+NP
Sbjct: 1001 SLRVVQVWMDERHRKAFYAVNP 1022
>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
5 [Pan troglodytes]
Length = 638
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASDM----TWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
Length = 622
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Takifugu rubripes]
Length = 552
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 125/288 (43%), Positives = 168/288 (58%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK+K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ E+ILVDD S E +
Sbjct: 82 CKQKLYAEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEVILVDDFSDKEHLKV 141
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + + I + G +L R K V+ P++D I
Sbjct: 142 PLDEYMVRLPKVRILRTKKREGLIRTRLLGAARAKGEVITFLDSHCEANVNWLPPLLDRI 201
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
+ KT+VCP+IDVI F Y T A D G F+W++ ++ R+P P E+ +
Sbjct: 202 AQNR------KTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYK--RIPIPLELQKE 253
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E P
Sbjct: 254 --DPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDTP 311
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH++R PY PGGVS + N RVAEVWMDE+ ++ Y P
Sbjct: 312 CSRVGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYIYQRRP 357
>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 6 [Homo sapiens]
Length = 622
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Loxodonta africana]
Length = 622
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/286 (42%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ +P LKEIILVDDAS + + +Q
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTAPAIFLKEIILVDDASTE------EYLKEQLD 229
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
+Y+ + ++ + K + ++ V + ++ A
Sbjct: 230 QYVKQLQIV------RVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLL 283
Query: 116 --------TVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I TFE+ + G F+W L F W VP E RR
Sbjct: 284 ARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWETVPLHEKQRRK 343
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 344 -DETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G++ ++ N R+AEVWMD++++ +Y N
Sbjct: 403 SVVGHVFRTKSPHTFPKGIN-VIARNQVRLAEVWMDDYKEIFYRRN 447
>gi|344271584|ref|XP_003407617.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12-like [Loxodonta
africana]
Length = 576
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 128/288 (44%), Positives = 169/288 (58%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C +K Y LP TS++I F+NEAWSTLLRTV+SV+ S LL+E+ILVDD S+R +
Sbjct: 120 CMEKKYDYENLPRTSVIIAFYNEAWSTLLRTVYSVLETSSDMLLEEVILVDDYSDREHLK 179
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ +S + ++ G +L + K V+ P+++
Sbjct: 180 ERLATELSGLPKVRLIRANKREGLVRARLLGASVAKGNVLTFLDCHCECHEGWLEPLLER 239
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ + GGF+W+L F W+ VP RE RR
Sbjct: 240 IHEEE------SAVVCPVIDVIDWDTFEYLGNPGEPQIGGFDWRLVFTWHTVPERER-RR 292
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE P
Sbjct: 293 MRSPIDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHP 352
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF K+PY+ L N+ R AEVWMDE+++ YY NP
Sbjct: 353 CSHVGHVFPKKAPYS-----RNKALANSVRAAEVWMDEYKELYYHRNP 395
>gi|167515504|ref|XP_001742093.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778717|gb|EDQ92331.1| predicted protein [Monosiga brevicollis MX1]
Length = 548
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 122/280 (43%), Positives = 166/280 (59%), Gaps = 14/280 (5%)
Query: 1 CKKKSYPTF--LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERV 56
C K+ + LPT S++I+F+NEA STL+RTVWSV++R+ TLLKEIILVDD S E +
Sbjct: 199 CLKREHYNIDSLPTVSVIIIFYNEARSTLMRTVWSVLDRTHPTLLKEIILVDDHSSMEHL 258
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
P+ D ++ + D G K+ V+ C + D + +
Sbjct: 259 GQPLEDEVAATPKTKLLRLDKRSGLIRAKVHGALNAVGDVILFLDSHCEVNDGYLEPLLD 318
Query: 111 --YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
Y KTV PIID I +T+E+ T + G F+W L F W + P + R D +
Sbjct: 319 RIYRNRKTVAMPIIDAIDFETWEHRTGL-LERGVFDWTLTFSWKMLSPMQEAERADDPLA 377
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P +P MAGGLFA+D+ +F+E+G+YD GMD WGGEN+EMS R+W CGG++E +PCSHVGH
Sbjct: 378 PFTSPAMAGGLFAMDRKFFFEIGAYDMGMDTWGGENIEMSVRIWTCGGVIEAVPCSHVGH 437
Query: 229 VFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYYA 267
VFR K+PY F K + N RVA+VWMD+ D YYA
Sbjct: 438 VFRQKTPYKFKDKDPLKTIGRNLNRVADVWMDDHADLYYA 477
>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pongo abelii]
gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
3 [Pongo abelii]
Length = 622
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 123/280 (43%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG +EIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 658
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 126/292 (43%), Positives = 171/292 (58%), Gaps = 30/292 (10%)
Query: 1 CKKKSYPTF-LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ +YP +PT S+VI+F +E +STLLRT+ SVI+RSPR LL+EIILVDD S+
Sbjct: 191 CRAITYPVAEMPTASVVIIFTDEIFSTLLRTIVSVIDRSPRHLLREIILVDDFSQS---- 246
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKN----------RHKKTVVCPIIDVISDQTF 109
+ + D+ YI +L E++ R + V +D + T
Sbjct: 247 --EDLKDRLERYIEHHFRADVVRLIRLPERSGLIRARLVGARAARGDVLIFLDSHCETTP 304
Query: 110 EYITA---------KTVVCPIIDVISDQTFEYITASDMTW--GGFNWKLNFRWYRVPPRE 158
++ + VVCP+IDVI +T +Y+ A + GGFNW+ F W+ +P
Sbjct: 305 GWLEPLLEPIRRDRRAVVCPVIDVIDYRTLQYVAAEGDRFQIGGFNWRGEFTWHNIPS-A 363
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
R + P+R+PTMAGGLFAI+++YF+E GSYDE MD WGGENLEMSFR+WQCGG +
Sbjct: 364 WRRNRVSVAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHI 423
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
I PCSHVGH+FRD PY PGG + N R EVWMDE++ + Y P
Sbjct: 424 VIAPCSHVGHIFRDYQPYKIPGGKDTNAI-NTKRAVEVWMDEFKKYIYQARP 474
>gi|322792015|gb|EFZ16120.1| hypothetical protein SINV_06269 [Solenopsis invicta]
Length = 433
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 183/313 (58%), Gaps = 30/313 (9%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--RVV 57
CK+ Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+ +
Sbjct: 1 CKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLK 60
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+ D + + I ++ G +L K V+ P++D
Sbjct: 61 RQLEDYMMNYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDR 120
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMR 161
I+ TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +
Sbjct: 121 IARD------PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPERER-K 173
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
R + + P+ +PTMAGGLF+ID+ +F +G+YD G DIWGGENLE+SF+ W CGG LEI+
Sbjct: 174 RHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 233
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY---AMNPGKSASVST 278
PCSHVGH+FR +SPY + GV+ ++ N+ R++EVW+DE+ +YY + G VS
Sbjct: 234 PCSHVGHIFRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGNYGDVSE 292
Query: 279 CAAHFRMLSYSSW 291
A + L S+
Sbjct: 293 RKALRKKLGCKSF 305
>gi|351699369|gb|EHB02288.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Heterocephalus
glaber]
Length = 570
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 129/290 (44%), Positives = 170/290 (58%), Gaps = 34/290 (11%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-------- 51
CK+K Y LPTTS++I F+NEAWSTLLRTV+SV+ SP L++E+ILVDD
Sbjct: 114 CKEKKYDYENLPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILVEEVILVDDYSDKEHLK 173
Query: 52 ---ASERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDV 103
A E P + +I E + + + G L + + P++
Sbjct: 174 ERLAEELSALPKVRLIRATKREGLVRARLLGASVARGDVLTFLDCHCECHEGWLEPLLQR 233
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPRE--MM 160
I ++ VVCP+IDVI TFEY+ S + GGFNW+L F W+ VP R+ +M
Sbjct: 234 IHEKE------SAVVCPVIDVIDWNTFEYLGNSREPQVGGFNWQLVFTWHVVPERDRLLM 287
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
+ D +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE
Sbjct: 288 KSPIDV---IRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLET 344
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGHVF ++PY+ L N+ R AEVWMDE+++ YY P
Sbjct: 345 HPCSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDEFKELYYHRTP 389
>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Saimiri boliviensis boliviensis]
gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Saimiri boliviensis boliviensis]
Length = 622
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/280 (43%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G + ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKG-TNVIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|5834643|emb|CAB55352.1| N-acetylgalactosaminyltransferase T-6 [Mus musculus]
Length = 623
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 129/281 (45%), Positives = 168/281 (59%), Gaps = 29/281 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------ERV-----VC 58
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS ER+
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKERLEQYVQQL 235
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAK--- 115
I+ V+ + + + + + + K H P + V S E + A+
Sbjct: 236 QIVRVVRQRERKGLITARLLGASV---AQAKGAH--VSWTPTVSV-SHGWLEPLLARIAE 289
Query: 116 ---TVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VV P I I TF++ + G F+W L F W +P E RR D +
Sbjct: 290 DKTPVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPQHEKQRRK-DETY 348
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGH
Sbjct: 349 PIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGH 408
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
VFR KSP+TFP G S ++ N R+AEVWMD+++ +Y N
Sbjct: 409 VFRTKSPHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRN 448
>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Callithrix jacchus]
Length = 622
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/280 (43%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASMAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPIQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G + ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKG-TNVIARNQVRLAEVWMDSFKKIFYRRN 447
>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
Length = 454
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 121/292 (41%), Positives = 169/292 (57%), Gaps = 37/292 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK++SYP LP S+VIVFHNE WSTL+RTV +V+ RSP +L+EI++VDD S +
Sbjct: 50 CKQRSYPINLPKASVVIVFHNEGWSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNK----- 104
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
D + + +Y I + G ++ N VV
Sbjct: 105 -DFLKQKLDDYTKKLGKIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLP 163
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
P+++ I+ + +T VCP ID I +TF+Y G FNW+ +++ V P E
Sbjct: 164 PLLERIA------LNRRTAVCPTIDFIDHKTFQYKPMDPYIRGTFNWRFDYKERAVRPEE 217
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
M +R D + +++P MAGGLFAI++++F ELG YD GM IWGGE E+SF++WQCGG L
Sbjct: 218 MAKRR-DPTQEVKSPVMAGGLFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQL 276
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
E IPCS VGHV+R PYT+P + +V N RVAEVWMDE++D+ Y P
Sbjct: 277 ENIPCSRVGHVYRHHVPYTYPKHDATLV--NFRRVAEVWMDEYKDWLYDKRP 326
>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Monodelphis domestica]
Length = 708
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 167/288 (57%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ KS+ LPTTS++I F+NEAWSTLLRTV SV+ +P LLKEIILVDD S++V
Sbjct: 254 CRLKSFDYRRLPTTSVIIAFYNEAWSTLLRTVHSVLETAPAVLLKEIILVDDLSDKVYLK 313
Query: 60 I-----------IDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDV 103
+ +I + E + + + F L + + P+++
Sbjct: 314 AQLETYISSLQRVRLIRTKKREGLVRARLIGATFATGEVLTFLDCHCECNQGWLEPLLER 373
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++CP+ID I TF+ Y+ + GGF+W L F+W VP E RR
Sbjct: 374 IGQDE------SVIICPVIDTIDWNTFDFYMQEGEPVIGGFDWHLTFQWQPVPEHER-RR 426
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
R+ P+++P MAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG LEI P
Sbjct: 427 WQSRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGALEIHP 486
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY P N R AEVWMD++++ +Y NP
Sbjct: 487 CSHVGHVFPKRAPYARPN-----FRQNTVRAAEVWMDDYKEHFYNRNP 529
>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Takifugu rubripes]
Length = 618
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/298 (42%), Positives = 174/298 (58%), Gaps = 44/298 (14%)
Query: 1 CKKKSYP--TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
C ++ +P LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS
Sbjct: 159 CVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDAS----- 213
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT----- 113
+ + +Q E++ + LR+ R K ++ + S+ E +T
Sbjct: 214 -VAGHLKEQLEEFVLQFKIVR-----VLRQPER--KGLITARLLGASEAQGEVLTFLDAH 265
Query: 114 ------------------AKTVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRW 151
VV P I I ++F++ ++ G F+W L F W
Sbjct: 266 CECFHGWLEPLLARIVEEPTAVVSPEITTIDLESFQFNKPAPSSHAFNRGNFDWSLTFGW 325
Query: 152 YRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRV 211
++P R D + P++TPT AGGLF+I K YF +G+YD+ M+IWGGEN+EMSFRV
Sbjct: 326 EQIPEAARKLRK-DETCPVKTPTFAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSFRV 384
Query: 212 WQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
WQCGG LEIIPCS VGHVFR KSP+TFP G ++++ N R+AEVWMD+++ +Y N
Sbjct: 385 WQCGGQLEIIPCSVVGHVFRTKSPHTFPKG-TEVITRNQVRLAEVWMDDYKKIFYRRN 441
>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Loxodonta africana]
Length = 527
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 124/290 (42%), Positives = 174/290 (60%), Gaps = 31/290 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIIL------------ 48
CK+KSYP LP S+VI F+NEA+S LLRTV SV +R+P LL EIIL
Sbjct: 141 CKEKSYPLDLPAASVVICFYNEAFSALLRTVHSVTDRTPAHLLHEIILVDDDSDLDDLKG 200
Query: 49 -VDDASERVVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIID 102
+D+ ++ + VI ++ E + M G L + + P++
Sbjct: 201 ELDEYVQKYLPGKTKVIRNKKREGLIRGRMIGAAQATGEVLVFLDSHCEVNEMWLQPLLA 260
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ + TVVCP+ID+IS T Y ++S + GGFNW L+F+W VP E+
Sbjct: 261 AVRED------PHTVVCPVIDIISADTLLY-SSSPIVRGGFNWGLHFKWDLVPFDEL--- 310
Query: 163 GGDR--SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
GG ++P+++PTMAGGLFA+++ YF ELG YD GMDIWGGENLE+SFR+W CGG L I
Sbjct: 311 GGPEGATAPIKSPTMAGGLFAMNRHYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
IPCS VGH+FR + PY P G + HN+ R+A VW+DE+++ Y+++ P
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEGQDTMT-HNSLRLAHVWLDEYKEQYFSLRP 419
>gi|170587206|ref|XP_001898369.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158594195|gb|EDP32781.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 582
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 119/275 (43%), Positives = 163/275 (59%), Gaps = 22/275 (8%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-----VCPIIDVI 64
LP+TS+VI +HNEA STLLRT+ SV RSP LL EIILVDD S+ + + PI +V+
Sbjct: 144 LPSTSVVITYHNEARSTLLRTIVSVFLRSPPQLLHEIILVDDFSDDITIGTDLLPIENVV 203
Query: 65 SDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI---------TAK 115
+ + G +++ + +V+ +D + ++ +
Sbjct: 204 -------VIRNTKREGLIRSRVKGSTLARASVLT-FLDSHCECNVNWLEPLLARVKENHR 255
Query: 116 TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
VV P+ID+I TF+Y+ AS GGF W L F+W + + R ++P+RTP +
Sbjct: 256 AVVAPVIDIIDKDTFKYVAASADLRGGFEWNLIFKWEYLLGKLRDDRHAQPTAPIRTPVI 315
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF I KD+F +LG+YDE MD+WGGENLE+SFRVW CGG LEIIPCS VGHVFR + P
Sbjct: 316 AGGLFMIQKDWFEKLGTYDEQMDVWGGENLELSFRVWLCGGSLEIIPCSRVGHVFRKQHP 375
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
YTFPGG + N R AEVW+ +++ Y P
Sbjct: 376 YTFPGGNGNVFQKNTRRAAEVWLGDYKYLYLRKVP 410
>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
Length = 619
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/273 (44%), Positives = 169/273 (61%), Gaps = 20/273 (7%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LKEII+VDDAS E + + + +
Sbjct: 171 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAAFLKEIIMVDDASTAEHLHGKLEEYVKAL 230
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVCPI----------IDVISDQTFEYITAKTV 117
+ G +L ++ + ++ + ++ + + E TA V
Sbjct: 231 KIVKVVRQPERKGLITARLLGASKAEGEILTFLDAHCECFHGWLEPLLARIVEEPTA--V 288
Query: 118 VCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTP 173
V P I I TF++ TA G F+W L F W +P E +R D + P++TP
Sbjct: 289 VSPEITTIDLNTFQFHKPVATARAHNRGNFDWSLTFGWEGIPDYENAKRK-DETYPVKTP 347
Query: 174 TMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDK 233
T AGGLF+I K YF ++G+YD+ M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHVFR K
Sbjct: 348 TFAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTK 407
Query: 234 SPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
SP+TFP G ++++ N R+AEVWMD+++ +Y
Sbjct: 408 SPHTFPKG-TEVITRNQVRLAEVWMDDYKLIFY 439
>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
anubis]
Length = 637
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 172/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 169 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 228
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 229 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 288
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 289 AR------NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 339
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 340 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 399
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 400 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAAQ 457
Query: 283 FRMLS 287
++ S
Sbjct: 458 KKLRS 462
>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 591
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 127/285 (44%), Positives = 171/285 (60%), Gaps = 32/285 (11%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------ 53
C+++ + LPT S+V+ F+NEAWS L+RTV S++ R+P LL E+ILVDD S
Sbjct: 115 CRQQEFQEQSLPTASVVVCFYNEAWSALVRTVHSILERTPAALLHELILVDDNSTLPELG 174
Query: 54 ---ERVVCP----IIDVISDQTFEYITASDMTWGGFNWK---LREKNRHKKTVVC---PI 100
R V + +I E + + M +G N L + H + V P+
Sbjct: 175 LQLSRYVASELPSHVRLIRTPAREGLIRARM-YGAHNASGQVLVFLDSHCEVNVGWLEPM 233
Query: 101 IDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
+ I TV CP+ID+I+ TFEY +AS + GGFNW L+F+W PPR +
Sbjct: 234 LARIG------ANRTTVTCPVIDIINADTFEY-SASPIVRGGFNWGLHFKW-ESPPR--L 283
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
R P+ +PTMAGGLFA+D+ YF+ELG YD+GMDIWGGENLE+SFR+W CGG LEI
Sbjct: 284 RGPQQAIDPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISFRIWMCGGRLEI 343
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
+PCS VGHVFR + PY P G + N+ RVA VWMDE++ +Y
Sbjct: 344 LPCSRVGHVFRRRRPYGSPSGEDTLT-KNSLRVAHVWMDEYKTYY 387
>gi|307203928|gb|EFN82835.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Harpegnathos saltator]
Length = 482
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 118/275 (42%), Positives = 167/275 (60%), Gaps = 18/275 (6%)
Query: 6 YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVIS 65
Y LP T+++I FHNEAWS LLRTV SV++RSP L++EIILVDD S+ + + +
Sbjct: 1 YSKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSD--MPHLKRQLE 58
Query: 66 DQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYITAK---- 115
D Y + +R + P++ + E + +
Sbjct: 59 DYMMNYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARD 118
Query: 116 --TVVCPIIDVISDQTFEYI--TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLR 171
TVVCP+IDVI D T EY + + GGF+W L F W+ VP RE +R + + P+
Sbjct: 119 PTTVVCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREK-KRHKNPAEPVW 177
Query: 172 TPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFR 231
+PTMAGGLF+ID+ +F +G+YD G DIWGGENLE+SF+ W CGG LEI+PCSHVGH+FR
Sbjct: 178 SPTMAGGLFSIDRVFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFR 237
Query: 232 DKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
+SPY + GV+ ++ N+ R++EVW+DE+ +YY
Sbjct: 238 KRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYY 271
>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 622
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS E + + +
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKERLEQYVQQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ G +L + + V+ P++ I++
Sbjct: 236 QIVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TF++ + G F+W L F W +P E RR D + P
Sbjct: 290 KTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD+++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRN 447
>gi|126303658|ref|XP_001380711.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Monodelphis domestica]
Length = 552
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 124/282 (43%), Positives = 162/282 (57%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--- 57
C Y LP TSI+I FHNEA STLLRT+ SV NR+P L+ EIILVDD S+
Sbjct: 101 CATLHYGPDLPPTSIIITFHNEARSTLLRTIRSVSNRTPVHLVHEIILVDDFSDDPDDCQ 160
Query: 58 ----CPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT 108
P + + ++ E I +D+ L K + P++ I +
Sbjct: 161 LLSKLPKVKCLRNEQREGLIRSRIRGADLAQASILTFLDSHCEVNKDWLLPLLHRIKED- 219
Query: 109 FEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
VVCP+ID+I+ TF Y+++S GGF+W L+F+W + RE R D
Sbjct: 220 -----PTRVVCPVIDIINRDTFAYVSSSPDMRGGFDWTLHFKWEELTLREKALRV-DPIQ 273
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+ TP ++GGLF ++K +F LG YD MDIWGGEN E+SFRVW CGG LEI+PCS VGH
Sbjct: 274 PIETPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEILPCSRVGH 333
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VFR K PYTFP G + N R AEVWMDE++ ++YA P
Sbjct: 334 VFRKKHPYTFPEGNLNTYIKNTKRTAEVWMDEFKHYFYAARP 375
>gi|109096689|ref|XP_001083664.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
mulatta]
Length = 641
Score = 220 bits (561), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oreochromis niloticus]
Length = 600
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C++K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ E+ILVDD S E +
Sbjct: 130 CRQKLYAEKLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSRLITEVILVDDFSDKEHLKV 189
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ + + I + G +L K V+ P++D I
Sbjct: 190 ALEEYMKRMPKVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANVNWLPPLLDRI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ K +VCP+IDVI F Y T A D G F+W++ ++ +PP EM R
Sbjct: 250 AQ------NRKAIVCPMIDVIDHDNFGYDTQAGDAMRGAFDWEMYYKRIPIPP-EMQR-- 300
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF++W CGG +E IPC
Sbjct: 301 DDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKLWMCGGRMEDIPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY PGG+S + N RVAEVWMDE+ ++ Y P
Sbjct: 361 SRVGHIYRKYVPYKVPGGIS--LAKNLKRVAEVWMDEYAEYVYQRRP 405
>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
scrofa]
Length = 624
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 156 CNSKRYLEMLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 215
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 216 PLEDYMALFPNVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 275
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 276 AR------NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 326
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 327 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 386
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 387 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 444
Query: 283 FRMLS 287
++ S
Sbjct: 445 KKLRS 449
>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Ornithorhynchus anatinus]
Length = 467
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 126/288 (43%), Positives = 169/288 (58%), Gaps = 30/288 (10%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C +K Y LP TS+VI F+NEA STLLRTV SV+ SP LL E+ILVDD S++ +
Sbjct: 12 CNEKKYDYRRLPRTSVVIAFYNEARSTLLRTVHSVLETSPDVLLNEVILVDDYSDKGHLK 71
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + ++ + + G +L + V+ P+++
Sbjct: 72 EPLENHLAGLPKVRLIRASKREGLVRARLLGASIATGQVLTFLDCHCECHEGWLEPLLER 131
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I ++ VVCP+IDVI TFEY+ A + GGF+W+L F W+ +P RE RR
Sbjct: 132 IREEE------SAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHPIPEREQKRR 185
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ +R+PTMAGGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LEI P
Sbjct: 186 RS-KVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHP 244
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CSHVGHVF ++PY+ L N+ R AEVWMD +++ YY NP
Sbjct: 245 CSHVGHVFPKQAPYS-----RSKALANSVRAAEVWMDGYKELYYHRNP 287
>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Otolemur garnettii]
Length = 603
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 126/303 (41%), Positives = 171/303 (56%), Gaps = 29/303 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEAYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAAQ 423
Query: 283 FRM 285
R+
Sbjct: 424 KRL 426
>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
taurus]
Length = 620
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 123/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 152 CKSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 211
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 212 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 271
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 272 AR------NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 322
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 323 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 382
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P
Sbjct: 383 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRP 427
>gi|27370010|ref|NP_766281.1| polypeptide N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|51315979|sp|Q8BGT9.1|GLT12_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|26329325|dbj|BAC28401.1| unnamed protein product [Mus musculus]
gi|26334957|dbj|BAC31179.1| unnamed protein product [Mus musculus]
gi|33991661|gb|AAH56425.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|52851351|dbj|BAD52068.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74140287|dbj|BAE33836.1| unnamed protein product [Mus musculus]
Length = 576
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 127/274 (46%), Positives = 167/274 (60%), Gaps = 21/274 (7%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVCPIIDVISDQ 67
LP TS+VI F+NEAWSTLLRTV+SV+ SP LL+E+ILVDD S+R + + + +S
Sbjct: 130 LPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSQL 189
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK--TVVC 119
+ + G +L + + V+ C + + + I K VVC
Sbjct: 190 PKVRLIRASRREGLVRARLLGASAARGEVLTFLDCHCECHEGWLEPLLQRIHEKESAVVC 249
Query: 120 PIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRS--SPLRTPTMA 176
P+IDVI TFEY+ + + GGF+W+L F W+ VP RE R RS +R+PTMA
Sbjct: 250 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRE---RQSMRSPIDVIRSPTMA 306
Query: 177 GGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPY 236
GGLFA+ K YF LGSYD GM++WGGENLE SFR+WQCGG LE PCSHVGHVF ++PY
Sbjct: 307 GGLFAVSKRYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPKQAPY 366
Query: 237 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ L N+ R AEVWMDE+++ YY NP
Sbjct: 367 S-----RSKALANSVRAAEVWMDEFKELYYHRNP 395
>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4 [Ciona intestinalis]
Length = 582
Score = 220 bits (560), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 123/275 (44%), Positives = 161/275 (58%), Gaps = 17/275 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP--IIDVISDQ 67
LPTTS+VI F+NE WSTL+RTV+SV++ SP LL EIILVDD S++V + D +
Sbjct: 140 LPTTSVVIAFYNEGWSTLIRTVFSVLHNSPDALLTEIILVDDYSDKVYLKDKLADFLKAL 199
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITA--KTVVC 119
+ + G +L + K V+ C ++ + E I +V
Sbjct: 200 ARVRLVRTTKREGLVRARLLGASLAKGEVLTFLDCHCECVEGWLEPLLERIMEDESVIVV 259
Query: 120 PIIDVISDQTFEYI-TASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGG 178
P+ID I TFEY + GGF+W+L F+W+ +P E RR P+R+PTMAGG
Sbjct: 260 PVIDTIDWNTFEYYYGGHEPQIGGFDWRLTFQWHTIPDHERKRRKSP-VDPIRSPTMAGG 318
Query: 179 LFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTF 238
LFA+ K YF +G+YD GM+IWGGENLE+SFR W CGG LE IPCSHVGHVF +SPY
Sbjct: 319 LFAVSKRYFTRIGTYDAGMEIWGGENLELSFRTWMCGGKLETIPCSHVGHVFPKQSPYPR 378
Query: 239 PGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
P L N R AEVWMD+++ +Y NP S
Sbjct: 379 PK-----FLTNTLRAAEVWMDDYKRHFYIRNPPAS 408
>gi|313240484|emb|CBY32818.1| unnamed protein product [Oikopleura dioica]
Length = 635
Score = 220 bits (560), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 127/292 (43%), Positives = 165/292 (56%), Gaps = 30/292 (10%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------ 53
C YP LPTTSI+I FHNE STLLRT+ SVI ++P +LKEI+LVDDAS
Sbjct: 163 CPSVEYPKEGLPTTSIIITFHNELRSTLLRTIISVIRKTPANILKEIVLVDDASSDPDVG 222
Query: 54 -ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
E + + +I ++ + I A + G L K + P++ I
Sbjct: 223 MELIKIEKVKLIVNRERQGLIRARIRAVMVATGDTLTFLDSHVEVNKNWIQPLMQRIQQ- 281
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG--- 164
K VV PIIDVI+ TF YI A GG +W + FRW + + +G
Sbjct: 282 -----NPKIVVAPIIDVINKDTFSYIGADAFLTGGVSWAMVFRW------DWLTQGATST 330
Query: 165 -DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + LR+PT+AGGLF+I K +F+ELG YD+ MDIWGGEN+E SFRVWQCGG +EI+PC
Sbjct: 331 MDHTKGLRSPTIAGGLFSISKSWFHELGEYDDQMDIWGGENIEFSFRVWQCGGEMEILPC 390
Query: 224 SHVGHVFRDKSPYTF-PGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
S VGHVFRD+ PY F G + + + N R WMDE+ DFYY P +
Sbjct: 391 SRVGHVFRDEHPYDFGKKGSNNVFVKNNNRFVHTWMDEYTDFYYGTRPNAKS 442
>gi|426226648|ref|XP_004007451.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6 [Ovis aries]
Length = 792
Score = 220 bits (560), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 126/273 (46%), Positives = 164/273 (60%), Gaps = 35/273 (12%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS---------ERVVCPI 60
LP TS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS ER V +
Sbjct: 367 LPATSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERHVKQL 426
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTVVCP 120
V + ++ A + G+ L + + D+T VV P
Sbjct: 427 QVVQVVRVLTFLDAHCECFHGWLEPL-------------LARIAEDET-------VVVSP 466
Query: 121 IIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMA 176
I I TFE+ + G F+W L F W +P RE RR D + P+++PT A
Sbjct: 467 NIVTIDLNTFEFSKPVQRGRVQSRGNFDWSLTFGWEVLPAREKQRRK-DETYPIKSPTFA 525
Query: 177 GGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPY 236
GGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHVFR KSP+
Sbjct: 526 GGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKSPH 585
Query: 237 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
TFP G++ ++ N R+AEVWMD +++ +Y N
Sbjct: 586 TFPKGIN-VIARNQVRLAEVWMDGYKEIFYRRN 617
>gi|313226836|emb|CBY21981.1| unnamed protein product [Oikopleura dioica]
Length = 635
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 127/292 (43%), Positives = 165/292 (56%), Gaps = 30/292 (10%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------ 53
C YP LPTTSI+I FHNE STLLRT+ SVI ++P +LKEI+LVDDAS
Sbjct: 163 CPSVEYPKEGLPTTSIIITFHNELRSTLLRTIISVIRKTPANILKEIVLVDDASSDPDVG 222
Query: 54 -ERVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQ 107
E + + +I ++ + I A + G L K + P++ I
Sbjct: 223 MELIKIEKVKLIVNRERQGLIRARIRAVMVATGDTLTFLDSHVEVNKNWIQPLMQRIQQ- 281
Query: 108 TFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG--- 164
K VV PIIDVI+ TF YI A GG +W + FRW + + +G
Sbjct: 282 -----NPKIVVAPIIDVINKDTFSYIGADAFLTGGVSWAMVFRW------DWLTQGATST 330
Query: 165 -DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + LR+PT+AGGLF+I K +F+ELG YD+ MDIWGGEN+E SFRVWQCGG +EI+PC
Sbjct: 331 MDHTKGLRSPTIAGGLFSISKSWFHELGEYDDQMDIWGGENIEFSFRVWQCGGEMEILPC 390
Query: 224 SHVGHVFRDKSPYTF-PGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
S VGHVFRD+ PY F G + + + N R WMDE+ DFYY P +
Sbjct: 391 SRVGHVFRDEHPYDFGKKGSNNVFVKNNNRFVHTWMDEYTDFYYGTRPNAKS 442
>gi|355564239|gb|EHH20739.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
gi|355762987|gb|EHH62101.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
fascicularis]
gi|380809242|gb|AFE76496.1| polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
Length = 622
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|402886019|ref|XP_003906439.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Papio anubis]
gi|402886021|ref|XP_003906440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Papio anubis]
Length = 622
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
L TTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + + +
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +PP E RR D + P
Sbjct: 290 KTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRN 447
>gi|427789289|gb|JAA60096.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 526
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 131/294 (44%), Positives = 167/294 (56%), Gaps = 33/294 (11%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ Y LPT S+VI+F +E +S LLRTV+SVINR+P LL+EIILVDD S+
Sbjct: 149 CRNAEYDVENLPTASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQ----- 203
Query: 60 IIDVISDQTFEYITASDMTWG-----------GFNWKLREKNRHKKTVVCPIIDVISDQT 108
ID +++ E G G R V +D + T
Sbjct: 204 -IDEMANGRLERFIRRHFRPGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEAT 262
Query: 109 FEYITA---------KTVVCPIIDVISDQTFEYI-TASDM-TWGGFNWKLNFRWYRVPPR 157
++ TVVCPIIDVI D+T +Y+ T+SD GGFNWK F W P
Sbjct: 263 DHWLEPMVELIKKDRTTVVCPIIDVIDDKTLQYMGTSSDFYQIGGFNWKGEFIWINTP-- 320
Query: 158 EMMRRG-GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E R+ ++ P+R+PTMAGGLFAID+ YF+E GSYD M+ WGGENLEMSFR+W CGG
Sbjct: 321 EAWRKARKSKADPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGG 380
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
L I PCSHVGH+FRD PY FP + N AR+AEVWMD ++ ++Y P
Sbjct: 381 SLVIAPCSHVGHIFRDYHPYKFPSNKDTHGI-NTARLAEVWMDNYKYYFYQNRP 433
>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 5 [Macaca mulatta]
Length = 603
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 172/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAAQ 423
Query: 283 FRMLS 287
++ S
Sbjct: 424 KKLRS 428
>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
partial [Strongylocentrotus purpuratus]
Length = 531
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 174/294 (59%), Gaps = 37/294 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK SYP LP+TSI+I FHNEAWSTLLRT+ S+I+RSP L+KEIIL+DDAS E +
Sbjct: 85 CKNISYPHDLPSTSIIICFHNEAWSTLLRTLNSIIDRSPLRLIKEIILLDDASTMEHLQE 144
Query: 59 PIIDVIS------------DQTFEYITASDMTWGGFNWKLREK----NRHKKTVVC---P 99
PI D IS ++ I A M G + E + H + ++ P
Sbjct: 145 PIEDYISQIHSVRIRMVRAEKRLGLIKARMM---GVDASEGETFTFLDSHVEVMIGWLEP 201
Query: 100 II-DVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
++ + SD+T VV P++D I+ TF Y + + GGFNW+ +RW +P
Sbjct: 202 LLARLASDRTI-------VVMPVVDEINKDTFNYNVVPEPLQRGGFNWRFEYRWKPIPNY 254
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
+ + + +P+++P M GGL +D+ +F ELG +D GM++WGGENLE S ++W CGG
Sbjct: 255 D---KRPSKVAPIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGS 311
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVS-KIVLHNAARVAEVWMDEWRDFYYAMNP 270
+EIIPCS VGHV+RD SPY+F G IV HNA RV EVW DE + +Y P
Sbjct: 312 IEIIPCSRVGHVYRDTSPYSFLGQNPLDIVEHNAMRVVEVWTDEHKHHFYDRLP 365
>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 626
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 131/294 (44%), Positives = 167/294 (56%), Gaps = 33/294 (11%)
Query: 1 CKKKSYPT-FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
C+ Y LPT S+VI+F +E +S LLRTV+SVINR+P LL+EIILVDD S+
Sbjct: 149 CRNAEYDVENLPTASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQ----- 203
Query: 60 IIDVISDQTFEYITASDMTWG-----------GFNWKLREKNRHKKTVVCPIIDVISDQT 108
ID +++ E G G R V +D + T
Sbjct: 204 -IDEMANGRLERFIRRHFRPGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEAT 262
Query: 109 FEYITA---------KTVVCPIIDVISDQTFEYI-TASDM-TWGGFNWKLNFRWYRVPPR 157
++ TVVCPIIDVI D+T +Y+ T+SD GGFNWK F W P
Sbjct: 263 DHWLEPMVELIKKDRTTVVCPIIDVIDDKTLQYMGTSSDFYQIGGFNWKGEFIWINTP-- 320
Query: 158 EMMRRG-GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E R+ ++ P+R+PTMAGGLFAID+ YF+E GSYD M+ WGGENLEMSFR+W CGG
Sbjct: 321 EAWRKARKSKADPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGG 380
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
L I PCSHVGH+FRD PY FP + N AR+AEVWMD ++ ++Y P
Sbjct: 381 SLVIAPCSHVGHIFRDYHPYKFPSNKDTHGI-NTARLAEVWMDNYKYYFYQNRP 433
>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
Length = 652
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 123/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 184 CKSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 243
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 244 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 303
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 304 AR------NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 354
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 355 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 414
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P
Sbjct: 415 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRP 459
>gi|148878418|gb|AAI46056.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Bos
taurus]
gi|296487792|tpg|DAA29905.1| TPA: polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 165/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LP TS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + P+ +
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ G +L + + V+ P++ I++
Sbjct: 236 QVVQVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +P RE RR D + P
Sbjct: 290 ETVVVSPNIVTIDLNTFEFSKPVQRGRIQSRGNFDWSLTFGWEVLPAREKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G++ ++ N R+AEVWMD +++ +Y N
Sbjct: 409 FRTKSPHTFPKGIN-VIARNQVRLAEVWMDGYKEIFYRRN 447
>gi|62751482|ref|NP_001015534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
gi|75057892|sp|Q5EA41.1|GALT6_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|59857821|gb|AAX08745.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 124/280 (44%), Positives = 165/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LP TS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + P+ +
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ G +L + + V+ P++ I++
Sbjct: 236 QVVQVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W +P RE RR D + P
Sbjct: 290 ETVVVSPNIVTIDLNTFEFSKPVQRGRVQSRGNFDWSLTFGWEVLPAREKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G++ ++ N R+AEVWMD +++ +Y N
Sbjct: 409 FRTKSPHTFPKGIN-VIARNQVRLAEVWMDGYKEIFYRRN 447
>gi|167536139|ref|XP_001749742.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771890|gb|EDQ85551.1| predicted protein [Monosiga brevicollis MX1]
Length = 1275
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 119/284 (41%), Positives = 167/284 (58%), Gaps = 20/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK +YP LP +++I F NEAWS L RTVWSV++R+P LL EIIL+DDAS+ +
Sbjct: 240 CKDVAYPPDLPAATVIICFVNEAWSALFRTVWSVLDRTPENLLHEIILLDDASDASWLQQ 299
Query: 59 PIIDVISDQTFEY-ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + + + S G +L +H +D + +I
Sbjct: 300 PLEEELQRLPAKVKLVRSPRRLGLIRARLL-GAKHATADYMIFLDSHCEANVGWIQPLLA 358
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMT-WGGFNWKLNFRWYRVPPREMMRRGGDRS 167
VV P+ID I++ Y A + G F+W L+F W P E + + D
Sbjct: 359 WMAGDPSRVVTPVIDSINNNDMSYHGAGGASSRGTFHWTLDFSWEANP--EPVAQVTD-- 414
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+++PTMAGGLF I++ YFY++GSYD+GMD WGGENLEMSFRVWQCGG L I+PCSHVG
Sbjct: 415 -PVKSPTMAGGLFGINRQYFYDVGSYDQGMDGWGGENLEMSFRVWQCGGSLHILPCSHVG 473
Query: 228 HVFRDKSPYTFPGG-VSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
H+FRD PYT P ++ L N+ R+AE WMD++++ +Y + P
Sbjct: 474 HIFRDSHPYTIPNSTINDTFLRNSIRLAETWMDDYKEIFYQIRP 517
>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
Length = 551
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 172/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 83 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 142
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 143 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 202
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 203 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 253
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 254 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 313
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 314 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAAQ 371
Query: 283 FRMLS 287
++ S
Sbjct: 372 KKLRS 376
>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cricetulus griseus]
Length = 715
Score = 219 bits (559), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 247 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 306
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 307 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRI 366
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 367 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 417
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 418 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 477
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 478 SRVGHIYRKSVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 522
>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oreochromis niloticus]
Length = 587
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 127/289 (43%), Positives = 172/289 (59%), Gaps = 32/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP T+I+I FHNE WS+LLRTV SV+NRSP L+ EIILVDD S++
Sbjct: 117 CKHKLYAEKLPNTTIIIPFHNEGWSSLLRTVHSVLNRSPPHLIAEIILVDDFSDKEHLKV 176
Query: 56 ------VVCPIIDVISDQTFEYITASDMTWGGFNWK---LREKNRHKKTVVC---PIIDV 103
V P + ++ + E + + + G K L + H + V P++D
Sbjct: 177 ALEEYMVRLPKVRILRTKKREGLIRTRLL-GAAAAKGEVLTFLDSHCEANVNWLPPLLDR 235
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMR 161
I+ KT+VCP+IDVI F Y T A D G F+W++ ++ R+P P E+ +
Sbjct: 236 IAQNR------KTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYK--RIPIPTELQK 287
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E I
Sbjct: 288 --DDPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDI 345
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS VGH++R PY PGGVS + N RVAEVWMDE+ ++ Y P
Sbjct: 346 PCSRVGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYIYQRRP 392
>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 4 [Macaca mulatta]
Length = 602
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 172/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 134 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 193
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 194 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 253
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 254 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 304
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 305 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 364
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 365 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAAQ 422
Query: 283 FRMLS 287
++ S
Sbjct: 423 KKLRS 427
>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Callithrix jacchus]
Length = 667
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 199 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 258
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 259 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 318
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 319 AR------NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 369
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 370 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 429
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 430 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 474
>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
scrofa]
Length = 506
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 38 CNSKRYLEMLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 97
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 98 PLEDYMALFPNVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 157
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 158 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 208
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 209 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 268
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 269 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 326
Query: 283 FRMLS 287
++ S
Sbjct: 327 KKLRS 331
>gi|345326650|ref|XP_003431069.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4-like
[Ornithorhynchus anatinus]
Length = 580
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 130/284 (45%), Positives = 167/284 (58%), Gaps = 41/284 (14%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS+VI F+NEAWSTLLRTV SV+ SP LLKE+ILVDD S+R + +
Sbjct: 136 LPTTSVVIAFYNEAWSTLLRTVHSVLETSPAVLLKEVILVDDLSDR------PYLKAELE 189
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII-------DVIS-------------DQTF 109
+Y++A +L NR + V +I +V++ +
Sbjct: 190 KYVSALQRV------RLVRTNRREGLVRARLIGATFATGEVLTFLDCHCECGPGWLEPLL 243
Query: 110 EYI--TAKTVVCPIIDVISDQTFE-YITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
E I VVCP+ID I TFE Y+ + GGF+W+L F+W VP RR R
Sbjct: 244 ERIGRNETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWQTVP-ERERRRRRSR 302
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+ +PTMAGGLFA+ K YF LG+YD GM++WGGENLE+SFRVWQCGG LEI+PCSHV
Sbjct: 303 IDPIPSPTMAGGLFAVGKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEILPCSHV 362
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GHVF ++PY P L N AR AEVWMD +++ +Y NP
Sbjct: 363 GHVFPKRAPYARPS-----FLRNTARAAEVWMDGYKEHFYNRNP 401
>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
[Saccoglossus kowalevskii]
Length = 634
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 174/316 (55%), Gaps = 42/316 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK YPT LPT S+V+VF NE WSTL+RTV SV N SP LL EI++VDD S++
Sbjct: 183 CKYWHYPTDLPTASVVLVFINEGWSTLMRTVHSVFNTSPSHLLAEIVMVDDFSDK----- 237
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI-------- 112
D + + EYI D G KL + + + I I+ + E +
Sbjct: 238 -DHLKSKLEEYIK-QDRFEGKI--KLVRNAKREGLIRARTIGAINAERGEVVVFLDAHCE 293
Query: 113 ---------------TAKTVVCPIIDVISDQTFEYITASD-MTWGGFNWKLNFRWYRVPP 156
K VVCP++D + F Y +D M G FNW ++ +PP
Sbjct: 294 CSPNWLPPLLSRIKQNRKAVVCPLVDAVDADNFGYAPQADGMARGVFNWDFFYKRIPIPP 353
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
+E RR + S P R+P MAGGLFA+ + +F+++G YD G+DIWGGE E+SF++W CGG
Sbjct: 354 KEANRRERN-SEPYRSPVMAGGLFALSRSFFFDIGGYDNGLDIWGGEQYEISFKIWMCGG 412
Query: 217 ILEIIPCSHVGHVFRDKS-PYTFP---GGVSKIVLHNAARVAEVWMDEWRDFYYAMNP-- 270
ILE +PCS VGH++R PY++P G+S IV N RVAEVWMDE+++++Y M P
Sbjct: 413 ILEFVPCSRVGHIYRRGGIPYSYPQSDDGIS-IVNKNYLRVAEVWMDEYKEYFYRMKPEL 471
Query: 271 -GKSASVSTCAAHFRM 285
GK T FR
Sbjct: 472 RGKPYGDITEQVQFRQ 487
>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 619
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 163/288 (56%), Gaps = 29/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+KK Y LPT S++I+F+NE WS LLRTV+SV+NRSP LLKEIILV+D S
Sbjct: 148 CRKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPHLLKEIILVNDHSTKPFLWK 207
Query: 54 ------ERVVCPIIDVI-----SDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID 102
E + P + +I S + + G L + P+++
Sbjct: 208 PLQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLE 267
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+ +T VCP+IDVI TFEY + + G F+WK ++ + P ++
Sbjct: 268 PIAQDY------RTCVCPLIDVIVHDTFEYRSQDEGKRGAFDWKFYYKRLPLRPGDL--- 318
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P +P MAGGLFAI +F+ELG YDEG+DIWGGE E+SF++WQCGG + P
Sbjct: 319 -DDPTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAP 377
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGHV+R SP+ P GV+ V N RVAEVWMDE++ F Y NP
Sbjct: 378 CSRVGHVYRGYSPFPNPRGVN-FVTRNFKRVAEVWMDEYKQFLYERNP 424
>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 587
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 120/280 (42%), Positives = 168/280 (60%), Gaps = 16/280 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK +Y + LP+ SI+I+FHNEA+S LLRTV+SVI +P LLKEIILVDD S + +
Sbjct: 122 CKNVTYDSVLPSASIIIIFHNEAFSVLLRTVYSVIKETPPKLLKEIILVDDKSNEELLGL 181
Query: 61 IDVISDQTFEY---ITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEY 111
++ + D G +L+ V+ C + + +
Sbjct: 182 LEYYIQTRLPKKVKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQR 241
Query: 112 ITAK--TVVCPIIDVISDQTFEYITASDMTW---GGFNWKLNFRWYRVPPREMMRRGGDR 166
I K VV PIID IS++TFEY + + ++ GGF W +F W + ++ +
Sbjct: 242 IKEKKNAVVTPIIDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTS-A 300
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
SP+++PTMAGGLFAI++ YF+++GSYD+ M+ WGGENLEMSFR+WQCGG+LE IPCS V
Sbjct: 301 ISPVKSPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCSRV 360
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
GHVFR+ PY FP + N AR+A VWMD+++ YY
Sbjct: 361 GHVFRNFLPYKFPMDKDTHGI-NTARLANVWMDDYKRLYY 399
>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Saimiri boliviensis boliviensis]
Length = 682
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 214 CNSKHYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 273
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 274 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 333
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 334 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 384
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 385 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 444
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 445 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 489
>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
lupus familiaris]
Length = 603
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSELIAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 423
Query: 283 FRMLS 287
++ S
Sbjct: 424 KKLRS 428
>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 608
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/288 (42%), Positives = 163/288 (56%), Gaps = 29/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------- 53
C+KK Y LPT S++I+F+NE WS LLRTV+SV+NRSP LLKEIILV+D S
Sbjct: 137 CRKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPSHLLKEIILVNDHSTKPFLWK 196
Query: 54 ------ERVVCPIIDVI-----SDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID 102
E + P + +I S + + G L + P+++
Sbjct: 197 PLQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLE 256
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+ +T VCP+IDVI TFEY + + G F+WK ++ + P ++
Sbjct: 257 PIAQDY------RTCVCPLIDVIVHDTFEYRSQDEGKRGAFDWKFYYKRLPLRPGDL--- 307
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P +P MAGGLFAI +F+ELG YDEG+DIWGGE E+SF++WQCGG + P
Sbjct: 308 -DDPTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAP 366
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGHV+R SP+ P GV+ V N RVAEVWMDE++ F Y NP
Sbjct: 367 CSRVGHVYRGYSPFPNPRGVN-FVTRNFKRVAEVWMDEYKQFLYERNP 413
>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
Length = 582
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 172/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R +
Sbjct: 114 CNNKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIVLVDDFSDREHLKK 173
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ D ++ I + G ++ + V+ P++D I
Sbjct: 174 PLEDYMAHFPSVRILRTKKREGLIRTRMLGASAASGDVITFLDSHCEANVNWLPPLLDRI 233
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 234 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 284
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 285 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 344
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 345 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEFAEHIYQRRPEYRHLSAGDVAAQ 402
Query: 283 FRMLS 287
++ S
Sbjct: 403 KKLRS 407
>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
Length = 579
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/296 (42%), Positives = 171/296 (57%), Gaps = 47/296 (15%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK Y LP TS++I+F +EAW+ LLRTV SVINRSP LL+EIIL+DD S+R
Sbjct: 123 CKDIKYDYATLPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEIILLDDNSKR---- 178
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRH----------KKTV------------- 96
+ + E+I +GG +R+ RH ++ V
Sbjct: 179 --QELQEPLDEHIK----RFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEA 232
Query: 97 ----VCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWY 152
+ PI+ ISD+ +VCP+ID ISD T Y ++ GGF+W L+F W
Sbjct: 233 NHGWLEPIVQRISDER------TAIVCPMIDSISDSTLAYHGDWSLSVGGFSWALHFTWE 286
Query: 153 RVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVW 212
+P E+ RR + +R+PTMAGGL A +++YF+E+G YDE MDIWGGENLE+SFR W
Sbjct: 287 GLPDEELKRRT-KVTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNW 345
Query: 213 QCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLH--NAARVAEVWMDEWRDFYY 266
CGG +E IPCSHVGH+FR PY G + +H N+ R+AEVWMD+++ YY
Sbjct: 346 MCGGSIEFIPCSHVGHIFRAGHPYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYY 401
>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
catus]
Length = 603
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 423
Query: 283 FRMLS 287
++ S
Sbjct: 424 KKLRS 428
>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
Length = 551
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 83 CNGKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 142
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 143 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 202
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 203 AQNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 253
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 254 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 313
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 314 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 371
Query: 283 FRMLS 287
++ S
Sbjct: 372 KKLRS 376
>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [synthetic
construct]
Length = 603
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
Length = 637
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 115/282 (40%), Positives = 165/282 (58%), Gaps = 17/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV--C 58
C+ K+Y LPT S++++F+NE WS LLRTV+SV+NRSP +LLKE+ILV+D S +
Sbjct: 144 CRTKAYLRELPTVSVIVIFYNEHWSALLRTVYSVLNRSPASLLKEVILVNDHSTKPFLWA 203
Query: 59 PIIDVISDQTFEYITASDM-TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ + + + + D+ G R + V ++D ++ ++
Sbjct: 204 PLREFVESELAPKVRLIDLPERSGLILARMAGAREARGDVLIVLDSHTEVNNNWLPPLLE 263
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+T VCP IDVI+ TF+Y + G F+WK ++ + P ++ D +
Sbjct: 264 PIAEDYRTCVCPFIDVIAHDTFQYRAQDEGKRGAFDWKFYYKRLPLLPGDL----DDPTK 319
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P +P MAGGLFAI +F+ELG YDEG+DIWGGE E+SF++WQCGG L PCS VGH
Sbjct: 320 PFNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGH 379
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
V+R +P+ P GV+ V+ N RVAEVWMDE+ F Y NP
Sbjct: 380 VYRGYAPFGNPRGVN-FVVRNFKRVAEVWMDEYAKFLYERNP 420
>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Homo sapiens]
Length = 603
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
Length = 603
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 599
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 173/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C +K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R +
Sbjct: 131 CNRKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 190
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ D ++ I + G ++ + V+ P++D I
Sbjct: 191 PLEDYMAHFPSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRI 250
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 251 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 301
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 302 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 361
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 362 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEFAEHIYQRRPEYRHLSAGDVAAQ 419
Query: 283 FRMLS 287
++ S
Sbjct: 420 KKLRS 424
>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Ailuropoda melanoleuca]
Length = 598
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 130 CNGKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 189
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 190 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 250 AQNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 300
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 301 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 361 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 418
Query: 283 FRMLS 287
++ S
Sbjct: 419 KKLRS 423
>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
Mn2+
gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
Galnac-ser On Lectin Domain
Length = 570
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 102 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 161
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 162 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 221
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 222 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 272
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 273 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 332
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 333 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 377
>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
mulatta]
Length = 565
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 97 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 156
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 157 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 216
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 217 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 267
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 268 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 327
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 328 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 372
>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
Length = 634
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 166 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 225
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 226 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 285
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 286 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 336
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 337 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 396
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 397 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 441
>gi|351697576|gb|EHB00495.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Heterocephalus
glaber]
Length = 622
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 125/280 (44%), Positives = 168/280 (60%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS---------ERVV--C 58
LP TS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS ER V
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEYLKEKLERYVEQL 235
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWK--LREKNRHKKTV---VCPIIDVISDQTFEYIT 113
I+ V+ + + + + + L + H + + P++ I++
Sbjct: 236 QIVKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFYGWLEPLLARIAEDQV---- 291
Query: 114 AKTVVCPIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I+ TFE+ + G F+W L F W +P +E RR D + P
Sbjct: 292 --AVVSPDIVTINLDTFEFSKPIPGGRVHSRGNFDWSLTFGWETLPAQEKQRRE-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEI PCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGRLEIAPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR +SP+TFP G S ++ N R+AEVWMD+++ +Y N
Sbjct: 409 FRSRSPHTFPKGTS-VISRNQVRLAEVWMDDYKKIFYRRN 447
>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
troglodytes]
Length = 551
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 83 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 142
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 143 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 202
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 203 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 253
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 254 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 313
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 314 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 358
>gi|326427851|gb|EGD73421.1| GALNT4 protein [Salpingoeca sp. ATCC 50818]
Length = 537
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/281 (44%), Positives = 171/281 (60%), Gaps = 20/281 (7%)
Query: 3 KKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPII 61
KK YP + LPT S++++F+NEA STLLRTVWSV++RSPR+L+KEI+LVDD S P +
Sbjct: 196 KKYYPLSELPTVSVILIFYNEARSTLLRTVWSVLDRSPRSLIKEILLVDDHSS---MPHL 252
Query: 62 DVISDQTFEYITASDMTW-----GGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
DQ I + + G K+ + + V+ C + D + +
Sbjct: 253 GYPLDQEVAGIPKTRVIRLPERSGLIRAKVYGAQQARGDVLVYLDSHCEVNDGWLEPLLD 312
Query: 111 YI--TAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
I KTV PIID I +T+E+ T + G F+W L F+W ++ + R D +
Sbjct: 313 RIRRNRKTVAMPIIDAIDYETWEHRTGL-LERGIFDWSLVFKWKQLTADDKRGRPDD-TD 370
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P +P MAGGLFA+D+ YF+E+G+YD GM+ WGGEN+EMS RVW CGG +E +PCSHV H
Sbjct: 371 PFASPAMAGGLFAMDRKYFFEVGAYDMGMETWGGENIEMSMRVWACGGRIEALPCSHVAH 430
Query: 229 VFRDKSPYTFP-GGVSKIVLHNAARVAEVWMDEWRDFYYAM 268
VFR K+PY F + + N RVAEVWMDE++D YYA+
Sbjct: 431 VFRKKTPYEFKTKDPQETIARNLNRVAEVWMDEYKDVYYAV 471
>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Nomascus leucogenys]
gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
Length = 506
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 38 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 97
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 98 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRI 157
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 158 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 208
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 209 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 268
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 269 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 313
>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
Length = 489
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/307 (41%), Positives = 177/307 (57%), Gaps = 32/307 (10%)
Query: 1 CKKKS-YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK + Y T LP T ++I FHNEAWSTLLRTV SV+ RSP L+ ++ILVDD S+ P
Sbjct: 162 CKDTARYLTDLPKTDVIICFHNEAWSTLLRTVHSVLARSPEHLIGKVILVDDYSD---MP 218
Query: 60 IIDVISDQTFEYITASDMT-----WGGFNWKLREKNRHKKTVVC--------------PI 100
+ + + F + G +L VV P+
Sbjct: 219 HLKIQLKEYFSLYPKVQLVRVAKREGLVRARLFGMEYADSPVVTFLDSHCECTEGWLEPL 278
Query: 101 IDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREM 159
+D I+ TV P ID+I +TF+Y ++ G F+W L F W +P RE+
Sbjct: 279 LDRIARNR------NTVASPTIDMIDPKTFQYNYDGANDVLGVFDWNLEFYWIPIPLREL 332
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
RR + P++TPT+AGGLFAID ++F +G+YD G +IWGG+NLE+SF+ W CGGILE
Sbjct: 333 KRRNH-FAEPIQTPTIAGGLFAIDLEFFRSVGTYDPGFNIWGGDNLELSFKTWMCGGILE 391
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTC 279
IIPCSHVGH+FRD SPY +P + +V N AR+AEVW+D++ +YY + G + S++T
Sbjct: 392 IIPCSHVGHIFRDDSPYEWPSSRAMMVESNLARLAEVWLDDYAKYYYERS-GGNKSLATD 450
Query: 280 AAHFRML 286
+ + L
Sbjct: 451 VSDRKKL 457
>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Oryctolagus cuniculus]
Length = 622
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/280 (43%), Positives = 163/280 (58%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LP+TS++IVFHNEAWSTLLRTV+SV++ +P LL+EIILVDDAS E + + +
Sbjct: 176 LPSTSVIIVFHNEAWSTLLRTVYSVLHTAPAILLREIILVDDASTEEYLKEKLEQYVKQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ + G +L + + V+ P++ I++
Sbjct: 236 QVVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFTGWLEPLLARIAEDE----- 290
Query: 114 AKTVVCPIIDVISDQTFEYITASD----MTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TFE+ + G F+W L F W VP E RR D + P
Sbjct: 291 -TVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAVPAHENRRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG LEIIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G + ++ N R+AEVWMD ++ +Y N
Sbjct: 409 FRTKSPHTFPKG-TNVIARNQVRLAEVWMDNYKKIFYRRN 447
>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
caballus]
Length = 561
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 171/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 93 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 152
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 153 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 212
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 213 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 263
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 264 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 323
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ + Y P + S AA
Sbjct: 324 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEYRHLSAGDVAAQ 381
Query: 283 FRMLS 287
++ S
Sbjct: 382 KQLRS 386
>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
Length = 579
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 125/296 (42%), Positives = 171/296 (57%), Gaps = 47/296 (15%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK Y + LP TS++I+F +EAW+ LLRTV SVINRSP LL+E+IL+DD S+R
Sbjct: 123 CKDIKYDYSSLPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKR---- 178
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRH----------KKTV------------- 96
+ + E+I +GG +R+ RH ++ V
Sbjct: 179 --QELQEPLDEHIK----RFGGKVKLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEA 232
Query: 97 ----VCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWY 152
+ PI+ ISD+ +VCP+ID ISD T Y ++ GGF+W L+F W
Sbjct: 233 NHGWLEPIVQRISDER------TAIVCPMIDSISDSTLAYHGDWSLSVGGFSWALHFTWE 286
Query: 153 RVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVW 212
+P E RR + +R+PTMAGGL A +++YF+E+G YDE MDIWGGENLE+SFR W
Sbjct: 287 GIPEDEQKRRKKP-TDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNW 345
Query: 213 QCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLH--NAARVAEVWMDEWRDFYY 266
CGG +E IPCSHVGH+FR PY G + +H N+ R+AEVWMD+++ YY
Sbjct: 346 MCGGSIEFIPCSHVGHIFRAGHPYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYY 401
>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 607
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 167/288 (57%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
C++K Y LP T+I+I FHNE WS+LLRTV SVINRSP L+ EIILVDD S++
Sbjct: 137 CRQKLYAEKLPNTTIIIPFHNEGWSSLLRTVHSVINRSPPRLVAEIILVDDFSDKEHLKV 196
Query: 59 ---------PIIDVISDQTFEYITASDMTWGGFNWK-----LREKNRHKKTVVCPIIDVI 104
P + ++ + E + + + G L + P++D I
Sbjct: 197 ALEEYMKRFPKVRILRTKKREGLIRTRLLGAGAAKGEVITFLDSHCEANVNWLPPLLDRI 256
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
KT+VCP+IDVI F Y T A D G F+W++ ++ R+P P EM R
Sbjct: 257 VQNR------KTIVCPMIDVIDHDNFGYDTQAGDAMRGAFDWEMYYK--RIPIPAEM--R 306
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D + P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IP
Sbjct: 307 TDDPTEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIP 366
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH++R PY PGG+S + N RVAEVWMDE+ ++ Y P
Sbjct: 367 CSRVGHIYRKYVPYKVPGGIS--LAKNLKRVAEVWMDEYAEYVYQRRP 412
>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
Length = 603
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
norvegicus]
gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
norvegicus]
gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
Length = 603
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
Length = 520
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 129/291 (44%), Positives = 168/291 (57%), Gaps = 39/291 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+K SY LP S+VI+F +EAWS L+RTV SVINR+P LL+EIILVDD S+R
Sbjct: 63 CRKISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQR----- 117
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH---------KKTVVCPIIDVISDQT--- 108
D + + EYI +G +R R K V ++ +
Sbjct: 118 -DELKGKLEEYIK----RFGDKVRLVRAPERQGLIRAKLLGAKEAVGDVLVFLDSHCEVG 172
Query: 109 ---FEYITAK------TVVCPIIDVISDQTFEYITASDMTW--GGFNWKLNFRWYRVPPR 157
E + A+ V+CPII+ IS +T Y +A+D GGF W L+FRW +P
Sbjct: 173 EGWLEPLLARIKDKRSAVLCPIINHISPETLTY-SANDRPAHVGGFWWSLHFRWDPMPKE 231
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
D + P+R+PTMAGGL A+D+ YF+E+G YD MDIWGGENLEMSFRVW CGG
Sbjct: 232 ---YSDADPTEPIRSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEMSFRVWMCGGS 288
Query: 218 LEIIPCSHVGHVFRDKSPYTF--PGGVSKIVLHNAARVAEVWMDEWRDFYY 266
+E IPCSHVGH+FR PY PG + N+ R+AEVWMD+++ FYY
Sbjct: 289 VEFIPCSHVGHIFRAGHPYNMIGPGNNKDVHGTNSKRLAEVWMDDYKKFYY 339
>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 756
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/289 (42%), Positives = 167/289 (57%), Gaps = 28/289 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C K Y + LP TS++IVFHNEAWS LLRTV SVINR+PR L EIILVDDAS I
Sbjct: 298 CLYKQYSSALPNTSVIIVFHNEAWSALLRTVHSVINRTPRQYLSEIILVDDAS------I 351
Query: 61 IDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQT------F 109
+ Q Y+ + G + +R + R ++ +
Sbjct: 352 HAHLGHQLDSYVAKLPVPVHVERMGVRSGLIRARMRGALVAQGQVLTFLDSHCEASHGWL 411
Query: 110 EYITAK------TVVCPIIDVISDQTFEYITASDMT--WGGFNWKLNFRWYRVPPREMMR 161
E + A+ VV P+IDVI+ Q Y A + T G F+W L FRW + R++
Sbjct: 412 EPLLARIAEDRSNVVTPVIDVINAQNLAY-EADNQTPAIGVFDWSLTFRWQSIQRRDLPL 470
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D + P+ +PTMAGGLFAID+ YF E G YD G +IWG ENLE+SF+ W CGG +EI+
Sbjct: 471 LKHDPTHPIPSPTMAGGLFAIDRSYFIETGMYDSGFEIWGAENLEISFKTWMCGGRIEIL 530
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCSHVGH+FR +PY+ ++ + +N R+AEVW+D +++F+Y M+P
Sbjct: 531 PCSHVGHIFRKHAPYS--NTLTDFISYNNKRLAEVWLDGYKEFFYFMSP 577
>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
Length = 603
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
Length = 579
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 170/296 (57%), Gaps = 47/296 (15%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK Y LP TS++I+F +EAW+ LLRTV SVINRSP LL+E+IL+DD S+R
Sbjct: 123 CKDIKYDYAALPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKR---- 178
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRH----------KKTV------------- 96
+ + E+I +GG +R+ +RH ++ V
Sbjct: 179 --QELQEPLDEHIK----RFGGKVRLIRKHDRHGLIRAKLAGAREAVGDIIVFLDSHCEA 232
Query: 97 ----VCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWY 152
+ PI+ ISD+ +VCP+ID ISD T Y ++ GGF+W L+F W
Sbjct: 233 NHGWLEPIVQRISDER------TAIVCPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWE 286
Query: 153 RVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVW 212
+ E RR + +R+PTMAGGL A +++YF+E+G YDE MDIWGGENLE+SFR W
Sbjct: 287 GLSEEEQKRRTKP-TDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAW 345
Query: 213 QCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLH--NAARVAEVWMDEWRDFYY 266
CGG +E IPCSHVGH+FR PY G + +H N+ R+AEVWMD+++ YY
Sbjct: 346 MCGGSIEFIPCSHVGHIFRAGHPYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYY 401
>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
Length = 603
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
Length = 528
Score = 218 bits (555), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 60 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 119
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 120 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRI 179
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 180 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 230
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 231 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 290
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 291 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRP 335
>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cavia porcellus]
Length = 602
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 134 CNSKRYLEVLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 193
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 194 PLEDYMALFPSVRILRTKRREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRI 253
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 254 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 304
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 305 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 364
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMD++ ++ Y P
Sbjct: 365 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDDYAEYIYQRRP 409
>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oryctolagus cuniculus]
Length = 603
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 126/305 (41%), Positives = 172/305 (56%), Gaps = 29/305 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 195 PLEDYMALFPSVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 255 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 305
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 306 VDPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 365
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 366 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAAQ 423
Query: 283 FRMLS 287
++ S
Sbjct: 424 KKLRS 428
>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
[Strongylocentrotus purpuratus]
gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
[Strongylocentrotus purpuratus]
Length = 670
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 125/295 (42%), Positives = 170/295 (57%), Gaps = 33/295 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP- 59
CK YPT LP TS+VIVFH E WSTL+RT+ SV N SP+ LL E++LVDD S++V
Sbjct: 203 CKHWHYPTNLPNTSVVIVFHQEGWSTLIRTIHSVFNTSPKELLAEVLLVDDYSDKVHLKK 262
Query: 60 -IIDVISDQTFE---YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE----- 110
+ D I D F I + G +R + + + ++ + D E
Sbjct: 263 KLDDYIRDPRFSGKIRIVRNKKREG----LIRSRTIGARKAIGQVLTFL-DAHCECGPNW 317
Query: 111 --------YITAKTVVCPIIDVISDQTFEYITASD-MTWGGFNWKLNFRWYRVPPREMMR 161
+ T+VCP +D IS TF Y + D + G F+W +F + R+P +
Sbjct: 318 LPPLLAEIAVDRSTIVCPTVDAISSDTFAYTSQGDGLCRGAFDW--DFWYKRIPVKPYWH 375
Query: 162 RGG--DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
R G RS P +P MAGGL A+D+ YF+ELG YD G+ IWGGEN E+SF+VW CGG L+
Sbjct: 376 RLGLKQRSQPYPSPVMAGGLLALDRSYFFELGGYDPGLQIWGGENFEISFKVWMCGGSLK 435
Query: 220 IIPCSHVGHVFRDKSPYTFPG----GVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+PCS VGHV+R + PY++P GVS + L N RVAEVW+DE++D +YA P
Sbjct: 436 FVPCSRVGHVYRKQVPYSYPSSGVEGVSVVDL-NYMRVAEVWLDEYKDSFYATKP 489
>gi|198426119|ref|XP_002128247.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
6 [Ciona intestinalis]
Length = 627
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 122/294 (41%), Positives = 166/294 (56%), Gaps = 31/294 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK + + LP TS++I+FHNEAW LLRTV SV+ SP+ LLKEIILVDDAS +
Sbjct: 173 CKVRKWRKPLPDTSVIIIFHNEAWCALLRTVHSVLENSPKILLKEIILVDDASTLSNLGK 232
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ D ++ I G +L + +V+ P+++ I
Sbjct: 233 ELTDYVAKLQIVKIIRLPSRAGLIRARLAGAQEAQGSVLTFLDSHCECAPHWLEPMLERI 292
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTF--EYITASDMTWGGFNWKLNFRW--YRVPPREMM 160
++ VVCP+I+VI TF TA + G +W L F W ++ P + +
Sbjct: 293 AEDNTR------VVCPVIEVIDADTFAMSLTTARSVQTGILSWSLGFNWAPRKINPGQPI 346
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
+ L + TMAGGLFA+ + YFY LGSYD M +WGGEN+EMS R+W CGG LEI
Sbjct: 347 KN----DEALTSATMAGGLFAMSRKYFYHLGSYDNDMLVWGGENIEMSLRIWMCGGSLEI 402
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
PCSHVGHVFR ++PY+ PGG S ++ HN RVAEVW+DE+++ YY P A
Sbjct: 403 HPCSHVGHVFRKRAPYSHPGG-SDVITHNNKRVAEVWLDEYKEQYYKRVPRARA 455
>gi|345491789|ref|XP_001607575.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 566
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 125/282 (44%), Positives = 161/282 (57%), Gaps = 19/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-----ER 55
CK +Y T LPTTS+VI+FHNEAWS LLRTV+SV+ SP LKEIILVDD S E
Sbjct: 110 CKSVTYDTKLPTTSVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELED 169
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTF 109
++ I+ + + + G +L + V+ C +
Sbjct: 170 ILAYYIETRLPKKVKLLRLPKRQ-GLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLL 228
Query: 110 EYITAK--TVVCPIIDVISDQTFEYITA---SDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
I A+ V+ P+IDVI +T EY A S M GGF W +F W + R
Sbjct: 229 HRIKARPNAVLIPVIDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPK-RTTA 287
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
P+ TPTMAGGLFAID+ YF+ +GSYDE MD WGGENLEMSFR+WQCGG +EI+PCS
Sbjct: 288 SPIDPINTPTMAGGLFAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPCS 347
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
VGH+FRD PY FP ++ N AR A VWMD+++ ++
Sbjct: 348 RVGHIFRDFFPYEFPSSRDTYLI-NTARAAHVWMDDYKRLFF 388
>gi|358336356|dbj|GAA28182.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 592
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 117/286 (40%), Positives = 172/286 (60%), Gaps = 29/286 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK +S+ + LP T+++I FHNEAWS LLR+V SV++ SP+ LL+EIILVDD S R
Sbjct: 142 CKTQSFSSDLPKTAVIICFHNEAWSALLRSVHSVLDYSPKELLQEIILVDDFSSRDYLKE 201
Query: 59 ---------PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKT-VVC------PIID 102
P++ +I + E + + M G N E + + + C P+++
Sbjct: 202 PLEIYMQQFPVVKIIRTKRREGLIRARMV--GTNVSTAEVLTYLDSHIECTPGWLEPLLE 259
Query: 103 VISDQTFEYITAKTVVCPIIDVISDQ--TFEYITASDMTWGGFNWKLNFRWYRVPPREMM 160
I T VV P+I++I+DQ + + + + GGF+W L F W+ P R+ +
Sbjct: 260 RIKAST------SNVVVPVIEIINDQDLSMKATQEASVQVGGFDWSLTFTWHLPPKRDQI 313
Query: 161 RRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEI 220
R G SP+R+PTMAGGLFAI +D+F LG YDE M++WGGENLE+SF+ W CGG LE
Sbjct: 314 RLGAP-YSPIRSPTMAGGLFAIHRDFFAYLGYYDEEMEVWGGENLELSFKTWMCGGQLET 372
Query: 221 IPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
+ CSHVGH+FR +SPY++ + + N R+AE W+D+++ YY
Sbjct: 373 VVCSHVGHIFRSRSPYSWESKRTSPIKFNLVRLAETWLDDYKFLYY 418
>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
Length = 611
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 121/286 (42%), Positives = 166/286 (58%), Gaps = 40/286 (13%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LLKEIILVDDAS D + +Q
Sbjct: 170 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDAS------AADHLKEQLE 223
Query: 70 EYITASDMTWGGFNWKLREKNRHKKTVVCPII--DVISDQTFEYITAK------------ 115
++ + ++ + K + ++ V + ++ A
Sbjct: 224 VFVHQLKIV------RVVRQPERKGLITARLLGASVAQGEVLTFLDAHCECFHGWLEPLL 277
Query: 116 --------TVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
VV P I I +TF++ ++ G F+W L F W ++P R
Sbjct: 278 ARIVEEPTAVVSPEITTIDLETFQFNKPVASSHAYNRGNFDWGLTFGWEQIPEAARKLRK 337
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TPT AGGLF+I K YF +G+YD+ M+IWGGEN+EMSFRVWQCGG LEIIPC
Sbjct: 338 -DETYPVKTPTFAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIPC 396
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
S VGHVFR KSP+TFP G + ++ N R+AEVWMD+++ +Y N
Sbjct: 397 SVVGHVFRTKSPHTFPKG-TDVITRNQVRLAEVWMDDYKKIFYRRN 441
>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 606
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 177/308 (57%), Gaps = 44/308 (14%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK YP LPTTS++IVFHNE WSTLLRTV SV NRSP LL EIILVDD S +
Sbjct: 149 CKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDDFSTK----- 203
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLR--EKNRHKKTVVCPII-------DVIS--DQTF 109
+ + ++ +Y+ + FN KL+ +R + + II DV+ D
Sbjct: 204 -EHLKERLEDYVQEAR-----FNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHC 257
Query: 110 EY-------------ITAKTVVCPIIDVISDQTFEYIT--ASDMTWGGFNWKLNFRWYRV 154
E + T VCPIIDVI + + D GGF+W L ++ V
Sbjct: 258 EVGVNWLPPLLTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQDRGGFDWSLYWKHLPV 317
Query: 155 PPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQC 214
P E RR S P R+P MAGGLFA+D+ YF+ELG+YDEG++IWGGEN E+SF++W C
Sbjct: 318 PQFEKSRRQ-HASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMC 376
Query: 215 GGILEIIPCSHVGHVFR--DKSPYTFPGGVSKIVL--HNAARVAEVWMDEWRDFYYAMNP 270
GG L +PCS VGHV+R K PY+ P G S ++L N RV EVW D++++++Y P
Sbjct: 377 GGSLLWVPCSRVGHVYRILGKVPYSAPNG-SMLILSERNLRRVVEVWFDDYKEYFYRSKP 435
Query: 271 GKSASVST 278
+S VST
Sbjct: 436 -ESLLVST 442
>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
Length = 442
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 121/283 (42%), Positives = 167/283 (59%), Gaps = 15/283 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK+K + LP TS++I+F+NEAWSTLLRTV SV+ SP LL+E+ILVDD S + +
Sbjct: 74 CKQKQFFRPLPQTSVIIIFYNEAWSTLLRTVHSVLEASPAELLREVILVDDCSTFDHLKA 133
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYI 112
P+ +S + S G +L + V+ C + + E I
Sbjct: 134 PLETYLSTLPQVRLVRSPKRQGLIRARLLGALHARGEVLTFLDSHCECMHGWLEPQLETI 193
Query: 113 TAKTVVCPI--IDVISDQTFEY--ITASDMTWGGFNWK-LNFRWYRVPPREMMRRGGDRS 167
PI +D I TF+Y + GG N+K L F W +P E RR
Sbjct: 194 ARNYTTVPISVLDNILHDTFQYTFMDLQSTQMGGINFKELTFIWEPIPEHE-RRRQKSPV 252
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+R+PTMAGG+F+I+K YF LG+YD GM++WGGEN+EMSFR+WQCGG + ++PCSHVG
Sbjct: 253 DPIRSPTMAGGIFSINKKYFEYLGAYDTGMEVWGGENIEMSFRIWQCGGTIVVLPCSHVG 312
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVFR SPY+ G K ++HN R+AEVWMD++++ YY +P
Sbjct: 313 HVFRPTSPYST-GDAWKKLVHNNRRMAEVWMDDYKEIYYRKHP 354
>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
Length = 579
Score = 216 bits (550), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 169/296 (57%), Gaps = 47/296 (15%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP 59
CK Y LP TS++I+F +EAW+ LLRTV SVINRSP LL+E+IL+DD S+R
Sbjct: 123 CKDIKYDYAALPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKR---- 178
Query: 60 IIDVISDQTFEYITASDMTWGGFNWKLREKNRH----------KKTV------------- 96
+ + E+I +GG +R+ RH ++ V
Sbjct: 179 --QELQEPLDEHIK----RFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEA 232
Query: 97 ----VCPIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWY 152
+ PI+ ISD+ +VCP+ID ISD T Y ++ GGF+W L+F W
Sbjct: 233 NHGWLEPIVQRISDER------TAIVCPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWE 286
Query: 153 RVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVW 212
+ E RR + +R+PTMAGGL A +++YF+E+G YDE MDIWGGENLE+SFR W
Sbjct: 287 GLSEEEQKRRTKP-TDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAW 345
Query: 213 QCGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLH--NAARVAEVWMDEWRDFYY 266
CGG +E IPCSHVGH+FR PY G + +H N+ R+AEVWMD+++ YY
Sbjct: 346 MCGGSIEFIPCSHVGHIFRAGHPYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYY 401
>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 612
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 113/281 (40%), Positives = 162/281 (57%), Gaps = 16/281 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
C+KK Y + LPT S+++ FHNE W+TLLRT SV+NRSP L+KEIIL DD S E++
Sbjct: 150 CQKKRYVSKLPTVSVIVPFHNEHWTTLLRTATSVLNRSPPELIKEIILADDYSNKEQLKK 209
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
P+ D I+ + G R V +D ++ ++
Sbjct: 210 PLEDYIAKHWNKVRVVRATRREGLIRARLLGARQATGDVLIFLDSHTEANVNWLPPLLEP 269
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
+TVVCP IDVI +TF Y + G F+W+L ++ + P ++ + + P
Sbjct: 270 IAKDYRTVVCPFIDVIDYETFAYRAQDEGARGSFDWELYYKRLPLLPEDL----ANPTEP 325
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
++P MAGGLFAI + YF+ELG YDEG+D+WGGE E+SF++WQCGG + PCS VGH+
Sbjct: 326 FKSPVMAGGLFAISRRYFWELGGYDEGLDVWGGEQYELSFKIWQCGGTMVDAPCSRVGHI 385
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+R +P+ P G+ V N RVAEVWMDE++++ Y P
Sbjct: 386 YRKFAPFPNP-GIGDFVGRNYRRVAEVWMDEYKEYLYMRRP 425
>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
griseus]
Length = 494
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/291 (41%), Positives = 165/291 (56%), Gaps = 30/291 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 20 CNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 79
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + ++ + E + + M G L + P++D I
Sbjct: 80 PLEDYMALFPSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRI 139
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 140 ARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 190
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 191 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPC 250
Query: 224 SHVGHVFRDKSPYTFPGGVSK----IVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G + + L N RVAEVWMDE+ ++ Y P
Sbjct: 251 SRVGHIYRKSVPYKVPAGPADPCNCLSLQNLKRVAEVWMDEYAEYIYQRRP 301
>gi|344265184|ref|XP_003404666.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 10-like [Loxodonta
africana]
Length = 602
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 176/306 (57%), Gaps = 32/306 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNSKRYLEMLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLHK 194
Query: 56 ----VVCPIIDV-ISDQTFEYITASDMTWGGFNWKLRE----KNRHKKTVVC---PIIDV 103
+ P V IS E + + M G + + + + H + V P++D
Sbjct: 195 PLXRLHGPFPSVRISVPETEGLIRTRML--GASAAIXDVITFLDSHCEANVNWLPPLLDR 252
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 253 IARNR------KTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK- 304
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IP
Sbjct: 305 -ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIP 363
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAA 281
CS VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + S AA
Sbjct: 364 CSRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAA 421
Query: 282 HFRMLS 287
++ S
Sbjct: 422 QKKLRS 427
>gi|340371807|ref|XP_003384436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 350
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 120/278 (43%), Positives = 163/278 (58%), Gaps = 28/278 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-------VCPIID 62
LPTTS++I FHNEA S LLRT++SV++ P L+KEIILVDD S+ + V P +
Sbjct: 45 LPTTSVIICFHNEARSALLRTIYSVLSHEPAKLIKEIILVDDFSDDINDGEILSVIPKVK 104
Query: 63 VISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITAKT--- 116
+I + + + +T R + V +D + T E + A+
Sbjct: 105 LIRLNERQGLIRARLTGA----------RAAQGEVLTFLDSHCEVTPGWLEPLLARIKED 154
Query: 117 ---VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTP 173
VV PIID+I +Y A+ GGF L F+W + +E+ RR D ++P+ TP
Sbjct: 155 RRHVVSPIIDIIRKDDMKYNQANANIKGGFGHNLLFKWDNLNWQELQRRRQDNTAPIPTP 214
Query: 174 TMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDK 233
+AGGLF+ID+ YF E+GSYDE M+IWGGEN+E S RVW CGG LEI+PCSHVGH+FR
Sbjct: 215 AIAGGLFSIDRGYFKEIGSYDEEMEIWGGENVEFSIRVWMCGGRLEIMPCSHVGHIFRSS 274
Query: 234 SPYTFPGGVS--KIVLHNAARVAEVWMDEWRDFYYAMN 269
PY+F G S V N R+AEVWMDE++ +Y N
Sbjct: 275 MPYSFGKGKSYHTTVTRNLRRIAEVWMDEYKYLFYNAN 312
>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
gallus]
Length = 611
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 140 CKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 199
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P++D I
Sbjct: 200 RLEDYMAQFPNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 259
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 260 AR------NRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPP-ELQKL- 311
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 312 -DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPC 370
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 371 SRVGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRP 415
>gi|157820305|ref|NP_001099666.1| polypeptide N-acetylgalactosaminyltransferase 2 [Rattus norvegicus]
gi|149043195|gb|EDL96727.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_b [Rattus norvegicus]
Length = 473
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/265 (44%), Positives = 158/265 (59%), Gaps = 18/265 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS----ERV 56
C++K + LP TS+VI FHNEA S LLRTV SV+ RSP L+KEIILVDD S +
Sbjct: 59 CQRKQWRVDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSNDPEDGA 118
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFE 110
+ I+ + + +D G ++R + + V+ C + + E
Sbjct: 119 LLGKIEKVR------VLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLE 172
Query: 111 YITAKT--VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ VV PIIDVI+ F+Y+ AS GGF+W L F+W + P + R G+ +
Sbjct: 173 RVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVA 232
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P++TP +AGGLF +DK YF ELG YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGH
Sbjct: 233 PIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGH 292
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARV 253
VFR + PYTFPGG + +R+
Sbjct: 293 VFRKQHPYTFPGGSGTVFARIQSRL 317
>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Taeniopygia guttata]
Length = 555
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ E++LVDD S+R
Sbjct: 84 CKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEVVLVDDFSDREHLKK 143
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P++D I
Sbjct: 144 RLEDYMAQFPSVRILRTKRREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 203
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 204 AR------NRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 254
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 255 PDPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPC 314
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ +F Y P
Sbjct: 315 SRVGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEFIYQRRP 359
>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
Length = 561
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 123/289 (42%), Positives = 164/289 (56%), Gaps = 31/289 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV---- 56
C K Y LP S+VI FHNE W+TLLRTV SV+NRSP L+ EIILVDD S+R
Sbjct: 89 CASKKYVRDLPDVSLVIPFHNEGWTTLLRTVHSVLNRSPEQLIHEIILVDDFSDRSHLGK 148
Query: 57 --------VCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDV 103
+ P + V+ + E + + + G L + P+++
Sbjct: 149 DLEDYVAKLSPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPLLEP 208
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMR 161
I+ + KT+VCP IDVI F Y T A D G F+W++ ++ R+P P E+
Sbjct: 209 IA------LNKKTIVCPNIDVIDKDDFHYETQAGDAMRGAFDWEMYYK--RIPIPDEI-- 258
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
+ D S P +P MAGGLFA+D++YF ELG YD G+DIWGGE E+SF+VWQCGG +
Sbjct: 259 KNPDPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGGEQYELSFKVWQCGGRMVDA 318
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS VGHV+R PY P GV+ + N RVAEVWMDE+++ Y P
Sbjct: 319 PCSRVGHVYRKFVPYKVPAGVN--LGKNLKRVAEVWMDEYKEHLYKRRP 365
>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Meleagris gallopavo]
Length = 562
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 90 CKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 149
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P++D I
Sbjct: 150 RLEDYMAQFPNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 209
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 210 AR------NRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPP-ELQKL- 261
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 262 -DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPC 320
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 321 SRVGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRP 365
>gi|307198758|gb|EFN79561.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Harpegnathos
saltator]
Length = 606
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 118/283 (41%), Positives = 172/283 (60%), Gaps = 27/283 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + YPT LP SI+I F+NE + TLLR++ S+I+++P +LL EIILV+D S+
Sbjct: 127 CRARKYPTNLPNASIIICFYNEHYMTLLRSLHSIIDKTPTSLLHEIILVNDYSDS----- 181
Query: 61 IDVISDQTFEYITAS-DMTWGGFNWKLREK--------NRHKKTVVCPIIDV-------- 103
+++ ++ YIT + D F RE R V +D
Sbjct: 182 -NILHEKIKVYITNNFDAKVQFFKTDKREGLIRARVFGARKATGDVLIFLDSHIEVNEVW 240
Query: 104 ISDQTFEYITAKTVVC-PIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
I +KT+V P+ID+I+ TF+Y T S + GGFNW L+F+W +P +++
Sbjct: 241 IEPLLSRIAHSKTIVAMPVIDIINADTFQY-TGSPLVRGGFNWGLHFKWDNLPI-GTLKQ 298
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D P+++PTMAGGLFAID++YF ++G YD GMD+WGGENLE+SFR+W CGG +E+IP
Sbjct: 299 EDDFVKPIKSPTMAGGLFAIDREYFTKIGEYDTGMDVWGGENLEISFRIWMCGGNIELIP 358
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
CS VGHVFR + PY +L N+ RVA VW+DE++D++
Sbjct: 359 CSRVGHVFRRRRPYG-SDDPQDTMLKNSLRVAHVWLDEYKDYF 400
>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
livia]
Length = 560
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 165/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 78 CKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKK 137
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P++D I
Sbjct: 138 RLEDYMAQFPNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 197
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 198 AR------NRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPP-ELQKL- 249
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 250 -DPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPC 308
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 309 SRVGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRP 353
>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
(Silurana) tropicalis]
gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
Length = 603
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 124/300 (41%), Positives = 173/300 (57%), Gaps = 29/300 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK K Y + LP TS++I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S++
Sbjct: 132 CKNKFYFSKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDYSDKAHLKS 191
Query: 59 ---------PIIDVISDQTFEYITASDMTWG--GFNWKLREKNRHKKTVVC---PIIDVI 104
P + ++ + E + + M L + H + V P++D +
Sbjct: 192 RLEKYMANFPKVKIVRTKKREGLIRTRMLGATVASGEVLTFLDSHCEANVNWLPPLLDPL 251
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
KTVVCP+IDVI F Y+T A D G F+W++ ++ +PP E+ +
Sbjct: 252 VQ------NYKTVVCPMIDVIDSDNFGYVTQAGDAMRGAFDWEMFYKRIPIPP-ELQK-- 302
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
GD S P +P MAGGLFAI++++F++LG YD G++IWGGE E+SF+VW CGG + PC
Sbjct: 303 GDPSDPFDSPVMAGGLFAINREWFWQLGGYDPGLEIWGGEQYEISFKVWMCGGRMVDSPC 362
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSASVSTCAAH 282
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P + SV AA
Sbjct: 363 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPDYRHLSVGDVAAQ 420
>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
Length = 613
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 118/286 (41%), Positives = 163/286 (56%), Gaps = 29/286 (10%)
Query: 3 KKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV------ 56
KK Y LPT S++++F+NE WSTLLRTV+SV+NRSP LLKEI+LV+D S +
Sbjct: 144 KKRYLQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPL 203
Query: 57 -------VCPIIDVIS-----DQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
+ P + +IS +T + G L + P+I+ I
Sbjct: 204 QDFVRTELAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPI 263
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
++ +T VCP IDVI+ TF+Y + G F+WK ++ + ++M+
Sbjct: 264 AEDY------RTCVCPFIDVIAHDTFQYRAQDEGKRGAFDWKFLYKRLPLRAQDMV---- 313
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
D + P +P MAGGLFAI +F+ELG YDEG+DIWGGE E+SF+VWQCGG + PCS
Sbjct: 314 DPTEPFESPIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCS 373
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGHV+R +P+ P G + V N RVAEVWMDE++ F Y NP
Sbjct: 374 RVGHVYRGYAPFPNPRG-TNFVTRNFKRVAEVWMDEYKQFLYERNP 418
>gi|326923175|ref|XP_003207815.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Meleagris gallopavo]
Length = 709
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/289 (37%), Positives = 167/289 (57%), Gaps = 34/289 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTT+I++ F +E WSTLLR+V SV++RSP LL+E+ILVDD S +
Sbjct: 258 CLEQQVHDDLPTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTK----- 312
Query: 61 IDVISDQTFEYITASDMT--------WGGFNWKLREKNRHKKTVVC-------------- 98
D + ++ Y++ G +L TV+
Sbjct: 313 -DYLKEKLDAYMSQFPKVKVLHLRERHGLIRARLAGAQMATGTVLTFLDSHVECNVGWLE 371
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
P+++ + + V CP+I+VISD+ Y+T + G F W +NF W ++P
Sbjct: 372 PLLERVR------LHRARVACPVIEVISDKDMSYMTVDNFQRGIFTWPMNFGWKQIPQEV 425
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ + + +R P MAGGLF+++K YF+ELG+YD G+D+WGGEN+E+SF+VW CGG +
Sbjct: 426 IEKNKLKETDIIRCPVMAGGLFSVEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEI 485
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
EI+PCS VGH+FR+ +PY+FP + V N ARVAEVW+D +++ +Y
Sbjct: 486 EIVPCSRVGHIFRNDNPYSFPKDRVRTVERNLARVAEVWLDGYKELFYG 534
>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Megachile rotundata]
Length = 637
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 119/279 (42%), Positives = 173/279 (62%), Gaps = 20/279 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE-RVVCP 59
C+ + YP LP SIVI F+NE + TLLR++ S+I R+P+ LL EIILV+D S+ + +
Sbjct: 159 CQMQQYPNKLPNASIVICFYNEHYMTLLRSIHSIIERTPKHLLHEIILVNDWSDSKELHE 218
Query: 60 IIDVISDQTFEYITA---SDMTWGGFNWKLREKNRHKKTVVCPI---IDV----ISDQTF 109
I + F+ ++ G ++ + V+ + I+V I
Sbjct: 219 KIKAFINNNFDRKVKFFKTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKMWIEPLLS 278
Query: 110 EYITAKTVVC-PIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+KT+V P+ID+I+ TF+Y TAS + GGFNW L+F+W ++P + + D
Sbjct: 279 RIAHSKTIVAMPVIDIINADTFQY-TASPLVRGGFNWGLHFKWEQLPTK--LVHDEDFIK 335
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+++PTMAGGLFA+D++YF ELG YD GMD+WGGENLE+SFR+W CGG +E+IPCS VGH
Sbjct: 336 PIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGGENLEISFRIWMCGGSIELIPCSRVGH 395
Query: 229 VFRDKSPYTFPGGVSK--IVLHNAARVAEVWMDEWRDFY 265
VFR + PY G K +L N+ RVA VW+DE++ +Y
Sbjct: 396 VFRKRRPY---GADDKHDTMLKNSLRVAYVWLDEYKHYY 431
>gi|395834931|ref|XP_003790440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Otolemur garnettii]
gi|395834933|ref|XP_003790441.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Otolemur garnettii]
Length = 622
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 124/274 (45%), Positives = 160/274 (58%), Gaps = 16/274 (5%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-ERVVCPIIDVISDQT 68
LPTTS++IVFHNEAWSTLLRTV+SV++ +P LLKEIILVDDAS E + ++ Q
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEKLEQYVQQL 235
Query: 69 FEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---FEYITAK------TVVC 119
+ G + V +D + E + A+ VV
Sbjct: 236 QVVRVVRQVERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVVS 295
Query: 120 PIIDVISDQTFEYIT----ASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTM 175
P I I TFE+ + G F+W L F W +P E RR D + P+++PT
Sbjct: 296 PDIVTIDLNTFEFSKPIPRGRVHSRGNFDWSLTFGWETLPTHEKQRRK-DETYPIKSPTF 354
Query: 176 AGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSP 235
AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG +EIIPCS VGHVFR KSP
Sbjct: 355 AGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPCSVVGHVFRTKSP 414
Query: 236 YTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
+TFP G S ++ N R+AEVWMD ++ +Y N
Sbjct: 415 HTFPKGTS-VIARNQVRLAEVWMDSYKMIFYRRN 447
>gi|363736053|ref|XP_422169.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gallus
gallus]
Length = 811
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 110/283 (38%), Positives = 167/283 (59%), Gaps = 22/283 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LPTT+I++ F +E WSTLLR+V SV++RSP LL+E+ILVDD S +
Sbjct: 358 CLEQQVHNDLPTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTK----- 412
Query: 61 IDVISDQTFEYITASDMT--------WGGFNWKLREKNRHKKTVV--------CPIIDVI 104
D + ++ Y++ G +L + TV+ C + +
Sbjct: 413 -DYLKEKLDAYMSQFPKVKVLHLRERHGLIRARLAGAQVARGTVLTFLDSHVECNVGWLE 471
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGG 164
+ V CP+I+VISD+ Y+T + G F W +NF W ++P + +
Sbjct: 472 PLLERVRLRRARVACPVIEVISDKDMSYMTVDNFQRGIFTWPMNFGWKQIPQEVIEKNKL 531
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
+ +R P MAGGLF+I+K YF+ELG+YD G+D+WGGEN+E+SF+VW CGG +EI+PCS
Sbjct: 532 KETDIIRCPVMAGGLFSIEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEIEIVPCS 591
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
VGH+FR+ +PY+FP + V N ARVAEVW+D++++ +Y
Sbjct: 592 RVGHIFRNDNPYSFPKDRVRTVERNLARVAEVWLDDYKELFYG 634
>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
Length = 622
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 161/280 (57%), Gaps = 28/280 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LPTTS++IVFHNEAWSTLLRTV+SV++ SP LL EIIL+DDAS E + + +
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLNEIILMDDASTDEHLKERLEQYVQQL 235
Query: 68 TFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFEYIT 113
+ G +L + + V+ P++ I++
Sbjct: 236 QIVRVVRQRERGGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAED------ 289
Query: 114 AKTVVCPIIDVISDQTFEY----ITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
VV P I I TF++ + G F+W L F W +P E RR D + P
Sbjct: 290 KTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQRRK-DETYP 348
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+++PT AGGLF+I K YF +G+YD M+IWGGEN+EMSFRVWQCGG L IIPCS VGHV
Sbjct: 349 IKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLGIIPCSVVGHV 408
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMN 269
FR KSP+TFP G S ++ N R+AEVWMD+++ +Y N
Sbjct: 409 FRTKSPHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRN 447
>gi|449268007|gb|EMC78887.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Columba
livia]
Length = 514
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 161/288 (55%), Gaps = 24/288 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRT-----LLKEIILVDDASER 55
C Y LP TS++I FHNEA STLLRT+ S + + L+ EIILVDD S+
Sbjct: 57 CTTLHYRQDLPPTSVIITFHNEARSTLLRTIRSTVMHFLSSFFTVHLVHEIILVDDFSDD 116
Query: 56 VV-------CPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDV 103
P + + + E I +D+ G L K + P++
Sbjct: 117 PDDCRLLGKLPKVKCLRNGRREGLIRSRIRGADVAQAGVLTFLDSHCEVNKDWLLPLLQR 176
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I + VV P+ID+I+ TF Y+ AS GGF+W L+F+W ++ P + +R
Sbjct: 177 IKED------PTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQLSPEQKAKRL 230
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P++TP +AGGLF IDK +F LG YD MDIWGGEN E+SFRVW CGG LEIIPC
Sbjct: 231 -DPTEPIKTPIIAGGLFMIDKAWFNHLGKYDSAMDIWGGENFEISFRVWMCGGSLEIIPC 289
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
S VGHVFR K PY FP G + + N R AEVWMDE++ +YYA P
Sbjct: 290 SRVGHVFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKRYYYAARPA 337
>gi|351708673|gb|EHB11592.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Heterocephalus glaber]
Length = 570
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 170/326 (52%), Gaps = 67/326 (20%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVC- 58
C SY + LP TS++I FHNEA STLLRTV SV+NR+P +L++EIILVDD +S+ C
Sbjct: 85 CPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFSSDPEDCL 144
Query: 59 -----PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTV--VCPIID-VISDQTFE 110
P + + + E S ++ L + + + + P++ V D T
Sbjct: 145 LLTRIPKVKCLRNDKREGECRSALS---LTAPLLPSHHCEVNIEWLQPMLQRVKEDHT-- 199
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
VV PIIDVIS F Y+ AS GGF+W L+F+W ++P + M R D + P+
Sbjct: 200 -----RVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRT-DPTKPI 253
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENL------------------------- 205
RTP +AGG+F IDK +F LG YD MDIWGGEN
Sbjct: 254 RTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFGPVALALKQPAQLEGVGDNFISYWC 313
Query: 206 ---------------------EMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGVSK 244
E+SFRVW CGG LEI+PCS VGHVFR + PY FP G +
Sbjct: 314 LPVAKPIIQREGSPMAQPIRAELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYNFPEGNAL 373
Query: 245 IVLHNAARVAEVWMDEWRDFYYAMNP 270
+ N R AEVWMDE++ +YY P
Sbjct: 374 TYIRNTKRTAEVWMDEYKQYYYEARP 399
>gi|345307949|ref|XP_001508273.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Ornithorhynchus anatinus]
Length = 593
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 166/288 (57%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 135 CNNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKK 194
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P++D I
Sbjct: 195 RLEDYMARFPRVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 254
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
+ KT+VCP+IDVI F Y T A D G F+W++ ++ R+P P+E+ +
Sbjct: 255 AR------NRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYK--RIPIPQELQK- 305
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D S P +P MAGGLFA+DK +F+ELG YD G++IWGGE E+SF+VW CGG +E IP
Sbjct: 306 -PDPSDPFESPVMAGGLFAVDKKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIP 364
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 365 CSRVGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRP 410
>gi|324507788|gb|ADY43296.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Ascaris suum]
Length = 580
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 120/268 (44%), Positives = 155/268 (57%), Gaps = 14/268 (5%)
Query: 13 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCPIIDVISDQTFEY 71
TSI+I +HNEA STLLRTV S RSP L+ EIILVDD +S+ + + I +
Sbjct: 143 TSIIITYHNEARSTLLRTVMSAFLRSPAKLITEIILVDDFSSDETIGKDLTSIE----KV 198
Query: 72 ITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISD---QTFEYITAK------TVVCPII 122
I + G + + K + +D + Q E + A+ VV PII
Sbjct: 199 IVIRNTKREGLIRSRVKGAQLAKASILTFLDSHCECNVQWLEPLLARVKENPHAVVAPII 258
Query: 123 DVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAI 182
DVI+ TF Y+ AS GGF W L F+W + + R + P++TP +AGGLF I
Sbjct: 259 DVINMDTFNYVAASADLRGGFEWNLVFKWEYLSGKLRDDRHSHPTLPIKTPVIAGGLFMI 318
Query: 183 DKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTFPGGV 242
KD+F LG+YD MD+WGGENLE+SFRVWQCGG LEIIPCS VGHVFR + PYTFPGG
Sbjct: 319 RKDWFETLGTYDPDMDVWGGENLELSFRVWQCGGSLEIIPCSRVGHVFRKQHPYTFPGGS 378
Query: 243 SKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ N R AEVW+D+++ Y P
Sbjct: 379 GNVFQKNTRRAAEVWLDDYKMLYLKQVP 406
>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Sarcophilus harrisii]
Length = 651
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 123/288 (42%), Positives = 166/288 (57%), Gaps = 30/288 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C K Y LP TSI+I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R
Sbjct: 180 CNSKLYLEKLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLVAEIVLVDDFSDREHLKK 239
Query: 56 ------VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P++D I
Sbjct: 240 RLEDYMAQFPNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRI 299
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVP-PREMMRR 162
+ KT+VCP+IDVI + F Y T A D G F+W++ ++ R+P P E+ +
Sbjct: 300 AS------NRKTIVCPMIDVIDNDHFGYKTQAGDAMRGAFDWEMYYK--RIPIPLELQK- 350
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IP
Sbjct: 351 -SDPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIP 409
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 410 CSRVGHIYRKYIPYKIPTGVS--LARNLKRVAEVWMDEYAEYIYQRLP 455
>gi|313246955|emb|CBY35801.1| unnamed protein product [Oikopleura dioica]
Length = 468
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 116/263 (44%), Positives = 158/263 (60%), Gaps = 13/263 (4%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS++I FHNE STLLRT+ SVI R+P +LKEI+LVDDAS ++ I++ +
Sbjct: 23 LPTTSVIITFHNELRSTLLRTIISVIRRTPSNILKEIVLVDDASIKL---ILNRKREGLI 79
Query: 70 EYITASDMTWGGFNWKLREKNRH-KKTVVCPIIDVISDQTFEYITAKTVVCPIIDVISDQ 128
+ M G + + + + + P++ I + + VV PIIDVI+
Sbjct: 80 RARIRAAMIATGDTFTFLDSHVEVNQDWIQPLMQRIKE------NPRMVVAPIIDVINKD 133
Query: 129 TFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFY 188
F+YI A GG +W + FRW + EM D + L++PT+AGGLF++ K +F+
Sbjct: 134 NFQYIGADAFLTGGVSWAMVFRWDWLSRHEM--ETMDHTVGLKSPTIAGGLFSVGKAWFH 191
Query: 189 ELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYTF-PGGVSKIVL 247
ELG YD+ MDIWGGEN+E SFRVWQCGG +EI+PCS VGHVFRD PY F G + + +
Sbjct: 192 ELGEYDDQMDIWGGENIEFSFRVWQCGGEMEIMPCSRVGHVFRDDHPYDFGKKGSNHVFV 251
Query: 248 HNAARVAEVWMDEWRDFYYAMNP 270
N R WMDE+ FYY P
Sbjct: 252 KNNNRFVHTWMDEYSTFYYGTRP 274
>gi|313233396|emb|CBY24511.1| unnamed protein product [Oikopleura dioica]
Length = 661
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/275 (43%), Positives = 160/275 (58%), Gaps = 21/275 (7%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS-------ERVVCPIID 62
LPTTS++I FHNE STLLRT+ SVI R+P +LKEI+LVDDAS E + +
Sbjct: 202 LPTTSVIITFHNELRSTLLRTIISVIRRTPSNILKEIVLVDDASSDPNVGRELIKINKVK 261
Query: 63 VISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITAKTV 117
+I ++ E I A+ + G L + + P++ I + + V
Sbjct: 262 LILNRKREGLIRARIRAAMIATGDTFTFLDSHVEVNQDWIQPLMQRIKE------NPRMV 315
Query: 118 VCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTPTMAG 177
V PIIDVI+ F+YI A GG +W + FRW + EM D + L++PT+AG
Sbjct: 316 VAPIIDVINKDNFQYIGADAFLTGGVSWAMVFRWDWLSRHEM--ETMDHTVGLKSPTIAG 373
Query: 178 GLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDKSPYT 237
GLF++ K +F+ELG YD+ MDIWGGEN+E SFRVWQCGG +EI+PCS VGHVFRD PY
Sbjct: 374 GLFSVGKAWFHELGEYDDQMDIWGGENIEFSFRVWQCGGEMEIMPCSRVGHVFRDDHPYD 433
Query: 238 F-PGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG 271
F G + + + N R WMDE+ FYY P
Sbjct: 434 FGKKGSNHVFVKNNNRFVHTWMDEYSTFYYGTRPN 468
>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
Length = 585
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 115/281 (40%), Positives = 162/281 (57%), Gaps = 17/281 (6%)
Query: 2 KKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC--P 59
+ Y LPT S++I+F+NE WS LLRTV+SV+NRSP LLKEIILV+D S + P
Sbjct: 114 NRSEYLKELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPALLKEIILVNDHSTKPFLWTP 173
Query: 60 IIDVISDQTFEYITASDM-TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
+ + + + + D+ G R + V ++D ++ ++
Sbjct: 174 LREFVESELAPKVRLVDLPERSGLIVARMAGAREARGDVLIVLDSHTEVNTNWLPPLLEP 233
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
+T VCP IDVI+ TF+Y + + G F+WK ++ + P ++ D + P
Sbjct: 234 IAEDYRTCVCPFIDVIAHDTFQYRSQDEGKRGAFDWKFYYKRLPLLPGDL----DDPTKP 289
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
+P MAGGLFAI +F+ELG YDEG+DIWGGE E+SF++WQCGG L PCS VGHV
Sbjct: 290 FNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHV 349
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+R +P+ P GV+ V+ N RVAEVWMDE+ F Y NP
Sbjct: 350 YRGYAPFGNPRGVN-FVVRNFKRVAEVWMDEYSQFLYERNP 389
>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Strongylocentrotus purpuratus]
Length = 521
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 121/283 (42%), Positives = 164/283 (57%), Gaps = 16/283 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK+ +Y LP S++I FHNEA STL RTV S+ NRSP L+ EIILVDD S+R
Sbjct: 49 CKEITYLAKLPNVSVIIPFHNEALSTLKRTVHSIFNRSPPELIHEIILVDDFSDRAYLKG 108
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI------ 112
P+ D +S I + G +L VV +D + + ++
Sbjct: 109 PLDDYMSAFPKVKIIRLEKREGLIRTRLLGAGPATGDVVL-FLDSHCEANYNWLPPLLER 167
Query: 113 ---TAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+ +VCP+IDVIS++ F Y + A D+ G F+W+L ++ + E RR + S
Sbjct: 168 IALNRRRIVCPMIDVISNEDFHYESQAGDVMRGAFDWELYYKRIPISEAENKRRSHE-SD 226
Query: 169 PLRTPTMAGGLFAIDKDYFYE-LGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P RTP MAGGLFA+D+ YF E LG YDEG++IWGGE ++SF+VW CGG +E IPCS VG
Sbjct: 227 PFRTPIMAGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPCSRVG 286
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
H++R YT PGG ++ N RV EVWMDEW ++Y P
Sbjct: 287 HIYRKFMSYTVPGGAG-VINKNLLRVVEVWMDEWGKYFYERRP 328
>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
floridanus]
Length = 667
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 119/286 (41%), Positives = 170/286 (59%), Gaps = 33/286 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK + Y + LP SI+I F+NE ++TLLR++ S++ R+P LL EIILV+D S+
Sbjct: 188 CKTQKYSSNLPNASIIICFYNEHYTTLLRSLHSILERTPAALLHEIILVNDFSDS----- 242
Query: 61 IDVISDQTFEYITASDMTWGG----FNWKLREK--------NRHKKTVVCPIIDV----- 103
D++ ++ YI + +G F K RE R V +D
Sbjct: 243 -DILHEKIHAYIKNN---FGAKVRLFKTKKREGLIRARVFGARKATGDVLIFLDSHIEVN 298
Query: 104 ---ISDQTFEYITAKTVV-CPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
I +KT+V P+ID+I+ TF+Y T S + GGFNW L+F+W +P
Sbjct: 299 EIWIEPLLSRIAYSKTIVPMPVIDIINADTFQY-TGSPLVRGGFNWGLHFKWDNLPI-GT 356
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
++ D P+++PTMAGGLFAID++YF ++G YD GMD+WGGENLE+SFR+W CGG +E
Sbjct: 357 LKHENDFVKPIKSPTMAGGLFAIDREYFIKIGEYDTGMDVWGGENLEISFRIWMCGGSIE 416
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
+IPCS VGHVFR + PY +L N+ RVA VWMDE++D++
Sbjct: 417 LIPCSRVGHVFRRRRPYG-SDDPHDTMLKNSLRVAHVWMDEYKDYF 461
>gi|357606408|gb|EHJ65055.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 389
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 122/297 (41%), Positives = 167/297 (56%), Gaps = 26/297 (8%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++ LP S+VI F NEAWSTLLRT+ SV+NRSP LL+E++L+DD S+ + I
Sbjct: 44 CLERYSSKLLPQASVVICFFNEAWSTLLRTLHSVLNRSPPHLLREVLLIDDFSD--MDHI 101
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA------ 114
+ + T ++ + +R + K P++ V D E
Sbjct: 102 KVRLENYTRKFPNVILIRTSQREGLIRARIVGAKKASAPVL-VFLDSHCECTEGWLEPLL 160
Query: 115 -------KTVVCPIIDVISDQTFEYITAS--DMTWGGFNWKLNFRWYRVPPREMMRRGGD 165
K V P+ID I TFEYI+ + D+ GGFNW L F W R + + +
Sbjct: 161 ERLVENPKIVASPVIDHIDPNTFEYISQNPKDIYIGGFNWNLKFIW-----RSIEYKREN 215
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
P++TPT+AGGLFAIDK++FY +G YDEG D+WGGENLE+SF+VW CGG LEI+PCSH
Sbjct: 216 FLLPIKTPTIAGGLFAIDKEFFYSIGYYDEGFDVWGGENLELSFKVWMCGGSLEIVPCSH 275
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASVSTCAAH 282
VGH+FR+ PY G K NAAR+AEVW+D++ +Y S+ A
Sbjct: 276 VGHIFRENFPYYTSGETFK---RNAARLAEVWLDDYAKIFYERIGNADVSLGDVTAQ 329
>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Callithrix jacchus]
Length = 601
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Saimiri boliviensis boliviensis]
Length = 601
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Otolemur garnettii]
Length = 601
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
D + D+ EY I + G +L + + V+
Sbjct: 185 -DHLKDKLEEYMARFSQVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE 297
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
+RR D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 298 --LRRA-DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Anolis carolinensis]
Length = 612
Score = 213 bits (542), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 121/287 (42%), Positives = 166/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
C K Y LP TS++I FHNE WS+LLRTV SV+NRSP L+ EI+LVDD S+R +
Sbjct: 141 CNSKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLRK 200
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ D ++ T I + G ++ + V+ P++D I
Sbjct: 201 RLEDYMAQFTKVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRI 260
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYIT-ASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y T A D G F+W++ ++ +PP E+ +
Sbjct: 261 AR------NHKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPP-ELQK-- 311
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG +E IPC
Sbjct: 312 PDPSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPC 371
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P GVS + N RVAEVWMDE+ ++ Y P
Sbjct: 372 SRVGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRP 416
>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Otolemur garnettii]
Length = 600
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 129 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 183
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
D + D+ EY I + G +L + + V+
Sbjct: 184 -DHLKDKLEEYMARFSQVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 242
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 243 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
+RR D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 --LRRA-DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 353
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 354 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 404
>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 2
[Macaca mulatta]
Length = 584
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 113 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSER----- 167
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 168 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 226
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 227 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 279
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 280 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 337
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 338 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 388
>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Papio anubis]
Length = 584
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 113 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSER----- 167
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 168 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 226
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 227 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 279
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 280 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 337
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 338 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 388
>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Papio anubis]
Length = 601
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|390350617|ref|XP_784979.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 647
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 119/285 (41%), Positives = 157/285 (55%), Gaps = 18/285 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVC 58
CK YP LPTTS++I+FHNEA+S LLRTV SVINRSPR LLKEIILVDDAS E +
Sbjct: 282 CKSLVYPEVLPTTSVIIIFHNEAFSALLRTVHSVINRSPRHLLKEIILVDDASTQEHLKV 341
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT------FEYI 112
+ D IS + +R + + I+ + E +
Sbjct: 342 KLDDYISRHFHSSARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPL 401
Query: 113 TA------KTVVCPIIDVISDQTFEYITASDMTW-GGFNWKLNFRWYRVPPREMMRRGGD 165
A + VV P IDVI D Y + + G F W + FRW + ++ +
Sbjct: 402 LAVIDKDRRNVVTPTIDVIDDNDLAYKGSDQLPQVGSFGWTMAFRWTAIQTMDLEEAKRN 461
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSH 225
+ P+R+PTMAGGLF+IDK YF ELG YD G IWG EN+E+SF+ W CGG L + CSH
Sbjct: 462 PTLPIRSPTMAGGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACSH 521
Query: 226 VGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
VGH+FR +PY+ G+ N R+ EVW+ + R FYY ++P
Sbjct: 522 VGHIFRKFAPYS---GMGSYFHRNNKRLIEVWLGDARAFYYKLHP 563
>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1
[Macaca mulatta]
gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
Length = 601
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
Length = 534
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 119/283 (42%), Positives = 162/283 (57%), Gaps = 26/283 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK + YP+ LP S++I FHNE WSTLLRTV VI R+P LL E+ILVDD S + C
Sbjct: 63 CKDRLYPSRLPNVSVIIPFHNEHWSTLLRTVHGVIGRTPPHLLGEVILVDDFSSKENCGR 122
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
P+ + ++ I G +LR + V+ P+++ I
Sbjct: 123 PLNEYMATFPQVRILRMKQREGLIRARLRGVEVARGNVLVFMDAHCEVNVNWLPPLLEPI 182
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGG-FNWKLNFRWYRVPPREMMRRG 163
S ++ TV P IDVI TFEY G F+W+LN++ R+P + R
Sbjct: 183 S------VSMTTVTIPTIDVIDHATFEYKEQQGGPMRGVFDWQLNYK--RIPVLDGRGRK 234
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
+ P TP M GG+FAIDK++F+ LG YD G++IWGGE E+SF++WQCGG+L+ +PC
Sbjct: 235 VRPTLPFSTPVMPGGVFAIDKEFFHHLGGYDSGLEIWGGEQFELSFKIWQCGGVLQEVPC 294
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
S VGHVFR SPY V +I L N RVAEVWMD+++ +YY
Sbjct: 295 SRVGHVFRKFSPYATDNDVLQI-LKNYMRVAEVWMDDYKQYYY 336
>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
10-like [Acyrthosiphon pisum]
Length = 592
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 117/284 (41%), Positives = 166/284 (58%), Gaps = 21/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
C+ K Y + LPT S+VI FHNE +STLLRTV+SV+NRSP+ LLKEIILVDD+S +
Sbjct: 132 CRFKKYNSKLPTVSVVIPFHNEHFSTLLRTVYSVLNRSPKILLKEIILVDDSSTKTSLKR 191
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQT---------- 108
P+ + +S+ + T + +R + + I+ + T
Sbjct: 192 PLDNFLSNNLAD--TVQIIHLKKRQGLIRARLAGARKATSEILIFLDSHTEANANWLPPL 249
Query: 109 FEYITA--KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
E IT +T VCP IDVI+ +TFEY + G F+W+ ++ + P +++
Sbjct: 250 LEPITEDYRTCVCPFIDVIAFETFEYRAQDEGARGAFDWEFFYKRLPLLPEDLLYP---- 305
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ P R+P MAGGLFAI +F+ELG YD G+DIWGGE E+SF++WQCGG + PCS V
Sbjct: 306 TKPFRSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKIWQCGGTILDAPCSRV 365
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GH++R +P+ P G+ V N RVAEVWMDE+ ++ Y P
Sbjct: 366 GHIYRKFAPFPNP-GIGDFVGKNYRRVAEVWMDEYAEYLYLRRP 408
>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Nomascus leucogenys]
Length = 601
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MLDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|332030162|gb|EGI69956.1| N-acetylgalactosaminyltransferase 6 [Acromyrmex echinatior]
Length = 603
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 115/282 (40%), Positives = 166/282 (58%), Gaps = 18/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV-VCP 59
C+KK Y L S+++ FHNE +STLLRT WSV+NRSP +LL+EIILVDDAS ++ +
Sbjct: 132 CRKKKYLKNLDPISVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKK 191
Query: 60 IIDVISDQTFEYITASDMTW--GGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
+D Q ++ ++ G +L + + V+ +D S+ ++
Sbjct: 192 KLDDYVAQHLPKVSIVRLSKRSGLIRGRLAGAKKARAKVLV-FLDSHSEANVNWLPPLLE 250
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
KT VCP IDVI+ +TFEYI + + G F+W+L ++ + P ++ R +
Sbjct: 251 PIAQNYKTCVCPFIDVIAYETFEYIAQDEGSRGAFDWELYYKRLPLLPEDLKRP----TE 306
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P ++P MAGGLFAI +F+ELG YD G+DIWGGE E+SF++WQCGG + PCS VGH
Sbjct: 307 PFKSPIMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGH 366
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
V+R P+ PG + N RVAEVWMDE+ ++ Y P
Sbjct: 367 VYRKFPPFPNPGR-GDFLGKNFKRVAEVWMDEYAEYLYKRRP 407
>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
saltator]
Length = 598
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/286 (44%), Positives = 164/286 (57%), Gaps = 29/286 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C +Y T LP+ S++I+F+NE WS LLRTV SV+ S LLKEIILVDD SE
Sbjct: 133 CANLTYDTLLPSVSVIIIFYNEPWSVLLRTVHSVLKGSLPHLLKEIILVDDHSEE----- 187
Query: 61 IDVISDQTFEYITASDMT----------WGGFNWKLR-EKNRHKKTVV-----CPIIDVI 104
+ + Q Y++ T G +L KN +V C +I
Sbjct: 188 -EELQGQLDYYLSTRLPTKVKLLRLPYRQGLIRARLHGAKNATGDVLVFLDAHCEVIKDW 246
Query: 105 SDQTFEYITAK--TVVCPIIDVISDQTFEYITASDMTW---GGFNWKLNFRWYRVPPREM 159
+ I K V+ PIID IS++T EY ++ ++ GGF W +F W + E+
Sbjct: 247 LQPLLQRIKEKRNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHEL 306
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
R SP R+PTMAGGLFAID+ YF+E+GSYD+ MD WGGENLEMSFR+WQCGG LE
Sbjct: 307 KSRLS-LISPTRSPTMAGGLFAIDRKYFWEVGSYDDKMDGWGGENLEMSFRIWQCGGTLE 365
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
IIPCS VGH+FR+ PY FP + N AR+A VWMDE++ +
Sbjct: 366 IIPCSRVGHIFRNFHPYKFPNDKDTHGI-NTARLAFVWMDEYKRLF 410
>gi|157107408|ref|XP_001649763.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884049|gb|EAT48274.1| AAEL000646-PA [Aedes aegypti]
Length = 582
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 116/282 (41%), Positives = 163/282 (57%), Gaps = 17/282 (6%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
C+KK Y LPT S++++F+NE WSTLLRTV S++NRSP LLKEI+LV+D S +
Sbjct: 111 CRKKRYLQELPTVSVIVIFYNEHWSTLLRTVHSILNRSPSKLLKEIVLVNDHSTKEFLWE 170
Query: 59 PIIDVISDQTFEYITASDM-TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
P+ D + + + ++ G + V ++D ++ ++
Sbjct: 171 PLQDYVRSKLPSKVKLFNLPVRSGLIAARLAGAKAATGDVLIVLDSHTEVNVNWLPPLIE 230
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
+T VCP ID I+ TFEY S+ G F+WK F + R+P R + D +
Sbjct: 231 PIAENYRTCVCPYIDGIAHDTFEYKPQSEGRRGAFDWK--FLYKRLPLRPQDQ--TDPTE 286
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P +P MAGGLFAI +F+ELG YDE +DIWGGE E+SF++WQCGG + PCSHVGH
Sbjct: 287 PFDSPIMAGGLFAISAKFFWELGGYDEELDIWGGEQYELSFKIWQCGGRMVDAPCSHVGH 346
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
V+R +P+ P G + V N RVAEVWMDE++ F + NP
Sbjct: 347 VYRGLAPFPNPRG-TNFVTRNFKRVAEVWMDEYKQFLFERNP 387
>gi|241557818|ref|XP_002400302.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215501764|gb|EEC11258.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 464
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/277 (43%), Positives = 161/277 (58%), Gaps = 18/277 (6%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS--ERVVCPIIDVISDQ 67
LP+ S++I+F +E +S LLRTV+SV+NR+P LL+E+ILVDDAS E + +D +
Sbjct: 155 LPSASVIIIFTDEIFSALLRTVYSVVNRTPAKLLREVILVDDASSIEELANQRLDKYLRR 214
Query: 68 TFE----YITASDMTWGGFNWKLREKNRHKKTVV------CPIIDVISDQTFEYITAK-- 115
F + G +L V+ C D + + I
Sbjct: 215 HFRPGLVKLIRLPQRQGLIRARLTGAQAASGDVLVFLDSHCEATDYWLEPLLQPIREDRT 274
Query: 116 TVVCPIIDVISDQTFEYIT--ASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPLRTP 173
TVVCPIIDV+ D++ +Y+ A GGFNWK F W +P + R ++ P+ TP
Sbjct: 275 TVVCPIIDVVDDKSLQYMGNGADYFQIGGFNWKGEFVWINLPSGWKVARK-TKADPVNTP 333
Query: 174 TMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVFRDK 233
TMAGGLFAID+ YF+E GSYD M+ WGGENLEMSFR+W CGG L I PCSHVGH+FRD
Sbjct: 334 TMAGGLFAIDRKYFWESGSYDNEMEGWGGENLEMSFRIWMCGGKLVIAPCSHVGHIFRDY 393
Query: 234 SPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PY FP + N R+AEVWMD +++++Y P
Sbjct: 394 HPYKFPNNKDTHGI-NTVRLAEVWMDGYKNYFYQNRP 429
>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
norvegicus]
Length = 601
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 164/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKD 189
Query: 56 ------VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
PI+ ++ + E + + + G L + P+++ I
Sbjct: 190 KLEDYMARFPIVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP E+ R
Sbjct: 250 A------LNHKTIVCPMIDVIDHSHFGYEAQAGDAMRGAFDWEMYYKRIPIPP-ELQR-- 300
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 301 ADPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 361 SRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
AltName: Full=Polypeptide GalNAc transferase 17;
Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 17;
AltName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 17; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 17
gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Homo sapiens]
Length = 601
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
Length = 584
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 113 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 167
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 168 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 226
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 227 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 279
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 280 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 337
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 338 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 388
>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6-like [Hydra
magnipapillata]
Length = 591
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/305 (40%), Positives = 179/305 (58%), Gaps = 33/305 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
C YP L TTS++++FHNEAWS LLRTV +V+ RSP +LKEIILVDDAS +
Sbjct: 122 CFNVDYPVKLSTTSVIVIFHNEAWSVLLRTVHTVLARSPPHMLKEIILVDDASVKEKYGH 181
Query: 56 VVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYI--- 112
+ + + ++ + + S + G +L + V+ +D + +F ++
Sbjct: 182 LGEKLENYVNTLSKVKLIRSPVRVGLTQARLIGADNAVGEVLV-FLDSHCEASFGWLEPL 240
Query: 113 ------TAKTVVCPIIDVISDQTFEYITAS-DMTWGGFNWKLNFRWYRVPPREMMRRGGD 165
K V P I+VIS + FEY + G F+W+L F W +PPRE MRR +
Sbjct: 241 LARLQENPKLAVVPDIEVISFKNFEYSSEKGSYNRGIFSWELMFNWGPLPPREKMRRKYE 300
Query: 166 RSSPLRTPTMAGGLFAIDKDYFYELGSYDEG---------MDIWGGENLEMSFRVWQCGG 216
S P+++PTMAGGLFA+++ YF+E G+YD + WGGEN+EMSFR+W CG
Sbjct: 301 -SDPIKSPTMAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYWGGENVEMSFRLWMCGE 359
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA----MNPGK 272
+EIIPCS VGHVFR+++PY P G + HN+ RVAEVWMDE+++ +Y+ + P +
Sbjct: 360 GIEIIPCSRVGHVFRERAPYKSPDGSTD---HNSIRVAEVWMDEFKEIFYSFRANLKPEQ 416
Query: 273 SASVS 277
VS
Sbjct: 417 GGDVS 421
>gi|350409603|ref|XP_003488790.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Bombus
impatiens]
Length = 610
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 169/314 (53%), Gaps = 24/314 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CKKK Y L + S+++ FHNE +STL+RT WSVINRSP LLKEIILVDDAS +V
Sbjct: 139 CKKKKYLRNLDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKVELKK 198
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
P+ D I++ + G + K V +D S+ ++
Sbjct: 199 PLEDYITEHLTKVKIVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANVNWLPPLLEP 258
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KT VCP IDVI+ +TFEY + G F+W+L ++ + P ++ + + P
Sbjct: 259 IAQDYKTCVCPFIDVIAYETFEYRAQDEGARGAFDWELYYKRLPLLPEDLQ----NPTEP 314
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
++P MAGGLFAI +F+ELG YD +DIWGGE E+SF++WQCGG + PCS VGH+
Sbjct: 315 FKSPVMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHI 374
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY-------AMNPGKSASVSTCAAH 282
+R P+ PG + N RVAEVWMDE+ ++ Y ++NPG A
Sbjct: 375 YRKFPPFPNPGK-GDFLGKNYKRVAEVWMDEYAEYIYTRRPHLRSLNPGNLKEQRDLRAR 433
Query: 283 FRMLSYSSWFSGSI 296
+ WF +I
Sbjct: 434 LHCKPF-KWFMENI 446
>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Felis catus]
Length = 601
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Equus
caballus]
Length = 601
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKRREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
Length = 615
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 171/321 (53%), Gaps = 38/321 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y L S+V+ FHNE +STL+RT WSVINRSP +LL EIILVDDAS +V
Sbjct: 137 CKNKRYLKDLDPVSVVVSFHNEHFSTLMRTCWSVINRSPPSLLHEIILVDDASTKVE--- 193
Query: 61 IDVISDQTFEYITAS---------DMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
+ D+ EY+ + G +L + ++ +D S+ +
Sbjct: 194 ---LKDKLDEYVKKNLPKVKIVRLPRRSGLIRGRLAGARKATAKILV-FLDSHSEANVNW 249
Query: 112 ITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ KT VCP IDVI+ +TFEY + G F+W+L ++ + P ++
Sbjct: 250 LPPLLEPIAKDYKTCVCPFIDVIAYETFEYRAQDEGARGAFDWELYYKRLPLLPEDLK-- 307
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
+ S P ++P MAGGLFAI +F+ELG YD G+DIWGGE E+SF++WQCGG + P
Sbjct: 308 --NPSEPFKSPVMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAP 365
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY-------AMNPGKSAS 275
CS VGH++R P+ PG + N RVAEVWMDE+ DF Y AM+PG
Sbjct: 366 CSRVGHIYRKFPPFPNPGR-GDFLGKNYKRVAEVWMDEYADFIYRRRPHLRAMDPGDLTE 424
Query: 276 VSTCAAHFRMLSYSSWFSGSI 296
+ S+ WF +I
Sbjct: 425 QKALRDKLKCKSF-KWFMENI 444
>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
Length = 601
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 165/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIP------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 1 [Xenopus (Silurana) tropicalis]
Length = 600
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 164/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TSI+I FHNE W++LLRT+ SVINR+P +L++E+ILVDD S+R
Sbjct: 129 CKHKLYLERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLRE 188
Query: 56 ------VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P+++ I
Sbjct: 189 KLEEYMAYYPKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQI 248
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP E+ R
Sbjct: 249 A------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP-ELQR-- 299
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 300 TDPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPC 359
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 360 SRVGHIYRKYVPYKVPTGTS--LARNLKRVAETWMDEYAEYIYQRRP 404
>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Heterocephalus glaber]
Length = 522
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 48 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 102
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 103 -EHLKDKLEEYMARFSKVRILRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 161
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 162 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 214
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 215 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 272
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 273 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 323
>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 2 [Xenopus (Silurana) tropicalis]
Length = 601
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 164/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER----- 55
CK K Y LP TSI+I FHNE W++LLRT+ SVINR+P +L++E+ILVDD S+R
Sbjct: 130 CKHKLYLERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLRE 189
Query: 56 ------VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVI 104
P + ++ + E + + M G L + P+++ I
Sbjct: 190 KLEEYMAYYPKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP E+ R
Sbjct: 250 A------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP-ELQR-- 300
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 301 TDPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 361 SRVGHIYRKYVPYKVPTGTS--LARNLKRVAETWMDEYAEYIYQRRP 405
>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Loxodonta africana]
Length = 601
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 166/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R +
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKD 189
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ D ++ + I + G +L + + V+ P+++ I
Sbjct: 190 KLEDYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP E+ R
Sbjct: 250 A------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP-ELQR-- 300
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 301 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 361 SRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|340713833|ref|XP_003395440.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like [Bombus
terrestris]
Length = 610
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 120/314 (38%), Positives = 169/314 (53%), Gaps = 24/314 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CKKK Y L + S+++ FHNE +STL+RT WSVINRSP LLKEIILVDDAS +
Sbjct: 139 CKKKKYLKNLDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKAELKK 198
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
P+ D I+++ + G + K V +D S+ ++
Sbjct: 199 PLEDYITERFTKVKLVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEP 258
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KT VCP IDVI+ +TFEY + G F+W+L ++ + P ++ + + P
Sbjct: 259 IAQDYKTCVCPFIDVIAYETFEYRAQDEGARGAFDWELYYKRLPLLPEDLQ----NPTEP 314
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
++P MAGGLFAI +F+ELG YD +DIWGGE E+SF++WQCGG + PCS VGH+
Sbjct: 315 FKSPVMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHI 374
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY-------AMNPGKSASVSTCAAH 282
+R P+ PG + N RVAEVWMDE+ ++ Y ++NPG A
Sbjct: 375 YRKFPPFPNPGK-GDFLGKNYKRVAEVWMDEYAEYIYTRRPHLRSLNPGNLKEQRDLRAR 433
Query: 283 FRMLSYSSWFSGSI 296
+ WF +I
Sbjct: 434 LHCKPF-KWFMENI 446
>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase 10-like [Metaseiulus
occidentalis]
Length = 590
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 170/315 (53%), Gaps = 57/315 (18%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ Y LPT SIVI FHNE S LLRT+ SV+ RSP++L+KEIILVDD S +
Sbjct: 126 CQNIRYAARLPTASIVIPFHNEHLSVLLRTITSVLRRSPKSLIKEIILVDDFSSKK---- 181
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNR------------------------HKKTV 96
+S + Y+++ +G LR R H +
Sbjct: 182 -SXVSTELENYLSSH---FGSQVKLLRATKREGLIRARLLGARAAEGDVLIFLDSHTEAN 237
Query: 97 VC---PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYR 153
V P++D I+ +TVVCP IDVI +TF Y + + G F+W+L ++
Sbjct: 238 VNWLPPLLDPIARNR------RTVVCPFIDVIHYETFAYRSQDEGARGAFDWELYYKRLP 291
Query: 154 VPPREMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQ 213
+ ++ R + P R+P MAGGLFAID+ YF+ELG YDEG+D+WGGE E+SF++WQ
Sbjct: 292 LLSEDLKRP----TEPFRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFKIWQ 347
Query: 214 CGGILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKS 273
CGG + PCS VGH++R +P+ P G+ V N RVAEVWMDE+++F Y P
Sbjct: 348 CGGQMFDAPCSRVGHIYRKFAPFPNP-GIGDFVGRNYRRVAEVWMDEYKEFLYNRRP--- 403
Query: 274 ASVSTCAAHFRMLSY 288
H+R L Y
Sbjct: 404 --------HYRTLGY 410
>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Oryctolagus cuniculus]
Length = 601
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 117/293 (39%), Positives = 163/293 (55%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEYIT----------------------ASDMTWGGFNWKLREKNRHKKTVVC 98
D + D+ EY+ + M G L +
Sbjct: 185 -DHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMAGGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLF++D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFSVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
Length = 613
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 113/281 (40%), Positives = 156/281 (55%), Gaps = 16/281 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK K Y LPT S+V+ FHNE W+TLLRT SV+NRSP LLKE+ILVDD S +
Sbjct: 144 CKSKKYLKDLPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKK 203
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
P+ D ++ + G R V +D ++ ++
Sbjct: 204 PLDDYLAANLTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEP 263
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KT VCP IDVI +TFEY + G F+W+ ++ + P ++ + P
Sbjct: 264 IAQDYKTCVCPFIDVIQYETFEYRAQDEGARGAFDWEFFYKRLPLLPEDLEHP----TEP 319
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
++P MAGGLFAI + +F+ELG YDEG+DIWGGE E+SF++WQCGG++ PCS VGH+
Sbjct: 320 FKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHI 379
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+R +P+ PG V N RVAEVWMDE+ ++ Y P
Sbjct: 380 YRKYAPFPNPGK-GDFVGRNYRRVAEVWMDEYAEYLYKRRP 419
>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Mus musculus]
Length = 601
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 166/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R +
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKD 189
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ D ++ + I + G +L + + V+ P+++ I
Sbjct: 190 KLEDYMARFSKVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQI 249
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP E+ R
Sbjct: 250 A------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP-ELQR-- 300
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 301 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 361 SRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Monodelphis domestica]
Length = 573
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 119/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 102 CKHKMYLEKLPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDR----- 156
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + K V+
Sbjct: 157 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLP 215
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 216 PLLNQIA------LNRKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 268
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 269 ELQR--ADPSDPFESPVMAGGLFAVDRRWFWELGGYDPGLEIWGGEQYEISFKVWMCGGG 326
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 327 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 377
>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 573
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 164/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDD-ASERVVCP 59
CK+K Y L T S+V+ FHNE WSTLLRTV+SV+NRSP LLKEIILVDD +S+ +
Sbjct: 113 CKEKKYRKNLNTVSVVVPFHNEHWSTLLRTVYSVLNRSPSHLLKEIILVDDYSSKPFLKK 172
Query: 60 IIDVISDQTFEYITASDMT--WGGFNWKLREKNRHKKTVVC--------------PIIDV 103
+D+ D+ + + G +L + K V+ P+++
Sbjct: 173 KLDIYVDRHLPKVKIIRLPERMGLIRARLAGAKKAKAQVLLFLDSHTEANVNWLPPLLEP 232
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I++ KT VCP IDVI+ TFEY + G F+W+ ++ + P ++
Sbjct: 233 IAE------NYKTCVCPFIDVIAHDTFEYRAQDEGRRGAFDWEFFYKRLPLLPEDLKHP- 285
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
+ P ++P MAGGLFAI +F+ELG YDEG+ IWGGE E+SF++WQCGG + PC
Sbjct: 286 ---TEPFQSPVMAGGLFAISAKFFWELGGYDEGLAIWGGEQYELSFKIWQCGGKMVDAPC 342
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R +P+ P G+ V N RVAEVWMDE+ ++ Y P
Sbjct: 343 SRVGHIYRKFAPFPNP-GIGDFVGKNYRRVAEVWMDEYAEYLYKRRP 388
>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
troglodytes]
Length = 601
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+++ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|167536399|ref|XP_001749871.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771586|gb|EDQ85250.1| predicted protein [Monosiga brevicollis MX1]
Length = 521
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 119/292 (40%), Positives = 163/292 (55%), Gaps = 29/292 (9%)
Query: 1 CKKKSYPT---FLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV 57
C+K+ YP LP+T+I+ FHNEA S L R+V SV++RSP L+ EIILVDD S+
Sbjct: 211 CQKRRYPDDYDSLPSTTIIFTFHNEARSVLYRSVRSVLDRSPPELIDEIILVDDFSDDPE 270
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA--- 114
I + ++ I + G ++R N + V+ +D + ++
Sbjct: 271 QGKIVLGMEKV--RILRNTRREGLIRSRIRAANAAQSPVLT-FLDSHIEANVGWLPPLLA 327
Query: 115 ------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRW----YRVPPREMMRRGG 164
KTV PIIDVI+ + FEY+ ++ G F+W + F+W Y V P E
Sbjct: 328 HVAKDYKTVASPIIDVINMENFEYLFSTPTLRGVFSWSMQFQWGLTPYHVDPDE------ 381
Query: 165 DRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCS 224
P++TP +AGGLF+I K +F E G YD GMD+WGGEN E+SFR WQCGG + I PCS
Sbjct: 382 ----PIKTPMIAGGLFSISKRWFDESGQYDTGMDVWGGENFEISFRTWQCGGAMYIDPCS 437
Query: 225 HVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
HVGH+FR PYTFP G N AR AEVWMD+++ +Y P +V
Sbjct: 438 HVGHIFRKSHPYTFPKGNGNTYDTNTARTAEVWMDDYKQHFYHARPSAQKAV 489
>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Length = 591
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 113/281 (40%), Positives = 156/281 (55%), Gaps = 16/281 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC-- 58
CK K Y LPT S+V+ FHNE W+TLLRT SV+NRSP LLKE+ILVDD S +
Sbjct: 122 CKSKKYLKDLPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKK 181
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
P+ D ++ + G R V +D ++ ++
Sbjct: 182 PLDDYLAANLTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEP 241
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KT VCP IDVI +TFEY + G F+W+ ++ + P ++ + P
Sbjct: 242 IAQDYKTCVCPFIDVIQYETFEYRAQDEGARGAFDWEFFYKRLPLLPEDLEHP----TEP 297
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
++P MAGGLFAI + +F+ELG YDEG+DIWGGE E+SF++WQCGG++ PCS VGH+
Sbjct: 298 FKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHI 357
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+R +P+ PG V N RVAEVWMDE+ ++ Y P
Sbjct: 358 YRKYAPFPNPGK-GDFVGRNYRRVAEVWMDEYAEYLYKRRP 397
>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
Length = 540
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 112/281 (39%), Positives = 159/281 (56%), Gaps = 16/281 (5%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCP- 59
CKKK Y L S+++ FHNE +STLLRT WSV+NRSP +LL+EIILVDDAS ++
Sbjct: 71 CKKKQYLKNLDPVSVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKK 130
Query: 60 -IIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA---- 114
+ D ++ + + G + + V +D S+ ++
Sbjct: 131 KLDDYVAQHLPKVLIVRLPKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEP 190
Query: 115 -----KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSP 169
KT VCP IDVI+ +TFEY + G F+W+L ++ + P ++ R + P
Sbjct: 191 IARDYKTCVCPFIDVIAYETFEYRAQDEGARGAFDWELYYKRLPLLPEDLKRP----AEP 246
Query: 170 LRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHV 229
++P MAGGLFAI +F+ELG YD G+DIWGGE E+SF++WQCGG + PCS VGH+
Sbjct: 247 FKSPIMAGGLFAISTKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRVGHI 306
Query: 230 FRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+R P+ P G + N RVAEVWMDE+ ++ Y P
Sbjct: 307 YRKFPPFPNP-GRGDFLGKNYKRVAEVWMDEYAEYIYKRRP 346
>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Pan paniscus]
Length = 518
Score = 210 bits (534), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 47 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 101
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 102 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 160
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 161 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 213
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+++ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 214 ELQR--ADPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 271
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 272 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 322
>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Gorilla gorilla gorilla]
Length = 555
Score = 210 bits (534), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 118/293 (40%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD SER
Sbjct: 84 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSER----- 138
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 139 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 197
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 198 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 250
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+++ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 251 ELQR--ADPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 308
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 309 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 359
>gi|321469963|gb|EFX80941.1| hypothetical protein DAPPUDRAFT_224457 [Daphnia pulex]
Length = 498
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 115/292 (39%), Positives = 171/292 (58%), Gaps = 32/292 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C+ + YP TS++I F+NE STL RTV SV++++P LL E++LVDD+S+
Sbjct: 12 CQTQEYPKLYLNTSVIICFYNEDPSTLFRTVHSVLDQTPAELLHEVLLVDDSSDGELLGN 71
Query: 55 ----------RVVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCP 99
+ P + ++ E I + G L + V P
Sbjct: 72 IHNEVEEFVGKNFPPKVQLLKTMKREGLIRARIFGAKKATGQVLIFLDSHCEVNREWVQP 131
Query: 100 IIDVISD-QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPRE 158
+I I + +TF VV PIID+I+ TF+Y T+S + GGFNW L+F+W +P +
Sbjct: 132 LIARIQENRTF-------VVTPIIDIINSDTFQY-TSSPLVRGGFNWGLHFKWDSLPD-D 182
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
++ D P+ +PTMAGGLFAI+++YF+++G YD GM++WGGENLE+SFR+W CGG L
Sbjct: 183 TLKTNEDFVKPILSPTMAGGLFAIEREYFFDIGEYDAGMNVWGGENLEISFRIWMCGGRL 242
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
EIIPCS VGHVFR + PY P G + +N+ R A VW+D++ + ++ + P
Sbjct: 243 EIIPCSRVGHVFRRRRPYGSPNG-EDTMTYNSLRAAHVWLDDYIEHFFHVRP 293
>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 601
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 117/293 (39%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + ++ EY I + G +L + + V+
Sbjct: 185 -EHLKEKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Cricetulus griseus]
Length = 555
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 166/287 (57%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R +
Sbjct: 84 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIVEIILVDDFSDREHLKD 143
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ D ++ + I + G +L + + V+ P+++ I
Sbjct: 144 KLEDYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQI 203
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP E+ R
Sbjct: 204 A------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKKIPIPP-ELQR-- 254
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 255 ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPC 314
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G I+ N RVAE WMDE+ ++ Y P
Sbjct: 315 SRVGHIYRKYVPYKVPSGT--ILARNLKRVAETWMDEFAEYIYQRRP 359
>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 639
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 116/289 (40%), Positives = 169/289 (58%), Gaps = 29/289 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C ++SY LP+ SI++ F+NE TLLR+V SV+ R+P LL EIILVDD S+
Sbjct: 164 CPEQSYDKVLPSASIIMCFYNEHLQTLLRSVNSVLGRTPAYLLHEIILVDDCSDFDDLGD 223
Query: 55 -------RVVCPIIDVISDQTFEYITASDMTWGGFNWK---LREKNRHKKTVVC---PII 101
+ I +I ++ E + S + +G N L + H + V P++
Sbjct: 224 DLEVGLKKFNNSKIRLIRNRDREGLMRSRV-YGARNATGDVLVFLDSHIEVNVDWIEPLL 282
Query: 102 DVISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMR 161
I + + P+ID+I+ TF Y T+S + GGFNW L+F+W +P + +
Sbjct: 283 QRIK------VNRTILAMPVIDIINSDTFAY-TSSPLVRGGFNWGLHFKWDNLPKGSLAK 335
Query: 162 RGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEII 221
D P ++PTMAGGLFA+D+ YF ELG YD GMD+WGGENLE+SFR WQCGG +E++
Sbjct: 336 ET-DFVGPFQSPTMAGGLFAMDRKYFKELGEYDMGMDVWGGENLEISFRAWQCGGSIELL 394
Query: 222 PCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
PCS +GHVFR + PY P G ++ N+ R+A VWMD++ +++ P
Sbjct: 395 PCSRIGHVFRKRRPYGSPDGTDTMI-RNSLRLARVWMDDYIKYFFENQP 442
>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Canis lupus
familiaris]
Length = 601
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 117/293 (39%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + D+ EY I + G +L + + V+
Sbjct: 185 -EHLKDKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+++ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Ovis aries]
Length = 601
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 117/293 (39%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + ++ EY I + G +L + + V+
Sbjct: 185 -EHLKEKLEEYMARFSKVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|443687046|gb|ELT90152.1| hypothetical protein CAPTEDRAFT_141956, partial [Capitella teleta]
Length = 351
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 112/293 (38%), Positives = 166/293 (56%), Gaps = 24/293 (8%)
Query: 1 CKKKSYP-TFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VV 57
C KK+Y T L TSI+I+FHNEA STLLRT+ +++ R+P LL EI++VDDAS +
Sbjct: 32 CHKKTYDLTTLGKTSIIIIFHNEARSTLLRTIHALLERTPILLLVEILIVDDASTHAWLK 91
Query: 58 CPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDV 103
P+ + I G + R K ++ P++
Sbjct: 92 EPLDKYLQHLPRIRIIRLKQRQGLIRARTRGAEEAKGDILYFADAHTEVGEGWLPPLLQR 151
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I + K +V P +D I Q+FEY A D G F W + F+ Y+ P+E++ R
Sbjct: 152 IKENR------KVLVFPEMDPIQHQSFEYWRAGDEYHGAFYWHMEFK-YKFAPKEILNRR 204
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D + P+ +P M G AI+++YF+E G+YD M+IWGGEN+E +FR+W CGG +E+IPC
Sbjct: 205 SDPTQPVPSPVMVGCAHAIEREYFFETGAYDTDMEIWGGENIEHAFRLWMCGGRVEVIPC 264
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSASV 276
S VGHVF+ + PY+F G + I+ N R+AE WMD+++ F+YA P ++V
Sbjct: 265 SRVGHVFKPRLPYSFTGDSASIIQRNLIRIAETWMDDYKKFFYATQPSTISAV 317
>gi|157126612|ref|XP_001654672.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108873195|gb|EAT37420.1| AAEL010596-PA [Aedes aegypti]
Length = 569
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 111/288 (38%), Positives = 161/288 (55%), Gaps = 20/288 (6%)
Query: 1 CKKKSYPTFLP-TTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVV-- 57
CK + + T TTS++I FHNEA STLLRT+ SV+ ++P LL+EII++DD S +
Sbjct: 106 CKTREFLTSPGRTTSVIITFHNEATSTLLRTIGSVLKQTPPELLQEIIVIDDCSTSLEHN 165
Query: 58 ------CPIIDVISDQTFEYITASD-----MTWGGFNWKLREKNRHKKTVVCPIIDVISD 106
P++ + E + S G F L + + P++D ++
Sbjct: 166 LDFLGRIPLVRFHRNFVREGLIRSRNIGVAYASGDFVLFLDSHCEVNRGWLEPLVDRLT- 224
Query: 107 QTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
+ + V+ PIID+I +FEY S GGF+W L FRW V E+ R D
Sbjct: 225 -----VDSTAVLSPIIDIIDADSFEYRPNSARLRGGFDWSLRFRWLPVAEEELEHRNHDE 279
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
S P +P ++GG+F + K F +LG +D G++IWGGE+LE S + W CG +E++PCS +
Sbjct: 280 SQPFYSPAISGGVFIVSKTLFQQLGGFDGGLEIWGGESLEFSLKAWLCGAHVEVVPCSRI 339
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGKSA 274
GHVFR K PY FP G + L N R+A VWMDE+++F+Y P SA
Sbjct: 340 GHVFRRKHPYGFPQGSAATYLRNTKRIASVWMDEFQNFFYKTRPEASA 387
>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus terrestris]
Length = 643
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 173/280 (61%), Gaps = 21/280 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDAS------E 54
C+ + Y + LP SIVI F+NE + TLLR++ S+I+R+P +LL EIILV+D S E
Sbjct: 164 CEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVNDWSDSKALHE 223
Query: 55 RVVCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPI---IDV----ISDQ 107
++ I++ + + Y T + G ++ + V+ + I+V I
Sbjct: 224 KIKTYIVNNFNGKVKFYKT--EKREGLIRARMFGARKATGEVLIFLDSHIEVNKRWIEPL 281
Query: 108 TFEYITAKTVVC-PIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
+ +KT+V PIID+I+ TF+Y T S + GGFNW L+F+W VP D
Sbjct: 282 LSQIAQSKTIVAMPIIDIINPDTFQY-TGSPLVRGGFNWGLHFKWDNVPVGTFAH-DEDF 339
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
P+++PTMAGGLFA+D+ YF +LG YD GMDIWGGENLE+SFR+W CGG +E+IPCS V
Sbjct: 340 IKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPCSRV 399
Query: 227 GHVFRDKSPY-TFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
GHVFR + PY TF +L N+ RVA VW+DE++D++
Sbjct: 400 GHVFRRRRPYGTFDQ--HDTMLKNSLRVAHVWLDEYKDYF 437
>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
Length = 602
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 114/284 (40%), Positives = 162/284 (57%), Gaps = 22/284 (7%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+KK Y L S+++ FHNE +STL+RT WSVINRSP +LL+EIILVDDAS +V +
Sbjct: 131 CRKKKYSKNLDPVSVIVSFHNEHFSTLMRTCWSVINRSPPSLLEEIILVDDASTKV--EL 188
Query: 61 IDVISDQTFEYITASDMTW-----GGFNWKLREKNRHKKTVVCPIIDVISDQTFEYITA- 114
+ D +Y+ + G +L + V+ +D S+ ++
Sbjct: 189 KKKLDDYIAQYLPKVSIVRLAKRSGLIRGRLAGAKAARAKVLV-FLDSHSEANVNWLPPL 247
Query: 115 --------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDR 166
KT VCP IDVI+ +TFEY + G F+W+L ++ + P ++ R
Sbjct: 248 LEPIAQNYKTCVCPFIDVIAYETFEYRAQDEGARGAFDWELYYKRLPLLPEDLKRP---- 303
Query: 167 SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHV 226
+ P ++P MAGGLFAI +F+ELG YD G+DIWGGE E+SF++WQCGG + PCS V
Sbjct: 304 AEPFKSPIMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRV 363
Query: 227 GHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
GH++R P+ PG + N RVAEVWMDE+ ++ Y P
Sbjct: 364 GHIYRKFPPFPNPGR-GDFLGKNYKRVAEVWMDEYAEYIYKRRP 406
>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
floridanus]
Length = 597
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 122/286 (42%), Positives = 160/286 (55%), Gaps = 29/286 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C +Y LP+ S+VI+F+NE WS LLRTV SV+ SP LLKEIILVDD SE
Sbjct: 133 CMNITYDKLLPSASVVIIFYNEPWSVLLRTVHSVLKGSPPHLLKEIILVDDHSEE----- 187
Query: 61 IDVISDQTFEYITAS----------DMTWGGFNWKLREKNRHKKTVV------CPIIDVI 104
+ + Q Y++ G +L K V+ C +I
Sbjct: 188 -EELQGQLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAKGDVLVFLDAHCEVIKDW 246
Query: 105 SDQTFEYI--TAKTVVCPIIDVISDQTFEYITASDMTW---GGFNWKLNFRWYRVPPREM 159
+ I V+ PIID IS++T EY ++ ++ GGF W +F W + E+
Sbjct: 247 LQPLLQRIKDNKNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHEV 306
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
R SP R+PTMAGGLFAI++ YF+E+GSYD+ MD WGGENLEMSFR+WQCGG LE
Sbjct: 307 ESRPSP-ISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSFRIWQCGGTLE 365
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
IIPCS VGH+FR+ PY FP + N AR+A VWMD ++ +
Sbjct: 366 IIPCSRVGHIFRNFHPYKFPNDKDTHGI-NTARLAFVWMDGYKRLF 410
>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Cavia porcellus]
Length = 601
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 163/293 (55%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKMYLETLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEYIT----------------------ASDMTWGGFNWKLREKNRHKKTVVC 98
+ + ++ EY+ + M G L +
Sbjct: 185 -EHLKEKLEEYVARFSKVRILRTRKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+D+ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGE 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MFDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|350593501|ref|XP_003359567.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Sus
scrofa]
Length = 927
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 117/279 (41%), Positives = 163/279 (58%), Gaps = 58/279 (20%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPIIDVISDQTF 69
LPTTS+++ F +E WSTLLR+V SV+NRSP L+KEI+LVDD S + D + D
Sbjct: 508 LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTK------DYLKDNLD 561
Query: 70 EYITASDMTWGGFNWKLREKNRH---KKTVVCPII---DVIS--DQTFE----------- 110
+Y++ LR K RH + + I DV++ D E
Sbjct: 562 KYMSQFPKVR-----ILRLKERHGLIRARLAGAQIATGDVLTFLDSHVECNVGWLEPLLE 616
Query: 111 --YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSS 168
Y++ K V CP+I+VI+D+ DM R+ R+ G+ +
Sbjct: 617 RVYLSRKKVACPVIEVINDK--------DM------------------RKKERKKGNVDT 650
Query: 169 PLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGH 228
P+R P MAGGLF+IDK YF+ELG+YD G+D+WGGEN+E+SF+VW CGG +EIIPCS VGH
Sbjct: 651 PIRCPVMAGGLFSIDKTYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGH 710
Query: 229 VFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYA 267
+FR+ +PY+FP K V N RVAEVW+DE+++ +Y
Sbjct: 711 IFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 749
>gi|347971870|ref|XP_313714.5| AGAP004429-PA [Anopheles gambiae str. PEST]
gi|333469065|gb|EAA09257.5| AGAP004429-PA [Anopheles gambiae str. PEST]
Length = 663
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 115/293 (39%), Positives = 165/293 (56%), Gaps = 35/293 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ + Y LP S+V+ F+NE TL+R++ +V+ R+P LLKE+ILVDD S+
Sbjct: 184 CQAQVYDKVLPVASVVMCFYNEHLETLVRSIHTVLKRTPAYLLKELILVDDCSD------ 237
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKN--------------RHKKTVVCPIIDVISD 106
D T ++ G N +N R+ V +D +
Sbjct: 238 ---FEDLTVGGQLEKELAQLGTNKVRLLRNTDREGLIRSRVYGARNATGQVLIFLDSHIE 294
Query: 107 QTFEYITA--------KTVVC-PIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPR 157
++I +T++ P+ID+I+ TF Y TAS + GGFNW L+F+W +P +
Sbjct: 295 VNVDWIEPLLARIKHDRTILAMPVIDIINSDTFVY-TASPLVRGGFNWGLHFKWDNLP-K 352
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
+ R D P +PTMAGGLFAID+ YF ELG YD GMD+WGGENLE+SFR WQCGG
Sbjct: 353 GSLERDTDFVGPFNSPTMAGGLFAIDRAYFKELGEYDMGMDVWGGENLEISFRAWQCGGS 412
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+E++PCS +GHVFR + PY P G ++ N+ R+A VWMD++ ++Y P
Sbjct: 413 IELLPCSRIGHVFRKRRPYGSPDGQDTMI-RNSLRLAHVWMDDYIRYFYEQQP 464
>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
Length = 662
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 112/288 (38%), Positives = 167/288 (57%), Gaps = 27/288 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C ++SY LP+ SI++ F+NE TL+R+V S+I R+P LL EIILVDD C
Sbjct: 187 CHEQSYDKVLPSASIIMCFYNEHLETLVRSVTSIIRRTPSYLLHEIILVDD------CSD 240
Query: 61 IDVISDQTFEYITA---------SDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEY 111
+D + D + A + G R+ V +D + ++
Sbjct: 241 LDDLRDNLEHELNALKNSKVRLIRNAEREGLMRSRVYGARNATGDVLIFLDSHIEVNVDW 300
Query: 112 I--------TAKTVVC-PIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
+ T KT++ P+ID+I+ TF Y ++S + GGFNW L+F+W +P + + +
Sbjct: 301 VEPLLQRIKTNKTILAMPVIDIINSDTFIY-SSSPLVRGGFNWGLHFKWDNLP-KGTLAK 358
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D P ++PTMAGGLFA+D+ YF +LG YD GMD+WGGENLE+SFR WQCGG +E++P
Sbjct: 359 ESDFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFRTWQCGGSIELVP 418
Query: 223 CSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
CS +GHVFR + PY P G S ++ N+ R++ VWMD++ ++ P
Sbjct: 419 CSRIGHVFRKRRPYGSPDG-SDTMIRNSLRLSRVWMDDYIKYFLENQP 465
>gi|405975887|gb|EKC40420.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 653
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 117/290 (40%), Positives = 164/290 (56%), Gaps = 37/290 (12%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK+++YP LP+TSI+I FHNEAWS LLRTV S++NRSP L+ EI+LVDD S+
Sbjct: 230 CKQETYPENLPSTSIIICFHNEAWSVLLRTVHSILNRSPLHLIHEILLVDDFSD------ 283
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFEYIT------- 113
++ + + EY+ + G ++ + + + ++ + T E +T
Sbjct: 284 LEFLKTELDEYLAEN----GKGKVRVVRTPKREGLIRARLLGY-GEATGEILTFLDSHIE 338
Query: 114 ----------------AKTVVCPIIDVISDQTFEYITASDMTWGGFN-WKLNFRWYRVPP 156
VV P+ID+ISD++ G F + F W +
Sbjct: 339 CFPGWLEPLLARVAENHTRVVAPVIDMISDRSLACGGNEIGNLGTFEIANMGFNWLTLNK 398
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E + G +S P +TPT+AGGLF+I++ YF ++G+YD GMDIWGGENLE+SFRVW CGG
Sbjct: 399 TEKAKHG--QSEPWKTPTIAGGLFSINRAYFTKMGTYDHGMDIWGGENLEISFRVWMCGG 456
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYY 266
LEI PCSHV H+FR SPY + I+ NA R AEVWMDE++ YY
Sbjct: 457 SLEIHPCSHVAHLFRSMSPYKWGKSFRDILRKNAVRTAEVWMDEYKHIYY 506
>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
gigas]
Length = 613
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 118/294 (40%), Positives = 167/294 (56%), Gaps = 40/294 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ ++P TSI++ F NE S LLR V S+ +++P+ L+KEIILVDD+S
Sbjct: 136 CQDVTFPAINLDTSIIVCFFNEQPSALLRLVHSINDQTPQELVKEIILVDDSS------T 189
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPII---DVISDQTFEYITAK-- 115
+D +S Q Y+ FN + ++ ++ + ++ S Q ++ +
Sbjct: 190 LDDLSCQIENYVNQH------FNNVRLVRTPEREGLIRARVFGANLASGQVLVFLDSHCE 243
Query: 116 ------------------TVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPR 157
TVV P+ID+I+ T EY S + GGFNW L+F W R+P
Sbjct: 244 VNTDWLEPLLLRISHDPTTVVVPVIDIINHDTMEY-QQSPLVRGGFNWGLHFSWDRLPDN 302
Query: 158 EMMRRGGDR-SSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
E + D S P+ +PTMAGGLFA+ +DYF+ LG YD GMDIWGGENLE+SFR+W CGG
Sbjct: 303 E--KNDPDLGSKPILSPTMAGGLFAMKRDYFHHLGEYDLGMDIWGGENLEISFRIWMCGG 360
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
LEIIPCS VGH+FR + PY P G L N+ RVA VWMD++++++ P
Sbjct: 361 KLEIIPCSRVGHIFRKRRPYGNPKG-RDTFLKNSLRVANVWMDKYKEYFLKQRP 413
>gi|326437001|gb|EGD82571.1| polypeptide N-acetylgalactosaminyltransferase 14 [Salpingoeca sp.
ATCC 50818]
Length = 621
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/294 (39%), Positives = 162/294 (55%), Gaps = 30/294 (10%)
Query: 1 CKKKSYPTF----LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERV 56
C+K+ YP LP+T+I+ FHNEA STL R+V SV++RSP L+ E+IL+DDAS+
Sbjct: 180 CRKRKYPEDYPPDLPSTTIIFTFHNEARSTLYRSVRSVLDRSPPELIDEVILIDDASDD- 238
Query: 57 VCPIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE------ 110
P I + + + G +R + R P++ + D E
Sbjct: 239 --PEQGKIVEGMEKVRILRNHEREGL---IRSRVRGANAAQSPVLTFL-DSHIECNTDWL 292
Query: 111 -------YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ + V P IDVI+ F Y+ ++ GGF W + F+W P +G
Sbjct: 293 PPLLEQLHKNYRAVASPTIDVINMDNFRYLGSTSGLRGGFTWSMQFQWETAP-----LKG 347
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
P+ TP +AGGLF+I K +F E G YD MD+WGGEN E+SFR W CGG ++I+PC
Sbjct: 348 KSAIDPIPTPMIAGGLFSITKRWFDESGQYDLDMDVWGGENFEISFRTWMCGGRMDIVPC 407
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRD-FYYAMNPGKSASV 276
S VGHVFR + PY+FPGG + N AR AEVWMDE++ FY+A K A+V
Sbjct: 408 SRVGHVFRKQHPYSFPGGNGRTYDKNTARTAEVWMDEYKQHFYHARPSAKFAAV 461
>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
Length = 600
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 115/287 (40%), Positives = 163/287 (56%), Gaps = 28/287 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASER--VVC 58
CK+K Y LP TSI+I FHNE WS+LLRT+ S+ NR+P L+ EIILVDD S+R +
Sbjct: 129 CKQKLYLENLPNTSIIIPFHNEGWSSLLRTLHSISNRTPDHLIAEIILVDDYSDREHLKA 188
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVI 104
+ + +S I + G +L + + V+ P++D I
Sbjct: 189 HLAEYMSRFPKVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEANINWLPPLLDQI 248
Query: 105 SDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
+ KT+VCP+IDVI F Y A D G F+W++ ++ +PP +G
Sbjct: 249 AQ------NPKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQG 299
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
D S P ++P MAGGLFA+++ +F+ELG YD G++IWGGE E+SF+VW CGG + +PC
Sbjct: 300 PDPSDPYQSPVMAGGLFAVNRQWFWELGGYDTGLEIWGGEQFEISFKVWMCGGSMYDVPC 359
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
S VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 360 SRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEYTEYIYQRRP 404
>gi|195397828|ref|XP_002057530.1| GJ18184 [Drosophila virilis]
gi|194141184|gb|EDW57603.1| GJ18184 [Drosophila virilis]
Length = 625
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 113/289 (39%), Positives = 162/289 (56%), Gaps = 36/289 (12%)
Query: 1 CKKKS--YPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVC 58
CK++ P LP S+++ F+NE TL+R++ +V+ R+P LLKEIILVDD S+
Sbjct: 132 CKREEPLEPEQLPQASVIMCFYNEHKMTLMRSIKTVLERTPGHLLKEIILVDDNSD---- 187
Query: 59 PIIDVISDQTFEYITASDMTWGGFNWKLREKNRHKKTVVCPIID---------VISDQTF 109
+ + F + N + R + + +I V D
Sbjct: 188 -----LPELEFHLLADLHARLKYDNLRYVRNERREGLIRSRVIGARDANGDVLVFLDSHI 242
Query: 110 E-------------YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPP 156
E + T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P
Sbjct: 243 EVNRQWLEPLLRLVHAENATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP- 300
Query: 157 REMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGG 216
++ D P R+PTMAGGLFA+ + YF +G YD MDIWGGEN+E+SFRVWQCGG
Sbjct: 301 EGTLKVPEDFKGPFRSPTMAGGLFAVSRLYFQHIGEYDMAMDIWGGENIEISFRVWQCGG 360
Query: 217 ILEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
++I+PCS VGH+FR + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 361 AIKIVPCSRVGHIFRKRRPYTAPDG-ANTMLKNSLRLAHVWMDKYKDYY 408
>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Nasonia vitripennis]
Length = 637
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 114/285 (40%), Positives = 174/285 (61%), Gaps = 31/285 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK ++Y LP SIVI F+NE ++TLLR+++S+++R+P+ LL EIIL++D S+
Sbjct: 162 CKNQTYDQKLPNASIVICFYNEHYNTLLRSLYSILDRTPKHLLHEIILINDFSDS----- 216
Query: 61 IDVISDQTFEYITAS-DMTWGGFNWKLREK--------NRHKKTVVCPIIDV---ISDQT 108
+ +Q +Y+ + D + + RE + V +D ++
Sbjct: 217 -KSLHEQVRDYVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEVNKMW 275
Query: 109 FEYITAKT------VVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRR 162
E + A+ V P+ID+I+ TF+Y ++S + GGFNW L+F+W +P +
Sbjct: 276 LEPLLARISHSRTIVPMPVIDIINADTFQY-SSSPLVRGGFNWGLHFKWDSLPIGTLSLE 334
Query: 163 GGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIP 222
D P+++PTMAGGLFA+D+ YF+ELG YD GMD+WGGENLE+SFR+W CGG +E+IP
Sbjct: 335 Q-DFVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISFRIWMCGGSIELIP 393
Query: 223 CSHVGHVFRDKSPYTFPGGVSK--IVLHNAARVAEVWMDEWRDFY 265
CS VGHVFR + PY GG + +L N+ RVA VWMD+++ ++
Sbjct: 394 CSRVGHVFRRRRPY---GGNDQQDTMLKNSLRVAYVWMDQYKKYF 435
>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
1 [Taeniopygia guttata]
gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
2 [Taeniopygia guttata]
Length = 601
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 166/293 (56%), Gaps = 40/293 (13%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
CK K Y LP TSI+I FHNE W++LLRT+ S+INR+P +L+ EIILVDD S+R
Sbjct: 130 CKHKVYLEALPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDR----- 184
Query: 61 IDVISDQTFEY--------ITASDMTWGGFNWKLREKNRHKKTVVC-------------- 98
+ + ++ EY I + G +L + + V+
Sbjct: 185 -EHLKEKLEEYMLRFAKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLP 243
Query: 99 PIIDVISDQTFEYITAKTVVCPIIDVISDQTFEY-ITASDMTWGGFNWKLNFRWYRVPPR 157
P+++ I+ + KT+VCP+IDVI F Y A D G F+W++ ++ +PP
Sbjct: 244 PLLNQIA------LNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPP- 296
Query: 158 EMMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGI 217
E+ R D S P +P MAGGLFA+++ +F+ELG YD G++IWGGE E+SF+VW CGG
Sbjct: 297 ELQR--ADPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGG 354
Query: 218 LEIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
+ +PCS VGH++R PY P G S + N RVAE WMDE+ ++ Y P
Sbjct: 355 MYDVPCSRVGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRP 405
>gi|21552973|gb|AAM62406.1|AF478698_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase mutant
SF32 [Drosophila melanogaster]
Length = 632
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 147 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 206
Query: 56 VVCPIIDVISDQTFE-----YITASDMTWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E ++ + G L + + P++ +I +
Sbjct: 207 LKYDNLRYIKNEQREGLIRSWVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 264
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 265 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 318
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 319 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 378
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 379 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 412
>gi|241682071|ref|XP_002411622.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215504373|gb|EEC13867.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 473
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/291 (39%), Positives = 161/291 (55%), Gaps = 33/291 (11%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C+ +P LP+ S+VI F+NE S LLRTV+SV+NRSPR +L+E+ILVDD S+
Sbjct: 153 CQALRFPKDLPSVSVVITFYNEILSALLRTVYSVVNRSPRRILREVILVDDFSD------ 206
Query: 61 IDVISDQTFEYITASDMTWGGFNWKLREKNRH----------KKTV--VCPIIDVISDQT 108
+ + Q + ++ GF LR R K+ V +D + T
Sbjct: 207 LPEVKGQLYRFLKRHFRP--GFVKLLRLPRREGLIRARLVGAKEAAGHVLVFLDSHCEAT 264
Query: 109 FEYITA---------KTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREM 159
+++ TV PII +I TF + + G F W +F W PP
Sbjct: 265 RQWLEPLVTAVNDDPTTVASPIITIIDGNTFAHEDMGFLPLGSFEWNGDFTWIHPPPGW- 323
Query: 160 MRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILE 219
R D+++P+R+PT+AGGLFA+D+ YF+++G YD GM+ WGGENLE+SFR+W CGG L
Sbjct: 324 --RSPDQTAPVRSPTIAGGLFAVDRTYFFQMGGYDPGMNGWGGENLELSFRIWMCGGRLV 381
Query: 220 IIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
++PCS VGHVFR PYT P N R AEVWMDE+++ +Y P
Sbjct: 382 VVPCSQVGHVFRTDRPYTIPNETDSHA-RNTKRAAEVWMDEYKEIFYKEKP 431
>gi|14549429|gb|AAK66862.1|AF158747_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
melanogaster]
Length = 631
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 146 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 205
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 206 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 263
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 264 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 317
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 318 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 377
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 378 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 411
>gi|24584318|ref|NP_652069.2| polypeptide N-acetylgalactosaminyltransferase 35A [Drosophila
melanogaster]
gi|51316020|sp|Q8MVS5.2|GLT35_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 35A;
AltName: Full=Protein l(2)35Aa; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 35A;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 35A; Short=pp-GaNTase
35A; AltName: Full=dGalNAc-T1
gi|7298154|gb|AAF53391.1| polypeptide N-acetylgalactosaminyltransferase 35A [Drosophila
melanogaster]
gi|334191718|gb|AEG66944.1| LD24449p [Drosophila melanogaster]
Length = 632
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 147 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 206
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 207 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 264
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 265 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 318
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 319 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 378
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 379 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 412
>gi|332027983|gb|EGI68034.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Acromyrmex
echinatior]
Length = 597
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 122/287 (42%), Positives = 164/287 (57%), Gaps = 31/287 (10%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPI 60
C +Y LP+ SI+I+F+NE WS LLRTV SV+ SP LLKEIILVDD SE
Sbjct: 132 CANVTYDKLLPSASIIIIFYNEPWSVLLRTVHSVLKGSPPNLLKEIILVDDHSEE----- 186
Query: 61 IDVISDQTFEYITA---SDMTWGGFNWK---LREKNRHKKTVVCPIIDVISDQTFEYI-- 112
+ + Q Y++ + + ++ +R + K V ++ V D E I
Sbjct: 187 -EELQGQLDYYLSTRLPAKVKLLRLPYRQGLIRARLHGAKNAVGDVL-VFLDAHCEVIKD 244
Query: 113 -----------TAKTVVCPIIDVISDQTFEYITASDMTW---GGFNWKLNFRWYRVPPRE 158
V+ PIID IS++T EY ++ + GGF W +F W + E
Sbjct: 245 WLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHDNEAFFFQVGGFTWSGHFTWITIQKHE 304
Query: 159 MMRRGGDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGIL 218
+ R SP R+PTMAGGLFAI++ YF+E+GSYD+ MD WGGENLE+SFR+WQCGG L
Sbjct: 305 VESRFSP-ISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEISFRIWQCGGTL 363
Query: 219 EIIPCSHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
EIIPCS VGH+FR+ PY FP + N AR+A VWMDE++ +
Sbjct: 364 EIIPCSRVGHIFRNFHPYKFPNDKDTHGI-NTARLAFVWMDEYKRLF 409
>gi|195338421|ref|XP_002035823.1| GM15572 [Drosophila sechellia]
gi|194129703|gb|EDW51746.1| GM15572 [Drosophila sechellia]
Length = 604
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 149 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 208
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 209 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 266
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 267 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 320
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 321 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 380
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 381 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 414
>gi|194758571|ref|XP_001961535.1| GF14884 [Drosophila ananassae]
gi|190615232|gb|EDV30756.1| GF14884 [Drosophila ananassae]
Length = 627
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 111/275 (40%), Positives = 163/275 (59%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 146 LPHASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDNSDLPELEFHLHADLRAR 205
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ P + I ++ E + S + G L + + P++ +I +
Sbjct: 206 LKYPNLRYIKNEEREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKAEN-- 263
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 264 ----STLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 317
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PC+ VGH+F
Sbjct: 318 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCARVGHIF 377
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PY+ P G + +L N+ R+A VWMD+++D+Y
Sbjct: 378 RKRRPYSSPDGAN-TMLKNSLRLAHVWMDQYKDYY 411
>gi|194857037|ref|XP_001968882.1| GG25115 [Drosophila erecta]
gi|190660749|gb|EDV57941.1| GG25115 [Drosophila erecta]
Length = 634
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 149 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 208
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 209 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIQSEN-- 266
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 267 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 320
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 321 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 380
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 381 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 414
>gi|17946358|gb|AAL49213.1| RE64279p [Drosophila melanogaster]
Length = 632
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 147 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 206
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 207 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 264
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 265 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 318
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 319 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 378
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 379 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 412
>gi|21552971|gb|AAM62405.1|AF478697_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 632
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 147 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 206
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 207 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 264
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 265 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 318
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 319 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 378
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 379 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 412
>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/283 (39%), Positives = 161/283 (56%), Gaps = 34/283 (12%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASERVVCPII-----DVI 64
LPTTS+++ F+NE W+TLLRT++S+++ SP LLKEIIL+DD S++V P + D++
Sbjct: 149 LPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDDSDKVEFPRLGKELEDIV 208
Query: 65 SDQTFEYITASDMTWGGFNWKLREKNRHKKTVVC--------------PIIDVISDQTFE 110
+ + + G +L V+ P++ I++
Sbjct: 209 ATMPRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIECNNGWLEPLLQRIAEDD-- 266
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDM---TWGGFNWKLNFRWYRVPPREMMRRGGDRS 167
V PII I+ Q F + +S+ GGF+W+L F+W+ +P +R D +
Sbjct: 267 ----SVVAVPIISTIAWQDFAFHHSSNSIEPQIGGFDWRLTFQWHSIPDEIKAKRKAD-T 321
Query: 168 SPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVG 227
P+ TPTMAGGLFA+ + YF +GSYD GM++WGGENLEMSFRVW CGG LEIIPCS VG
Sbjct: 322 DPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGGSLEIIPCSIVG 381
Query: 228 HVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNP 270
HVF +PY K N R EVW+D+++ +YA NP
Sbjct: 382 HVFPKTAPYE-----RKSFTPNTVRAVEVWLDDYKRHFYARNP 419
>gi|195579200|ref|XP_002079450.1| GD23962 [Drosophila simulans]
gi|194191459|gb|EDX05035.1| GD23962 [Drosophila simulans]
Length = 635
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 112/275 (40%), Positives = 162/275 (58%), Gaps = 28/275 (10%)
Query: 10 LPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE--------------R 55
LP SIV+ F+NE TL+R++ +V+ R+P LL+EIILVDD S+ R
Sbjct: 150 LPQASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRAR 209
Query: 56 VVCPIIDVISDQTFEYITASDM-----TWGGFNWKLREKNRHKKTVVCPIIDVISDQTFE 110
+ + I ++ E + S + G L + + P++ +I +
Sbjct: 210 LKYDNLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSEN-- 267
Query: 111 YITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRGGDRSSPL 170
T+ P+ID+I+ TFEY T S + GGFNW L+FRW +P ++ D P
Sbjct: 268 ----ATLAVPVIDLINADTFEY-TPSPLVRGGFNWGLHFRWENLP-EGTLKVPEDFRGPF 321
Query: 171 RTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPCSHVGHVF 230
R+PTMAGGLFA+++ YF LG YD MDIWGGEN+E+SFR WQCGG ++I+PCS VGH+F
Sbjct: 322 RSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIF 381
Query: 231 RDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFY 265
R + PYT P G + +L N+ R+A VWMD+++D+Y
Sbjct: 382 RKRRPYTSPDGAN-TMLKNSLRLAHVWMDQYKDYY 415
>gi|426228255|ref|XP_004008229.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Ovis
aries]
Length = 448
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 116/293 (39%), Positives = 171/293 (58%), Gaps = 28/293 (9%)
Query: 1 CKKKSYPTFLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDASE------ 54
C+KK+YP LPT SI+I FHNE +S L RT+ S++ +P+ +L+EIILVDD S+
Sbjct: 129 CRKKTYPARLPTASIIICFHNEEFSALFRTLSSIMALTPQYILEEIILVDDTSDFDDLKE 188
Query: 55 ------RVVCPIIDVISDQTFEYITASDMTW-----GGFNWKLREKNRHKKTVVCPIIDV 103
+ I +I ++ E + + MT G L K + P+++
Sbjct: 189 KLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNA 248
Query: 104 ISDQTFEYITAKTVVCPIIDVISDQTFEYITASDMTWGGFNWKLNFRWYRVPPREMMRRG 163
I+ K VVCP+IDVI T EY S + G FNW L F+W V E+
Sbjct: 249 IAKD------PKMVVCPLIDVIDYMTLEY-QPSPIVRGAFNWHLEFKWDHVLSYEIEGPE 301
Query: 164 GDRSSPLRTPTMAGGLFAIDKDYFYELGSYDEGMDIWGGENLEMSFRVWQCGGILEIIPC 223
G ++P+R+P MAGG+FAI ++YF E+G YD+GM++WGGENLE+S R+W CGG L +IPC
Sbjct: 302 GP-TTPIRSPAMAGGIFAISRNYFNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPC 360
Query: 224 SHVGHVFRDKSPYTFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPG-KSAS 275
S VGH+ R T + K+V +N+ R+A +W+DE+++ ++ P KSA+
Sbjct: 361 SRVGHINRQH--MTNDSEIMKVVEYNSLRLAHIWLDEYKEEFFLRRPALKSAA 411
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.323 0.137 0.451
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,199,579,864
Number of Sequences: 23463169
Number of extensions: 218599831
Number of successful extensions: 407464
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2009
Number of HSP's successfully gapped in prelim test: 38
Number of HSP's that attempted gapping in prelim test: 397548
Number of HSP's gapped (non-prelim): 5705
length of query: 304
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 162
effective length of database: 9,027,425,369
effective search space: 1462442909778
effective search space used: 1462442909778
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 76 (33.9 bits)