BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8674
(439 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|157106440|ref|XP_001649323.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108879843|gb|EAT44068.1| AAEL004538-PA [Aedes aegypti]
Length = 596
Score = 347 bits (890), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 166/241 (68%), Positives = 195/241 (80%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP ++GPGEGGKAY LPE + + EYGMN+ S+ IS DRTI D R+
Sbjct: 75 LGNFEPKEVDRRDGPGEGGKAYILPEDQQNRASDAEMEYGMNIVVSDTISLDRTIRDTRL 134
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
EECK+WDYP +LP SVI+VFHNEGFS LMRTVHS++ R+P L EIILVDDFS K DL
Sbjct: 135 EECKHWDYPHNLPTTSVIIVFHNEGFSVLMRTVHSVLNRSPKHVLHEIILVDDFSDKEDL 194
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
+KLE+YI+RF+GKV+LIRN EREGLIRTRSRGAKE+ GEVIV+LDAHCEV NWLPPLL
Sbjct: 195 KEKLENYIERFDGKVKLIRNVEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPPLL 254
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
APIY DR +MTVPVIDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 255 APIYRDRTVMTVPVIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 314
Query: 437 N 437
+
Sbjct: 315 D 315
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 153/200 (76%), Positives = 174/200 (87%), Gaps = 1/200 (0%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KLE+YIERF+GKV+LIRN EREGLIRTRSRGAKE+ GEVIV+LDAHCEV NWLPP
Sbjct: 193 DLKEKLENYIERFDGKVKLIRNVEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPP 252
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVPVIDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+R
Sbjct: 253 LLAPIYRDRTVMTVPVIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRR 312
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K++SEPYKSPTHAGGLFA++R FFLE+G YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 313 KHDSEPYKSPTHAGGLFAINREFFLEIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 372
Query: 181 SRIVSLIRPVFKAD-GKLGN 199
SR+ + R + GKL N
Sbjct: 373 SRVGHVYRGFMPYNFGKLAN 392
>gi|193683588|ref|XP_001951150.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Acyrthosiphon
pisum]
Length = 588
Score = 345 bits (885), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 167/248 (67%), Positives = 197/248 (79%), Gaps = 1/248 (0%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
P+++ D GN E K GPGE GKA+H+P SL EYGMNM S+ IS +
Sbjct: 58 PIYR-DQIFGNFEYSTSTNKPGPGEKGKAHHVPSDRENEALQSLSEYGMNMACSDDISLN 116
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R+IPD R EECKYW YP LP+ SVI+VFHNEG+SSL+RTVHSI+ RTP Q+LEEI+LVD
Sbjct: 117 RSIPDHREEECKYWTYPEQLPRTSVIIVFHNEGWSSLLRTVHSILNRTPPQFLEEILLVD 176
Query: 309 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
DFSSK +L +KLE YI++FNGKVRLIRN+EREGLIRTRS+GA +RGEVI+FLDAHCEVG
Sbjct: 177 DFSSKENLKKKLEYYIEKFNGKVRLIRNSEREGLIRTRSKGASNARGEVILFLDAHCEVG 236
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
NWLPPL+API DRKIMTVPVIDGID+ TWE+R VYE DH +RGIFEWGMLYKE E+P
Sbjct: 237 YNWLPPLIAPIARDRKIMTVPVIDGIDHNTWEYRPVYEKDHLFRGIFEWGMLYKEIEIPA 296
Query: 429 REAKKRKY 436
+E +KR Y
Sbjct: 297 QEERKRIY 304
Score = 320 bits (820), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 155/209 (74%), Positives = 176/209 (84%), Gaps = 2/209 (0%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KKLE YIE+FNGKVRLIRN+EREGLIRTRS+GA +RGEVI+FLDAHCEVG NWLPP
Sbjct: 183 NLKKKLEYYIEKFNGKVRLIRNSEREGLIRTRSKGASNARGEVILFLDAHCEVGYNWLPP 242
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L+API DRKIMTVPVIDGID+ TWE+R VYE DH +RGIFEWGMLYKE E+P +E +KR
Sbjct: 243 LIAPIARDRKIMTVPVIDGIDHNTWEYRPVYEKDHLFRGIFEWGMLYKEIEIPAQEERKR 302
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
Y SEPYKSPTHAGGLFA+DR +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 303 IYKSEPYKSPTHAGGLFAIDRNYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 362
Query: 181 SRIVSLIRPVFKAD-GKLG-NLEPPLEPY 207
SR+ + R + G+LG ++ PL Y
Sbjct: 363 SRVGHVYRGFMPYNFGELGKKVKGPLITY 391
>gi|158289457|ref|XP_311182.4| AGAP000656-PA [Anopheles gambiae str. PEST]
gi|157018524|gb|EAA06901.4| AGAP000656-PA [Anopheles gambiae str. PEST]
Length = 598
Score = 343 bits (881), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 163/241 (67%), Positives = 194/241 (80%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP +P +GPGEGGKAY LPE + + EYGMN+ S+ IS DRTI D R+
Sbjct: 77 LGNFEPADKPMVDGPGEGGKAYVLPEDQQNRATDAEMEYGMNIVVSDAISLDRTIKDTRL 136
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
EECK+WDYP LP+ SV++VFHNEGFS LMRTVHS++ R+P L EIILVDD+S K DL
Sbjct: 137 EECKHWDYPYHLPRTSVVIVFHNEGFSVLMRTVHSVLNRSPKHLLHEIILVDDYSDKEDL 196
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
KLE YI+RF+G VRLIRN+EREGLIRTRSRGAKE+ GEVIV+LDAHCEV NWLPPLL
Sbjct: 197 KGKLERYIERFDGMVRLIRNSEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPPLL 256
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
API+ DR +MTVP+IDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 257 APIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 316
Query: 437 N 437
+
Sbjct: 317 D 317
Score = 318 bits (814), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 150/200 (75%), Positives = 171/200 (85%), Gaps = 1/200 (0%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KLE YIERF+G VRLIRN+EREGLIRTRSRGAKE+ GEVIV+LDAHCEV NWLPP
Sbjct: 195 DLKGKLERYIERFDGMVRLIRNSEREGLIRTRSRGAKEATGEVIVYLDAHCEVNTNWLPP 254
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI+ DR +MTVP+IDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+R
Sbjct: 255 LLAPIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRR 314
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K++SEPY+SPTHAGGLFA++R FFLELG YD GLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 315 KHDSEPYRSPTHAGGLFAINRKFFLELGAYDSGLLVWGGENFELSFKIWQCGGSIEWVPC 374
Query: 181 SRIVSLIRPVFKAD-GKLGN 199
SR+ + R + GKL N
Sbjct: 375 SRVGHVYRGFMPYNFGKLAN 394
>gi|321473823|gb|EFX84789.1| hypothetical protein DAPPUDRAFT_209135 [Daphnia pulex]
Length = 521
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 156/241 (64%), Positives = 197/241 (81%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
+GN EPP+E + GPGEGGK + L R S+ E+GMNM S+ IS RTI D R
Sbjct: 1 MGNFEPPIEAPRSGPGEGGKPHTLLPDQRNEASQSISEFGMNMVVSDEISLSRTISDTRT 60
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
EC++W YP DLPKASV++VFHNEG+S+L+RTV S+I R+P Q+LEE++LVDDFS KA L
Sbjct: 61 PECQHWSYPEDLPKASVVIVFHNEGWSTLLRTVQSVIDRSPPQFLEEVLLVDDFSEKAHL 120
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
+KLED+I+R++GKVRLIRN EREGLIRTR+RGA+E+RGEV++FLDAHCEVGLNWLPPLL
Sbjct: 121 KRKLEDFIERYDGKVRLIRNKEREGLIRTRTRGAEEARGEVVLFLDAHCEVGLNWLPPLL 180
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
PIY DR MTVP+IDGID++ +E+R VY+ + ++RG+FEWGMLYKENE+PEREA+ R Y
Sbjct: 181 YPIYLDRTTMTVPLIDGIDHENFEYRPVYQGETNFRGVFEWGMLYKENEVPEREAQSRTY 240
Query: 437 N 437
N
Sbjct: 241 N 241
Score = 311 bits (797), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 140/187 (74%), Positives = 168/187 (89%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLED+IER++GKVRLIRN EREGLIRTR+RGA+E+RGEV++FLDAHCEVGLNWLPPL
Sbjct: 120 LKRKLEDFIERYDGKVRLIRNKEREGLIRTRTRGAEEARGEVVLFLDAHCEVGLNWLPPL 179
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PIY DR MTVP+IDGID++ +E+R VY+ + ++RG+FEWGMLYKENE+PEREA+ R
Sbjct: 180 LYPIYLDRTTMTVPLIDGIDHENFEYRPVYQGETNFRGVFEWGMLYKENEVPEREAQSRT 239
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
YNSEPYK+PTHAGGLFA++RA+FLE+G YDPGLLVWGGENFELSFKIW CGG I WVPCS
Sbjct: 240 YNSEPYKAPTHAGGLFAINRAYFLEIGAYDPGLLVWGGENFELSFKIWQCGGKILWVPCS 299
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 300 RVGHVYR 306
>gi|91081797|ref|XP_973938.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270006291|gb|EFA02739.1| hypothetical protein TcasGA2_TC008465 [Tribolium castaneum]
Length = 583
Score = 338 bits (867), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 165/266 (62%), Positives = 202/266 (75%), Gaps = 2/266 (0%)
Query: 172 GGSIEWVPCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDAS 231
G +E +R+ RP +D LGN EP EGPGEGGK +HL + + D S
Sbjct: 39 GFDLESPIAARLSHKERPKLVSD--LGNFEPRDSQEHEGPGEGGKPHHLRQDQQNDADQS 96
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
EYGMN+ S+ IS DRTI D R+ ECK+W+YP +LP SVI+VFHNEG+S L+RTVHS
Sbjct: 97 ESEYGMNVACSDEISLDRTILDTRLSECKHWNYPENLPSTSVIIVFHNEGWSVLLRTVHS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
+I R+P + L+E++LVDDFS K +L +LE YI+RFNGKVRLIRN +REGLIRTRSRGAK
Sbjct: 157 VINRSPPKILKEVLLVDDFSDKENLKTRLETYIERFNGKVRLIRNAQREGLIRTRSRGAK 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
E+ GEVIVFLDAHCEV NWLPPLLAPIY DR +MTVPVIDGID++T+E+R VY D H+
Sbjct: 217 EATGEVIVFLDAHCEVNTNWLPPLLAPIYRDRSVMTVPVIDGIDHKTFEYRPVYGEDRHF 276
Query: 412 RGIFEWGMLYKENELPEREAKKRKYN 437
RGIFEWGMLYKENE+P++E RK+N
Sbjct: 277 RGIFEWGMLYKENEVPQKELNTRKHN 302
Score = 320 bits (819), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 150/200 (75%), Positives = 170/200 (85%), Gaps = 1/200 (0%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +LE YIERFNGKVRLIRN +REGLIRTRSRGAKE+ GEVIVFLDAHCEV NWLPP
Sbjct: 180 NLKTRLETYIERFNGKVRLIRNAQREGLIRTRSRGAKEATGEVIVFLDAHCEVNTNWLPP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVPVIDGID++T+E+R VY D H+RGIFEWGMLYKENE+P++E R
Sbjct: 240 LLAPIYRDRSVMTVPVIDGIDHKTFEYRPVYGEDRHFRGIFEWGMLYKENEVPQKELNTR 299
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+NSEPYKSPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 300 KHNSEPYKSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 359
Query: 181 SRIVSLIRPVFKAD-GKLGN 199
SR+ + R + GKL
Sbjct: 360 SRVGHVYRSFMPYNFGKLAQ 379
>gi|312383497|gb|EFR28562.1| hypothetical protein AND_03374 [Anopheles darlingi]
Length = 874
Score = 337 bits (865), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 161/241 (66%), Positives = 190/241 (78%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP +EGPGEGG+AY LPE + + EYGMN+ S+ IS DRTI D R+
Sbjct: 75 LGNFEPHEPTVREGPGEGGRAYVLPEDQQNQATDAEMEYGMNIVVSDAISLDRTIRDTRL 134
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
EECK+WDYP LPK SVI+VFHNEGFS LMRTVHS++ R+P L EIILVDD+S K DL
Sbjct: 135 EECKHWDYPYHLPKTSVIIVFHNEGFSVLMRTVHSVLNRSPKHLLHEIILVDDYSDKEDL 194
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
KLE YI+RF V+LIRN+EREGLIRTRSRGA E+ GEVIV+LDAHCEV NWLPPLL
Sbjct: 195 RGKLERYIERFGSLVKLIRNSEREGLIRTRSRGAHEATGEVIVYLDAHCEVNTNWLPPLL 254
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
API+ DR +MTVP+IDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+RK+
Sbjct: 255 APIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRRKH 314
Query: 437 N 437
+
Sbjct: 315 D 315
Score = 310 bits (794), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 142/188 (75%), Positives = 163/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KLE YIERF V+LIRN+EREGLIRTRSRGA E+ GEVIV+LDAHCEV NWLPP
Sbjct: 193 DLRGKLERYIERFGSLVKLIRNSEREGLIRTRSRGAHEATGEVIVYLDAHCEVNTNWLPP 252
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI+ DR +MTVP+IDGID++T+E+R VY HHYRGIFEWGMLYKENE+P RE K+R
Sbjct: 253 LLAPIHRDRTVMTVPIIDGIDHKTFEYRPVYADGHHYRGIFEWGMLYKENEVPRREQKRR 312
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K++SEPY+SPTHAGGLFA++R FFL+LG YD GLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 313 KHDSEPYRSPTHAGGLFAINRKFFLDLGAYDSGLLVWGGENFELSFKIWQCGGSIEWVPC 372
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 373 SRVGHVYR 380
>gi|195039904|ref|XP_001990971.1| GH12336 [Drosophila grimshawi]
gi|193900729|gb|EDV99595.1| GH12336 [Drosophila grimshawi]
Length = 591
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 163/257 (63%), Positives = 196/257 (76%)
Query: 181 SRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNME 240
R+ + R V K LGN EP + GPGE G AY LP + DAS EYGMN+
Sbjct: 54 QRLENRPREVPKLVEGLGNFEPKDLKPRTGPGENGDAYTLPPEKKNVADASEMEYGMNIA 113
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S+ IS R++ + R+EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P
Sbjct: 114 CSDDISMHRSVRETRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHM 173
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L EIILVDDFS K +L KL+DY+Q+FNG V++IRN EREGLIRTRSRGA E+ GEVIVF
Sbjct: 174 LHEIILVDDFSDKENLRSKLDDYVQQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVF 233
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LDAHCEV LNWLPPLLAPIY DR +MTVP+IDGID++T+E+R VY D+H+RGIFEWGML
Sbjct: 234 LDAHCEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGML 293
Query: 421 YKENELPEREAKKRKYN 437
YKENE+P RE ++R +N
Sbjct: 294 YKENEVPRREQRRRAHN 310
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 141/188 (75%), Positives = 167/188 (88%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL+DY+++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPP
Sbjct: 188 NLRSKLDDYVQQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++T+E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|383860243|ref|XP_003705600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Megachile
rotundata]
Length = 581
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 161/249 (64%), Positives = 194/249 (77%), Gaps = 2/249 (0%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PV D LGN E P + GPGEGGK + L + + S EYGMNM S+ IS D
Sbjct: 54 PVLVKD--LGNFELQHVPIRTGPGEGGKPHILRDDQQNDVQQSESEYGMNMVCSDEISLD 111
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R++PD RM ECK+W+YP LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVD
Sbjct: 112 RSVPDTRMTECKHWNYPEVLPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVD 171
Query: 309 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
D+S K +L +LE YI+++ GKV+LIRN EREGLIRTRSRGA+E++GEVIVFLDAHCEV
Sbjct: 172 DYSDKDNLKGELESYIEQWEGKVKLIRNYEREGLIRTRSRGAREAKGEVIVFLDAHCEVN 231
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
+NWLPPLLAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP
Sbjct: 232 VNWLPPLLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPA 291
Query: 429 REAKKRKYN 437
RE K R YN
Sbjct: 292 REQKTRPYN 300
Score = 300 bits (769), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 140/188 (74%), Positives = 161/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +LE YIE++ GKV+LIRN EREGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 178 NLKGELESYIEQWEGKVKLIRNYEREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPP 237
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 238 LLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREQKTR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 298 PYNSMPYKSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPC 357
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 358 SHVGHVYR 365
>gi|195400935|ref|XP_002059071.1| GJ15190 [Drosophila virilis]
gi|194141723|gb|EDW58140.1| GJ15190 [Drosophila virilis]
Length = 591
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 161/257 (62%), Positives = 193/257 (75%)
Query: 181 SRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNME 240
R+ + R V K LGN EP + GPGE G A+ L + DAS EYGMN+
Sbjct: 54 QRLENRPREVPKLVEGLGNFEPKDLKPRNGPGENGDAHTLSPDKKNVADASEMEYGMNIA 113
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S+ IS R++ D R+EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P
Sbjct: 114 CSDEISMHRSVRDTRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHM 173
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L EIILVDDFS K +L KL+DY+ +F G VR+IRNTEREGLIRTRSRGA E+ GEVIVF
Sbjct: 174 LHEIILVDDFSDKENLRTKLDDYVLQFKGLVRIIRNTEREGLIRTRSRGAMEATGEVIVF 233
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LDAHCEV LNWLPPLLAPIY DR +MTVP+IDGID++++E+R VY D H+RGIFEWGML
Sbjct: 234 LDAHCEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKSFEYRPVYGSDTHFRGIFEWGML 293
Query: 421 YKENELPEREAKKRKYN 437
YKENE+P RE ++R +N
Sbjct: 294 YKENEVPRREQRRRAHN 310
Score = 310 bits (793), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 141/188 (75%), Positives = 165/188 (87%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL+DY+ +F G VR+IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPP
Sbjct: 188 NLRTKLDDYVLQFKGLVRIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++++E+R VY D H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKSFEYRPVYGSDTHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|195447414|ref|XP_002071203.1| GK25256 [Drosophila willistoni]
gi|194167288|gb|EDW82189.1| GK25256 [Drosophila willistoni]
Length = 587
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 161/257 (62%), Positives = 195/257 (75%)
Query: 181 SRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNME 240
R+ + R V K LGN EP + GPGE G A+ L + A DAS EYGMN+
Sbjct: 50 QRLENRPREVPKLIEGLGNFEPKDLKPRSGPGENGDAHVLNANKKNAADASEMEYGMNIA 109
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S+ IS R++ D R+EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P
Sbjct: 110 CSDDISMHRSVRDTRLEECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHM 169
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L EIILVDDFS K +L KL++YI +F+G V++IRN EREGLIRTRSRGAKE+ GEVIVF
Sbjct: 170 LHEIILVDDFSDKENLKAKLDEYILQFDGLVKIIRNKEREGLIRTRSRGAKEATGEVIVF 229
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LDAHCEV LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGML
Sbjct: 230 LDAHCEVNLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGML 289
Query: 421 YKENELPEREAKKRKYN 437
YKENE+P RE ++R +N
Sbjct: 290 YKENEVPRREQRRRAHN 306
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 144/200 (72%), Positives = 171/200 (85%), Gaps = 1/200 (0%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL++YI +F+G V++IRN EREGLIRTRSRGAKE+ GEVIVFLDAHCEV LNWLPP
Sbjct: 184 NLKAKLDEYILQFDGLVKIIRNKEREGLIRTRSRGAKEATGEVIVFLDAHCEVNLNWLPP 243
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 244 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 303
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 304 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 363
Query: 181 SRIVSLIRPVFKAD-GKLGN 199
SR+ + R + GKL N
Sbjct: 364 SRVGHVYRGFMPYNFGKLAN 383
>gi|427797631|gb|JAA64267.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 327 bits (838), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 155/251 (61%), Positives = 197/251 (78%), Gaps = 2/251 (0%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PVF+ D LGN EP ++GPGEGG AYH+PE R + S +YGMN+ S+HIS +
Sbjct: 112 PVFRKDRTLGNFEPKSHETRKGPGEGGVAYHVPERDRNSAADSNMQYGMNVVASDHISPN 171
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R++PD+R+EECKYWDYP DLP SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVD
Sbjct: 172 RSVPDMRLEECKYWDYPEDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVD 231
Query: 309 DFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
DFS K +L +LE YI G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCE
Sbjct: 232 DFSDKENLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCE 291
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
VG+NWLPPLLAPI ++R+ MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+
Sbjct: 292 VGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEI 351
Query: 427 PEREAKKRKYN 437
P+ E K+RKY+
Sbjct: 352 PDEEIKRRKYH 362
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 133/190 (70%), Positives = 159/190 (83%), Gaps = 2/190 (1%)
Query: 1 DLDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
+L +LE YI G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCEVG+NWL
Sbjct: 238 NLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCEVGINWL 297
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PPLLAPI ++R+ MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+P+ E K
Sbjct: 298 PPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEIPDEEIK 357
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+RKY+SEPYKSPTHAGGLFA++R +FLELGGYDPGLLVWGGENFELSFKIW CGG I WV
Sbjct: 358 RRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSFKIWQCGGMIYWV 417
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 418 PCSRVGHVYR 427
>gi|427797629|gb|JAA64266.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 641
Score = 327 bits (838), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 155/251 (61%), Positives = 197/251 (78%), Gaps = 2/251 (0%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PVF+ D LGN EP ++GPGEGG AYH+PE R + S +YGMN+ S+HIS +
Sbjct: 112 PVFRKDRTLGNFEPKSHETRKGPGEGGVAYHVPERDRNSAADSNMQYGMNVVASDHISPN 171
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R++PD+R+EECKYWDYP DLP SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVD
Sbjct: 172 RSVPDMRLEECKYWDYPEDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVD 231
Query: 309 DFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
DFS K +L +LE YI G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCE
Sbjct: 232 DFSDKENLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCE 291
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
VG+NWLPPLLAPI ++R+ MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+
Sbjct: 292 VGINWLPPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEI 351
Query: 427 PEREAKKRKYN 437
P+ E K+RKY+
Sbjct: 352 PDEEIKRRKYH 362
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 133/190 (70%), Positives = 159/190 (83%), Gaps = 2/190 (1%)
Query: 1 DLDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
+L +LE YI G VRL+RN+EREGLIR+RS GA++S G+V++FLDAHCEVG+NWL
Sbjct: 238 NLKGELETYIAHNFPRGLVRLLRNSEREGLIRSRSYGAEQSHGDVVLFLDAHCEVGINWL 297
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PPLLAPI ++R+ MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+P+ E K
Sbjct: 298 PPLLAPIRANRRAMTVPVIDGIDKDTFEYRPVYHGRQHFRGIFEWGMLYKEIEIPDEEIK 357
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+RKY+SEPYKSPTHAGGLFA++R +FLELGGYDPGLLVWGGENFELSFKIW CGG I WV
Sbjct: 358 RRKYHSEPYKSPTHAGGLFAINRKYFLELGGYDPGLLVWGGENFELSFKIWQCGGMIYWV 417
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 418 PCSRVGHVYR 427
>gi|242016390|ref|XP_002428804.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212513501|gb|EEB16066.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 579
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 160/242 (66%), Positives = 192/242 (79%), Gaps = 3/242 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
GN EP E +GPGEGG+ + L E + SL +YGMN+ S+ IS DR+IPD R+
Sbjct: 58 GNFEPREEEISDGPGEGGRPHKLREDQQNDASQSLADYGMNIACSDEISLDRSIPDTRLP 117
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
ECK W YP DLPKASVI+VFHNEG+S+L+RTVHS+I RTP Q+LEE+++VDDFS K +L
Sbjct: 118 ECKRWMYPEDLPKASVIIVFHNEGWSTLLRTVHSVINRTPPQFLEEVLMVDDFSDKENL- 176
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
++L+DYI RFNGKVRLIRN+ER+GLIRTRSRGA E+RGEVIVFLDAHCEV NWLPPLLA
Sbjct: 177 KELDDYILRFNGKVRLIRNSERQGLIRTRSRGAVEARGEVIVFLDAHCEVNKNWLPPLLA 236
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER--EAKKRK 435
PIY DR +TVPVIDGID+ T+E++ VY HHYRGIFEWGMLYKE EL ++ A RK
Sbjct: 237 PIYYDRTTLTVPVIDGIDHDTFEYKPVYVDGHHYRGIFEWGMLYKEIELTDQFANADNRK 296
Query: 436 YN 437
YN
Sbjct: 297 YN 298
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 139/187 (74%), Positives = 159/187 (85%), Gaps = 2/187 (1%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
K+L+DYI RFNGKVRLIRN+ER+GLIRTRSRGA E+RGEVIVFLDAHCEV NWLPPLLA
Sbjct: 177 KELDDYILRFNGKVRLIRNSERQGLIRTRSRGAVEARGEVIVFLDAHCEVNKNWLPPLLA 236
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER--EAKKRK 121
PIY DR +TVPVIDGID+ T+E++ VY HHYRGIFEWGMLYKE EL ++ A RK
Sbjct: 237 PIYYDRTTLTVPVIDGIDHDTFEYKPVYVDGHHYRGIFEWGMLYKEIELTDQFANADNRK 296
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
YNSEPY+SPTHAGGLFA+DR +FL++G YD GLLVWGGENFELSFK+W CGG I WVPCS
Sbjct: 297 YNSEPYRSPTHAGGLFAIDRNYFLDIGAYDDGLLVWGGENFELSFKVWQCGGRILWVPCS 356
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 357 RVGHVYR 363
>gi|195481361|ref|XP_002101619.1| GE15519 [Drosophila yakuba]
gi|194189143|gb|EDX02727.1| GE15519 [Drosophila yakuba]
Length = 591
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 155/250 (62%), Positives = 191/250 (76%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPEKKHMSDASEMEYGMNIACSDEISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS K +L +L++Y+Q+F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVQQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRAHN 310
Score = 303 bits (777), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 164/188 (87%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L++Y+++F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV NWLPP
Sbjct: 188 NLRSQLDEYVQQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|195130803|ref|XP_002009840.1| GI15586 [Drosophila mojavensis]
gi|193908290|gb|EDW07157.1| GI15586 [Drosophila mojavensis]
Length = 595
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 155/241 (64%), Positives = 188/241 (78%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP + GPGE G+ + L + DAS EYGMN+ S+ IS R++ D R+
Sbjct: 70 LGNFEPRDLKPRTGPGENGEGHILSPDKKNVADASEMEYGMNIACSDEISMHRSVRDTRL 129
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
EECK+WDYP DLP SVI+VFHNEGFS LMRTVHS+I R+P L EIILVDDFS K +L
Sbjct: 130 EECKHWDYPYDLPPTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILVDDFSDKENL 189
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
KL++Y+ +F G V++IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPPLL
Sbjct: 190 RSKLDEYVLQFKGLVKIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPPLL 249
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
APIY DR +MTVP+IDGID++T+E+R VY D+H+RGIFEWGMLYKENE+P RE ++R +
Sbjct: 250 APIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVPRREQRRRAH 309
Query: 437 N 437
N
Sbjct: 310 N 310
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 133/176 (75%), Positives = 157/176 (89%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL++Y+ +F G V++IRNTEREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPP
Sbjct: 188 NLRSKLDEYVLQFKGLVKIIRNTEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++T+E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKTFEYRPVYGSDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIE
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIE 363
>gi|195172039|ref|XP_002026809.1| GL27027 [Drosophila persimilis]
gi|194111748|gb|EDW33791.1| GL27027 [Drosophila persimilis]
Length = 567
Score = 325 bits (833), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 157/250 (62%), Positives = 190/250 (76%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + D S EYGMN+ SN IS
Sbjct: 61 REVPKLIEGLGNFEPKDLKPRSGPGENGEAHSLSPDKKNVADDSEMEYGMNIACSNDISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EECK+WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DD+S K DL L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDYSDKEDLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRTHN 310
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 131/176 (74%), Positives = 155/176 (88%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPP
Sbjct: 188 DLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSI+
Sbjct: 308 THNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSID 363
>gi|125980684|ref|XP_001354365.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
gi|54642673|gb|EAL31418.1| GA19561 [Drosophila pseudoobscura pseudoobscura]
Length = 591
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 157/250 (62%), Positives = 190/250 (76%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + D S EYGMN+ SN IS
Sbjct: 61 REVPKLVEGLGNFEPKDLKPRSGPGENGEAHTLSPDKKNVADDSEMEYGMNIACSNDISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EECK+WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECKHWDYPYDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPKHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DD+S K DL L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDYSDKEDLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
LNWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NLNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRAHN 310
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 139/188 (73%), Positives = 164/188 (87%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL L++Y ++FNG V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWLPP
Sbjct: 188 DLRSHLDEYSKQFNGLVKIIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNLNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|194766810|ref|XP_001965517.1| GF22410 [Drosophila ananassae]
gi|190619508|gb|EDV35032.1| GF22410 [Drosophila ananassae]
Length = 591
Score = 324 bits (830), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 152/241 (63%), Positives = 189/241 (78%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP + GPGE G+A++L + + DAS EYGMN+ S+ IS RT+ D R+
Sbjct: 70 LGNFEPKDLKPRTGPGENGEAHNLSKDKKNKADASEMEYGMNIACSDEISMHRTVKDTRL 129
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
EEC++WDYP DLPK SVI+VFHNEGFS LMRTVHS+I R+P+ L EIILVDDFS K +L
Sbjct: 130 EECRHWDYPYDLPKTSVIIVFHNEGFSVLMRTVHSVIDRSPSHILHEIILVDDFSDKENL 189
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
+L+ Y+++F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWL PLL
Sbjct: 190 GNQLDKYVEQFKGLVKVIRNKEREGLIRTRSRGATEATGEVIVFLDAHCEVNLNWLAPLL 249
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
APIY DR +MTVP+IDGID++ +E+R VY + H+RGIFEWGMLYKENE+P RE ++R +
Sbjct: 250 APIYRDRTVMTVPIIDGIDHKNFEYRPVYGTETHFRGIFEWGMLYKENEVPRREQRRRSH 309
Query: 437 N 437
N
Sbjct: 310 N 310
Score = 301 bits (771), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 162/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L+ Y+E+F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV LNWL P
Sbjct: 188 NLGNQLDKYVEQFKGLVKVIRNKEREGLIRTRSRGATEATGEVIVFLDAHCEVNLNWLAP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY + H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTETHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 SHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|194892500|ref|XP_001977673.1| GG18114 [Drosophila erecta]
gi|190649322|gb|EDV46600.1| GG18114 [Drosophila erecta]
Length = 591
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 154/250 (61%), Positives = 190/250 (76%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRAHN 310
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 163/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV NWLPP
Sbjct: 188 NLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|24643052|ref|NP_573301.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|24643054|ref|NP_728178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|51316019|sp|Q8MV48.2|GALT7_DROME RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 7;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7;
AltName: Full=dGalNAc-T2
gi|7293476|gb|AAF48851.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform A
[Drosophila melanogaster]
gi|22832507|gb|AAN09470.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2, isoform B
[Drosophila melanogaster]
gi|34043004|gb|AAQ56704.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
gi|54650858|gb|AAV37008.1| LD01328p [Drosophila melanogaster]
gi|220950352|gb|ACL87719.1| GalNAc-T2-PA [synthetic construct]
Length = 591
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 154/250 (61%), Positives = 190/250 (76%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRAHN 310
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 163/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV NWLPP
Sbjct: 188 NLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|307212076|gb|EFN87959.1| N-acetylgalactosaminyltransferase 7 [Harpegnathos saltator]
Length = 563
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 152/241 (63%), Positives = 192/241 (79%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP +K GPGEGGK + L E + S +YGMNM S+ IS R+IPD R
Sbjct: 42 LGNFEPRDTSFKAGPGEGGKPHILREDQQNDVQQSESDYGMNMVCSDEISMSRSIPDTRP 101
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+HS+I RTP+++LEE++LVDDFS K +L
Sbjct: 102 AECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIHSVINRTPSKFLEEVLLVDDFSDKENL 161
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPLL
Sbjct: 162 KDDLDSYIEQWGGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPLL 221
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
API +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R +
Sbjct: 222 APIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTRSH 281
Query: 437 N 437
+
Sbjct: 282 D 282
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 161/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L L+ YIE++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 160 NLKDDLDSYIEQWGGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPP 219
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R
Sbjct: 220 LLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTR 279
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++S PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGG+I WVPC
Sbjct: 280 SHDSMPYKSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGTILWVPC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVYR 347
>gi|380013105|ref|XP_003690610.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
7-like [Apis florea]
Length = 581
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 157/242 (64%), Positives = 190/242 (78%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+LGN EP + GPGE GK + L + + S +YGMNM S+ IS DR IPD R
Sbjct: 59 ELGNFEPKRISMRNGPGEKGKPHILRDDQQNDVQQSEIDYGMNMVCSDEISLDRLIPDTR 118
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
M ECK+W+YP LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEMLPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVDDFSDKDN 178
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +LE YI+R+ KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGELESYIERWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPPL 238
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LAPI +DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKIRP 298
Query: 436 YN 437
YN
Sbjct: 299 YN 300
Score = 297 bits (761), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 139/188 (73%), Positives = 161/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +LE YIER+ KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 178 NLKGELESYIERWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPP 237
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI +DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 238 LLAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKIR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 298 PYNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPC 357
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 358 SHVGHVYR 365
>gi|195345467|ref|XP_002039290.1| GM22807 [Drosophila sechellia]
gi|194134516|gb|EDW56032.1| GM22807 [Drosophila sechellia]
Length = 591
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 153/250 (61%), Positives = 190/250 (76%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS K +L +L++Y+ +F G V++IRN +REGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKQREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRAHN 310
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 135/188 (71%), Positives = 163/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L++Y+ +F G V++IRN +REGLIRTRSRGA E+ GEVIVFLDAHCEV NWLPP
Sbjct: 188 NLRSQLDEYVLQFKGLVKVIRNKQREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|156537099|ref|XP_001602659.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Nasonia
vitripennis]
Length = 583
Score = 321 bits (823), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 154/241 (63%), Positives = 194/241 (80%), Gaps = 1/241 (0%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN EP ++ ++ GPGE GK + L + + S YGMN+ S+ IS DR++PD R
Sbjct: 63 LGNFEPEIQ-HRTGPGEEGKPHILRDDQQNDVQESETAYGMNIVCSDEISLDRSVPDTRP 121
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
+ECK+W+Y +LPK SVI+VFHNEG+S LMRTVHS++ RTP QYLEEI+LVDDFS K +L
Sbjct: 122 DECKHWNYSKNLPKTSVIIVFHNEGWSVLMRTVHSVLNRTPPQYLEEILLVDDFSDKENL 181
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
+LE YI+++ KVRL+RN EREGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPLL
Sbjct: 182 KGELESYIEQWGPKVRLLRNKEREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPLL 241
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
+PI D K+MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP+REAK RK+
Sbjct: 242 SPIAEDNKVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPQREAKTRKH 301
Query: 437 N 437
N
Sbjct: 302 N 302
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 139/188 (73%), Positives = 164/188 (87%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +LE YIE++ KVRL+RN EREGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 180 NLKGELESYIEQWGPKVRLLRNKEREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+PI D K+MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP+REAK R
Sbjct: 240 LLSPIAEDNKVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPQREAKTR 299
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+NSEPY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 300 KHNSEPYRSPTHAGGLFAINREYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPC 359
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 360 SHVGHVYR 367
>gi|328781461|ref|XP_395266.4| PREDICTED: n-acetylgalactosaminyltransferase 7 [Apis mellifera]
Length = 581
Score = 321 bits (822), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 155/242 (64%), Positives = 190/242 (78%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+LGN EP + GPGE GK + L + + S +YGMN+ S+ IS DR IPD R
Sbjct: 59 ELGNFEPKHISMRNGPGEKGKPHILRDDQQNDVQQSEIDYGMNIVCSDEISLDRLIPDTR 118
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
M ECK+W+YP LP+ SVI+VFHNEG+S LMRTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEILPRTSVIIVFHNEGWSVLMRTVHSVINRTPPQFLEEILLVDDFSDKDN 178
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +LE YI+++ KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGELESYIEQWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPPL 238
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LAPI +DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRS 298
Query: 436 YN 437
YN
Sbjct: 299 YN 300
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 138/188 (73%), Positives = 161/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +LE YIE++ KV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 178 NLKGELESYIEQWGDKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNINWLPP 237
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI +DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 238 LLAPIAADRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 298 SYNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPC 357
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 358 SHVGHVYR 365
>gi|307169192|gb|EFN62008.1| N-acetylgalactosaminyltransferase 7 [Camponotus floridanus]
Length = 580
Score = 321 bits (822), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 156/249 (62%), Positives = 196/249 (78%), Gaps = 2/249 (0%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PVF +G LGN EP P + GPGE GK + L + S +YGMNM S+ IS
Sbjct: 53 PVF-VEG-LGNYEPRDVPVRSGPGENGKPHILRDDQLNDVQQSESDYGMNMVCSDEISLS 110
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R+IPD R ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+HS+I RTP+++LEEI+LVD
Sbjct: 111 RSIPDTRPAECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIHSVINRTPSKFLEEILLVD 170
Query: 309 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
DFS K +L L+ YI+++NGKVRL+RN ER+GLIRTRSRGA++++GEVIVFLDAHCEV
Sbjct: 171 DFSDKENLKGDLDSYIEQWNGKVRLLRNYERQGLIRTRSRGARDAKGEVIVFLDAHCEVN 230
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
+NWLPPLLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP
Sbjct: 231 VNWLPPLLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPR 290
Query: 429 REAKKRKYN 437
REAK R Y+
Sbjct: 291 REAKTRAYD 299
Score = 298 bits (763), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 137/188 (72%), Positives = 162/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L L+ YIE++NGKVRL+RN ER+GLIRTRSRGA++++GEVIVFLDAHCEV +NWLPP
Sbjct: 177 NLKGDLDSYIEQWNGKVRLLRNYERQGLIRTRSRGARDAKGEVIVFLDAHCEVNVNWLPP 236
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R
Sbjct: 237 LLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTR 296
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
Y+S PY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 297 AYDSMPYRSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPC 356
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 357 SHVGHVYR 364
>gi|21552985|gb|AAM62412.1|AF493067_1 UDP-N-acetylgalactosamine: polypeptide
N-acetylgalactosaminyltransferase 2 [Drosophila
melanogaster]
Length = 591
Score = 320 bits (820), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 153/250 (61%), Positives = 189/250 (75%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + G GE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGSGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEV 240
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P
Sbjct: 241 NTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVP 300
Query: 428 EREAKKRKYN 437
RE ++R +N
Sbjct: 301 RREQRRRAHN 310
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 163/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV NWLPP
Sbjct: 188 NLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 248 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 308 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHVYR 375
>gi|350400167|ref|XP_003485756.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Bombus
impatiens]
Length = 582
Score = 320 bits (820), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 156/242 (64%), Positives = 189/242 (78%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+LGN E + GPGEGGK Y L + + S +YGMNM S+ IS DR+I D R
Sbjct: 60 ELGNFELKHVSIRSGPGEGGKPYILRDDQQNDVQQSEIDYGMNMVCSDEISLDRSILDTR 119
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
M ECK+W+YP LP+ SVI+VFHNEG+S L+RTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 120 MPECKHWNYPEVLPRTSVIIVFHNEGWSVLLRTVHSVINRTPPQFLEEILLVDDFSDKDN 179
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L L+ YI+R+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 180 LKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPL 239
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 240 LAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRP 299
Query: 436 YN 437
YN
Sbjct: 300 YN 301
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 138/188 (73%), Positives = 160/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L L+ YIER+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 179 NLKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPP 238
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSR 298
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGG+I WVPC
Sbjct: 299 PYNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGNILWVPC 358
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 359 SHVGHVYR 366
>gi|340718182|ref|XP_003397550.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Bombus
terrestris]
Length = 581
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 156/242 (64%), Positives = 189/242 (78%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+LGN E + GPGEGGK Y L + + S +YGMNM S+ IS DR+I D R
Sbjct: 59 ELGNFELKHVSIRSGPGEGGKPYILRDDQQNDVQQSEIDYGMNMVCSDEISLDRSILDTR 118
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
M ECK+W+YP LP+ SVI+VFHNEG+S L+RTVHS+I RTP Q+LEEI+LVDDFS K +
Sbjct: 119 MPECKHWNYPEVLPRTSVIIVFHNEGWSVLLRTVHSVINRTPPQFLEEILLVDDFSDKDN 178
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L L+ YI+R+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPPL
Sbjct: 179 LKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPPL 238
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 239 LAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSRP 298
Query: 436 YN 437
YN
Sbjct: 299 YN 300
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 139/188 (73%), Positives = 160/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L L+ YIER+ GKV+LIRN +REGLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 178 NLKGDLDSYIERWEGKVKLIRNDKREGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPP 237
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI DR +MTVP+IDGID++T+E+R VY+ H YRGIFEWGMLYKENELP RE K R
Sbjct: 238 LLAPIAVDRTVMTVPIIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPAREKKSR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
YNS PYKSPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 298 PYNSMPYKSPTHAGGLFAINREYFLSLGGYDDGLLVWGGENFELSFKIWQCGGSILWVPC 357
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 358 SHVGHVYR 365
>gi|332023194|gb|EGI63450.1| N-acetylgalactosaminyltransferase 7 [Acromyrmex echinatior]
Length = 614
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 150/241 (62%), Positives = 189/241 (78%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
LGN E P + GPGEGGK + L + S +YGMNM S+ IS R+IPD R+
Sbjct: 93 LGNYELRDVPVRSGPGEGGKPHILKDDQLNDVQQSESDYGMNMVCSDEISLSRSIPDTRL 152
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
+CK+W+YP +LP+ SVI+VFHNEG+S L+RT+ S+I RTP++ LEEI+LVDDFS K +L
Sbjct: 153 AQCKHWNYPEELPRTSVIIVFHNEGWSVLLRTIQSVIDRTPSKLLEEILLVDDFSDKENL 212
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E+RGEVIVFLDAHCEV +NWLPPLL
Sbjct: 213 KSDLDSYIEQWGGKVRLLRNHERQGLIRTRSRGAREARGEVIVFLDAHCEVNVNWLPPLL 272
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
API +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R +
Sbjct: 273 APIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTRAH 332
Query: 437 N 437
+
Sbjct: 333 D 333
Score = 297 bits (761), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 137/188 (72%), Positives = 160/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L L+ YIE++ GKVRL+RN ER+GLIRTRSRGA+E+RGEVIVFLDAHCEV +NWLPP
Sbjct: 211 NLKSDLDSYIEQWGGKVRLLRNHERQGLIRTRSRGAREARGEVIVFLDAHCEVNVNWLPP 270
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R
Sbjct: 271 LLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTR 330
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++S PY+SPTHAGGLFA+ R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 331 AHDSMPYRSPTHAGGLFAISRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPC 390
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 391 SHVGHVYR 398
>gi|322798640|gb|EFZ20244.1| hypothetical protein SINV_10970 [Solenopsis invicta]
Length = 580
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 155/249 (62%), Positives = 194/249 (77%), Gaps = 2/249 (0%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PVF +G LGN EP P + GPGEGGK + L + S +YGMNM S+ IS
Sbjct: 53 PVF-VEG-LGNYEPRDIPVRSGPGEGGKPHILRDDQLNDVQQSESDYGMNMVCSDEISLS 110
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R IPD R ECK+W+YP +LP+ SVI+VFHNEG+S L+RT+ S+I RTP+++LEEI+LVD
Sbjct: 111 RAIPDTRPAECKHWNYPEELPRTSVIIVFHNEGWSVLLRTIQSVIDRTPSKFLEEILLVD 170
Query: 309 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
DFS K +L L+ YI+++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV
Sbjct: 171 DFSDKENLKGDLDSYIEQWEGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVN 230
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
+NWLPPLLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP
Sbjct: 231 VNWLPPLLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPR 290
Query: 429 REAKKRKYN 437
REAK R ++
Sbjct: 291 REAKTRAHD 299
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 161/188 (85%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L L+ YIE++ GKVRL+RN ER+GLIRTRSRGA+E++GEVIVFLDAHCEV +NWLPP
Sbjct: 177 NLKGDLDSYIEQWEGKVRLLRNYERQGLIRTRSRGAREAKGEVIVFLDAHCEVNVNWLPP 236
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI +R +MTVPVIDGID++T+E+R VY+ H YRGIFEWGMLYKENELP REAK R
Sbjct: 237 LLAPIAENRNVMTVPVIDGIDHKTFEYRPVYQEGHLYRGIFEWGMLYKENELPRREAKTR 296
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++S PY+SPTHAGGLFA++R +FL LGGYD GLLVWGGENFELSFKIW CGGSI WVPC
Sbjct: 297 AHDSMPYRSPTHAGGLFAINRQYFLSLGGYDEGLLVWGGENFELSFKIWQCGGSILWVPC 356
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 357 SHVGHVYR 364
>gi|443298648|gb|AGC81884.1| N-acetylgalactosaminyltransferase, partial [Bombyx mori]
Length = 499
Score = 310 bits (795), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 142/203 (69%), Positives = 173/203 (85%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
+YGMN+ SN I+ +R+IPD R++ECKYW YP DL K SVI+VFHNEGFS LMRTVHS+I
Sbjct: 15 KYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEDLAKTSVIIVFHNEGFSVLMRTVHSVI 74
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
RTPAQ+L E++LVDDFS K DL + L++YI+R+NGKVRL+RN +REGLIRTRSRGA+E+
Sbjct: 75 NRTPAQFLHEVVLVDDFSDKDDLKENLDNYIKRWNGKVRLVRNVQREGLIRTRSRGAQEA 134
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+VIVFLDAHCEV +NWLPPLLAPIY D + MTVPVIDGIDY T+E+R VY+ +YRG
Sbjct: 135 TGDVIVFLDAHCEVNVNWLPPLLAPIYRDYRTMTVPVIDGIDYNTFEYRPVYQHGTNYRG 194
Query: 414 IFEWGMLYKENELPEREAKKRKY 436
IFEWGMLYKENE+P+REA K+
Sbjct: 195 IFEWGMLYKENEVPDREAHLHKH 217
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 140/188 (74%), Positives = 165/188 (87%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL + L++YI+R+NGKVRL+RN +REGLIRTRSRGA+E+ G+VIVFLDAHCEV +NWLPP
Sbjct: 96 DLKENLDNYIKRWNGKVRLVRNVQREGLIRTRSRGAQEATGDVIVFLDAHCEVNVNWLPP 155
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY D + MTVPVIDGIDY T+E+R VY+ +YRGIFEWGMLYKENE+P+REA
Sbjct: 156 LLAPIYRDYRTMTVPVIDGIDYNTFEYRPVYQHGTNYRGIFEWGMLYKENEVPDREAHLH 215
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ SEPYKSPTHAGGLFA++R +FLE+G YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 216 KHKSEPYKSPTHAGGLFAINRRYFLEIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 275
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 276 SRVGHVYR 283
>gi|357602062|gb|EHJ63261.1| putative n-acetylgalactosaminyltransferase [Danaus plexippus]
Length = 499
Score = 304 bits (779), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 140/188 (74%), Positives = 165/188 (87%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL + L++Y++R+ GKVR+IRN+ER+GLIRTRSRGA E+ GEVIVFLDAHCEV +NWLPP
Sbjct: 96 DLKENLDNYVKRWKGKVRIIRNSERQGLIRTRSRGAMEATGEVIVFLDAHCEVNVNWLPP 155
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY D KIMTVPVIDGID++T+E+R VY +YRGIFEWGMLYKENE+P+REA
Sbjct: 156 LLAPIYRDYKIMTVPVIDGIDHKTFEYRPVYSHGINYRGIFEWGMLYKENEVPDREASLH 215
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ SEPYKSPTHAGGLFA++R +FLE+G YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 216 KHKSEPYKSPTHAGGLFAINRNYFLEIGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 275
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 276 SRVGHVYR 283
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 138/206 (66%), Positives = 172/206 (83%)
Query: 231 SLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVH 290
S EYGMN+ SN I+ +R+IPD R++ECKYW YP +LP SVI+VFHNEGFS LMRTVH
Sbjct: 12 SESEYGMNIAASNDIAMNRSIPDTRLDECKYWHYPEELPSTSVIIVFHNEGFSVLMRTVH 71
Query: 291 SIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA 350
++I R+P L+E+++VDDFS K DL + L++Y++R+ GKVR+IRN+ER+GLIRTRSRGA
Sbjct: 72 TVIDRSPPNILKEVVMVDDFSDKDDLKENLDNYVKRWKGKVRIIRNSERQGLIRTRSRGA 131
Query: 351 KESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH 410
E+ GEVIVFLDAHCEV +NWLPPLLAPIY D KIMTVPVIDGID++T+E+R VY +
Sbjct: 132 MEATGEVIVFLDAHCEVNVNWLPPLLAPIYRDYKIMTVPVIDGIDHKTFEYRPVYSHGIN 191
Query: 411 YRGIFEWGMLYKENELPEREAKKRKY 436
YRGIFEWGMLYKENE+P+REA K+
Sbjct: 192 YRGIFEWGMLYKENEVPDREASLHKH 217
>gi|16198165|gb|AAL13889.1| LD36616p [Drosophila melanogaster]
Length = 486
Score = 301 bits (771), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 136/188 (72%), Positives = 163/188 (86%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+ GEVIVFLDAHCEV NWLPP
Sbjct: 83 NLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPP 142
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RGIFEWGMLYKENE+P RE ++R
Sbjct: 143 LLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRR 202
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+NSEPY+SPTHAGGLFA++R +FLELG YDPGLLVWGGENFELSFKIW CGGSIEWVPC
Sbjct: 203 AHNSEPYRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPC 262
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 263 SRVGHVYR 270
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 136/204 (66%), Positives = 168/204 (82%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYGMN+ S+ IS R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I
Sbjct: 2 EYGMNIACSDEISMHRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVI 61
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
R+P L EIILVDDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+
Sbjct: 62 DRSPTHMLHEIILVDDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEA 121
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
GEVIVFLDAHCEV NWLPPLLAPIY DR +MTVP+IDGID++ +E+R VY D+H+RG
Sbjct: 122 TGEVIVFLDAHCEVNTNWLPPLLAPIYRDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRG 181
Query: 414 IFEWGMLYKENELPEREAKKRKYN 437
IFEWGMLYKENE+P RE ++R +N
Sbjct: 182 IFEWGMLYKENEVPRREQRRRAHN 205
>gi|391336074|ref|XP_003742408.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Metaseiulus
occidentalis]
Length = 593
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/249 (61%), Positives = 188/249 (75%), Gaps = 7/249 (2%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISF 247
PVF+ D LGN E + P K GPGEGG AY + + R+ L +YGMNM SN IS
Sbjct: 70 PVFR-DDVLGNFEMSM-PKKVGPGEGGAAYVI--SGRSVEQQKLKNQYGMNMVVSNEISP 125
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
+RTIPDLR++ECKYW YP DLP SVI+VFHNEG S LMRTVHS+I R+P Q+L E++LV
Sbjct: 126 NRTIPDLRLDECKYWHYPEDLPGTSVIVVFHNEGLSVLMRTVHSVINRSPRQFLHEVVLV 185
Query: 308 DDFSSKADLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
DDFS K +L ++LE+YI R G VRL+RN R+GLIR+RS GA+ + GEVI+FLDAHC
Sbjct: 186 DDFSDKLNLREELENYIARNFPKGLVRLVRNKSRQGLIRSRSYGAEVATGEVILFLDAHC 245
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 425
EVG NWLPPLLAPI ++ K MTVPVIDGID++ +E+R VY H+RGIFEWGMLYKE E
Sbjct: 246 EVGANWLPPLLAPIKANPKTMTVPVIDGIDHENFEYRPVYHGKQHFRGIFEWGMLYKEIE 305
Query: 426 LPEREAKKR 434
+PE E K+R
Sbjct: 306 IPEEEVKRR 314
Score = 277 bits (708), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 131/190 (68%), Positives = 156/190 (82%), Gaps = 2/190 (1%)
Query: 1 DLDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
+L ++LE+YI R G VRL+RN R+GLIR+RS GA+ + GEVI+FLDAHCEVG NWL
Sbjct: 193 NLREELENYIARNFPKGLVRLVRNKSRQGLIRSRSYGAEVATGEVILFLDAHCEVGANWL 252
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PPLLAPI ++ K MTVPVIDGID++ +E+R VY H+RGIFEWGMLYKE E+PE E K
Sbjct: 253 PPLLAPIKANPKTMTVPVIDGIDHENFEYRPVYHGKQHFRGIFEWGMLYKEIEIPEEEVK 312
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+R +SEPYKSPTHAGGLFAM+R +FLELGGYDPGLLVWGGENFELSFK+W CGG I WV
Sbjct: 313 RRTKHSEPYKSPTHAGGLFAMNREYFLELGGYDPGLLVWGGENFELSFKLWQCGGQILWV 372
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 373 PCSRVGHVYR 382
>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
Length = 640
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/254 (58%), Positives = 191/254 (75%), Gaps = 12/254 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNH 244
VFK G+LGN EP + + G GE G+ ++ PE RA + E+G N S+
Sbjct: 115 VFKK-GELGNFEPKEKQSRPGKHGEMGEPVNVDLNQPEVQRA-----MNEFGFNTFVSDM 168
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS +R++PD+RM+ECKYW YP DLP ASV++VFHNEG+S L+RTVHS+I R+P L+EI
Sbjct: 169 ISLNRSVPDVRMDECKYWHYPEDLPTASVVIVFHNEGWSPLLRTVHSVILRSPPNLLKEI 228
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
+LVDDFS K L +L+ YI++FNGKVRL+RN EREGLIRTRS GA+ + G+V++FLDAH
Sbjct: 229 VLVDDFSDKEHLKDRLDRYIEQFNGKVRLVRNNEREGLIRTRSIGAQHAVGDVVIFLDAH 288
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKE 423
CEV +NWLPPLLAPI +RK+MTVPVIDGID TW +R VY D H+RGIFEWG+LYKE
Sbjct: 289 CEVNINWLPPLLAPIRRNRKVMTVPVIDGIDMHTWSYRRVYGSADRHFRGIFEWGLLYKE 348
Query: 424 NELPEREAKKRKYN 437
E+ + EA++RKYN
Sbjct: 349 TEITKEEARRRKYN 362
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 126/188 (67%), Positives = 155/188 (82%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ YIE+FNGKVRL+RN EREGLIRTRS GA+ + G+V++FLDAHCEV +NWLPPL
Sbjct: 240 LKDRLDRYIEQFNGKVRLVRNNEREGLIRTRSIGAQHAVGDVVIFLDAHCEVNINWLPPL 299
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID TW +R VY D H+RGIFEWG+LYKE E+ + EA++R
Sbjct: 300 LAPIRRNRKVMTVPVIDGIDMHTWSYRRVYGSADRHFRGIFEWGLLYKETEITKEEARRR 359
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
KYNSEP++SPTHAGGLFA+D+ +F ELG YDPGL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 360 KYNSEPFRSPTHAGGLFAIDKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFVPC 419
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 420 SHVGHVYR 427
>gi|241651003|ref|XP_002411252.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215503882|gb|EEC13376.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 478
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/245 (61%), Positives = 186/245 (75%), Gaps = 4/245 (1%)
Query: 197 LGNLEPPLEPY--KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 254
LGN EP + K PGEGG YH P + S EYGMN+ S+HIS +RTIPD+
Sbjct: 1 LGNFEPAVADVVDKRKPGEGGFPYHTPPKLKNNVAHSNMEYGMNVVASDHISPNRTIPDM 60
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R++ECKYWDYP DLP SV++VFHNEG S LMRTVHS+I R+P Q+L+E++LVDD+S K
Sbjct: 61 RLQECKYWDYPTDLPTTSVVVVFHNEGLSVLMRTVHSVINRSPRQFLKEVVLVDDYSDKE 120
Query: 315 DLDQKLEDYIQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
+L +LE YI R G VRL+RN ER+GLIR+RS GA++S G+V++FLDAHCEVG+NWL
Sbjct: 121 NLKGELETYIARNFPVGLVRLLRNEERQGLIRSRSYGAEQSVGDVVLFLDAHCEVGINWL 180
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PPLLAPI ++R MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+PE E K
Sbjct: 181 PPLLAPIRANRYTMTVPVIDGIDKDTFEYRPVYHGGQHFRGIFEWGMLYKEIEIPEEEIK 240
Query: 433 KRKYN 437
+RKY+
Sbjct: 241 RRKYH 245
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 135/190 (71%), Positives = 159/190 (83%), Gaps = 2/190 (1%)
Query: 1 DLDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
+L +LE YI R G VRL+RN ER+GLIR+RS GA++S G+V++FLDAHCEVG+NWL
Sbjct: 121 NLKGELETYIARNFPVGLVRLLRNEERQGLIRSRSYGAEQSVGDVVLFLDAHCEVGINWL 180
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PPLLAPI ++R MTVPVIDGID T+E+R VY H+RGIFEWGMLYKE E+PE E K
Sbjct: 181 PPLLAPIRANRYTMTVPVIDGIDKDTFEYRPVYHGGQHFRGIFEWGMLYKEIEIPEEEIK 240
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+RKY+SEPYKSPTHAGGLFA+DR +FL+LGGYDPGLLVWGGENFELSFKIW CGGSI WV
Sbjct: 241 RRKYHSEPYKSPTHAGGLFAIDRKYFLKLGGYDPGLLVWGGENFELSFKIWQCGGSIYWV 300
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 301 PCSRVGHVYR 310
>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
Length = 562
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 147/254 (57%), Positives = 190/254 (74%), Gaps = 12/254 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGP-GEGGKAY----HLPEAYRAAGDASLGEYGMNMETSNH 244
+FK D ++GN EP ++ G GE GK +LPE +A + EYG N S+
Sbjct: 39 IFKMD-EIGNFEPKEIQWQPGNYGEMGKPVFVDKNLPEVKKA-----MREYGFNTYVSDM 92
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS +R+IPD+R++ECKYW YP DLP ASV++ FHNEG++ L+RTVHS++ R+P+Q ++EI
Sbjct: 93 ISLNRSIPDVRLDECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEI 152
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
ILVDDFS K L +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAH
Sbjct: 153 ILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAH 212
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKE 423
CEV +NWLPPLLAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE
Sbjct: 213 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKE 272
Query: 424 NELPEREAKKRKYN 437
E+P +E +RK++
Sbjct: 273 TEIPAQELLRRKHS 286
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/188 (65%), Positives = 152/188 (80%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAHCEV +NWLPPL
Sbjct: 164 LKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAHCEVNINWLPPL 223
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE E+P +E +R
Sbjct: 224 LAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKETEIPAQELLRR 283
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K++SEP++SPTHAGGLFA+ + +F ELG YDPGL +WGGE +ELSFKIW CGG I ++PC
Sbjct: 284 KHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFIPC 343
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 344 SHVGHVYR 351
>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
Length = 560
Score = 291 bits (745), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 147/254 (57%), Positives = 190/254 (74%), Gaps = 12/254 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGP-GEGGKAY----HLPEAYRAAGDASLGEYGMNMETSNH 244
+FK D ++GN EP ++ G GE GK +LPE +A + EYG N S+
Sbjct: 37 IFKMD-EIGNFEPKEIQWQPGNYGEMGKPVFVDKNLPEVKKA-----MREYGFNTYVSDM 90
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS +R+IPD+R++ECKYW YP DLP ASV++ FHNEG++ L+RTVHS++ R+P+Q ++EI
Sbjct: 91 ISLNRSIPDVRLDECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEI 150
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
ILVDDFS K L +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAH
Sbjct: 151 ILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAH 210
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKE 423
CEV +NWLPPLLAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE
Sbjct: 211 CEVNINWLPPLLAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKE 270
Query: 424 NELPEREAKKRKYN 437
E+P +E +RK++
Sbjct: 271 TEIPAQELLRRKHS 284
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 123/188 (65%), Positives = 152/188 (80%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+++F GKV+LIRN EREGLIRTRS GAKE+ G+V+VFLDAHCEV +NWLPPL
Sbjct: 162 LKDRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAHCEVNINWLPPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP-DHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE E+P +E +R
Sbjct: 222 LAPIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYKETEIPAQELLRR 281
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K++SEP++SPTHAGGLFA+ + +F ELG YDPGL +WGGE +ELSFKIW CGG I ++PC
Sbjct: 282 KHSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFIPC 341
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 342 SHVGHVYR 349
>gi|308506779|ref|XP_003115572.1| CRE-GLY-7 protein [Caenorhabditis remanei]
gi|308256107|gb|EFP00060.1| CRE-GLY-7 protein [Caenorhabditis remanei]
Length = 601
Score = 286 bits (733), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 141/248 (56%), Positives = 184/248 (74%), Gaps = 5/248 (2%)
Query: 194 DGKLGNLEP--PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRT 250
DG+LGN EP P P + PGE G+ + E AAG A+ E+G N S+ IS +RT
Sbjct: 79 DGELGNYEPKTPEIPSNQ-PGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDLISMNRT 137
Query: 251 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
IPD+R +ECK+WDYP +LP SV++VFHNEG++ L+RTVHS++ R+P + +E I++VDD
Sbjct: 138 IPDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIESIVMVDDD 197
Query: 311 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
S K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV N
Sbjct: 198 SDKPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTN 257
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPER 429
WLPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ER
Sbjct: 258 WLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITER 317
Query: 430 EAKKRKYN 437
E+ RK+N
Sbjct: 318 ESAHRKHN 325
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 120/188 (63%), Positives = 150/188 (79%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NWLPPL
Sbjct: 203 LKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPL 262
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE+ R
Sbjct: 263 LAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERESAHR 322
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+NS+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 323 KHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPC 382
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 383 SHVGHVYR 390
>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 645
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 139/250 (55%), Positives = 186/250 (74%), Gaps = 4/250 (1%)
Query: 190 VFKADGKLGNLEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
+FK D ++GN EP + G GE G+ + + +A + EYG N S+ IS +
Sbjct: 122 IFKLD-EIGNFEPKETQLQPGDYGEMGEPVLIDKTLTEVKEA-MREYGFNTYVSDMISLN 179
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R+IPD+RM+ECKYW YP DLP AS+++ FHNEG++ L+RTVHS++ R+P ++EII+VD
Sbjct: 180 RSIPDVRMDECKYWHYPEDLPTASIVIAFHNEGWTPLLRTVHSVLLRSPPHLIKEIIMVD 239
Query: 309 DFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
DFS K L +L+ Y+++F+GKV+L+RN+EREGLIRTRS GAKE+ G+V++FLDAHCEV
Sbjct: 240 DFSDKEHLKDRLDVYLKQFDGKVKLVRNSEREGLIRTRSIGAKEAVGDVVIFLDAHCEVN 299
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELP 427
+NWLPPLLAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE EL
Sbjct: 300 VNWLPPLLAPIRQNRKVMTVPVIDGIDKNDWSYRIVYGSADKHYRGIFEWGLLYKETELS 359
Query: 428 EREAKKRKYN 437
+E +RK+N
Sbjct: 360 SQELLRRKHN 369
Score = 266 bits (681), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 122/188 (64%), Positives = 154/188 (81%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+++F+GKV+L+RN+EREGLIRTRS GAKE+ G+V++FLDAHCEV +NWLPPL
Sbjct: 247 LKDRLDVYLKQFDGKVKLVRNSEREGLIRTRSIGAKEAVGDVVIFLDAHCEVNVNWLPPL 306
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID W +R VY D HYRGIFEWG+LYKE EL +E +R
Sbjct: 307 LAPIRQNRKVMTVPVIDGIDKNDWSYRIVYGSADKHYRGIFEWGLLYKETELSSQELLRR 366
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+NSEP++SPTHAGGLFA+++ +F ELG YDPGL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 367 KHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFVPC 426
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 427 SHVGHVYR 434
>gi|268555252|ref|XP_002635614.1| C. briggsae CBR-GLY-7 protein [Caenorhabditis briggsae]
Length = 601
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 139/247 (56%), Positives = 182/247 (73%), Gaps = 3/247 (1%)
Query: 194 DGKLGNLEP-PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTI 251
DG+LGN EP E PGE G+ + E AAG A+ E+G N S+ IS +RTI
Sbjct: 79 DGELGNYEPKTAEIPSNQPGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTI 138
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
PD+R +ECK+WDYP +LP SV++VFHNEG++ L+RTVHS++ R+P + +E+I++VDD S
Sbjct: 139 PDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIEQIVMVDDDS 198
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NW
Sbjct: 199 DKPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNW 258
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPERE 430
LPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 259 LPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERE 318
Query: 431 AKKRKYN 437
+ RK+
Sbjct: 319 SGHRKHT 325
Score = 261 bits (666), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 119/188 (63%), Positives = 149/188 (79%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NWLPPL
Sbjct: 203 LKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPL 262
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE+ R
Sbjct: 263 LAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERESGHR 322
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ S+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 323 KHTSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPC 382
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 383 SHVGHVYR 390
>gi|341881851|gb|EGT37786.1| hypothetical protein CAEBREN_30257 [Caenorhabditis brenneri]
gi|341887866|gb|EGT43801.1| CBN-GLY-7 protein [Caenorhabditis brenneri]
Length = 601
Score = 284 bits (726), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 138/247 (55%), Positives = 183/247 (74%), Gaps = 3/247 (1%)
Query: 194 DGKLGNLEPPL-EPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTI 251
+G+LGN EP + E PGE G+ + E AAG A+ E+G N S+ IS +RTI
Sbjct: 79 EGELGNYEPKIPEVPSNQPGEHGRPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTI 138
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
PD+R +ECK+WDYP +LP SV++VFHNEG++ L+RTVHS++ R+P + +E+I++VDD S
Sbjct: 139 PDIRPKECKHWDYPENLPTVSVVIVFHNEGWTPLLRTVHSVLLRSPPELIEQIVMVDDDS 198
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NW
Sbjct: 199 DKQHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNW 258
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPERE 430
LPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 259 LPPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERE 318
Query: 431 AKKRKYN 437
RK++
Sbjct: 319 TAHRKHS 325
Score = 260 bits (665), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 119/188 (63%), Positives = 149/188 (79%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NWLPPL
Sbjct: 203 LKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPL 262
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE R
Sbjct: 263 LAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERETAHR 322
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K++S+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 323 KHSSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPC 382
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 383 SHVGHVYR 390
>gi|17561826|ref|NP_503512.1| Protein GLY-7 [Caenorhabditis elegans]
gi|51315810|sp|O61397.1|GALT7_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 7;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7; Short=pp-GaNTase 7
gi|3047203|gb|AAC13677.1| GLY7 [Caenorhabditis elegans]
gi|373219860|emb|CCD70652.1| Protein GLY-7 [Caenorhabditis elegans]
Length = 601
Score = 284 bits (726), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 140/246 (56%), Positives = 181/246 (73%), Gaps = 5/246 (2%)
Query: 196 KLGNLEP--PLEPYKEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
+LGN EP P P + PGE GK + E AAG A+ E+G N S+ IS +RTIP
Sbjct: 81 ELGNYEPKEPEIPSNQ-PGEHGKPVPVTDEEGMAAGRAAEKEFGFNTYVSDMISMNRTIP 139
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D+R EECK+WDYP LP SV++VFHNEG++ L+RTVHS++ R+P + +E++++VDD S
Sbjct: 140 DIRPEECKHWDYPEKLPTVSVVVVFHNEGWTPLLRTVHSVLLRSPPELIEQVVMVDDDSD 199
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
K L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NWL
Sbjct: 200 KPHLKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWL 259
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREA 431
PPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE
Sbjct: 260 PPLLAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERET 319
Query: 432 KKRKYN 437
RK+N
Sbjct: 320 AHRKHN 325
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 120/188 (63%), Positives = 149/188 (79%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+ Y+ RFNGKV ++R +REGLI RS GAK S GEV++FLDAHCEV NWLPPL
Sbjct: 203 LKEKLDKYVTRFNGKVIVVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPL 262
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RK+MTVPVIDGID +WE+RSVY P+ H+ GIFEWG+LYKE ++ ERE R
Sbjct: 263 LAPIKRNRKVMTVPVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERETAHR 322
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+NS+P++SPTHAGGLFA++R +F ELG YD GL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 323 KHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPC 382
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 383 SHVGHVYR 390
>gi|449664489|ref|XP_002168298.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Hydra
magnipapillata]
Length = 599
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 137/251 (54%), Positives = 177/251 (70%), Gaps = 3/251 (1%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PVF +D KLGN E E K GPGEGGK + L + + G YG N S+ IS D
Sbjct: 51 PVFLSDNKLGNFEK-YEDVKSGPGEGGKPHRLKPEQKEEEERLKGVYGFNQLVSDEISLD 109
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R +PD+R EECK+W YP DLP +SVI +FHNEG+S+L+R+VHS+I RTPA L EI+LVD
Sbjct: 110 RVVPDMREEECKHWSYPNDLPSSSVIFIFHNEGWSTLLRSVHSVINRTPAHLLHEIVLVD 169
Query: 309 DFSSKADLDQKLEDYIQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
D S L ++L++ I++ + KV+L+RN +REGLIR R+ GA + GEV+VFLDAHCE
Sbjct: 170 DKSELEHLHERLDEEIKKPYYQSKVKLVRNKQREGLIRARNIGAIAATGEVLVFLDAHCE 229
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
VG NWLPPL+API D +T P+IDGI++ + VY+ H RGIFEWGMLYKE +L
Sbjct: 230 VGGNWLPPLIAPIQEDPTTLTAPIIDGINWDDFSINPVYQKGSHSRGIFEWGMLYKETDL 289
Query: 427 PEREAKKRKYN 437
PE+EA+KR Y+
Sbjct: 290 PEKEARKRLYH 300
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 114/189 (60%), Positives = 146/189 (77%), Gaps = 2/189 (1%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L ++L++ I++ + KV+L+RN +REGLIR R+ GA + GEV+VFLDAHCEVG NWLP
Sbjct: 177 LHERLDEEIKKPYYQSKVKLVRNKQREGLIRARNIGAIAATGEVLVFLDAHCEVGGNWLP 236
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+API D +T P+IDGI++ + VY+ H RGIFEWGMLYKE +LPE+EA+K
Sbjct: 237 PLIAPIQEDPTTLTAPIIDGINWDDFSINPVYQKGSHSRGIFEWGMLYKETDLPEKEARK 296
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R Y+SEPY SPTHAGGLFA+ R++F ELG YDPGLL+WGGEN+ELSFK+W CGG WVP
Sbjct: 297 RLYHSEPYNSPTHAGGLFAIKRSWFKELGWYDPGLLIWGGENYELSFKLWQCGGRSLWVP 356
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 357 CSHVSHVYR 365
>gi|402586829|gb|EJW80766.1| glycosyltransferase [Wuchereria bancrofti]
Length = 409
Score = 265 bits (678), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 123/188 (65%), Positives = 153/188 (81%), Gaps = 1/188 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+++F+GKV+L+RN EREGLIRTRS GAKE+ G+V++FLDAHCEV +NWLPPL
Sbjct: 11 LKDRLDVYLKQFDGKVKLVRNAEREGLIRTRSIGAKEAVGDVVIFLDAHCEVNVNWLPPL 70
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKENELPEREAKKR 120
LAPI +RKIMTVPVIDGID W +R VY D HYRGIFEWG+LYKE EL +E +R
Sbjct: 71 LAPIRQNRKIMTVPVIDGIDKNDWSYRIVYGSVDKHYRGIFEWGLLYKETELSSQELLRR 130
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+NSEP++SPTHAGGLFA+++ +F ELG YDPGL +WGGE +ELSFKIW CGG I +VPC
Sbjct: 131 KHNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFVPC 190
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 191 SHVGHVYR 198
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 87/133 (65%), Positives = 107/133 (80%), Gaps = 1/133 (0%)
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
+VDDFS K L +L+ Y+++F+GKV+L+RN EREGLIRTRS GAKE+ G+V++FLDAHC
Sbjct: 1 MVDDFSDKEHLKDRLDVYLKQFDGKVKLVRNAEREGLIRTRSIGAKEAVGDVVIFLDAHC 60
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHHYRGIFEWGMLYKEN 424
EV +NWLPPLLAPI +RKIMTVPVIDGID W +R VY D HYRGIFEWG+LYKE
Sbjct: 61 EVNVNWLPPLLAPIRQNRKIMTVPVIDGIDKNDWSYRIVYGSVDKHYRGIFEWGLLYKET 120
Query: 425 ELPEREAKKRKYN 437
EL +E +RK+N
Sbjct: 121 ELSSQELLRRKHN 133
>gi|443700020|gb|ELT99205.1| hypothetical protein CAPTEDRAFT_172619 [Capitella teleta]
Length = 336
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 121/213 (56%), Positives = 157/213 (73%)
Query: 225 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 284
+ A D S+ E+G NM S+ IS +RTIPD RMEECKYW YP LP ASVILVFHNEG+S+
Sbjct: 4 KEAADRSIREFGFNMVASDKISMNRTIPDTRMEECKYWHYPKTLPSASVILVFHNEGWST 63
Query: 285 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 344
L+RTVHS+I +P + L EI++VDDFS K L +LEDY+++F+GKV+L RN ER GLI
Sbjct: 64 LVRTVHSVIDMSPPELLHEIVMVDDFSDKEHLKTRLEDYLKQFHGKVKLYRNKERLGLIG 123
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
TR+ GA+ + G+ IVFLDAHCE NWLPPLLA I DR I+ +PVIDGID+ + + V
Sbjct: 124 TRTLGAQYATGDAIVFLDAHCECNRNWLPPLLARIAYDRTILAIPVIDGIDFDNFRYNPV 183
Query: 405 YEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
Y +RGIFEWG LYKE+++P + +R++
Sbjct: 184 YSGRELFRGIFEWGFLYKESKVPGKTLLERQHQ 216
Score = 251 bits (640), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 114/187 (60%), Positives = 143/187 (76%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LEDY+++F+GKV+L RN ER GLI TR+ GA+ + G+ IVFLDAHCE NWLPPL
Sbjct: 95 LKTRLEDYLKQFHGKVKLYRNKERLGLIGTRTLGAQYATGDAIVFLDAHCECNRNWLPPL 154
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR I+ +PVIDGID+ + + VY +RGIFEWG LYKE+++P + +R+
Sbjct: 155 LARIAYDRTILAIPVIDGIDFDNFRYNPVYSGRELFRGIFEWGFLYKESKVPGKTLLERQ 214
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ SE YKSPTHAGGLFA+DR +F ELG YDPGL +WGGENFELSFKIW CGGS+EWVPCS
Sbjct: 215 HQSEAYKSPTHAGGLFAIDRKYFFELGAYDPGLQIWGGENFELSFKIWQCGGSVEWVPCS 274
Query: 182 RIVSLIR 188
+ + R
Sbjct: 275 HVGHVYR 281
>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 606
Score = 253 bits (647), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 172/254 (67%), Gaps = 4/254 (1%)
Query: 185 SLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
+++ VF+ DG G+ EP P +EGPGEGG A + +A D + EYG N S+
Sbjct: 76 AVVGAVFR-DGVRGDYEPVNLPVREGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQ 134
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DR I DLR ++CK+W YP LP SVI+VFHNEG+S+L+RTVHS+ R+P+Q L EI
Sbjct: 135 ISLDRNIADLRSQQCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEI 194
Query: 305 ILVDDFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
ILVDDFS+K L ++LEDY+Q RFNGK++L+RN+ REGLIRTR GA+ S G+V+++LD
Sbjct: 195 ILVDDFSTKEHLKERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLD 254
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
AHCEVG+NWLPPLL PI +R P+ID ID + D RG F+W + +K
Sbjct: 255 AHCEVGVNWLPPLLTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWK 313
Query: 423 ENELPEREAKKRKY 436
+P+ E +R++
Sbjct: 314 HLPVPQFEKSRRQH 327
Score = 224 bits (570), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 107/194 (55%), Positives = 139/194 (71%), Gaps = 3/194 (1%)
Query: 2 LDKKLEDYIE--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L ++LEDY++ RFNGK++L+RN+ REGLIRTR GA+ S G+V+++LDAHCEVG+NWLP
Sbjct: 206 LKERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLP 265
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI +R P+ID ID + D RG F+W + +K +P+ E +
Sbjct: 266 PLLTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVPQFEKSR 324
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R++ SEPY+SP AGGLFAMDR +F ELG YD GL +WGGENFELSFKIWMCGGS+ WVP
Sbjct: 325 RQHASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVP 384
Query: 180 CSRIVSLIRPVFKA 193
CSR+ + R + K
Sbjct: 385 CSRVGHVYRILGKV 398
>gi|66472462|ref|NP_001018477.1| N-acetylgalactosaminyltransferase 7 [Danio rerio]
gi|63100869|gb|AAH95642.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7 [Danio rerio]
Length = 652
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 132/248 (53%), Positives = 178/248 (71%), Gaps = 6/248 (2%)
Query: 195 GKLGNLEPP-LEPY--KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
G LGN EP EP+ + GPGEG K + L Y+ A AS+ E+G NM S+ IS DRT+
Sbjct: 125 GTLGNFEPKEPEPHGVQGGPGEGSKPFVLGPEYKDAVQASIKEFGFNMVASDMISLDRTV 184
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
DLR EECKYW+Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+++DDFS
Sbjct: 185 GDLRHEECKYWNYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVMIDDFS 244
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCEVGLN 370
+KA L ++LE+YI+++NG V++ RN +REGLI+ RS GA K + G+V+++LDAHCEVG+N
Sbjct: 245 NKAHLKERLEEYIKQWNGLVKVFRNEKREGLIQARSIGARKATLGKVLIYLDAHCEVGVN 304
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
W PL+API DR + TVP+ID ID T E + + D RG ++W +L+K L
Sbjct: 305 WYAPLVAPISKDRTVCTVPLIDYIDGNDYTIEPQQGGDEDGLARGAWDWSLLWKRVPLSS 364
Query: 429 REAKKRKY 436
RE KRK+
Sbjct: 365 REKAKRKH 372
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 144/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCEVGLNWLPP 60
L ++LE+YI+++NG V++ RN +REGLI+ RS GA K + G+V+++LDAHCEVG+NW P
Sbjct: 249 LKERLEEYIKQWNGLVKVFRNEKREGLIQARSIGARKATLGKVLIYLDAHCEVGVNWYAP 308
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID ID T E + + D RG ++W +L+K L RE
Sbjct: 309 LVAPISKDRTVCTVPLIDYIDGNDYTIEPQQGGDEDGLARGAWDWSLLWKRVPLSSREKA 368
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 369 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGQLLFV 428
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 429 PCSRVGH----IYRLQGWQGNPPP 448
>gi|198419403|ref|XP_002128971.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
intestinalis]
Length = 631
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 136/270 (50%), Positives = 178/270 (65%), Gaps = 15/270 (5%)
Query: 181 SRIVSLIRPVFKAD---------GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDAS 231
+ +V+ I+PV K D KLGN E L + GPGE G A H + A AS
Sbjct: 84 NEVVAPIKPVVKEDFSNYPQLNWRKLGNYEESLA-RRNGPGEYGVAVHATNDEKEAVAAS 142
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
+ E+G NM S+ IS DR DLR +EC++WDYP DLP SVI+VFHNEG+S+L+RTVHS
Sbjct: 143 IKEFGFNMVNSDKISLDRLPKDLRHDECRHWDYPSDLPDVSVIIVFHNEGWSTLVRTVHS 202
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
+I TP + L EI+++DD S+K L QKL +YIQRFNG V+L RN REGLIR RS GA+
Sbjct: 203 VINLTPKKLLYEIVMIDDHSNKEHLGQKLTEYIQRFNGLVKLYRNERREGLIRARSIGAQ 262
Query: 352 ESR---GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
+S G V+V+LDAHCEVG NWLPPL+ PI ++RK+ TVP+ID I+ Q + F S D
Sbjct: 263 KSTPADGRVLVYLDAHCEVGYNWLPPLIMPIVNNRKVTTVPLIDVINGQDYTFTSQAGGD 322
Query: 409 HH--YRGIFEWGMLYKENELPEREAKKRKY 436
+ RG ++W ML+K L + E +RK+
Sbjct: 323 ANGFARGAWDWSMLWKRVPLTKEEHNRRKH 352
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 109/210 (51%), Positives = 146/210 (69%), Gaps = 9/210 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR---GEVIVFLDAHCEVGLNWL 58
L +KL +YI+RFNG V+L RN REGLIR RS GA++S G V+V+LDAHCEVG NWL
Sbjct: 227 LGQKLTEYIQRFNGLVKLYRNERREGLIRARSIGAQKSTPADGRVLVYLDAHCEVGYNWL 286
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPERE 116
PPL+ PI ++RK+ TVP+ID I+ Q + F S D + RG ++W ML+K L + E
Sbjct: 287 PPLIMPIVNNRKVTTVPLIDVINGQDYTFTSQAGGDANGFARGAWDWSMLWKRVPLTKEE 346
Query: 117 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
+RK+ ++PY+SP AGGLFA++R +F ++G YDPGL +WGGENFE+SFKIWMC G +
Sbjct: 347 HNRRKHTTDPYRSPAMAGGLFAIERQYFFDIGLYDPGLEIWGGENFEMSFKIWMCEGEVL 406
Query: 177 WVPCSRIVSLIRPVFKADGKLGNLEPPLEP 206
+VPCSR+ V++ G GN P P
Sbjct: 407 FVPCSRVGH----VYRLPGWSGNPPPEYVP 432
>gi|390370478|ref|XP_793045.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like, partial [Strongylocentrotus purpuratus]
Length = 658
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 123/241 (51%), Positives = 164/241 (68%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G+ EP P +EGPGEGG A + +A D + EYG N S+ IS DR I DLR +
Sbjct: 333 GDYEPVNLPVREGPGEGGAAVRTQPSEKAKVDRLIQEYGFNQYVSDQISLDRNIADLRSQ 392
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
+CK+W YP LP SVI+VFHNEG+S+L+RTVHS+ R+P+Q L EIILVDDFS+K L
Sbjct: 393 QCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLK 452
Query: 318 QKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
++LEDY+Q RFNGK++L+RN+ REGLIRTR GA+ S G+V+++LDAHCEVG+NWLPPL
Sbjct: 453 ERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPL 512
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
L PI +R P+ID ID + D RG F+W + +K +P+ E +R+
Sbjct: 513 LTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVPQFEKSRRQ 571
Query: 436 Y 436
+
Sbjct: 572 H 572
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 106/189 (56%), Positives = 137/189 (72%), Gaps = 3/189 (1%)
Query: 2 LDKKLEDYIE--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L ++LEDY++ RFNGK++L+RN+ REGLIRTR GA+ S G+V+++LDAHCEVG+NWLP
Sbjct: 451 LKERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLP 510
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI +R P+ID ID + D RG F+W + +K +P+ E +
Sbjct: 511 PLLTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQD-RGGFDWSLYWKHLPVPQFEKSR 569
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R++ SEPY+SP AGGLFAMDR +F ELG YD GL +WGGENFELSFKIWMCGGS+ WVP
Sbjct: 570 RQHASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVP 629
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 630 CSRVGHVYR 638
>gi|432847870|ref|XP_004066191.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oryzias
latipes]
Length = 653
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/248 (52%), Positives = 175/248 (70%), Gaps = 8/248 (3%)
Query: 195 GKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 250
G LGN EP EP +G PGEG K L Y+ + AS+ E+G NM S+ IS DRT
Sbjct: 127 GTLGNFEPK-EPEPQGILNGPGEGAKPLILGSEYKDSVQASIKEFGFNMVASDMISMDRT 185
Query: 251 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
I DLR +ECKYW Y L +SV++VFHNEG+S+LMRTVHS+IKRTP QYL EI+++DDF
Sbjct: 186 ISDLRNDECKYWHYDDRLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRQYLAEIVMIDDF 245
Query: 311 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGL 369
S+K L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDAHCEVG+
Sbjct: 246 SNKVHLKERLEEYIKQWNGLVKLFRNDKREGLIQARSIGAKKATKGQVLIYLDAHCEVGI 305
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NW PL+API DR + TVP+ID I + T E + + D RG ++W ML+K L
Sbjct: 306 NWYAPLIAPISKDRTVCTVPLIDSIHGERFTIEPQGGGDEDGFARGAWDWSMLWKRVPLG 365
Query: 428 EREAKKRK 435
+RE K RK
Sbjct: 366 DREKKLRK 373
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/204 (51%), Positives = 146/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPP 60
L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDAHCEVG+NW P
Sbjct: 251 LKERLEEYIKQWNGLVKLFRNDKREGLIQARSIGAKKATKGQVLIYLDAHCEVGINWYAP 310
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQ--TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID I + T E + + D RG ++W ML+K L +RE K
Sbjct: 311 LIAPISKDRTVCTVPLIDSIHGERFTIEPQGGGDEDGFARGAWDWSMLWKRVPLGDREKK 370
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R +F ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 371 LRKTQTEPYRSPAMAGGLFAIERDYFFELGLYDPGLQIWGGENFEISYKIWQCGGQLLFV 430
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 431 PCSRVGH----IYRLQGWQGNPPP 450
>gi|148237032|ref|NP_001084848.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Xenopus
laevis]
gi|47124654|gb|AAH70527.1| MGC78803 protein [Xenopus laevis]
Length = 653
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 129/254 (50%), Positives = 183/254 (72%), Gaps = 9/254 (3%)
Query: 189 PVFKADGKLGNLEPPLEP----YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
PV + G LGN+EP EP +GPGEGGK + L Y+ A A++ E+G NM S+
Sbjct: 121 PVLRP-GILGNMEPK-EPEPQGVVDGPGEGGKHFMLGPDYKDAIKATIKEFGFNMVASDM 178
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DRTI DLR EECK+W+Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 179 ISLDRTINDLRHEECKFWNYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEI 238
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 363
+++DDFS+K L ++L++YI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 239 VMIDDFSNKEHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDA 298
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLY 421
HCEVG+NW PL+API DR TVP+ID I+ T+E ++ + D RG ++W ML+
Sbjct: 299 HCEVGINWYAPLIAPIAKDRTTCTVPLIDVIEGNTYELIPQAGGDEDGFARGAWDWSMLW 358
Query: 422 KENELPEREAKKRK 435
K L +E ++RK
Sbjct: 359 KRVPLTSKEKEQRK 372
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 99/204 (48%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L++YI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEVG+NW P
Sbjct: 250 LKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDAHCEVGINWYAP 309
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID I+ T+E ++ + D RG ++W ML+K L +E +
Sbjct: 310 LIAPIAKDRTTCTVPLIDVIEGNTYELIPQAGGDEDGFARGAWDWSMLWKRVPLTSKEKE 369
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+RK +EPY+SP AGGLFA++R +F ELG YDPGL +WGGENFE+S+KIW CGG + +
Sbjct: 370 QRKTKTEPYRSPAMAGGLFAIEREYFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFT 429
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 430 PCSRVGH----IYRLHGWQGNPTP 449
>gi|326918604|ref|XP_003205578.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Meleagris
gallopavo]
Length = 665
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 180/253 (71%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 133 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 191
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 192 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 251
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+KA L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 252 LIDDFSNKAHLKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 311
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEVG+NW PL+API DR TVP+ID ID T++ + + D RG ++W ML+K
Sbjct: 312 CEVGINWYAPLIAPISKDRTTCTVPLIDVIDGNTFKIVPQGGGDEDGFARGAWDWSMLWK 371
Query: 423 ENELPEREAKKRK 435
L +RE +KR+
Sbjct: 372 RVPLSKREKEKRE 384
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 101/190 (53%), Positives = 141/190 (74%), Gaps = 3/190 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAHCEVG+NW P
Sbjct: 262 LKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAHCEVGINWYAP 321
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID T++ + + D RG ++W ML+K L +RE +
Sbjct: 322 LIAPISKDRTTCTVPLIDVIDGNTFKIVPQGGGDEDGFARGAWDWSMLWKRVPLSKREKE 381
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KR+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 382 KRETKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 441
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 442 PCSRVGHIYR 451
>gi|410927898|ref|XP_003977377.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase
7-like [Takifugu rubripes]
Length = 664
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 178/253 (70%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV K G LGNLEP P P G GEG K + L Y+ A AS+ E+G NM S+ I
Sbjct: 132 PVLKK-GILGNLEPKEPEPPGVPGGLGEGAKPFVLNAEYKDAIQASIKEFGFNMVASDMI 190
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR+I D+R +ECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 191 SLDRSISDIRHDECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIV 250
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAH 364
++DDFS+K L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDAH
Sbjct: 251 MIDDFSNKVHLKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLIYLDAH 310
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYK 422
CEVG+NW PL+API DR + TVP+ID ID Q + D + RG ++W ML+K
Sbjct: 311 CEVGINWYAPLVAPISKDRTVCTVPLIDSIDGQKYTVDPQGGGDQNGFARGAWDWSMLWK 370
Query: 423 ENELPEREAKKRK 435
L +RE + RK
Sbjct: 371 RVPLGDREKQLRK 383
Score = 214 bits (544), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 103/190 (54%), Positives = 140/190 (73%), Gaps = 3/190 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPP 60
L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+++LDAHCEVG+NW P
Sbjct: 261 LKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLIYLDAHCEVGINWYAP 320
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID ID Q + D + RG ++W ML+K L +RE +
Sbjct: 321 LVAPISKDRTVCTVPLIDSIDGQKYTVDPQGGGDQNGFARGAWDWSMLWKRVPLGDREKQ 380
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 381 LRKTETEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGQLLFV 440
Query: 179 PCSRIVSLIR 188
PCSRI + R
Sbjct: 441 PCSRIGHIYR 450
>gi|345307492|ref|XP_001507110.2| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Ornithorhynchus anatinus]
Length = 873
Score = 247 bits (631), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 177/254 (69%), Gaps = 9/254 (3%)
Query: 189 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
PV + GKLGN EP EP +G PGE K Y L Y+ + AS+ E+G NM S+
Sbjct: 98 PVLQP-GKLGNFEPK-EPEPQGVMGGPGEEAKPYVLGPEYKDSIQASIKEFGFNMVASDM 155
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DR+I DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 156 ISLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEI 215
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 363
+L+DDFS+KA L +L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 216 VLIDDFSNKAHLKDRLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDA 275
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLY 421
HCEV +NW PL+API DR + TVP+ID I T+ + + D + RG ++W ML+
Sbjct: 276 HCEVAVNWYAPLVAPISKDRTVCTVPLIDVISGNTFNIVPQGGGDEDGYARGAWDWSMLW 335
Query: 422 KENELPEREAKKRK 435
K L +RE RK
Sbjct: 336 KRVPLTQREKTLRK 349
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 98/190 (51%), Positives = 137/190 (72%), Gaps = 3/190 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 227 LKDRLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 286
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID I T+ + + D + RG ++W ML+K L +RE
Sbjct: 287 LVAPISKDRTVCTVPLIDVISGNTFNIVPQGGGDEDGYARGAWDWSMLWKRVPLTQREKT 346
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 347 LRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 406
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 407 PCSRVGHIYR 416
>gi|344288243|ref|XP_003415860.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Loxodonta
africana]
Length = 657
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 177/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKPAVQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L QKL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKQKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR + TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRAVCTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L ERE + RK
Sbjct: 364 RVPLTEREKRMRK 376
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 106/206 (51%), Positives = 148/206 (71%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKQKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID I+ T+E + + D + RG ++W ML+K L ERE +
Sbjct: 314 LVAPISKDRAVCTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTEREKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 434 PCSRVGH----IYRLEGWQGN-PPPL 454
>gi|327268630|ref|XP_003219099.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Anolis
carolinensis]
Length = 654
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 178/253 (70%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 122 PVLRP-GILGNFEPKEPEPHGVVNGPGEEAKPYVLGAEYKESVQASIKEFGFNMVASDMI 180
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR+I D+R EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 181 SLDRSINDIRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 240
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+KA L ++LE+YI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 241 LIDDFSNKAHLKERLEEYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 300
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID T+ + + D + RG ++W ML+K
Sbjct: 301 CEVAVNWYAPLIAPISKDRTTCTVPLIDVIDGNTYNIVPQGGGDDDGYARGAWDWSMLWK 360
Query: 423 ENELPEREAKKRK 435
L +RE + RK
Sbjct: 361 RVPLTKREKEMRK 373
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++LE+YI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 251 LKERLEEYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 310
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID T+ + + D + RG ++W ML+K L +RE +
Sbjct: 311 LIAPISKDRTTCTVPLIDVIDGNTYNIVPQGGGDDDGYARGAWDWSMLWKRVPLTKREKE 370
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +
Sbjct: 371 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFT 430
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 431 PCSRVGH----IYRLQGWQGNPPP 450
>gi|344235654|gb|EGV91757.1| N-acetylgalactosaminyltransferase 7 [Cricetulus griseus]
Length = 607
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 176/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 75 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 133
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 134 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 193
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFSSK L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 194 LIDDFSSKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 253
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 254 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 313
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 314 RVPLTSREKRLRK 326
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/206 (51%), Positives = 147/206 (71%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 204 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 263
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 264 LVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTSREKR 323
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 324 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 383
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 384 PCSRVGH----IYRLEGWQGN-PPPL 404
>gi|449500526|ref|XP_002187477.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Taeniopygia
guttata]
Length = 828
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 178/254 (70%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 296 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 354
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 355 SLDRSVNDLRQEECKYWHYDDNLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 414
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+KA L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 415 LIDDFSNKAHLQERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 474
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEVG+NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 475 CEVGINWYAPLIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 534
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 535 RIPLSHKEKSKRKH 548
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 144/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAHCEVG+NW P
Sbjct: 425 LQERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAHCEVGINWYAP 484
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 485 LIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKS 544
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 545 KRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 604
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 605 PCSRVGH----IYRLQGWQGNPPP 624
>gi|355689622|gb|AER98894.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 [Mustela putorius
furo]
Length = 351
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 131/253 (51%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKGKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV LNW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVALNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRK 435
L RE K RK
Sbjct: 322 RVPLTPREKKMRK 334
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 98/140 (70%), Gaps = 3/140 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV LNW P
Sbjct: 212 LKGKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVALNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE K
Sbjct: 272 LVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPREKK 331
Query: 119 KRKYNSEPYKSPTHAGGLFA 138
RK +EPY+SP AGGLF+
Sbjct: 332 MRKTKTEPYRSPAMAGGLFS 351
>gi|363733313|ref|XP_420521.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gallus gallus]
Length = 636
Score = 245 bits (625), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 179/253 (70%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 104 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESVQASIKEFGFNMVASDMI 162
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECK+W Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 163 SLDRSVNDLRQEECKHWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 222
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 223 LIDDFSNKVHLKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 282
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEVG+NW PL+API DR TVP+ID ID T++ + + D RG ++W ML+K
Sbjct: 283 CEVGINWYAPLIAPISKDRTTCTVPLIDVIDGDTFKIVPQGGGDEDGFARGAWDWSMLWK 342
Query: 423 ENELPEREAKKRK 435
L +RE +KR+
Sbjct: 343 RVPLSKREKEKRE 355
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 146/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L+DYI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAHCEVG+NW P
Sbjct: 233 LKERLDDYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAHCEVGINWYAP 292
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID T++ + + D RG ++W ML+K L +RE +
Sbjct: 293 LIAPISKDRTTCTVPLIDVIDGDTFKIVPQGGGDEDGFARGAWDWSMLWKRVPLSKREKE 352
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KR+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 353 KRETKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 412
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 413 PCSRVGH----IYRLQGWQGNPPP 432
>gi|12621080|ref|NP_075215.1| N-acetylgalactosaminyltransferase 7 [Rattus norvegicus]
gi|51315737|sp|Q9R0C5.1|GALT7_RAT RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|4092503|gb|AAC99426.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T6 [Rattus
norvegicus]
gi|149032267|gb|EDL87173.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_a
[Rattus norvegicus]
Length = 657
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEYKQAAQASIKEFGFNMAASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL +YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLTEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 364 RVPLTPREKRLRK 376
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 105/206 (50%), Positives = 146/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL +YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLTEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 314 LVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 434 PCSRVGH----IYRLEGWQGN-PPPL 454
>gi|449270894|gb|EMC81540.1| N-acetylgalactosaminyltransferase 7, partial [Columba livia]
Length = 613
Score = 244 bits (623), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 129/254 (50%), Positives = 179/254 (70%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 81 PVLRP-GVLGNFEPKEPEPHGVVGGPGEEAKPYVLGPDYKESIQASIKEFGFNMVASDMI 139
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 140 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 199
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+KA L ++L++YI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 200 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 259
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEVG+NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 260 CEVGINWYAPLIAPIAKDRTTCTVPLIDYIDGSDYSIEPQQGGDEDGFARGAWDWSLLWK 319
Query: 423 ENELPEREAKKRKY 436
L ++E KRK+
Sbjct: 320 RIPLSQKEKSKRKH 333
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L++YI+++NG V++ RN REGLI+ RS GA++++ G+V+V+LDAHCEVG+NW P
Sbjct: 210 LKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAHCEVGINWYAP 269
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L ++E
Sbjct: 270 LIAPIAKDRTTCTVPLIDYIDGSDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSQKEKS 329
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 330 KRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 389
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 390 PCSRVGH----IYRLQGWQGNPPP 409
>gi|345790686|ref|XP_543898.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Canis lupus
familiaris]
Length = 721
Score = 244 bits (622), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 176/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 189 PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 247
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 248 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 307
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 308 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 367
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 368 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 427
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 428 RVPLTPREKRMRK 440
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 318 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 377
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 378 LVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 437
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 438 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 497
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 498 PCSRVGH----IYRLEGWQGNPPP 517
>gi|338722468|ref|XP_001915592.2| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Equus
caballus]
Length = 621
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 177/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 89 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 147
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 148 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 207
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 208 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 267
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ +T+E + + D + RG ++W ML+K
Sbjct: 268 CEVAVNWYAPLIAPISKDRTICTVPIIDVINGKTYEIIPQGGGDEDGYARGAWDWSMLWK 327
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 328 RVPLTPREKRMRK 340
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 106/206 (51%), Positives = 148/206 (71%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 218 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 277
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ +T+E + + D + RG ++W ML+K L RE +
Sbjct: 278 LIAPISKDRTICTVPIIDVINGKTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 337
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 338 MRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 397
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 398 PCSRVGH----IYRLEGWQGN-PPPL 418
>gi|74139820|dbj|BAE31754.1| unnamed protein product [Mus musculus]
gi|74191634|dbj|BAE30388.1| unnamed protein product [Mus musculus]
gi|74198878|dbj|BAE30662.1| unnamed protein product [Mus musculus]
Length = 546
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 14 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 72
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 73 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 132
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 133 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 192
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 193 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 252
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 253 RVPLTSREKRLRK 265
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/206 (50%), Positives = 146/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 143 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 202
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I T+E + + D + RG ++W ML+K L RE +
Sbjct: 203 LVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTSREKR 262
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 263 LRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 322
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 323 PCSRVGH----IYRLEGWQGN-PPPL 343
>gi|269784707|ref|NP_653332.3| N-acetylgalactosaminyltransferase 7 isoform 1 [Mus musculus]
gi|51315950|sp|Q80VA0.2|GALT7_MOUSE RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|13650041|gb|AAK37549.1|AF349573_1 UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 [Mus musculus]
gi|30851602|gb|AAH52461.1| Galnt7 protein [Mus musculus]
Length = 657
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 364 RVPLTSREKRLRK 376
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/206 (50%), Positives = 146/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I T+E + + D + RG ++W ML+K L RE +
Sbjct: 314 LVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTSREKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 434 PCSRVGH----IYRLEGWQGN-PPPL 454
>gi|387019377|gb|AFJ51806.1| n-acetylgalactosaminyltransferase 7-like [Crotalus adamanteus]
Length = 658
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 128/248 (51%), Positives = 171/248 (68%), Gaps = 8/248 (3%)
Query: 195 GKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 250
G LGN EP EP G PGE K + L Y+ + AS+ E+G NM S+ IS DR+
Sbjct: 131 GILGNFEPK-EPESHGVVGGPGEEAKPFVLGPEYKESIQASIKEFGFNMVASDMISLDRS 189
Query: 251 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
I DLR EECKYW Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDF
Sbjct: 190 INDLRQEECKYWHYDENLLTSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDF 249
Query: 311 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGL 369
S+K L ++LEDYI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +
Sbjct: 250 SNKEHLKERLEDYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGKVLIYLDAHCEVAV 309
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELP 427
NW PL+API DR TVP+ID ID T+ + + D RG ++W ML+K L
Sbjct: 310 NWYAPLIAPISKDRTACTVPLIDVIDGNTYNIVPQGGGDEDGFARGAWDWSMLWKRVPLT 369
Query: 428 EREAKKRK 435
+RE RK
Sbjct: 370 KREKAMRK 377
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 142/204 (69%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++LEDYI+++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 255 LKERLEDYIKQWNGLVKIFRNERREGLIQARSIGAQKAKLGKVLIYLDAHCEVAVNWYAP 314
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID T+ + + D RG ++W ML+K L +RE
Sbjct: 315 LIAPISKDRTACTVPLIDVIDGNTYNIVPQGGGDEDGFARGAWDWSMLWKRVPLTKREKA 374
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 375 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGQLLFV 434
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 435 PCSRVGH----IYRLQGWQGNPPP 454
>gi|148696676|gb|EDL28623.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_a [Mus
musculus]
Length = 615
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 322 RVPLTSREKRLRK 334
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/206 (50%), Positives = 146/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 212 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I T+E + + D + RG ++W ML+K L RE +
Sbjct: 272 LVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTSREKR 331
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 332 LRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 391
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 392 PCSRVGH----IYRLEGWQGN-PPPL 412
>gi|296484976|tpg|DAA27091.1| TPA: N-acetylgalactosaminyltransferase 7-like [Bos taurus]
Length = 781
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 249 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 307
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 308 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 367
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 368 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 427
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 428 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 487
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 488 RVPLTLREKRLRK 500
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 378 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 437
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 438 LVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTLREKR 497
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 498 LRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 557
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 558 PCSRVGH----IYRLEGWQGNPPP 577
>gi|440908503|gb|ELR58512.1| N-acetylgalactosaminyltransferase 7, partial [Bos grunniens mutus]
Length = 615
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 322 RVPLTPREKRLRK 334
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 212 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 272 LVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 331
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 332 LRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 391
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 392 PCSRVGH----IYRLEGWQGNPPP 411
>gi|119896052|ref|XP_602855.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
Length = 772
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 240 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 298
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 299 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 358
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 359 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 418
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 419 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 478
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 479 RVPLTLREKRLRK 491
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 369 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 428
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 429 LVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTLREKR 488
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 489 LRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 548
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 549 PCSRVGH----IYRLEGWQGNPPP 568
>gi|359067894|ref|XP_002689501.2| PREDICTED: N-acetylgalactosaminyltransferase 7 [Bos taurus]
Length = 617
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 130/253 (51%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 85 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 143
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 144 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 203
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 204 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 263
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 264 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 323
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 324 RVPLTLREKRLRK 336
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 214 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 273
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 274 LVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTLREKR 333
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 334 LRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 393
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 394 PCSRVGH----IYRLEGWQGNPPP 413
>gi|354484375|ref|XP_003504364.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cricetulus
griseus]
Length = 784
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 130/254 (51%), Positives = 174/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 252 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 310
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 311 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 370
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFSSK L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 371 LIDDFSSKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 430
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 431 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 490
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 491 RIPLSHKEKAKRKH 504
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/206 (50%), Positives = 145/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 381 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 440
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W ML+K L +E
Sbjct: 441 LVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWKRIPLSHKEKA 500
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 501 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 560
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 561 PCSRVGH----IYRLEGWQGN-PPPL 581
>gi|291241093|ref|XP_002740445.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7-like [Saccoglossus
kowalevskii]
Length = 594
Score = 242 bits (617), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 128/262 (48%), Positives = 172/262 (65%), Gaps = 13/262 (4%)
Query: 184 VSLIRP-------VFKADGKLGNLE--PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGE 234
+ L++P VFK+ LGN E PP + + GPGE KA + D S+ E
Sbjct: 54 IKLVKPGSVSLGNVFKSR-VLGNYENLPPSQEGRTGPGEYAKAVKTTPDEQKQVDRSINE 112
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
YG N S+ IS DRTI DLR E+CKYW YP LP VI+VFHNEG+S+L+RTVHS+
Sbjct: 113 YGFNQYVSDKISLDRTIKDLREEQCKYWHYPESLPAVGVIIVFHNEGWSTLLRTVHSLFN 172
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKE 352
RTP L E++LVDDFS+K L ++LE+Y++ RF GK++L+RN +REGLIRTR+ GA
Sbjct: 173 RTPPTLLHEVVLVDDFSNKEHLRERLEEYVKEPRFLGKIKLVRNAKREGLIRTRTVGAIH 232
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
S +V+V+LDAHCEVG+NWLPPLL+PI +R +TVP+ID ID + RS + R
Sbjct: 233 STADVLVWLDAHCEVGINWLPPLLSPIAQNRTTVTVPIIDVIDNMDYTMRSQGSGELS-R 291
Query: 413 GIFEWGMLYKENELPEREAKKR 434
G F+W + +K + + E +KR
Sbjct: 292 GGFDWSLYWKHLPMSKEETRKR 313
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 107/194 (55%), Positives = 140/194 (72%), Gaps = 3/194 (1%)
Query: 2 LDKKLEDYIE--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L ++LE+Y++ RF GK++L+RN +REGLIRTR+ GA S +V+V+LDAHCEVG+NWLP
Sbjct: 194 LRERLEEYVKEPRFLGKIKLVRNAKREGLIRTRTVGAIHSTADVLVWLDAHCEVGINWLP 253
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL+PI +R +TVP+ID ID + RS + RG F+W + +K + + E +K
Sbjct: 254 PLLSPIAQNRTTVTVPIIDVIDNMDYTMRSQGSGELS-RGGFDWSLYWKHLPMSKEETRK 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R +SEPY+SP AGGLFAM R +F ELG YDPGL VWGGENFELSFKIW CGGS+ WVP
Sbjct: 313 RSLSSEPYRSPAMAGGLFAMARDYFFELGAYDPGLEVWGGENFELSFKIWQCGGSMLWVP 372
Query: 180 CSRIVSLIRPVFKA 193
CS + + R + K
Sbjct: 373 CSHVGHVYRILGKV 386
>gi|335301041|ref|XP_001926518.3| PREDICTED: N-acetylgalactosaminyltransferase 7 [Sus scrofa]
Length = 712
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ I
Sbjct: 180 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKHAVQASIKEFGFNMVASDMI 238
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L AS+++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 239 SLDRSVNDLRQEECKYWHYDENLLTASIVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 298
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 299 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 358
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 359 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 418
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 419 RVPLTPREKRMRK 431
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 309 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 368
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 369 LVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 428
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 429 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 488
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 489 PCSRVGH----IYRLEGWQGNPPP 508
>gi|301753757|ref|XP_002912714.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Ailuropoda
melanoleuca]
gi|281338294|gb|EFB13878.1| hypothetical protein PANDA_000463 [Ailuropoda melanoleuca]
Length = 657
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L KL+DY++ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKGKLDDYLKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 364 RVPLTPREKRMRK 376
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 144/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L KL+DY++ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKGKLDDYLKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 314 LVAPISKDRTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 MRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|417411949|gb|JAA52393.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 615
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 322 RVPLTPQEKRMRK 334
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 212 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 272 LVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 331
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 332 MRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 391
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 392 PCSRVGH----IYRLEGWQGNPPP 411
>gi|395840002|ref|XP_003792859.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 1 [Otolemur
garnettii]
Length = 657
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 129/247 (52%), Positives = 171/247 (69%), Gaps = 6/247 (2%)
Query: 195 GKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ IS DR+I
Sbjct: 130 GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKQAAQASIKEFGFNMVASDMISLDRSI 189
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
DLR EECKYW Y +L ASVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS
Sbjct: 190 NDLRQEECKYWHYDENLLTASVIVVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFS 249
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLN 370
+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +N
Sbjct: 250 NKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVN 309
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 428
W PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L
Sbjct: 310 WYAPLVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTL 369
Query: 429 REAKKRK 435
RE RK
Sbjct: 370 REKSLRK 376
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 144/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTLREKS 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|26329091|dbj|BAC28284.1| unnamed protein product [Mus musculus]
Length = 657
Score = 241 bits (614), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 174/253 (68%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+ PI DR I TVP+ID I T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVPPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L RE + RK
Sbjct: 364 RVPLTSREKRLRK 376
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 103/206 (50%), Positives = 145/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+ PI DR I TVP+ID I T+E + + D + RG ++W ML+K L RE +
Sbjct: 314 LVPPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTSREKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 434 PCSRVGH----IYRLEGWQGN-PPPL 454
>gi|126331345|ref|XP_001372222.1| PREDICTED: n-acetylgalactosaminyltransferase 7-like [Monodelphis
domestica]
Length = 585
Score = 241 bits (614), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 177/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G +GN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GIIGNFEPKEPEPHGVLGGPGEEAKPYVLGPDYKESIHASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR+I DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+KA L ++L++YI+++NG V++ RN REGLI+ RS GA +++ G+V+++LDAH
Sbjct: 244 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR + TVP+ID ID ++ + + D + RG ++W +L+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTVCTVPIIDIIDGNNFKIMPQGGGDEDGYARGAWDWSLLWK 363
Query: 423 ENELPEREAKKRK 435
L +RE RK
Sbjct: 364 RVPLTQREKTMRK 376
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 99/204 (48%), Positives = 144/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L++YI+++NG V++ RN REGLI+ RS GA +++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID ID ++ + + D + RG ++W +L+K L +RE
Sbjct: 314 LVAPISKDRTVCTVPIIDIIDGNNFKIMPQGGGDEDGYARGAWDWSLLWKRVPLTQREKT 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|26334735|dbj|BAC31068.1| unnamed protein product [Mus musculus]
Length = 383
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 174/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 364 RIPLSHKEKAKRKH 377
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 89/130 (68%), Gaps = 3/130 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W ML+K L +E
Sbjct: 314 LVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWKRIPLSHKEKA 373
Query: 119 KRKYNSEPYK 128
KRK+ +EPY+
Sbjct: 374 KRKHKTEPYR 383
>gi|332820787|ref|XP_003310650.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan troglodytes]
gi|410227832|gb|JAA11135.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410262380|gb|JAA19156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410297750|gb|JAA27475.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
gi|410332293|gb|JAA35093.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Pan
troglodytes]
Length = 657
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 176/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN +REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNEKREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTTQEKRLRK 376
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 146/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN +REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNEKREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTTQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|269784709|ref|NP_001161453.1| N-acetylgalactosaminyltransferase 7 isoform 2 [Mus musculus]
gi|26331462|dbj|BAC29461.1| unnamed protein product [Mus musculus]
Length = 657
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 174/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 364 RIPLSHKEKAKRKH 377
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 103/206 (50%), Positives = 145/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W ML+K L +E
Sbjct: 314 LVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWKRIPLSHKEKA 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 KRKHKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 434 PCSRVGH----IYRLEGWQGN-PPPL 454
>gi|395542397|ref|XP_003773119.1| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
[Sarcophilus harrisii]
Length = 797
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 176/254 (69%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G +GN EP EP+ GPGE K Y L Y+ + AS+ E+G NM S+ I
Sbjct: 265 PVLRP-GIIGNFEPKEPEPHGVLGGPGEEAKPYVLGPDYKESIHASIKEFGFNMVASDMI 323
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR+I DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 324 SLDRSINDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 383
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+KA L ++L++YI+++NG V++ RN REGLI+ RS GA +++ G+V+++LDAH
Sbjct: 384 LIDDFSNKAHLKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAH 443
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 444 CEVAVNWYAPLIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 503
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 504 RIPLSHKEKAKRKH 517
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 97/190 (51%), Positives = 137/190 (72%), Gaps = 3/190 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L++YI+++NG V++ RN REGLI+ RS GA +++ G+V+++LDAHCEV +NW P
Sbjct: 394 LKERLDEYIKQWNGLVKVFRNERREGLIQARSIGAHKAKLGQVLIYLDAHCEVAVNWYAP 453
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 454 LIAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 513
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 514 KRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 573
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 574 PCSRVGHIYR 583
>gi|296195170|ref|XP_002745262.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Callithrix jacchus]
Length = 657
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+V+LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|148696677|gb|EDL28624.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_b [Mus
musculus]
Length = 615
Score = 239 bits (609), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 174/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPLVLGPEYKQAVQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 322 RIPLSHKEKAKRKH 335
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 103/206 (50%), Positives = 145/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 212 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W ML+K L +E
Sbjct: 272 LVAPISKDRATCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSMLWKRIPLSHKEKA 331
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 332 KRKHKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 391
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 392 PCSRVGH----IYRLEGWQGN-PPPL 412
>gi|157502212|ref|NP_059119.2| N-acetylgalactosaminyltransferase 7 [Homo sapiens]
gi|51315961|sp|Q86SF2.1|GALT7_HUMAN RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|28279289|gb|AAH46129.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|28704077|gb|AAH47468.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|119625166|gb|EAX04761.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7) [Homo
sapiens]
gi|193786832|dbj|BAG52155.1| unnamed protein product [Homo sapiens]
gi|325464563|gb|ADZ16052.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7 (GalNAc-T7)
[synthetic construct]
Length = 657
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|397505872|ref|XP_003823466.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Pan paniscus]
Length = 657
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|197101721|ref|NP_001124628.1| N-acetylgalactosaminyltransferase 7 [Pongo abelii]
gi|75042656|sp|Q5RFJ6.1|GALT7_PONAB RecName: Full=N-acetylgalactosaminyltransferase 7; AltName:
Full=Polypeptide GalNAc transferase 7; Short=GalNAc-T7;
Short=pp-GaNTase 7; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 7; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 7
gi|55725190|emb|CAH89461.1| hypothetical protein [Pongo abelii]
Length = 657
Score = 238 bits (608), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|6318186|emb|CAB60270.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 7 [Homo
sapiens]
Length = 657
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 174/253 (68%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR + DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRNVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGL A++R FF ELG YDP L +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLCAIEREFFFELGLYDPSLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|402870854|ref|XP_003899414.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Papio anubis]
Length = 657
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|383412007|gb|AFH29217.1| N-acetylgalactosaminyltransferase 7 [Macaca mulatta]
Length = 657
Score = 238 bits (607), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 304 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 363
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 364 RVPLTPQEKRLRK 376
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 314 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|109076193|ref|XP_001085532.1| PREDICTED: n-acetylgalactosaminyltransferase 7 [Macaca mulatta]
Length = 630
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 98 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 156
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 157 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 216
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 217 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 276
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 277 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 336
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 337 RVPLTPQEKRLRK 349
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 227 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 286
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 287 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 346
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 347 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 406
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 407 PCSRVGH----IYRLEGWQGNPPP 426
>gi|410956565|ref|XP_003984911.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Felis catus]
Length = 772
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 174/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 240 PVLRP-GILGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 298
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 299 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 358
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 359 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 418
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 419 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 478
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 479 RIPLSHKEKAKRKH 492
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 369 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 428
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 429 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 488
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 489 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 548
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 549 PCSRVGH----IYRLEGWQGNPPP 568
>gi|355778494|gb|EHH63530.1| hypothetical protein EGM_16517, partial [Macaca fascicularis]
Length = 615
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 322 RVPLTPQEKRLRK 334
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 212 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 272 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 331
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 332 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 391
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 392 PCSRVGH----IYRLEGWQGNPPP 411
>gi|426222421|ref|XP_004005390.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Ovis aries]
Length = 865
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 173/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE + L ++ A AS+ E+G NM S+ I
Sbjct: 333 PVLRP-GVLGNFEPKEPEPPGVVGGPGEKAQPLVLGPEFKHAVQASIKEFGFNMVASDMI 391
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L AS+I+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 392 SLDRSVNDLRQEECKYWHYDENLLTASIIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 451
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 452 LIDDFSNKEHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 511
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 512 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 571
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 572 RIPLSHKEKAKRKH 585
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 462 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 521
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 522 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 581
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 582 KRKHKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 641
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 642 PCSRVGH----IYRLEGWQGNPPP 661
>gi|355687724|gb|EHH26308.1| hypothetical protein EGK_16238, partial [Macaca mulatta]
Length = 615
Score = 238 bits (606), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 127/253 (50%), Positives = 175/253 (69%), Gaps = 7/253 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 83 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 141
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 142 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 201
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 202 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K
Sbjct: 262 CEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWK 321
Query: 423 ENELPEREAKKRK 435
L +E + RK
Sbjct: 322 RVPLTPQEKRLRK 334
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 212 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 272 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 331
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW GG +V
Sbjct: 332 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQGGGKFLFV 391
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 392 PCSRVGH----IYRLEGWQGNPPP 411
>gi|395840004|ref|XP_003792860.1| PREDICTED: N-acetylgalactosaminyltransferase 7 isoform 2 [Otolemur
garnettii]
Length = 657
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 127/248 (51%), Positives = 170/248 (68%), Gaps = 6/248 (2%)
Query: 195 GKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ IS DR+I
Sbjct: 130 GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKQAAQASIKEFGFNMVASDMISLDRSI 189
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
DLR EECKYW Y +L ASVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS
Sbjct: 190 NDLRQEECKYWHYDENLLTASVIVVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFS 249
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLN 370
+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +N
Sbjct: 250 NKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVN 309
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPE 428
W PL+API DR TVP+ID ID + E + + D RG ++W +L+K L
Sbjct: 310 WYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSH 369
Query: 429 REAKKRKY 436
+E KRK+
Sbjct: 370 KEKAKRKH 377
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 314 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 373
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 374 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 433
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 434 PCSRVGH----IYRLEGWQGNPPP 453
>gi|291244621|ref|XP_002742193.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 7-like
[Saccoglossus kowalevskii]
Length = 634
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 123/251 (49%), Positives = 160/251 (63%), Gaps = 6/251 (2%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
VFK G +GN EPP + G GEG L A + E+G NM S+ IS DR
Sbjct: 115 VFKP-GIVGNFEPPKSERRTGLGEGAIPVQLNPADENKYVKAKREFGFNMVISDQISLDR 173
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
T+ D+R ECKYW YP DLP ASV+LVF NEG+S+LMRTVHS+ +P+ L EI++VDD
Sbjct: 174 TVKDIRDPECKYWHYPTDLPTASVVLVFINEGWSTLMRTVHSVFNTSPSHLLAEIVMVDD 233
Query: 310 FSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGA-KESRGEVIVFLDAHCE 366
FS K L KLE+YI+ RF GK++L+RN +REGLIR R+ GA RGEV+VFLDAHCE
Sbjct: 234 FSDKDHLKSKLEEYIKQDRFEGKIKLVRNAKREGLIRARTIGAINAERGEVVVFLDAHCE 293
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
NWLPPLL+ I +RK + P++D +D + + + D RG+F W YK +
Sbjct: 294 CSPNWLPPLLSRIKQNRKAVVCPLVDAVDADNFGYAP--QADGMARGVFNWDFFYKRIPI 351
Query: 427 PEREAKKRKYN 437
P +EA +R+ N
Sbjct: 352 PPKEANRRERN 362
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 101/190 (53%), Positives = 136/190 (71%), Gaps = 5/190 (2%)
Query: 2 LDKKLEDYI--ERFNGKVRLIRNTEREGLIRTRSRGAKES-RGEVIVFLDAHCEVGLNWL 58
L KLE+YI +RF GK++L+RN +REGLIR R+ GA + RGEV+VFLDAHCE NWL
Sbjct: 240 LKSKLEEYIKQDRFEGKIKLVRNAKREGLIRARTIGAINAERGEVVVFLDAHCECSPNWL 299
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PPLL+ I +RK + P++D +D + + + D RG+F W YK +P +EA
Sbjct: 300 PPLLSRIKQNRKAVVCPLVDAVDADNFGYAP--QADGMARGVFNWDFFYKRIPIPPKEAN 357
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+R+ NSEPY+SP AGGLFA+ R+FF ++GGYD GL +WGGE +E+SFKIWMCGG +E+V
Sbjct: 358 RRERNSEPYRSPVMAGGLFALSRSFFFDIGGYDNGLDIWGGEQYEISFKIWMCGGILEFV 417
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 418 PCSRVGHIYR 427
>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
[Strongylocentrotus purpuratus]
gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
[Strongylocentrotus purpuratus]
Length = 670
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 115/236 (48%), Positives = 157/236 (66%), Gaps = 6/236 (2%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
PV K GN EPP +PY+ GPGE G L + D + EYG NM S+ IS D
Sbjct: 135 PVLKE--TTGNYEPPRQPYRTGPGEYGLGVLLDHNEKHLYDKAFEEYGFNMVVSDRISLD 192
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R + DLR +ECK+W YP +LP SV++VFH EG+S+L+RT+HS+ +P + L E++LVD
Sbjct: 193 RIVADLRDKECKHWHYPTNLPNTSVVIVFHQEGWSTLIRTIHSVFNTSPKELLAEVLLVD 252
Query: 309 DFSSKADLDQKLEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
D+S K L +KL+DYI+ RF+GK+R++RN +REGLIR+R+ GA+++ G+V+ FLDAHCE
Sbjct: 253 DYSDKVHLKKKLDDYIRDPRFSGKIRIVRNKKREGLIRSRTIGARKAIGQVLTFLDAHCE 312
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
G NWLPPLLA I DR + P +D I T+ + S + D RG F+W YK
Sbjct: 313 CGPNWLPPLLAEIAVDRSTIVCPTVDAISSDTFAYTS--QGDGLCRGAFDWDFWYK 366
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 103/192 (53%), Positives = 136/192 (70%), Gaps = 9/192 (4%)
Query: 2 LDKKLEDYIE--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L KKL+DYI RF+GK+R++RN +REGLIR+R+ GA+++ G+V+ FLDAHCE G NWLP
Sbjct: 260 LKKKLDDYIRDPRFSGKIRIVRNKKREGLIRSRTIGARKAIGQVLTFLDAHCECGPNWLP 319
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I DR + P +D I T+ + S + D RG F+W YK +P +
Sbjct: 320 PLLAEIAVDRSTIVCPTVDAISSDTFAYTS--QGDGLCRGAFDWDFWYKR--IPVKPYWH 375
Query: 120 R---KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
R K S+PY SP AGGL A+DR++F ELGGYDPGL +WGGENFE+SFK+WMCGGS++
Sbjct: 376 RLGLKQRSQPYPSPVMAGGLLALDRSYFFELGGYDPGLQIWGGENFEISFKVWMCGGSLK 435
Query: 177 WVPCSRIVSLIR 188
+VPCSR+ + R
Sbjct: 436 FVPCSRVGHVYR 447
>gi|313226887|emb|CBY22032.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 126/251 (50%), Positives = 166/251 (66%), Gaps = 5/251 (1%)
Query: 188 RPVFKADGKLGNLEPPLEPYKE---GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
+ +++ GKLGN EP KE G G+ GK + + A S+ E+G NM S+
Sbjct: 82 KEIYRDSGKLGNYEPDQATIKEMETGTGDYGKQVNWGKDEEDAVKKSIKEFGFNMVMSDK 141
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DR D+R +CKY DYP LP+ SV++VFHNEG+S+LMRTVHS+IK+TP + L E+
Sbjct: 142 ISLDRVPKDIRDPKCKYVDYPEKLPEVSVVIVFHNEGWSTLMRTVHSVIKQTPKELLGEV 201
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
++VDD S+K L L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAH
Sbjct: 202 VMVDDASTKEHLKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYK 422
CE NWLPPLLAPI + +I TVP+IDGID + F S D R G ++W L+K
Sbjct: 262 CEAEFNWLPPLLAPIARNDRISTVPMIDGIDGNHYHFTSQGGGDRWGRATGAWDWSFLWK 321
Query: 423 ENELPEREAKK 433
LPE E KK
Sbjct: 322 RIALPESEDKK 332
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 108/212 (50%), Positives = 142/212 (66%), Gaps = 8/212 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAHCE NWLPPL
Sbjct: 213 LKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAHCEAEFNWLPPL 272
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKK 119
LAPI + +I TVP+IDGID + F S D R G ++W L+K LPE E KK
Sbjct: 273 LAPIARNDRISTVPMIDGIDGNHYHFTSQGGGDRWGRATGAWDWSFLWKRIALPESEDKK 332
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+P+ SP AGGLFA++R +F ++ YDPGL +WGGENFELS+K+WMCGG + +VP
Sbjct: 333 LPSKIQPFPSPAMAGGLFAINRQYFKDIMYYDPGLEIWGGENFELSYKLWMCGGGMLFVP 392
Query: 180 CSRIVSLIRPVFKADGKLGNLEPPLEPYKEGP 211
CSR+ +++ +G GN PP + K P
Sbjct: 393 CSRVGH----IYRLEGWEGN--PPPKTVKSNP 418
>gi|432092277|gb|ELK24900.1| N-acetylgalactosaminyltransferase 7 [Myotis davidii]
Length = 802
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/254 (50%), Positives = 173/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ I
Sbjct: 75 PVLRP-GILGNFEPKEPEPHGVVGGPGENAKPLVLGPELKHAVQASIKEFGFNMVASDMI 133
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SVI+VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 134 SLDRSVNDLRQEECKYWHYDENLLVSSVIIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 193
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 194 LIDDFSNKEHLKKKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 253
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 254 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 313
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 314 RIPLSHKEKAKRKH 327
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/136 (48%), Positives = 91/136 (66%), Gaps = 6/136 (4%)
Query: 69 RKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 126
R I TVP+ID I+ T+E + + D + RG ++W ML+K L +E + RK +EP
Sbjct: 333 RTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRIRKTKTEP 392
Query: 127 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSL 186
Y+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +VPCSR+
Sbjct: 393 YRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGH- 451
Query: 187 IRPVFKADGKLGNLEP 202
+++ +G GN P
Sbjct: 452 ---IYRLEGWQGNPPP 464
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/131 (46%), Positives = 90/131 (68%), Gaps = 3/131 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L KKL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 204 LKKKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 263
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 264 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 323
Query: 119 KRKYNSEPYKS 129
KRK+ +EPY++
Sbjct: 324 KRKHKTEPYRT 334
>gi|348566877|ref|XP_003469228.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Cavia
porcellus]
Length = 637
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 126/255 (49%), Positives = 174/255 (68%), Gaps = 9/255 (3%)
Query: 189 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
PV + G LGN EP EP +G PGE K L ++ A AS+ E+G NM S+
Sbjct: 105 PVLRP-GILGNFEPK-EPEPQGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDM 162
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI
Sbjct: 163 ISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEI 222
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 363
+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 223 VLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDA 282
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLY 421
HCEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+
Sbjct: 283 HCEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLW 342
Query: 422 KENELPEREAKKRKY 436
K L +E KRK+
Sbjct: 343 KRIPLSHKEKAKRKH 357
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 103/206 (50%), Positives = 145/206 (70%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 234 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 293
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 294 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 353
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 354 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 413
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 414 PCSRVGH----IYRLEGWQGN-PPPL 434
>gi|431918328|gb|ELK17555.1| N-acetylgalactosaminyltransferase 7 [Pteropus alecto]
Length = 717
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 124/246 (50%), Positives = 170/246 (69%), Gaps = 6/246 (2%)
Query: 197 LGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 253
LGN EP EP+ GPGE K L ++ A AS+ E+G NM S+ IS DR++ D
Sbjct: 131 LGNFEPKEPEPHGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMISLDRSVND 190
Query: 254 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
LR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS+K
Sbjct: 191 LRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNK 250
Query: 314 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWL 372
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW
Sbjct: 251 EHLKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWY 310
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
PL+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 311 APLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKE 370
Query: 431 AKKRKY 436
KRK+
Sbjct: 371 KAKRKH 376
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/136 (48%), Positives = 91/136 (66%), Gaps = 6/136 (4%)
Query: 69 RKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEP 126
R I TVP+ID I+ T+E + + D + RG ++W ML+K L +E + RK +EP
Sbjct: 382 RTICTVPIIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRMRKTKTEP 441
Query: 127 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSL 186
Y+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +VPCSR+
Sbjct: 442 YRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGH- 500
Query: 187 IRPVFKADGKLGNLEP 202
+++ +G GN P
Sbjct: 501 ---IYRLEGWQGNPPP 513
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/131 (45%), Positives = 90/131 (68%), Gaps = 3/131 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL+DYI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 253 LKEKLDDYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 312
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 313 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 372
Query: 119 KRKYNSEPYKS 129
KRK+ +EPY++
Sbjct: 373 KRKHKTEPYRT 383
>gi|403295730|ref|XP_003938783.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Saimiri boliviensis
boliviensis]
Length = 659
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 127/254 (50%), Positives = 173/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 127 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAVQASIKEFGFNMVASDMI 185
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 186 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 245
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+V+LDAH
Sbjct: 246 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAH 305
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 306 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 365
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 366 RIPLSHKEKAKRKH 379
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 102/204 (50%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+V+LDAHCEV +NW P
Sbjct: 256 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLVYLDAHCEVAVNWYAP 315
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 316 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 375
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 376 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 435
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 436 PCSRVGH----IYRLEGWQGNPPP 455
>gi|426346015|ref|XP_004040686.1| PREDICTED: N-acetylgalactosaminyltransferase 7 [Gorilla gorilla
gorilla]
Length = 650
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 126/254 (49%), Positives = 173/254 (68%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 118 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 176
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 177 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 236
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 237 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 296
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 297 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 356
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 357 RIPLSHKEKAKRKH 370
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 247 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 306
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 307 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 366
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 367 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 426
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 427 PCSRVGH----IYRLEGWQGNPPP 446
>gi|348538240|ref|XP_003456600.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like [Oreochromis
niloticus]
Length = 649
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 173/255 (67%), Gaps = 7/255 (2%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEG---GKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV K G LGN EP PG K + L Y+ + AS+ E+G NM S+ I
Sbjct: 117 PVLKK-GILGNFEPKEPEPPGVPGGPGEGAKPFVLGPEYKDSVQASIKEFGFNMVASDMI 175
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DRTI D+R EECKYW Y L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 176 SLDRTINDIRHEECKYWHYDDRLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRRYLAEIV 235
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAH 364
L+DDFS+K L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+V+LDAH
Sbjct: 236 LIDDFSNKVHLKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLVYLDAH 295
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEVG+NW PL+API DR + TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 296 CEVGINWYAPLIAPISKDRTVCTVPLIDYIDGNEYSMEPQQGGDEDGLARGAWDWSLLWK 355
Query: 423 ENELPEREAKKRKYN 437
L +RE KR +
Sbjct: 356 RVPLSQREKAKRTHT 370
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 104/204 (50%), Positives = 146/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPP 60
L ++LE+YI+++NG V+L RN +REGLI+ RS GAK+ ++G+V+V+LDAHCEVG+NW P
Sbjct: 246 LKERLEEYIKQWNGLVKLFRNEKREGLIQARSIGAKKATKGQVLVYLDAHCEVGINWYAP 305
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR + TVP+ID ID + E + + D RG ++W +L+K L +RE
Sbjct: 306 LIAPISKDRTVCTVPLIDYIDGNEYSMEPQQGGDEDGLARGAWDWSLLWKRVPLSQREKA 365
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KR + ++PY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 366 KRTHTTQPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGQLLFV 425
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 426 PCSRVGH----IYRLQGWQGNPPP 445
>gi|351701091|gb|EHB04010.1| N-acetylgalactosaminyltransferase 7, partial [Heterocephalus
glaber]
Length = 616
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 127/255 (49%), Positives = 176/255 (69%), Gaps = 10/255 (3%)
Query: 189 PVFKADGKLGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSN 243
PV + G LGN EP EP +G PGE K L ++ A AS+ E+G NM S+
Sbjct: 83 PVLRP-GILGNFEPK-EPEPQGVVGGPGEEAKPLILGPEFKHAVQASIIKEFGFNMVASD 140
Query: 244 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
IS DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL E
Sbjct: 141 MISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAE 200
Query: 304 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLD 362
I+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LD
Sbjct: 201 IVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLD 260
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGML 420
AHCEV +NW PL+API DR I TVP+ID I+ T++ + + D + RG ++W ML
Sbjct: 261 AHCEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYQIVPQGGGDEDGYARGAWDWSML 320
Query: 421 YKENELPEREAKKRK 435
+K L RE + RK
Sbjct: 321 WKRVPLTPREKRLRK 335
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 104/206 (50%), Positives = 147/206 (71%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 213 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 272
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T++ + + D + RG ++W ML+K L RE +
Sbjct: 273 LVAPISKDRTICTVPLIDVINGNTYQIVPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 332
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 333 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 392
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PPL
Sbjct: 393 PCSRVGH----IYRLEGWQGN-PPPL 413
>gi|313220437|emb|CBY31290.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 125/251 (49%), Positives = 166/251 (66%), Gaps = 5/251 (1%)
Query: 188 RPVFKADGKLGNLEPPLEPYKE---GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
+ +++ GKLGN EP KE G G+ GK + + A S+ E+G NM S+
Sbjct: 82 KEIYRDSGKLGNYEPDQATIKEMETGTGDYGKQVNWGKDEEDAVKKSIKEFGFNMVMSDT 141
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DR D+R +CKY DYP LP+ SV++VFHNEG+S+LMRTVHS+IK+TP + L E+
Sbjct: 142 ISLDRVPKDIRDPKCKYVDYPEKLPEVSVVIVFHNEGWSTLMRTVHSVIKQTPKELLGEV 201
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
++VDD S+K L L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAH
Sbjct: 202 VMVDDASTKEHLKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAH 261
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYK 422
CE NWLPPLLAPI + +I TVP+IDGID + F + D R G ++W L+K
Sbjct: 262 CEAEFNWLPPLLAPIARNDRISTVPMIDGIDGNHYHFTTQGGGDRWGRATGAWDWSFLWK 321
Query: 423 ENELPEREAKK 433
LPE E KK
Sbjct: 322 RIALPEPEDKK 332
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/212 (50%), Positives = 142/212 (66%), Gaps = 8/212 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++Y++R+NG VR+ RN +REGLIR RS GA ES+ EV+VFLDAHCE NWLPPL
Sbjct: 213 LKDNLDEYVKRWNGLVRVHRNEQREGLIRARSIGAFESKKEVLVFLDAHCEAEFNWLPPL 272
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKK 119
LAPI + +I TVP+IDGID + F + D R G ++W L+K LPE E KK
Sbjct: 273 LAPIARNDRISTVPMIDGIDGNHYHFTTQGGGDRWGRATGAWDWSFLWKRIALPEPEDKK 332
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+P+ SP AGGLFA++R +F ++ YDPGL +WGGENFELS+K+WMCGG + +VP
Sbjct: 333 LPSKIQPFPSPAMAGGLFAINRQYFKDIMYYDPGLEIWGGENFELSYKLWMCGGGMLFVP 392
Query: 180 CSRIVSLIRPVFKADGKLGNLEPPLEPYKEGP 211
CSR+ +++ +G GN PP + K P
Sbjct: 393 CSRVGH----IYRLEGWEGN--PPPKTVKSNP 418
>gi|260789880|ref|XP_002589972.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
gi|229275159|gb|EEN45983.1| hypothetical protein BRAFLDRAFT_114654 [Branchiostoma floridae]
Length = 522
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 118/244 (48%), Positives = 163/244 (66%), Gaps = 3/244 (1%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
+GN EP E + PGEG Y L Y+ D S+ E+G N+ S+ IS DRTI D+R
Sbjct: 1 MGNWEPEPERISDAPGEGAIPYKLGPEYKDDIDKSIKEFGFNIVASDKISLDRTIKDIRD 60
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
ECKYW Y LP SVI+VF+NE +S +MRTVHS+IKRTP + L EI+LVDDFS+K
Sbjct: 61 PECKYWHYDTKLPNMSVIIVFYNEAWSVVMRTVHSVIKRTPPELLAEIVLVDDFSTKEHW 120
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPPL 375
Q+L+DYI +F G V+L+RN +REGLI+ RS GA+E ++G+++V+LD+HCEVG+NW P L
Sbjct: 121 KQRLDDYIVQFKGLVKLVRNKQREGLIQARSIGAREATKGKILVYLDSHCEVGINWAPAL 180
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH--HYRGIFEWGMLYKENELPEREAKK 433
++PI +R TVP+ID ID + + D H RG ++W +L+K+ RE +
Sbjct: 181 ISPIAVNRTTCTVPLIDVIDGNNYNIYAQGGGDEYGHARGAWDWSLLWKKVPNTPRERAR 240
Query: 434 RKYN 437
KY+
Sbjct: 241 HKYH 244
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 141/202 (69%), Gaps = 7/202 (3%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKE-SRGEVIVFLDAHCEVGLNWLPPLL 62
++L+DYI +F G V+L+RN +REGLI+ RS GA+E ++G+++V+LD+HCEVG+NW P L+
Sbjct: 122 QRLDDYIVQFKGLVKLVRNKQREGLIQARSIGAREATKGKILVYLDSHCEVGINWAPALI 181
Query: 63 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH--HYRGIFEWGMLYKENELPEREAKKR 120
+PI +R TVP+ID ID + + D H RG ++W +L+K+ RE +
Sbjct: 182 SPIAVNRTTCTVPLIDVIDGNNYNIYAQGGGDEYGHARGAWDWSLLWKKVPNTPRERARH 241
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
KY++EPY+SP AGGLFA+DR +F ELG YDPGL +WGGENFE+S+K+W CGG + + PC
Sbjct: 242 KYHTEPYRSPAMAGGLFAIDREYFFELGLYDPGLKIWGGENFEISYKVWQCGGEVLFTPC 301
Query: 181 SRIVSLIRPVFKADGKLGNLEP 202
SR+ +++ G GN P
Sbjct: 302 SRVGH----IYRLKGWAGNPPP 319
>gi|441620192|ref|XP_003258074.2| PREDICTED: LOW QUALITY PROTEIN: N-acetylgalactosaminyltransferase 7
[Nomascus leucogenys]
Length = 636
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 125/254 (49%), Positives = 172/254 (67%), Gaps = 7/254 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G L N EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 104 PVLRP-GILSNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKQAIQASIKEFGFNMVASDMI 162
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 163 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 222
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 223 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 282
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYK 422
CEV +NW PL+API DR TVP+ID ID + E + + D RG ++W +L+K
Sbjct: 283 CEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWK 342
Query: 423 ENELPEREAKKRKY 436
L +E KRK+
Sbjct: 343 RIPLSHKEKAKRKH 356
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 233 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 292
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 293 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 352
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 353 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 412
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 413 PCSRVGH----IYRLEGWQGNPPP 432
>gi|47575716|ref|NP_001001200.1| polypeptide N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
tropicalis]
gi|45501097|gb|AAH67317.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 7 [Xenopus (Silurana)
tropicalis]
Length = 653
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/255 (50%), Positives = 176/255 (69%), Gaps = 9/255 (3%)
Query: 189 PVFKADGKLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNH 244
PV + G LGNLEP EP +G G K + L Y+ A AS+ E+G NM S+
Sbjct: 121 PVLRP-GILGNLEPK-EPEPQGVVGGPGEGGKPFELGPDYKDAVKASIKEFGFNMVASDM 178
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DRTI DLR EECKYW+Y +L +SV++VFHNEG+S+L+RT+HS+IKRTP QYL EI
Sbjct: 179 ISMDRTINDLRHEECKYWNYDENLLTSSVVIVFHNEGWSTLVRTIHSVIKRTPRQYLAEI 238
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDA 363
+++DDFS+K L +L++Y++++NG V++ RN REGLI+ RS GA++++ G+V+++LDA
Sbjct: 239 VMIDDFSNKEHLKGRLDEYLKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDA 298
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID--YQTWEFRSVYEPDHHYRGIFEWGMLY 421
HCEVG+NW PL+API DR TVP+ID ID T E + + D RG ++W ML+
Sbjct: 299 HCEVGINWYAPLIAPIAKDRTACTVPLIDYIDGNLYTIEPQQGGDEDGFARGAWDWSMLW 358
Query: 422 KENELPEREAKKRKY 436
K L RE KRK+
Sbjct: 359 KRIPLTVREKAKRKH 373
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 101/204 (49%), Positives = 141/204 (69%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEVG+NW P
Sbjct: 250 LKGRLDEYLKQWNGLVKVFRNERREGLIQARSIGAEKAKLGQVLIYLDAHCEVGINWYAP 309
Query: 61 LLAPIYSDRKIMTVPVIDGID--YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID T E + + D RG ++W ML+K L RE
Sbjct: 310 LIAPIAKDRTACTVPLIDYIDGNLYTIEPQQGGDEDGFARGAWDWSMLWKRIPLTVREKA 369
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY SP AGGLFA++R +F ELG YDPGL +WGGENFE+S+KIW CGG + +
Sbjct: 370 KRKHKTEPYWSPAMAGGLFAIERDYFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFT 429
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ G GN P
Sbjct: 430 PCSRVGH----IYRLHGWQGNPTP 449
>gi|32425405|gb|AAH35303.1| GALNT7 protein, partial [Homo sapiens]
Length = 495
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 111/214 (51%), Positives = 157/214 (73%), Gaps = 3/214 (1%)
Query: 225 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 284
+ A AS+ E+G NM S+ IS DR++ DLR EECKYW Y +L +SV++VFHNEG+S+
Sbjct: 1 KQAIQASIKEFGFNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWST 60
Query: 285 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 344
LMRTVHS+IKRTP +YL EI+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+
Sbjct: 61 LMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQ 120
Query: 345 TRSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-- 401
RS GA++++ G+V+++LDAHCEV +NW PL+API DR I TVP+ID I+ T+E
Sbjct: 121 ARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPLIDVINGNTYEIIP 180
Query: 402 RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ + D + RG ++W ML+K L +E + RK
Sbjct: 181 QGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRK 214
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 103/206 (50%), Positives = 147/206 (71%), Gaps = 8/206 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 92 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 151
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L +E +
Sbjct: 152 LVAPISKDRTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKR 211
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 212 LRKTKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 271
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPL 204
PCSR+ +++ +G GN PP+
Sbjct: 272 PCSRVGH----IYRLEGWQGN-PPPI 292
>gi|308506597|ref|XP_003115481.1| hypothetical protein CRE_18949 [Caenorhabditis remanei]
gi|308256016|gb|EFO99968.1| hypothetical protein CRE_18949 [Caenorhabditis remanei]
Length = 331
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 99/152 (65%), Positives = 123/152 (80%), Gaps = 1/152 (0%)
Query: 38 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHH 96
S GEV++FLDAHCEV NWLPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H
Sbjct: 7 NSTGEVVLFLDAHCEVNTNWLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAH 66
Query: 97 YRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLV 156
+ GIFEWG+LYKE ++ ERE+ RK+NS+P++SPTHAGGLFA++R +F ELG YD GL +
Sbjct: 67 HSGIFEWGLLYKETQITERESAHRKHNSQPFRSPTHAGGLFAINRLWFKELGYYDEGLQI 126
Query: 157 WGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
WGGE +ELSFKIW CGG I +VPCS + + R
Sbjct: 127 WGGEQYELSFKIWQCGGGIVFVPCSHVGHVYR 158
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 58/87 (66%), Positives = 71/87 (81%), Gaps = 1/87 (1%)
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY-EPDHH 410
S GEV++FLDAHCEV NWLPPLLAPI +RK+MTVPVIDGID +WE+RSVY P+ H
Sbjct: 7 NSTGEVVLFLDAHCEVNTNWLPPLLAPIKQNRKVMTVPVIDGIDSNSWEYRSVYGSPNAH 66
Query: 411 YRGIFEWGMLYKENELPEREAKKRKYN 437
+ GIFEWG+LYKE ++ ERE+ RK+N
Sbjct: 67 HSGIFEWGLLYKETQITERESAHRKHN 93
>gi|198437817|ref|XP_002130165.1| PREDICTED: similar to UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N-acetylgalactosaminyltransferase 7 [Ciona
intestinalis]
Length = 647
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/250 (46%), Positives = 156/250 (62%), Gaps = 4/250 (1%)
Query: 189 PVFKADGKLGNLE-PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
P F D LGN E + + G GE G+A L + + + +GE+G N S+ IS
Sbjct: 90 PKFVND-DLGNYELKAPDQKRAGAGEYGEAVQLDSSLDSQVKSVIGEFGFNTVASDRISL 148
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
DR DLR EECK+ DYP LP SVI+VFHNE +S LMRTVH++I TP QYL EI+++
Sbjct: 149 DRAPKDLRHEECKHIDYPSHLPSVSVIIVFHNEAWSPLMRTVHNVINNTPRQYLHEIVMI 208
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DD S K L KL++Y+ +FNG V++ RN REGLIR RS GAK+S GE++V+LDAHCE
Sbjct: 209 DDGSHKDHLGSKLDEYVTKFNGIVKVYRNDRREGLIRARSIGAKKSSGEILVYLDAHCEA 268
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENE 425
NWLPPL+ PI +D + TVP+ID ID + F D + RG ++W +K
Sbjct: 269 EPNWLPPLITPILNDHRACTVPLIDVIDGNKYTFTEQAGGDENGLARGAWDWSFQWKRIP 328
Query: 426 LPEREAKKRK 435
+ ++E +R
Sbjct: 329 ITKKEKARRN 338
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 104/207 (50%), Positives = 137/207 (66%), Gaps = 6/207 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL++Y+ +FNG V++ RN REGLIR RS GAK+S GE++V+LDAHCE NWLPPL
Sbjct: 217 LGSKLDEYVTKFNGIVKVYRNDRREGLIRARSIGAKKSSGEILVYLDAHCEAEPNWLPPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPEREAKK 119
+ PI +D + TVP+ID ID + F D + RG ++W +K + ++E +
Sbjct: 277 ITPILNDHRACTVPLIDVIDGNKYTFTEQAGGDENGLARGAWDWSFQWKRIPITKKEKAR 336
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R SEPY+SP AGGLFA+DR FF ELG YD GL +WGGENFELS+K+WMCGG + +VP
Sbjct: 337 RNRMSEPYRSPAMAGGLFAIDRNFFFELGLYDDGLEIWGGENFELSYKVWMCGGQLLFVP 396
Query: 180 CSRIVSLIRPVFKADGKLGNLEPPLEP 206
CSR+ V++ G GN P P
Sbjct: 397 CSRVGH----VYRLPGWRGNPPPAYVP 419
>gi|313226886|emb|CBY22031.1| unnamed protein product [Oikopleura dioica]
Length = 685
Score = 218 bits (555), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 124/254 (48%), Positives = 164/254 (64%), Gaps = 10/254 (3%)
Query: 189 PVFKADGKLGNLE--PPL-EPYKEGPGEGGKAYH-LPEAYRAAGDASLGEYGMNMETSNH 244
P +++DGK GN E P + E +GPGE G A H LPE + + +G N+ S+
Sbjct: 144 PFYRSDGKPGNWEDRPHVDESGHDGPGEHGAAVHTLPEEEEQVKEI-IKTFGFNLVNSDK 202
Query: 245 ISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
IS DR DLR +EC DYP LP SV++VFHNEG+ L+RT HS++ RTP + L EI
Sbjct: 203 ISMDRLPKDLRDKECINIDYPEKLPMVSVVVVFHNEGWGPLVRTFHSVVNRTPPELLGEI 262
Query: 305 ILVDDFS---SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
+++DD S K L LE+YI+R++GKV+L RN REGLIR RS GA+ + EV+VFL
Sbjct: 263 VIIDDGSVIKDKPHLGDPLEEYIKRWDGKVKLYRNARREGLIRARSIGAQHAIFEVLVFL 322
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGM 419
DAHCE G NWLPPL+API + +I TVP+ID ID Q + F DH+ R G +EW
Sbjct: 323 DAHCEAGYNWLPPLIAPIARNDRISTVPLIDSIDGQRYTFSGQAGGDHNGRAQGGWEWNF 382
Query: 420 LYKENELPEREAKK 433
L+K LP++EA+K
Sbjct: 383 LWKRYPLPKKEAEK 396
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 141/207 (68%), Gaps = 6/207 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+YI+R++GKV+L RN REGLIR RS GA+ + EV+VFLDAHCE G NWLPPL
Sbjct: 277 LGDPLEEYIKRWDGKVKLYRNARREGLIRARSIGAQHAIFEVLVFLDAHCEAGYNWLPPL 336
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKK 119
+API + +I TVP+ID ID Q + F DH+ R G +EW L+K LP++EA+K
Sbjct: 337 IAPIARNDRISTVPLIDSIDGQRYTFSGQAGGDHNGRAQGGWEWNFLWKRYPLPKKEAEK 396
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ +E Y SP AGGLFA++R F +G YDPGL +WGGE +E+S+K+WMCGG + +VP
Sbjct: 397 LSHGTEMYPSPAMAGGLFAINREHFNNVGMYDPGLEIWGGEQYEISYKLWMCGGGVYFVP 456
Query: 180 CSRIVSLIRPVFKADGKLGNLEPPLEP 206
CSR+ V++ +G GN P P
Sbjct: 457 CSRVGH----VYRLEGWGGNPPPEYVP 479
>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
Length = 454
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 106/226 (46%), Positives = 150/226 (66%), Gaps = 5/226 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
GPGE G+ + DA+ E+G N S+ IS +RTI D R + CK YP++LP
Sbjct: 1 GPGENGEPVETKAEDESKKDAAYSEFGFNQFVSDQISLERTISDTRHQACKQRSYPINLP 60
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
KASV++VFHNEG+S+LMRTVH+++ R+P L+EI++VDDFS+K L QKL+DY ++ G
Sbjct: 61 KASVVIVFHNEGWSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKL-G 119
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
K++++R ER GLI+ R GA + GEV++FLDAHCE WLPPLL I +R+ P
Sbjct: 120 KIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALNRRTAVCP 179
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 434
ID ID++T++++ + D + RG F W YKE + PE AK+R
Sbjct: 180 TIDFIDHKTFQYKPM---DPYIRGTFNWRFDYKERAVRPEEMAKRR 222
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 125/187 (66%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+DY ++ GK++++R ER GLI+ R GA + GEV++FLDAHCE WLPPL
Sbjct: 107 LKQKLDDYTKKL-GKIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPL 165
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R+ P ID ID++T++++ + D + RG F W YKE + E KR+
Sbjct: 166 LERIALNRRTAVCPTIDFIDHKTFQYKPM---DPYIRGTFNWRFDYKERAVRPEEMAKRR 222
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++ KSP AGGLFA++R FF ELG YDPG+ +WGGE +E+SFK+W CGG +E +PCS
Sbjct: 223 DPTQEVKSPVMAGGLFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCS 282
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 283 RVGHVYR 289
>gi|350597048|ref|XP_003361960.2| PREDICTED: hypothetical protein LOC100155807 [Sus scrofa]
Length = 1191
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 145/204 (71%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 807 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 866
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 867 LVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 926
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 927 MRKTKTEPYRSPAMAGGLFAIERDFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 986
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 987 PCSRVGH----IYRLEGWQGNPPP 1006
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 58/123 (47%), Positives = 86/123 (69%), Gaps = 3/123 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 374
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 807 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 866
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
L+API DR I TVP+ID I+ T+E + + D + RG ++W ML+K L RE +
Sbjct: 867 LVAPISKDRTICTVPIIDVINGNTYEIVPQGGGDEDGYARGAWDWSMLWKRVPLTPREKR 926
Query: 433 KRK 435
RK
Sbjct: 927 MRK 929
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L + A AS+ E+G NM S+ I
Sbjct: 271 PVLRP-GVLGNFEPKEPEPHGVVGGPGEKAKPVVLGPELKHAVQASIKEFGFNMVASDMI 329
Query: 246 SFDRTIPDLRMEE 258
S DR++ DLR E+
Sbjct: 330 SLDRSVNDLRQED 342
>gi|313242250|emb|CBY34413.1| unnamed protein product [Oikopleura dioica]
Length = 644
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/248 (44%), Positives = 165/248 (66%), Gaps = 8/248 (3%)
Query: 196 KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
++GN EP GPGEGG K E + DA + EYG NM S+ IS DR
Sbjct: 81 EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
DLR EECK++ YP LP +SVI VFHNEG+S+L+R++HS+I TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199
Query: 312 SKADLD-QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
+K + +LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 428
W+ PL+ P+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319
Query: 429 REAKKRKY 436
RE ++++
Sbjct: 320 RERARQEH 327
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 140/200 (70%), Gaps = 6/200 (3%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
+LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV NW+ PL+ P
Sbjct: 208 RLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPNWIVPLVEP 267
Query: 65 IYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +RE ++++
Sbjct: 268 MVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLNDRERARQEH 327
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+ PY+SP AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G + +VPCSR
Sbjct: 328 MTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQMLFVPCSR 387
Query: 183 IVSLIRPVFKADGKLGNLEP 202
+ +++ G GN P
Sbjct: 388 VGH----IYRMKGWRGNGTP 403
>gi|380792153|gb|AFE67952.1| N-acetylgalactosaminyltransferase 7, partial [Macaca mulatta]
Length = 323
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/199 (53%), Positives = 144/199 (72%), Gaps = 5/199 (2%)
Query: 189 PVFKADGKLGNLEP--PLEP-YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP P P GPGE K L ++ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GILGNFEPKEPEPPGVVGGPGEKAKPLVLGPEFKHAIQASIKEFGFNMVASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Query: 365 CEVGLNWLPPLLAPIYSDR 383
CEV +NW PL+API DR
Sbjct: 304 CEVAVNWYAPLVAPISKDR 322
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 37/69 (53%), Positives = 55/69 (79%), Gaps = 1/69 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 254 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 313
Query: 61 LLAPIYSDR 69
L+API DR
Sbjct: 314 LVAPISKDR 322
>gi|313230492|emb|CBY18708.1| unnamed protein product [Oikopleura dioica]
Length = 644
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 111/248 (44%), Positives = 165/248 (66%), Gaps = 8/248 (3%)
Query: 196 KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
++GN EP GPGEGG K E + DA + EYG NM S+ IS DR
Sbjct: 81 EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
DLR EECK++ YP LP +SVI VFHNEG+S+L+R++HS+I TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199
Query: 312 SKADLD-QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
+K + +LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 428
W+ PL+ P+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319
Query: 429 REAKKRKY 436
RE ++++
Sbjct: 320 RERARQEH 327
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 140/200 (70%), Gaps = 6/200 (3%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
+LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV NW+ PL+ P
Sbjct: 208 RLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPNWIVPLVEP 267
Query: 65 IYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +RE ++++
Sbjct: 268 MVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLNDRERARQEH 327
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+ PY+SP AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G + +VPCSR
Sbjct: 328 MTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQMLFVPCSR 387
Query: 183 IVSLIRPVFKADGKLGNLEP 202
+ +++ G GN P
Sbjct: 388 VGH----IYRMKGWRGNGTP 403
>gi|313230491|emb|CBY18707.1| unnamed protein product [Oikopleura dioica]
Length = 510
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 111/248 (44%), Positives = 165/248 (66%), Gaps = 8/248 (3%)
Query: 196 KLGNLEPPLEPYKEGPGEGG----KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
++GN EP GPGEGG K E + DA + EYG NM S+ IS DR
Sbjct: 81 EIGNYEPKDWKVPAGPGEGGVEPLKLDDSTEMQKKQKDA-INEYGFNMVASDAISLDRYP 139
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
DLR EECK++ YP LP +SVI VFHNEG+S+L+R++HS+I TP + LEE++L+DD S
Sbjct: 140 ADLRHEECKHYQYPESLPASSVIFVFHNEGWSTLVRSIHSVINYTPPELLEEVVLIDDGS 199
Query: 312 SKADLD-QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
+K + +LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV N
Sbjct: 200 NKEHITGGRLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPN 259
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPE 428
W+ PL+ P+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +
Sbjct: 260 WIVPLVEPMVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLND 319
Query: 429 REAKKRKY 436
RE ++++
Sbjct: 320 RERARQEH 327
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 140/200 (70%), Gaps = 6/200 (3%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
+LE++I+++NG V+L RN REGLIR RS GA+++ G V+++LDAHCEV NW+ PL+ P
Sbjct: 208 RLEEHIKQWNGLVKLYRNDRREGLIRARSIGARKAVGSVLIYLDAHCEVEPNWIVPLVEP 267
Query: 65 IYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
+ D +I TVP++D ID T+ F ++ + ++ RG ++W +L+K L +RE ++++
Sbjct: 268 MVHDYRICTVPMVDAIDGATYVFTPQAGGDENNFARGAWDWDLLWKRIPLNDRERARQEH 327
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+ PY+SP AGGLFA+ R FF ELG YD GL +WGGENFE+S+KIWMC G + +VPCSR
Sbjct: 328 MTSPYRSPAMAGGLFAISRKFFFELGLYDEGLDIWGGENFEISYKIWMCHGQMLFVPCSR 387
Query: 183 IVSLIRPVFKADGKLGNLEP 202
+ +++ G GN P
Sbjct: 388 VGH----IYRMKGWRGNGTP 403
>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
Length = 520
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 104/227 (45%), Positives = 149/227 (65%), Gaps = 2/227 (0%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + GPGE G +L + G+A + ++ MN+ S+ IS DR++PD R E+C+
Sbjct: 6 PDYSKKRTGPGEDGSGVYLTGKQKVRGEADMKKWFMNLVASDMISLDRSLPDHRHEQCRK 65
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
+YP +LP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + DL +LE
Sbjct: 66 INYPDNLPVASVVIIFTDEAWSPLMRTVHSVINRTPFKLLQEIILVDDFSQRDDLKGRLE 125
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+YI+RF KVRLIR ER+GLIR + GAKE+ G+V++FLD+HCEV WL PLLA I
Sbjct: 126 EYIKRFGNKVRLIRARERQGLIRAKLLGAKEAIGDVLIFLDSHCEVSEGWLEPLLARIKE 185
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
+R ++ P+ID I +T + + G F W + ++ + LPE
Sbjct: 186 NRSVVLCPIIDHISAETLAYSGSDRLAN--VGGFWWSLHFRWDPLPE 230
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 126/188 (67%), Gaps = 4/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +LE+YI+RF KVRLIR ER+GLIR + GAKE+ G+V++FLD+HCEV WL P
Sbjct: 119 DLKGRLEEYIKRFGNKVRLIRARERQGLIRAKLLGAKEAIGDVLIFLDSHCEVSEGWLEP 178
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I +R ++ P+ID I +T + + G F W + ++ + LPE
Sbjct: 179 LLARIKENRSVVLCPIIDHISAETLAYSGSDRLAN--VGGFWWSLHFRWDPLPEEYYGID 236
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++P +SPT AGGLFA+DR +F E+GGYDP + +WGGEN E+SF++WMCGG IE++PC
Sbjct: 237 P--TKPIRSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISFRVWMCGGGIEFIPC 294
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 295 SHVGHIFR 302
>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
Length = 561
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 144/221 (65%), Gaps = 4/221 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ GPGE GK L R G + E G N++ SN IS DR IPD+R C Y D
Sbjct: 40 RTGPGEQGKPADLTAEER--GPHAYEECGFNIKASNKISLDRAIPDIRHPNCASKKYVRD 97
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++ FHNEG+++L+RTVHS++ R+P Q + EIILVDDFS ++ L + LEDY+ +
Sbjct: 98 LPDVSLVIPFHNEGWTTLLRTVHSVLNRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKL 157
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ KVR++R +REGLIRTR GA+ ++G+V++FLD+HCE +NWLPPLL PI ++K +
Sbjct: 158 SPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIV 217
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
P ID ID + + + + RG F+W M YK +P+
Sbjct: 218 CPNIDVIDKDDFHYET--QAGDAMRGAFDWEMYYKRIPIPD 256
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 92/187 (49%), Positives = 127/187 (67%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ + + KVR++R +REGLIRTR GA+ ++G+V++FLD+HCE +NWLPPL
Sbjct: 146 LGKDLEDYVAKLSPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPL 205
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI ++K + P ID ID + + + + RG F+W M YK +P+ K
Sbjct: 206 LEPIALNKKTIVCPNIDVIDKDDFHYET--QAGDAMRGAFDWEMYYKRIPIPDE--IKNP 261
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+W CGG + PCS
Sbjct: 262 DPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGGEQYELSFKVWQCGGRMVDAPCS 321
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 322 RVGHVYR 328
>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
Length = 520
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 104/227 (45%), Positives = 149/227 (65%), Gaps = 2/227 (0%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + GPGEGG +L + G+A + ++ MN+ S+ IS DR++PD R ++C+
Sbjct: 6 PDYSKKRIGPGEGGTGVYLTGKQKVQGEADMKKWFMNVVASDLISLDRSLPDRRHKQCRK 65
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y DLP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + +L KLE
Sbjct: 66 ISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDELKGKLE 125
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+YI+RF KVRL+R ER+GLIR + GAKE+ G+V+VFLD+HCEVG WL PLLA I
Sbjct: 126 EYIKRFGDKVRLVRAPERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPLLARIKD 185
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
R + P+I+ I +T + + P H G F W + ++ + +P+
Sbjct: 186 KRSAVLCPIINHISPETLTYSANDRPAH--VGGFWWSLHFRWDPMPK 230
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 128/188 (68%), Gaps = 4/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KLE+YI+RF KVRL+R ER+GLIR + GAKE+ G+V+VFLD+HCEVG WL P
Sbjct: 119 ELKGKLEEYIKRFGDKVRLVRAPERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEP 178
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I R + P+I+ I +T + + P H G F W + ++ + +P+ +
Sbjct: 179 LLARIKDKRSAVLCPIINHISPETLTYSANDRPAH--VGGFWWSLHFRWDPMPKEYSDAD 236
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP +SPT AGGL A+DR +F E+GGYDP + +WGGEN E+SF++WMCGGS+E++PC
Sbjct: 237 P--TEPIRSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEMSFRVWMCGGSVEFIPC 294
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 295 SHVGHIFR 302
>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Mus musculus]
Length = 601
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 107/220 (48%), Positives = 145/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARF 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 S-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 194 bits (493), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLEDY+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Cavia porcellus]
Length = 601
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 147/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E + D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDSDDSAYRENGFNIFVSNNIALERSLPDIRHTNCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+
Sbjct: 136 LETLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYV 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRILRTRKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKEKLEEYVARFS-KVRILRTRKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|313227738|emb|CBY22887.1| unnamed protein product [Oikopleura dioica]
Length = 1030
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 105/227 (46%), Positives = 147/227 (64%), Gaps = 4/227 (1%)
Query: 212 GEGG-KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
GEGG L + A+LG +G NM S+ ++ DR DLRMEECK WDYP LP
Sbjct: 493 GEGGLSPIRLTSEDQTKVTAALGLWGFNMVASDKVNMDRVPADLRMEECKRWDYPDKLPA 552
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL-DQKLEDYIQRFNG 329
SVILVFHNEGFS+L+RTVHSI+ +P + L E++++DD S++ + + ++ YI+R++G
Sbjct: 553 VSVILVFHNEGFSTLLRTVHSIVNYSPPEMLHEVVMLDDGSTREYITNGTIDRYIERWDG 612
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
V++ N +REGLIR R+ G K S G V+VFLDAHCEV NWLPPL+ PI + K+ ++P
Sbjct: 613 LVKIFHNEKREGLIRARTIGGKHSTGSVLVFLDAHCEVEPNWLPPLITPIAKNYKVSSLP 672
Query: 390 VIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPEREAKKR 434
+ID ID T+ F D + RG ++W +K L +RE +R
Sbjct: 673 MIDAIDGNTYVFEPQQGGDENNLARGAWDWNFDWKRIPLNQREKARR 719
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 91/185 (49%), Positives = 125/185 (67%), Gaps = 2/185 (1%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
++ YIER++G V++ N +REGLIR R+ G K S G V+VFLDAHCEV NWLPPL+ PI
Sbjct: 603 IDRYIERWDGLVKIFHNEKREGLIRARTIGGKHSTGSVLVFLDAHCEVEPNWLPPLITPI 662
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH--YRGIFEWGMLYKENELPEREAKKRKYN 123
+ K+ ++P+ID ID T+ F D + RG ++W +K L +RE +R
Sbjct: 663 AKNYKVSSLPMIDAIDGNTYVFEPQQGGDENNLARGAWDWNFDWKRIPLNQREKARRATI 722
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
+EP++SP AGGLFA+ R +F ELG YD L +WGGENFELS+K+W CGG + +VPCSR+
Sbjct: 723 TEPFRSPAMAGGLFAISRKWFTELGWYDDKLEIWGGENFELSYKLWQCGGELLFVPCSRV 782
Query: 184 VSLIR 188
+ R
Sbjct: 783 GHIYR 787
>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Loxodonta africana]
Length = 601
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 107/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EALRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLEDY+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Monodelphis domestica]
Length = 573
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 106/220 (48%), Positives = 146/220 (66%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 53 RSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLEK 110
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF
Sbjct: 111 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKDKLEEYMARF 170
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I +RK +
Sbjct: 171 S-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNRKTIV 229
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 230 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 267
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPL
Sbjct: 159 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 217
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 218 LNQIALNRKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 273
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 274 DPSDPFESPVMAGGLFAVDRRWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCS 333
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 334 RVGHIYR 340
>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Canis lupus
familiaris]
Length = 601
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 107/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E R D++ E G N+ SN I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|444727227|gb|ELW67729.1| N-acetylgalactosaminyltransferase 7 [Tupaia chinensis]
Length = 606
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 100/204 (49%), Positives = 143/204 (70%), Gaps = 7/204 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L ++L++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 62 LKERLDEYIKMWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 121
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 122 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLNHKEKA 181
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGGENFE+S+KIW CGG + +V
Sbjct: 182 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFV 241
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP 202
PCSR+ +++ +G GN P
Sbjct: 242 PCSRVGH----IYRLEGWQGNPPP 261
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 92/181 (50%), Positives = 132/181 (72%), Gaps = 3/181 (1%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
CKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+L+DDFS+K L +
Sbjct: 5 CKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKE 64
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLA 377
+L++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW PL+A
Sbjct: 65 RLDEYIKMWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVA 124
Query: 378 PIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PI DR TVP+ID ID + E + + D RG ++W +L+K L +E KRK
Sbjct: 125 PISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLNHKEKAKRK 184
Query: 436 Y 436
+
Sbjct: 185 H 185
>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
troglodytes]
Length = 601
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 601
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 146/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Nomascus leucogenys]
Length = 601
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMLDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Otolemur garnettii]
Length = 600
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 146/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 77 EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 134
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 135 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 194
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ +VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 195 ARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 253
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 254 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 294
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ +VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 186 LKDKLEEYMARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 245 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LRRA 300
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 301 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 360
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 361 RVGHIYR 367
>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Ovis aries]
Length = 601
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 146/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKEKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 520
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 102/227 (44%), Positives = 147/227 (64%), Gaps = 2/227 (0%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + GPGE G +L + G+A + ++ MN+ S+ IS DR++PD R ++C
Sbjct: 6 PDYSKKRIGPGEDGTGVYLTGKQKVQGEADMKKWFMNLVASDLISLDRSLPDHRHKQCHK 65
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y DLP ASV+++F +E +S LMRTVHS+I RTP + L+EIILVDDFS + +L +KLE
Sbjct: 66 ISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDELKEKLE 125
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+YI+RF KVRL+R ER+GLIR + GAKE+ G+V+VFLD+HCEVG WL PLLA I
Sbjct: 126 EYIKRFGNKVRLVRALERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPLLARIKD 185
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
R + P+I+ I +T + + P + G F W + + + +P+
Sbjct: 186 KRSAVLCPIINHISAETLTYSANDRPTN--VGGFSWSLHFLWDPMPK 230
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 128/188 (68%), Gaps = 4/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +KLE+YI+RF KVRL+R ER+GLIR + GAKE+ G+V+VFLD+HCEVG WL P
Sbjct: 119 ELKEKLEEYIKRFGNKVRLVRALERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEP 178
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I R + P+I+ I +T + + P + G F W + + + +P+
Sbjct: 179 LLARIKDKRSAVLCPIINHISAETLTYSANDRPTN--VGGFSWSLHFLWDPMPKEYFDAD 236
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP +SPT AGGL A+DR++F E+GGYDP + +WGGEN E+SF++WMCGGSIE++PC
Sbjct: 237 P--TEPIRSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSFRVWMCGGSIEFIPC 294
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 295 SHVGHIFR 302
>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Otolemur garnettii]
Length = 601
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 146/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E R D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTEEDR--DDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ +VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ +VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LRRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
AltName: Full=Polypeptide GalNAc transferase 17;
Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 17;
AltName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 17; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 17
gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Homo sapiens]
Length = 601
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
Length = 584
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 61 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 118
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 178
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 278
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 170 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 228
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 229 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 284
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 285 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 344
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 345 RVGHIYR 351
>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Cricetulus griseus]
Length = 555
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/220 (48%), Positives = 145/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 35 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 92
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+ RF
Sbjct: 93 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIVEIILVDDFSDREHLKDKLEDYMARF 152
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 153 S-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 211
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK+ +P
Sbjct: 212 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKKIPIP 249
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLEDY+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 141 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 199
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK+ +P +R
Sbjct: 200 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKKIPIPPE--LQRA 255
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 256 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 315
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 316 RVGHIYR 322
>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Gorilla gorilla gorilla]
Length = 555
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 32 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 89
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 90 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 149
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 150 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 208
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 209 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 249
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 141 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 199
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 200 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 255
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 256 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 315
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 316 RVGHIYR 322
>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Felis catus]
Length = 601
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Callithrix jacchus]
Length = 601
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1
[Macaca mulatta]
gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
Length = 601
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Saimiri boliviensis boliviensis]
Length = 601
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 2
[Macaca mulatta]
Length = 584
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 145/223 (65%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 61 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 118
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 178
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 278
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 170 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 228
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 229 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 284
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 285 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 344
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 345 RVGHIYR 351
>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Equus
caballus]
Length = 601
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKRREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 128/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKRREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Papio anubis]
Length = 601
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Papio anubis]
Length = 584
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN I+ +R++PD+R CK+ Y
Sbjct: 61 EAMRSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMY 118
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 119 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYM 178
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 179 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 237
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 238 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 278
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 170 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 228
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 229 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 284
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 285 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 344
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 345 RVGHIYR 351
>gi|449679600|ref|XP_004209371.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Hydra magnipapillata]
Length = 565
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 105/236 (44%), Positives = 149/236 (63%), Gaps = 6/236 (2%)
Query: 201 EPPLEPYKEGPGEGGKAY-HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
EP + PGE G A PE Y A YG N TS+ ISF+R++PD R +EC
Sbjct: 37 EPTGISNQSSPGEQGIAVVTSPEDYGKRNQA-YTLYGFNQFTSDKISFNRSLPDPRPQEC 95
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K Y LP SV+++FHNEG+S+L+RTVHS++ R+P++ L EIIL DD+S K L ++
Sbjct: 96 KITKYQSRLPTVSVVIIFHNEGWSTLLRTVHSVLNRSPSKLLHEIILCDDYSQKEHLKKQ 155
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LEDYI + K++L+R +EREGLIR R GA + G++I+FLD+HCE + WLPPL++ I
Sbjct: 156 LEDYIIPY-PKIKLVRTSEREGLIRARVHGANHANGDIIIFLDSHCEANVGWLPPLVSEI 214
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ + +T P +D ID+ ++ +R V D + RG F W YKE + E + RK
Sbjct: 215 EKNYRCVTCPTVDFIDHDSFYYRGV---DPYIRGTFNWRFDYKERGITEHQKAARK 267
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 126/187 (67%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDYI + K++L+R +EREGLIR R GA + G++I+FLD+HCE + WLPPL
Sbjct: 152 LKKQLEDYIIPY-PKIKLVRTSEREGLIRARVHGANHANGDIIIFLDSHCEANVGWLPPL 210
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
++ I + + +T P +D ID+ ++ +R V D + RG F W YKE + E + RK
Sbjct: 211 VSEIEKNYRCVTCPTVDFIDHDSFYYRGV---DPYIRGTFNWRFDYKERGITEHQKAARK 267
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+E +SP AGGLFA+ + F+ ELG YDPG+ VWGGE +E+SFK+WMCGG + +PCS
Sbjct: 268 SVTEGVRSPVMAGGLFAISKKFWEELGKYDPGMYVWGGEQYEISFKLWMCGGEMLNMPCS 327
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 328 RVGHVYR 334
>gi|118090108|ref|XP_420520.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Gallus gallus]
Length = 601
Score = 201 bits (512), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 144/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMVRF 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKEKLEEYMVRF-AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
Length = 675
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 142/220 (64%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK + + +A R D + E G N+ S+ IS +R+IPD+R CK Y
Sbjct: 158 RSGNGEQGKPFPMTDADRV--DQAYRENGFNIYVSDRISLNRSIPDIRHPNCKQKLYAEK 215
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ FHNEG+SSL+RTVHS++ R+P Q + EIILVDDFS + L Q LE+Y+ R
Sbjct: 216 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDREHLKQPLEEYMVRL 275
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 276 -PKVRILRTKKREGLIRTRLLGATAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKTIV 334
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 335 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIP 372
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 94/187 (50%), Positives = 127/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 264 LKQPLEEYMVRLP-KVRILRTKKREGLIRTRLLGATAAKGEVITFLDSHCEANVNWLPPL 322
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K+
Sbjct: 323 LDRIAQNRKTIVCPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKEDP 380
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 381 --SEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGCMEDIPCS 438
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 439 RVGHIYR 445
>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
1 [Taeniopygia guttata]
gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
2 [Taeniopygia guttata]
Length = 601
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 144/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEA 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMLRF 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKEKLEEYMLRF-AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMYDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Heterocephalus glaber]
Length = 522
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 143/218 (65%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y LP
Sbjct: 1 GKGEHGKPYPLTE--EDGDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLERLP 58
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF+
Sbjct: 59 NTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS- 117
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P
Sbjct: 118 KVRILRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCP 177
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ + + + + RG F+W M YK +P
Sbjct: 178 MIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 213
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 105 LKDKLEEYMARFS-KVRILRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 163
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 164 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 219
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 220 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 279
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 280 RVGHIYR 286
>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 607
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 143/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK + + E R D + E G N+ SN IS +R++PD+R E C+ Y
Sbjct: 88 RAGNGEQGKPFPVTETDRV--DQAYRENGFNIYVSNRISLNRSLPDIRHENCRQKLYAEK 145
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP ++I+ FHNEG+SSL+RTVHS+I R+P + + EIILVDDFS K L LE+Y++RF
Sbjct: 146 LPNTTIIIPFHNEGWSSLLRTVHSVINRSPPRLVAEIILVDDFSDKEHLKVALEEYMKRF 205
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 206 P-KVRILRTKKREGLIRTRLLGAGAAKGEVITFLDSHCEANVNWLPPLLDRIVQNRKTIV 264
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 265 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP 302
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 95/188 (50%), Positives = 130/188 (69%), Gaps = 7/188 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++RF KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 194 LKVALEEYMKRFP-KVRILRTKKREGLIRTRLLGAGAAKGEVITFLDSHCEANVNWLPPL 252
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P A+ R
Sbjct: 253 LDRIVQNRKTIVCPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP---AEMRT 307
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +EP++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PC
Sbjct: 308 DDPTEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHIYR 375
>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
Length = 577
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 100/240 (41%), Positives = 149/240 (62%), Gaps = 2/240 (0%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
K+ + P + GPGE G A HL + G+A + ++ MN+ S+ +S DR+IPD R
Sbjct: 59 KMRHKRPDYSKQRSGPGENGAAVHLSGKEKEKGEADMKKWFMNVVASDKLSMDRSIPDTR 118
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
EC+ Y DLP ASV+++F +E ++ L+RTVHS++ R+P L E+IL+DDFS + +
Sbjct: 119 HAECRSVHYDDDLPSASVVIIFTDEAWTPLLRTVHSVVNRSPLHLLHEVILLDDFSQREE 178
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L KL++YI+RF G V+LIR ER GLIR + GA E+ GEVIVFLD+HCE WL PL
Sbjct: 179 LKGKLDEYIKRFGGIVKLIRKKERHGLIRAKLAGAHEATGEVIVFLDSHCEANEGWLEPL 238
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LA I R + P+ID I +T ++ + + + G F W + ++ + + + E +RK
Sbjct: 239 LARIKEKRTAVLCPIIDYISAETMQYSG--DANVNAVGGFWWSLHFRWDSIGKAERDRRK 296
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 133/203 (65%), Gaps = 4/203 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL++YI+RF G V+LIR ER GLIR + GA E+ GEVIVFLD+HCE WL P
Sbjct: 178 ELKGKLDEYIKRFGGIVKLIRKKERHGLIRAKLAGAHEATGEVIVFLDSHCEANEGWLEP 237
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I R + P+ID I +T ++ + + + G F W + ++ + + + E +R
Sbjct: 238 LLARIKEKRTAVLCPIIDYISAETMQYSG--DANVNAVGGFWWSLHFRWDSIGKAERDRR 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K EP +SPT AGGL A +R +FLE+GGYDPG+ +WGGEN E+SF++WMCGGSIE++PC
Sbjct: 296 KSAIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISFRVWMCGGSIEFIPC 355
Query: 181 SRIVSLIRP--VFKADGKLGNLE 201
S + + R + G GNL+
Sbjct: 356 SHVGHIFRAGHPYNMTGPGGNLD 378
>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Oryctolagus cuniculus]
Length = 601
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 105/223 (47%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSGKGEHGKPYPLTE--EDHDDSAYKENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA + GEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMAGGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 94/187 (50%), Positives = 128/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA + GEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMAGGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLF++DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFSVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Ornithorhynchus anatinus]
Length = 558
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 148/239 (61%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L PP++ EGPGE GK +P+ + N+ S I+F+R++PD+R+E C
Sbjct: 46 LRPPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASERIAFNRSLPDVRLEGC 105
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 106 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 165
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G VI FLDAHCE + WL PLLA I
Sbjct: 166 LESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLARI 225
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 226 KFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ + V +IR +R GLIR R +GA S+G VI FLDAHCE + WL PL
Sbjct: 162 LKRPLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|327278031|ref|XP_003223766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Anolis carolinensis]
Length = 602
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 144/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEQGKPYPLTE--EDNDDSAYRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L +KLE+Y+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPNSLIAEIILVDDFSDREHLKEKLEEYMARF 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -VKVRIVRTKKREGLIRTRLLGASIAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKEKLEEYMARF-VKVRIVRTKKREGLIRTRLLGASIAKGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRT 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F +LGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVNRKWFWDLGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Pan paniscus]
Length = 518
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 104/216 (48%), Positives = 142/216 (65%), Gaps = 5/216 (2%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y LP
Sbjct: 2 GEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLERLPNT 59
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF+ KV
Sbjct: 60 SIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KV 118
Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
R++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P+I
Sbjct: 119 RIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPMI 178
Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
D ID+ + + + + RG F+W M YK +P
Sbjct: 179 DVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 212
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 104 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 162
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 163 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 218
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 219 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 278
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 279 RVGHIYR 285
>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Nomascus leucogenys]
Length = 557
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/241 (43%), Positives = 150/241 (62%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ P+ +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E
Sbjct: 45 GDILKPITKNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 105 GCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 163 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 677
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 142/220 (64%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GKA+ L +A R D + E G N+ S+ IS +R++PD+R CK Y
Sbjct: 158 RSGNGEQGKAFPLTDADRV--DQAYRENGFNIFVSDRISLNRSVPDIRHPNCKQKLYAER 215
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG+SSL+RTVHS++ R+P Q + EIILVDDFS K L LE+Y+ R
Sbjct: 216 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDKDHLKGALEEYMVRL 275
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 276 P-KVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANINWLPPLLDRIALNRKTIV 334
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 335 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIP 372
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 94/187 (50%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y+ R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 264 LKGALEEYMVRLP-KVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANINWLPPL 322
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 323 LDRIALNRKTIVCPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPAELQKNDP 380
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 381 --SEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCS 438
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 439 RVGHIYR 445
>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
norvegicus]
Length = 601
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/220 (47%), Positives = 142/220 (64%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 81 RSGKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLEDY+ RF
Sbjct: 139 LPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARF 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K +
Sbjct: 199 -PIVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHSHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 127/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLEDY+ RF VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEDYMARF-PIVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHSHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|260789758|ref|XP_002589912.1| hypothetical protein BRAFLDRAFT_156854 [Branchiostoma floridae]
gi|229275097|gb|EEN45923.1| hypothetical protein BRAFLDRAFT_156854 [Branchiostoma floridae]
Length = 292
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 93/201 (46%), Positives = 135/201 (67%), Gaps = 2/201 (0%)
Query: 228 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 287
G + E G N++ SN IS DR IPD+R C Y DLP S+++ FHNEG+++L+R
Sbjct: 7 GPHAYEECGFNIKASNKISLDRAIPDIRHPNCASKKYVRDLPDVSLVIPFHNEGWTTLLR 66
Query: 288 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 347
TVHS++ R+P Q + EIILVDDFS ++ L + LEDY+ + + KVR++R +REGLIRTR
Sbjct: 67 TVHSVLNRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKLSPKVRVVRTKQREGLIRTRL 126
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA+ ++G+V++FLD+HCE +NWLPPLL PI ++K + P ID ID + + + +
Sbjct: 127 LGAQVAKGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIVCPNIDVIDKDDFHYET--QA 184
Query: 408 DHHYRGIFEWGMLYKENELPE 428
RG F+W M YK +P+
Sbjct: 185 GDAMRGAFDWEMYYKRIPIPD 205
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 92/187 (49%), Positives = 127/187 (67%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ + + KVR++R +REGLIRTR GA+ ++G+V++FLD+HCE +NWLPPL
Sbjct: 95 LGKDLEDYVAKLSPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPL 154
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI ++K + P ID ID + + + + RG F+W M YK +P+ K
Sbjct: 155 LEPIALNKKTIVCPNIDVIDKDDFHYET--QAGDAMRGAFDWEMYYKRIPIPDE--IKNP 210
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+W CGG + PCS
Sbjct: 211 DPSDPFESPVMAGGLFAVDREYFEELGGYDPGLDIWGGEQYELSFKVWQCGGRMVDAPCS 270
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 271 RVGHVYR 277
>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
Length = 601
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/223 (47%), Positives = 144/223 (64%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + GE GK Y L E D++ E G N+ SN+I+ +R++PD+R CK+ Y
Sbjct: 78 EAMRSRKGEHGKPYPLTE--EDHDDSAYRENGFNIFVSNNIALERSLPDIRHANCKHKMY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREHLKDKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 ARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIPLNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIPLNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 302 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|390348396|ref|XP_787966.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 403
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 97/182 (53%), Positives = 126/182 (69%), Gaps = 7/182 (3%)
Query: 10 IERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 69
+ + GKV+L RN++REGLIR+R GA++SRG+V+ FLDAHCE NWL PLL I +R
Sbjct: 3 VPQLQGKVKLYRNSKREGLIRSRIFGAEQSRGQVLTFLDAHCECSPNWLVPLLTEIALNR 62
Query: 70 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR---KYNSEP 126
+ P +D I +E+RS + D RG +W YK +P +++R KY SEP
Sbjct: 63 TTVVCPTVDSISADNFEYRS--QGDGLCRGAMDWDFWYKR--IPVDLSRQRLGLKYQSEP 118
Query: 127 YKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSL 186
Y SP AGGLFA+DR FF ELGGYDPGL +WGGENFE+SFK WMCGGS+++VPCSR+ +
Sbjct: 119 YDSPMMAGGLFALDREFFFELGGYDPGLQIWGGENFEISFKAWMCGGSLKFVPCSRVGHV 178
Query: 187 IR 188
R
Sbjct: 179 YR 180
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 46/99 (46%), Positives = 63/99 (63%), Gaps = 2/99 (2%)
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+ + GKV+L RN++REGLIR+R GA++SRG+V+ FLDAHCE NWL PLL I +R
Sbjct: 3 VPQLQGKVKLYRNSKREGLIRSRIFGAEQSRGQVLTFLDAHCECSPNWLVPLLTEIALNR 62
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+ P +D I +E+RS + D RG +W YK
Sbjct: 63 TTVVCPTVDSISADNFEYRS--QGDGLCRGAMDWDFWYK 99
>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
Length = 553
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 102/241 (42%), Positives = 151/241 (62%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEIIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ LE+Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLENYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDKTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 125/188 (66%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLENYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKADRRTVVCPIIDVISDDTFEYMA--GSDKTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Takifugu rubripes]
Length = 600
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 104/220 (47%), Positives = 144/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GKA+ L ++ R D + E G N+ S+ IS +R++PD+R +CK Y
Sbjct: 81 RTGNGEQGKAFPLTDSDRV--DQAYRENGFNIYISDRISLNRSLPDIRHADCKQKLYAEK 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ FHNEG+SSL+RTVHS++ R+P Q + E+ILVDDFS K L LE+Y++R
Sbjct: 139 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAELILVDDFSDKEHLKVPLEEYMKRM 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 199 -PKVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKSIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 127/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 187 LKVPLEEYMKRMP-KVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LDRIAQNRKSIVCPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPAE--MQRD 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 302 DPSQPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oryzias latipes]
Length = 557
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 104/244 (42%), Positives = 151/244 (61%), Gaps = 7/244 (2%)
Query: 195 GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 254
G+ +L P ++GPGEGGK +P+ + N+ S I+ +R++PD+
Sbjct: 44 GRADSLSRP----RDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDV 99
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R+E CK YP DLP+ SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S +
Sbjct: 100 RLEGCKNKLYPDDLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPRSLLEEIVLVDDASERD 159
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE Y++R VR++R +R GLIR R +GA S G+VI FLDAHCE L WL P
Sbjct: 160 FLKRQLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTLGWLEP 219
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LL I D++ + P+ID I T+E+ + D Y G F W + ++ +P+RE +R
Sbjct: 220 LLTRIKQDKRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 276
Query: 435 KYNR 438
K +R
Sbjct: 277 KGDR 280
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+ R VR++R +R GLIR R +GA S G+VI FLDAHCE L WL PL
Sbjct: 161 LKRQLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTLGWLEPL 220
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D++ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 221 LTRIKQDKRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 277
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 278 GDRTIPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 337
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 338 SHVGHVFR 345
>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Anolis carolinensis]
Length = 557
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 105/243 (43%), Positives = 150/243 (61%), Gaps = 3/243 (1%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+ G PL+ +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R
Sbjct: 43 QAGQTMIPLQRNQEGPGEMGKAVIIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVR 102
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
+E CK YP +LP SV++VFHNE +S+L+RT++S+I R P L EIILVDD S +
Sbjct: 103 LEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDF 162
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 163 LKVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 222
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LA I DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 436 YNR 438
+R
Sbjct: 280 GDR 282
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 123/184 (66%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PLLA I
Sbjct: 167 LENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 226
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 227 KEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 283
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +
Sbjct: 284 LPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 343
Query: 185 SLIR 188
+ R
Sbjct: 344 HVFR 347
>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Takifugu rubripes]
Length = 552
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 103/220 (46%), Positives = 142/220 (64%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GKA+ + +A R D + E G N+ S+ IS +R++PD+R CK Y
Sbjct: 33 RSGNGEQGKAFPMTDADRV--DQAYRENGFNIYVSDRISLNRSVPDIRHPNCKQKLYAEK 90
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ FHNEG+SSL+RTVHS++ R+P Q + E+ILVDDFS K L L++Y+ R
Sbjct: 91 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEVILVDDFSDKEHLKVPLDEYMVRL 150
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 151 -PKVRILRTKKREGLIRTRLLGAARAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKTIV 209
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 210 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIP 247
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++Y+ R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 139 LKVPLDEYMVRL-PKVRILRTKKREGLIRTRLLGAARAKGEVITFLDSHCEANVNWLPPL 197
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K+
Sbjct: 198 LDRIAQNRKTIVCPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPLELQKEDP 255
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E PCS
Sbjct: 256 --SEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDTPCS 313
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 314 RVGHIYR 320
>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Ailuropoda melanoleuca]
Length = 557
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 106/241 (43%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G + PL+ EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E
Sbjct: 45 GFIHIPLQDPHEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 105 GCKTKIYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 163 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Takifugu rubripes]
Length = 556
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 105/249 (42%), Positives = 153/249 (61%), Gaps = 7/249 (2%)
Query: 194 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D K G+L P L EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKDGSLLPALRAVISRRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNR 95
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++PD+R++ CK YP D+P S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD
Sbjct: 96 SLPDVRLDGCKTKVYPDDVPNTSIVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDD 155
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L +KLE+Y++ VR++R +R GLIR R RGA ++G+VI FLDAHCE +
Sbjct: 156 ASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTV 215
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLLA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 430 EAKKRKYNR 438
E +RK +R
Sbjct: 273 EMDRRKGDR 281
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 125/188 (66%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE+Y+ VR++R +R GLIR R RGA ++G+VI FLDAHCE + WL PL
Sbjct: 162 LKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
Length = 398
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 132/187 (70%), Gaps = 3/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE YI +F KV+LIR +REGLIR R GAK+++GEV+VFLD+HCE L WL PL
Sbjct: 97 LGSKLESYISQFT-KVQLIRAPKREGLIRARLIGAKQAKGEVLVFLDSHCEANLGWLEPL 155
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R I+ P I+ ID +T F +E + RGIF W + +K +PE E ++RK
Sbjct: 156 LARIGENRSIVVTPDIEVIDLRT--FGYTHEHGANNRGIFNWELTFKWRGIPEYERRRRK 213
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+S+P +SPT AGGLFA+D+++F E+G YD + WGGEN E+SF+IWMCGGS+E +PCS
Sbjct: 214 SDSDPIRSPTMAGGLFAIDKSYFYEIGSYDTEMSFWGGENVEISFRIWMCGGSLEIIPCS 273
Query: 182 RIVSLIR 188
++ + R
Sbjct: 274 KVGHVFR 280
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/204 (48%), Positives = 133/204 (65%), Gaps = 6/204 (2%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
Y N S+ ++ DR+IPD R + C YP LP ASVI++FHNE +S+L+RTVHS++
Sbjct: 13 YQFNELASSKVALDRSIPDNRPQSCLSLSYPTKLPTASVIIIFHNEAWSTLLRTVHSVLA 72
Query: 295 RTPAQYLEEIILVDDFS---SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
R+P L EI+LVDD S + L KLE YI +F KV+LIR +REGLIR R GAK
Sbjct: 73 RSPPYLLREIVLVDDHSRLDTYGHLGSKLESYISQFT-KVQLIRAPKREGLIRARLIGAK 131
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+++GEV+VFLD+HCE L WL PLLA I +R I+ P I+ ID +T F +E +
Sbjct: 132 QAKGEVLVFLDSHCEANLGWLEPLLARIGENRSIVVTPDIEVIDLRT--FGYTHEHGANN 189
Query: 412 RGIFEWGMLYKENELPEREAKKRK 435
RGIF W + +K +PE E ++RK
Sbjct: 190 RGIFNWELTFKWRGIPEYERRRRK 213
>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Ornithorhynchus anatinus]
Length = 559
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ P++ EGPGE GK +P+ + N+ S I+F+R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASERIAFNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ LE Y+++ V +IR +R GLIR R +GA S+G VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ + V +IR +R GLIR R +GA S+G VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKFDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
Length = 534
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
++GPGEGGK +P+ + N+ S I+ +R++PD+R+E CK YP +
Sbjct: 79 RDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP+ SV++VFHNE +S+L+RTVHS+I R+P LEEIILVDD S + L + LE Y++R
Sbjct: 139 LPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPLEQYVRRL 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLLA I DRK +
Sbjct: 199 EVPVRVVRMEQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLARIKKDRKTVV 258
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 259 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 306
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/170 (45%), Positives = 109/170 (64%), Gaps = 4/170 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ R VR++R +R GLIR R +GA S G+VI FLDAHCE WL PL
Sbjct: 187 LKRPLEQYVRRLEVPVRVVRMEQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 247 LARIKKDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 303
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 170
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++ M
Sbjct: 304 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRLQM 353
>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Takifugu rubripes]
Length = 557
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/238 (44%), Positives = 147/238 (61%), Gaps = 3/238 (1%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 260
E L ++GPGEGGK +P+ + N+ S I+ +R++PD+R+E CK
Sbjct: 46 EDTLTRPRDGPGEGGKPVVIPKENQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCK 105
Query: 261 YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKL 320
YP +LP+ SV++VFHNE +S+L+RTVHS+I R+P LEEIILVDD S + L + L
Sbjct: 106 NKLYPDNLPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPL 165
Query: 321 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 380
E Y++R VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLLA I
Sbjct: 166 EQYVRRLEVPVRVVRMDQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLARIK 225
Query: 381 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 226 KDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 280
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 113/188 (60%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ R VR++R +R GLIR R +GA S G+VI FLDAHCE WL PL
Sbjct: 161 LKRPLEQYVRRLEVPVRVVRMDQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPL 220
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 221 LARIKKDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 277
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P + AGG R +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 278 GDRTLPVRWVRCAGGXXXXXRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 337
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 338 SHVGHVFR 345
>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Sarcophilus harrisii]
Length = 559
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +RT+PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVAIPKEDQEKMKEMFKINQFNLMASEMIALNRTLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ + V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
Length = 501
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oreochromis niloticus]
Length = 557
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 146/231 (63%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
++GPGEGGK +P+ + N+ S I+ +R++PD+R+E CK YP +
Sbjct: 53 RDGPGEGGKPVVIPKEQQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN 112
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP+ SV++VFHNE +++L+RTVHS+I R+P LEEI+LVDD S + L Q+LE Y+++
Sbjct: 113 LPRTSVVIVFHNEAWTTLLRTVHSVIDRSPHTLLEEIVLVDDASERDFLKQQLERYVRKL 172
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLLA I DRK +
Sbjct: 173 EVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLARIKQDRKTVV 232
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 233 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 280
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+ + VR++R +R GLIR R +GA S G+VI FLDAHCE WL PL
Sbjct: 161 LKQQLERYVRKLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPL 220
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 221 LARIKQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 277
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 278 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 337
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 338 SHVGHVFR 345
>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Otolemur garnettii]
Length = 558
Score = 197 bits (502), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 107/239 (44%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L PP YK GPGE GKA +P+ + N+ S+ I+ +R++PD+R+E C
Sbjct: 49 LIPPQRDYK-GPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGC 107
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 108 KTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 167
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y++ + V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I
Sbjct: 168 LENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 227
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 228 KEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 283
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ + V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 164 LKLTLENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 223
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 224 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 280
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 281 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 340
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 341 SHVGHVFR 348
>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mustela putorius
furo]
Length = 461
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 103/241 (42%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVLEPIQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Canis lupus familiaris]
gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Ailuropoda melanoleuca]
gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
Length = 559
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
catus]
Length = 559
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
Length = 559
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
aries]
Length = 559
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Equus caballus]
Length = 559
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
taurus]
gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
taurus]
gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
Length = 559
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Sus scrofa]
Length = 559
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQDKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
Length = 559
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQDKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Danio rerio]
Length = 556
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 103/242 (42%), Positives = 147/242 (60%), Gaps = 3/242 (1%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
L L + E PGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVMSRAHEAPGEMGKAVVIPKEEQDKMKELFKINQFNLMASDMIALNRSLPDVRL 102
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
+ CK YP DLP S+++VFHNE +S+L+RTVHS I R+P Q L EI+LVDD S + L
Sbjct: 103 DGCKTKTYPDDLPNTSIVIVFHNEAWSTLLRTVHSAINRSPRQLLYEILLVDDASERDFL 162
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
+KLEDY+ VR++R +R GLIR R RGA +RG+VI FLDAHCE WL PL+
Sbjct: 163 KEKLEDYVATLEVPVRILRMEQRTGLIRARLRGAAATRGQVITFLDAHCECTTGWLEPLM 222
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
A I DR+ + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRRAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 437 NR 438
+R
Sbjct: 280 DR 281
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLEDY+ VR++R +R GLIR R RGA +RG+VI FLDAHCE WL PL
Sbjct: 162 LKEKLEDYVATLEVPVRILRMEQRTGLIRARLRGAAATRGQVITFLDAHCECTTGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
+A I DR+ + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 MARIKEDRRAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oreochromis niloticus]
Length = 600
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 102/220 (46%), Positives = 143/220 (65%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK + L E R D + E G N+ S+ IS +R++PD+R E C+ Y
Sbjct: 81 RMGNGEQGKPFPLTENDRV--DQAYRENGFNIYVSDRISLNRSLPDIRHENCRQKLYAEK 138
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG+SSL+RTVHS++ R+P++ + E+ILVDDFS K L LE+Y++R
Sbjct: 139 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSRLITEVILVDDFSDKEHLKVALEEYMKRM 198
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPLL I +RK +
Sbjct: 199 -PKVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANVNWLPPLLDRIAQNRKAIV 257
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 258 CPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 94/187 (50%), Positives = 127/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 187 LKVALEEYMKRM-PKVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LDRIAQNRKAIVCPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPPE--MQRD 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 302 DPSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKLWMCGGRMEDIPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
[Bos taurus]
Length = 519
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 105/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 8 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 66
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 67 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 126
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 127 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 186
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 187 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 242
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 123 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 182
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 183 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 239
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 240 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 299
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 300 SHVGHVFR 307
>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Callithrix jacchus]
gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
boliviensis boliviensis]
Length = 559
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oreochromis niloticus]
Length = 587
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 141/220 (64%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK + L +A R D + E G N+ S+ IS +R++PD+R CK+ Y
Sbjct: 68 RSGNGEQGKPFPLTDADRV--DQAYRENGFNIYVSDRISLNRSVPDIRHPNCKHKLYAEK 125
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP ++I+ FHNEG+SSL+RTVHS++ R+P + EIILVDDFS K L LE+Y+ R
Sbjct: 126 LPNTTIIIPFHNEGWSSLLRTVHSVLNRSPPHLIAEIILVDDFSDKEHLKVALEEYMVRL 185
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA ++GEV+ FLD+HCE +NWLPPLL I +RK +
Sbjct: 186 -PKVRILRTKKREGLIRTRLLGAAAAKGEVLTFLDSHCEANVNWLPPLLDRIAQNRKTIV 244
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 245 CPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIP 282
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y+ R KVR++R +REGLIRTR GA ++GEV+ FLD+HCE +NWLPPL
Sbjct: 174 LKVALEEYMVRL-PKVRILRTKKREGLIRTRLLGAAAAKGEVLTFLDSHCEANVNWLPPL 232
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 233 LDRIAQNRKTIVCPMIDVIDHDNFGYET--QAGDAMRGAFDWEMYYKRIPIPTELQKDDP 290
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 291 --SEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCS 348
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 349 RVGHIYR 355
>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Loxodonta africana]
Length = 560
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 108/249 (43%), Positives = 151/249 (60%), Gaps = 7/249 (2%)
Query: 194 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVHSVINRSPHYLLSEVILVDD 155
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L LE++++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENHVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKDDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 430 EAKKRKYNR 438
E +RK +R
Sbjct: 273 EMDRRKGDR 281
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 78/169 (46%), Positives = 110/169 (65%), Gaps = 4/169 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE++++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENHVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKDDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIW 169
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+ +
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRTY 327
>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
Length = 591
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 80 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 138
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 139 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 198
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 199 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 258
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 259 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 314
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 195 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 254
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 255 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 311
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 312 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 371
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 372 SHVGHVFR 379
>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Oryctolagus cuniculus]
Length = 557
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 105/241 (43%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G L ++ +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E
Sbjct: 45 GELLELIKENQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 105 GCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 165 LTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 163 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Otolemur garnettii]
Length = 556
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 146/231 (63%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 DVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ + V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Loxodonta africana]
Length = 559
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP LP+ SV++VFHNE +S+L+RTVHS++ R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDALPRTSVVIVFHNEAWSTLLRTVHSVLNRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Taeniopygia guttata]
Length = 559
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK Y +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cavia porcellus]
Length = 568
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 108/249 (43%), Positives = 151/249 (60%), Gaps = 7/249 (2%)
Query: 194 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 430 EAKKRKYNR 438
E +RK +R
Sbjct: 273 EMDRRKGDR 281
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Meleagris gallopavo]
gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
Length = 559
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ P++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPEPIQKPHEGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK Y +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L
Sbjct: 105 GCKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Saimiri boliviensis boliviensis]
Length = 556
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASEREFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Cavia porcellus]
Length = 559
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
Length = 587
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 112/279 (40%), Positives = 163/279 (58%), Gaps = 24/279 (8%)
Query: 168 IWMC-----------GGSIEWVPCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGK 216
+W+C GS W+P SR S ++P ++ G PP K+ E G
Sbjct: 18 LWLCVIYFVLWDIEEDGSTRWLPSSRRRSHLKPRYQITGL-----PP--SAKDDLSEIGL 70
Query: 217 AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 276
+ + + L Y N S+ +SF R IPD+R + C+ +YP +LP ASV++
Sbjct: 71 VRNPQDQVKREQGYKL--YAFNELISDRLSFHRPIPDVRHQLCQSEEYPAELPSASVVIC 128
Query: 277 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 336
F+NE +S L+RTVHSII RTP+ L EIILVDDFS L ++L+ Y+ + +L+RN
Sbjct: 129 FYNEAWSVLLRTVHSIIDRTPSALLHEIILVDDFSDLDHLAEQLDAYVSEHLPQTKLVRN 188
Query: 337 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 396
T REGLIR R G++ + GEV+VFLD+HCEV + W+ PLL+ I+ + K + VP+ID ID
Sbjct: 189 TRREGLIRARVIGSEHATGEVLVFLDSHCEVNVEWIQPLLSHIHGNHKRVAVPIIDIIDQ 248
Query: 397 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
T FR YE RG F WG+ Y+ +++PE +K++
Sbjct: 249 DT--FR--YESSPLVRGGFNWGLFYRWDQIPESLLRKQE 283
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 129/187 (68%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L+ Y+ + +L+RNT REGLIR R G++ + GEV+VFLD+HCEV + W+ PL
Sbjct: 168 LAEQLDAYVSEHLPQTKLVRNTRREGLIRARVIGSEHATGEVLVFLDSHCEVNVEWIQPL 227
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I+ + K + VP+ID ID T FR YE RG F WG+ Y+ +++PE +K++
Sbjct: 228 LSHIHGNHKRVAVPIIDIIDQDT--FR--YESSPLVRGGFNWGLFYRWDQIPESLLRKQE 283
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P K+PT AGGLFAM+R +F +LG YD G+ VWGGEN E+SF++W CGGS+ +PCS
Sbjct: 284 DYVKPIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGENLEISFRVWQCGGSMHILPCS 343
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 344 RVGHIFR 350
>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 559
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 148/239 (61%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+ R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVTDRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGASVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGASVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKQDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pongo abelii]
Length = 612
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
aries]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Nomascus leucogenys]
gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pan troglodytes]
gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
paniscus]
gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
gorilla gorilla]
gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [synthetic
construct]
gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
fascicularis]
gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
Length = 559
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Callithrix jacchus]
Length = 561
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Callithrix jacchus]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
adamanteus]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 104/242 (42%), Positives = 149/242 (61%), Gaps = 3/242 (1%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
L L + +EGPGE GKA +P+ + N+ S+ I+F+R++PD+R+
Sbjct: 43 LPALRAVMSRSQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDMIAFNRSLPDVRL 102
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
E CK YP +LP SV++VFHNE +S+L+RT++S++ R+P L EIILVDD S + L
Sbjct: 103 EGCKTKVYPDELPTTSVVIVFHNEAWSTLLRTIYSVMNRSPHYLLSEIILVDDASERDFL 162
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
LE+Y++ V++IR +R GLIR R RGA S+G+VI FLDAHCE WL PLL
Sbjct: 163 KLPLENYVRNLQVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTTGWLEPLL 222
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
A I DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 437 NR 438
+R
Sbjct: 280 DR 281
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 121/184 (65%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y+ V++IR +R GLIR R RGA S+G+VI FLDAHCE WL PLLA I
Sbjct: 166 LENYVRNLQVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTTGWLEPLLARI 225
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 226 KEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 282
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +
Sbjct: 283 LPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 342
Query: 185 SLIR 188
+ R
Sbjct: 343 HVFR 346
>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
taurus]
gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTRVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pan troglodytes]
gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Pongo abelii]
gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pongo abelii]
gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
paniscus]
gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
musculus]
Length = 594
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 56 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 115
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 116 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTL 175
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 176 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 235
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 236 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 283
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 164 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 223
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 224 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 280
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 281 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 340
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 341 SHVGHVFR 348
>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
Length = 556
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
1 [Ciona intestinalis]
Length = 573
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 104/236 (44%), Positives = 141/236 (59%), Gaps = 3/236 (1%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P GPGE GKA +P+ N+ S I+ +R++PD+RME CK
Sbjct: 63 PKFSLTSGPGEMGKAVIIPKDKEKEKQEKFKINQFNLMASEMIALNRSLPDVRMEGCKSK 122
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
YP LP S+++VFHNE +S+L+RTVHSII R+P+ LEEIILVDD S + L LE
Sbjct: 123 KYPEKLPTTSIVIVFHNEAWSTLLRTVHSIINRSPSHLLEEIILVDDASERDFLGAPLER 182
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
Y+++ VR++R +R GLIR R RGA S G+VI FLDAHCE WL PLL+ I D
Sbjct: 183 YVRKLRTLVRVVRMEKRTGLIRARLRGASVSTGQVITFLDAHCECTEGWLEPLLSEIAKD 242
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
R + P+ID I +T+EF + D Y G F W + ++ +P+RE +RK +R
Sbjct: 243 RTTVVCPIIDVISDETFEF--MVGSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 295
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE Y+ + VR++R +R GLIR R RGA S G+VI FLDAHCE WL PL
Sbjct: 176 LGAPLERYVRKLRTLVRVVRMEKRTGLIRARLRGASVSTGQVITFLDAHCECTEGWLEPL 235
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I +T+EF + D Y G F W + ++ +P+RE +RK
Sbjct: 236 LSEIAKDRTTVVCPIIDVISDETFEF--MVGSDMTYGG-FNWKLNFRWYPVPQREMDRRK 292
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SPT AGGLF++D+++F ELG YD G+ +WGGEN E+SF+IW CGG++ V C
Sbjct: 293 GDRTLPVRSPTMAGGLFSIDKSYFEELGTYDAGMDIWGGENLEISFRIWQCGGTLLIVTC 352
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 353 SHVGHVFR 360
>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 559
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
norvegicus]
gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
Length = 556
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
Length = 559
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
Length = 561
Score = 195 bits (496), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 108/249 (43%), Positives = 151/249 (60%), Gaps = 7/249 (2%)
Query: 194 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++PD+R+E CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 155
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 430 EAKKRKYNR 438
E +RK +R
Sbjct: 273 EMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Ailuropoda melanoleuca]
Length = 556
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
Length = 592
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
Length = 556
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLDFRWYPVPQREMDRRKGDR 282
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLDFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
Length = 516
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 14 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 73
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 74 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 133
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 134 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 193
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 194 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 241
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 122 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 181
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 182 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 238
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 239 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 298
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 299 SHVGHVFR 306
>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
caballus]
Length = 539
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 144/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDHEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 174 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 71/187 (37%), Positives = 99/187 (52%), Gaps = 19/187 (10%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+G + A+ P L++ +F +IW CGGS+E V CS
Sbjct: 279 -GDRTLPVSCFSGNMTAL------------PTGLLYNSCSFS---QIWQCGGSLEIVTCS 322
Query: 182 RIVSLIR 188
+ + R
Sbjct: 323 HVGHVFR 329
>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
Length = 516
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 145/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 9 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE 68
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 69 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNL 128
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 129 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 188
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 189 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 236
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 117 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 176
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 177 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 233
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 234 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 293
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 294 SHVGHVFR 301
>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
Length = 519
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 144/231 (62%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP
Sbjct: 17 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDQ 76
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++
Sbjct: 77 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 136
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK +
Sbjct: 137 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 196
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 197 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 244
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 125 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 184
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 185 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 241
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 242 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 301
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 302 SHVGHVFR 309
>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Anolis carolinensis]
Length = 556
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 104/242 (42%), Positives = 148/242 (61%), Gaps = 3/242 (1%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
L L + +EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVMSRSQEGPGEMGKAVIIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRL 102
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
E CK YP +LP SV++VFHNE +S+L+RT++S+I R P L EIILVDD S + L
Sbjct: 103 EGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFL 162
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PLL
Sbjct: 163 KVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
A I DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 437 NR 438
+R
Sbjct: 280 DR 281
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 123/184 (66%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PLLA I
Sbjct: 166 LENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 226 KEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 282
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +
Sbjct: 283 LPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 342
Query: 185 SLIR 188
+ R
Sbjct: 343 HVFR 346
>gi|345307949|ref|XP_001508273.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Ornithorhynchus anatinus]
Length = 593
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 143/223 (64%), Gaps = 5/223 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCNNKLYLEKLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ RF
Sbjct: 146 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKRLEDYMARF-P 204
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
+VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 205 RVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
+ID ID+ + + + + RG F+W M YK +P+ K
Sbjct: 265 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPQELQK 305
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 128/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ RF +VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKRLEDYMARF-PRVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P+ K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPQELQKPDP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+D+ +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDKKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Oreochromis niloticus]
Length = 556
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 145/230 (63%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GKA ++P+ + N+ S+ I+ +R++PD+R++ CK Y DL
Sbjct: 55 EGPGEMGKAVNIPKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYSDDL 114
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P S+++VFHNE +S+L+RTVHS+I R+P L EIILVDD S + L +KLE+Y++
Sbjct: 115 PNTSIVIVFHNEAWSTLLRTVHSVINRSPKHLLVEIILVDDASERDFLKKKLENYVRTLE 174
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R +R GLIR R RGA + G+VI FLDAHCE + WL PLLA I DR +
Sbjct: 175 VPVRILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTVGWLEPLLARIKEDRTAVVC 234
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 235 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE+Y+ VR++R +R GLIR R RGA + G+VI FLDAHCE + WL PL
Sbjct: 162 LKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTVGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
anubis]
Length = 559
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 103/239 (43%), Positives = 149/239 (62%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LERYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLERYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Monodelphis domestica]
Length = 559
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 104/239 (43%), Positives = 148/239 (61%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LE +P+ EGPGE GK +P+ + N+ S I+ +RT+PD+R+E C
Sbjct: 48 LETVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRTLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ + V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKVDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 624
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 104/231 (45%), Positives = 144/231 (62%), Gaps = 5/231 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--LD 267
GPGE GK +P +A N+ S+ IS +R++PD RM+ CK YP D
Sbjct: 114 GPGEMGKPVVIPLDRQAESKEKFKINQFNLVASDMISLNRSLPDYRMDACKRKSYPPNSD 173
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTVHSII R+P + L EI+LVDD S + +L +KLEDYI R
Sbjct: 174 LPDTSVVIVFHNEAWSTLLRTVHSIINRSPRELLNEILLVDDASEREELGKKLEDYIARL 233
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
R+IR+ ER GLIR R +GAK++RG+VI FLDAHCE WL PLL I+ DR +
Sbjct: 234 PVSTRVIRSEERTGLIRARLKGAKQARGKVITFLDAHCECTEGWLEPLLYEIHKDRTAVV 293
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I ++E+ + D + G F W + ++ +P+RE +R +R
Sbjct: 294 CPIIDVIGDDSFEY--ITGSDMTWGG-FNWKLNFRWYPVPQRELDRRGGDR 341
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 125/189 (66%), Gaps = 4/189 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KKLEDYI R R+IR+ ER GLIR R +GAK++RG+VI FLDAHCE WL P
Sbjct: 221 ELGKKLEDYIARLPVSTRVIRSEERTGLIRARLKGAKQARGKVITFLDAHCECTEGWLEP 280
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I+ DR + P+ID I ++E+ + D + G F W + ++ +P+RE +R
Sbjct: 281 LLYEIHKDRTAVVCPIIDVIGDDSFEY--ITGSDMTWGG-FNWKLNFRWYPVPQRELDRR 337
Query: 121 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P K+PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++WMCGG + V
Sbjct: 338 GGDRSNPTKTPTMAGGLFSIDRDYFYEVGSYDEGMDIWGGENLEMSFRVWMCGGKVYIVT 397
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 398 CSRVGHVFR 406
>gi|291236246|ref|XP_002738051.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Saccoglossus kowalevskii]
Length = 321
Score = 194 bits (492), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 138/229 (60%), Gaps = 3/229 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ GPGE G+ Y L + ++G N S+ +S +R +PD+R CK +Y +
Sbjct: 51 RTGPGEQGRPYILSPEEKKNEHQDFSKHGFNKHISDVLSVERALPDIRDPRCKTMEYLVK 110
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++ FHNE S L RTVHSII R+P + L EIILVDDFS + + L DY+
Sbjct: 111 LPNTSIVIPFHNEALSVLKRTVHSIINRSPPELLHEIILVDDFSDHDECKEPLNDYMVTV 170
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR+IR T+REGLIRTR GA + G+V+VFLD+HCE +NWLPPLL I +RK +
Sbjct: 171 -PKVRIIRATKREGLIRTRLLGASRATGQVLVFLDSHCEANVNWLPPLLESIALNRKCIA 229
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
P+ID I + + + + RG F+W + YK L E E K+RK+
Sbjct: 230 CPMIDVIGNNDYHYET--QAGDAMRGAFDWELFYKRIPLTEEELKRRKH 276
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 81/162 (50%), Positives = 110/162 (67%), Gaps = 3/162 (1%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
L DY+ KVR+IR T+REGLIRTR GA + G+V+VFLD+HCE +NWLPPLL I
Sbjct: 163 LNDYMVTV-PKVRIIRATKREGLIRTRLLGASRATGQVLVFLDSHCEANVNWLPPLLESI 221
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 125
+RK + P+ID I + + + + RG F+W + YK L E E K+RK+ +E
Sbjct: 222 ALNRKCIACPMIDVIGNNDYHYET--QAGDAMRGAFDWELFYKRIPLTEEELKRRKHAAE 279
Query: 126 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 167
P+++P AGGLFA+DR +F E+GGYD GL +WGGE ++LSFK
Sbjct: 280 PFRTPIMAGGLFAVDRLYFNEIGGYDAGLEIWGGEQYDLSFK 321
>gi|148696679|gb|EDL28626.1| mCG6366 [Mus musculus]
Length = 431
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 97/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLEDY+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 2 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPL 60
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 61 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 116
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 117 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 176
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 177 RVGHIYR 183
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 55/112 (49%), Positives = 74/112 (66%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L KLEDY+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 2 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPL 60
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 61 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 110
>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
taurus]
Length = 620
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK + L A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 105 GDGEQGKPFPLTYAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLETLP 162
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 163 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 222
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 223 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 281
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 282 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 317
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 209 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 267
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 268 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 325
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 326 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 383
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 384 RVGHIYR 390
>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
Length = 582
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ I+ +R++PD+R C Y LP
Sbjct: 67 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKIALNRSLPDIRHPNCNNKRYLETLP 124
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P Q + EI+LVDDFS + L + LEDY+ F
Sbjct: 125 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIVLVDDFSDREHLKKPLEDYMAHF-P 183
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 184 SVRILRTKKREGLIRTRMLGASAASGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 243
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 244 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 279
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 171 LKKPLEDYMAHFPS-VRILRTKKREGLIRTRMLGASAASGDVITFLDSHCEANVNWLPPL 229
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 230 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 287
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 288 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 345
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 346 RVGHIYR 352
>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 1 [Xenopus (Silurana) tropicalis]
Length = 600
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 141/223 (63%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D E G N+ SN I+ R++PD+R CK+ Y
Sbjct: 77 EALRSGKGEHGKPYPLTE--EEQDDTVYRENGFNIFVSNKIALARSLPDIRHPNCKHKLY 134
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HS+I RTP +EE+ILVDDFS + L +KLE+Y+
Sbjct: 135 LERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLREKLEEYM 194
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 195 AYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 253
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 254 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 294
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 128/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ + KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPL
Sbjct: 186 LREKLEEYMAYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 245 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRT 300
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+WMCGG + VPCS
Sbjct: 301 DPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCS 360
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 361 RVGHIYR 367
>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
Length = 652
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK + L A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 137 GDGEQGKPFPLTYAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLETLP 194
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 195 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 254
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 255 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 313
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 314 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 349
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 241 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 299
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 300 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 357
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 358 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 415
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 416 RVGHIYR 422
>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
gallus]
Length = 611
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 93 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 150
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 151 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 209
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 210 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 269
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ + + + + RG F+W M YK +P
Sbjct: 270 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIP 305
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ +F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 197 LKKRLEDYMAQF-PNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 255
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 256 LDRIARNRKTIVCPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP 313
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 314 --SDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCS 371
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 372 RVGHIYR 378
>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
lupus familiaris]
Length = 603
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 139/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P++ + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPSELIAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 2 [Xenopus (Silurana) tropicalis]
Length = 601
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 102/223 (45%), Positives = 141/223 (63%), Gaps = 5/223 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK Y L E D E G N+ SN I+ R++PD+R CK+ Y
Sbjct: 78 EALRSGKGEHGKPYPLTE--EEQDDTVYRENGFNIFVSNKIALARSLPDIRHPNCKHKLY 135
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+ FHNEG++SL+RT+HS+I RTP +EE+ILVDDFS + L +KLE+Y+
Sbjct: 136 LERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREHLREKLEEYM 195
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPLL I + K
Sbjct: 196 AYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNHK 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 255 TIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 295
Score = 188 bits (478), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 128/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ + KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPL
Sbjct: 187 LREKLEEYMAYY-PKVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRT 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA+DR +F ELGGYDPGL +WGGE +ELSFK+WMCGG + VPCS
Sbjct: 302 DPSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
livia]
Length = 560
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 31 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 88
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 89 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 147
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 148 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 207
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ + + + + RG F+W M YK +P
Sbjct: 208 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIP 243
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ +F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 135 LKKRLEDYMAQF-PNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 193
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 194 LDRIARNRKTIVCPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP 251
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 252 --SDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCS 309
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 310 RVGHIYR 316
>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Meleagris gallopavo]
Length = 562
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 43 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 100
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 101 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQF-P 159
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 160 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 219
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ + + + + RG F+W M YK +P
Sbjct: 220 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIP 255
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ +F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 147 LKKRLEDYMAQF-PNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 205
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 206 LDRIARNRKTIVCPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKLDP 263
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 264 --SDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCS 321
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 322 RVGHIYR 328
>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Sarcophilus harrisii]
Length = 651
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 141/218 (64%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ I+ +R++PD+R C Y LP
Sbjct: 133 GNGEQGKPYPITDAERV--DQAYRENGFNIFVSDKIALNRSLPDIRHPNCNSKLYLEKLP 190
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P Q + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 191 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLVAEIVLVDDFSDREHLKKRLEDYMAQF-P 249
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I S+RK + P
Sbjct: 250 NVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIASNRKTIVCP 309
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID + +++ + RG F+W M YK +P
Sbjct: 310 MIDVIDNDHFGYKT--QAGDAMRGAFDWEMYYKRIPIP 345
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 128/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ +F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 237 LKKRLEDYMAQF-PNVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 295
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I S+RK + P+ID ID + +++ + RG F+W M YK +P K
Sbjct: 296 LDRIASNRKTIVCPMIDVIDNDHFGYKT--QAGDAMRGAFDWEMYYKRIPIPLELQKSDP 353
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 354 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 411
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 412 RVGHIYR 418
>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 143/230 (62%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GK +P+ + N+ S I+F+R++PD+R+E CK YP L
Sbjct: 56 EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIAFNRSLPDVRLEGCKTKVYPDSL 115
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L + LE Y+++
Sbjct: 116 PTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLK 175
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I DR+ +
Sbjct: 176 VPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARIKHDRRTVVC 235
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 236 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Anolis carolinensis]
Length = 612
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 141/220 (64%), Gaps = 5/220 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y
Sbjct: 92 RTGNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCNSKLYLEK 149
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L ++LEDY+ +F
Sbjct: 150 LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLRKRLEDYMAQF 209
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I + K +
Sbjct: 210 T-KVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNHKTIV 268
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 269 CPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIP 306
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 122/301 (40%), Positives = 167/301 (55%), Gaps = 22/301 (7%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ +F KVR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 198 LRKRLEDYMAQFT-KVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 256
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 257 LDRIARNHKTIVCPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKPDP 314
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 315 --SDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCS 372
Query: 182 RIVSLIRPVFKADGKLG-----NLEPPLEPYKEGPGEGGKAYHLPEAYR--AAGD-ASLG 233
R+ + R G NL+ E + + E Y YR +AGD A+
Sbjct: 373 RVGHIYRKYVPYKVPTGVSLARNLKRVAEVWMDEYAE--YIYQRRPEYRHLSAGDVATQK 430
Query: 234 EYGMNMETSNHISFDRTIP-DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSI 292
E N+ + F + DLR K+ YP P A+ HN G S + T H
Sbjct: 431 ELRSNLNCKSFRWFMNEVAWDLR----KF--YPPVEPPAAAWGEIHNVGTSLCVDTKHGA 484
Query: 293 I 293
+
Sbjct: 485 L 485
>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
scrofa]
Length = 624
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 109 GNGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEMLP 166
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 167 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALF-P 225
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 226 NVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 285
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 286 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 321
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 213 LKKPLEDYMALF-PNVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 271
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 272 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 329
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 330 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 387
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 388 RVGHIYR 394
>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
norvegicus]
gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
norvegicus]
gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
Length = 603
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
(Silurana) tropicalis]
gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
Length = 603
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 137/218 (62%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK + + +A D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 85 GNGEQGKPFPMTDADHV--DQAYRENGFNIFVSDKISLNRSLPDIRNSNCKNKFYFSKLP 142
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDD+S KA L +LE Y+ F
Sbjct: 143 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDYSDKAHLKSRLEKYMANF-P 201
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KV+++R +REGLIRTR GA + GEV+ FLD+HCE +NWLPPLL P+ + K + P
Sbjct: 202 KVKIVRTKKREGLIRTRMLGATVASGEVLTFLDSHCEANVNWLPPLLDPLVQNYKTVVCP 261
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID F V + RG F+W M YK +P
Sbjct: 262 MIDVIDSDN--FGYVTQAGDAMRGAFDWEMFYKRIPIP 297
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 90/187 (48%), Positives = 120/187 (64%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+ F KV+++R +REGLIRTR GA + GEV+ FLD+HCE +NWLPPL
Sbjct: 189 LKSRLEKYMANF-PKVKIVRTKKREGLIRTRMLGATVASGEVLTFLDSHCEANVNWLPPL 247
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L P+ + K + P+ID ID F V + RG F+W M YK +P K
Sbjct: 248 LDPLVQNYKTVVCPMIDVIDSDN--FGYVTQAGDAMRGAFDWEMFYKRIPIPPELQKGDP 305
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P+ SP AGGLFA++R +F +LGGYDPGL +WGGE +E+SFK+WMCGG + PCS
Sbjct: 306 --SDPFDSPVMAGGLFAINREWFWQLGGYDPGLEIWGGEQYEISFKVWMCGGRMVDSPCS 363
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 364 RVGHIYR 370
>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 142/230 (61%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GK +P+ + N+ S I+ +R++PD+R+E CK YP +L
Sbjct: 56 EGPGEMGKPVVIPKEDQEKMKEMFKTNQFNLMASEMIALNRSLPDVRLEGCKTKVYPDNL 115
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L + LE Y+++
Sbjct: 116 PTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLK 175
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PLLA I DR+ +
Sbjct: 176 VPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVC 235
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 236 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|395542399|ref|XP_003773120.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Sarcophilus harrisii]
Length = 436
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA ++GEV+ FLD+HCEV +NWLPPL
Sbjct: 22 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 80
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 81 LNQIALNRKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 136
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 137 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCS 196
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 197 RVGHIYR 203
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/124 (45%), Positives = 81/124 (65%), Gaps = 3/124 (2%)
Query: 304 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
IL++ + K L KLE+Y+ RF+ KVR++R +REGLIRTR GA ++GEV+ FLD+
Sbjct: 10 FILLNYYREKKHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDS 68
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 423
HCEV +NWLPPLL I +RK + P+ID ID+ + + + + RG F+W M YK
Sbjct: 69 HCEVNVNWLPPLLNQIALNRKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKR 126
Query: 424 NELP 427
+P
Sbjct: 127 IPIP 130
>gi|395735509|ref|XP_002815348.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Pongo abelii]
Length = 366
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 26 LKNKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 84
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 85 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 140
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 141 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 200
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 201 RVGHIYR 207
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 55/118 (46%), Positives = 77/118 (65%), Gaps = 3/118 (2%)
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
++K L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +
Sbjct: 20 MTAKEHLKNKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNV 78
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
NWLPPLL I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 79 NWLPPLLNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 134
>gi|296484977|tpg|DAA27092.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 320
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 11 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 69
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 70 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 125
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 126 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 185
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 186 RVGHIYR 192
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 11 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 69
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 70 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 119
>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
caballus]
Length = 561
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 46 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 103
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 104 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 163
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 164 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 222
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 223 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 258
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 150 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 208
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 209 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 266
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 267 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 324
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 325 RVGHIYR 331
>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
catus]
Length = 603
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
Length = 603
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|449685123|ref|XP_002167708.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like,
partial [Hydra magnipapillata]
Length = 411
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 89/190 (46%), Positives = 128/190 (67%), Gaps = 2/190 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL L ++ + K++++RN ER GLIR+R GA++SRG+V++FLD+HCE W+ P
Sbjct: 1 DLKIPLVKHLNELSDKIQVVRNEERSGLIRSRLAGAEKSRGDVLIFLDSHCETTPGWIEP 60
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I + + VP I+ ID T E+R+ P+ RG F W ++Y N +PE E R
Sbjct: 61 LLARINEAKSNVVVPTIESIDADTLEYRASDNPEQ--RGGFSWDLMYDWNSIPENEKHLR 118
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S+P ++PT AGGLFA+D+++F E+G YD + +WGGEN ELSF+IWMCGG IE +PC
Sbjct: 119 QSPSDPIRTPTMAGGLFAIDKSYFFEMGSYDQEMDIWGGENLELSFRIWMCGGRIEILPC 178
Query: 181 SRIVSLIRPV 190
SR+ + R V
Sbjct: 179 SRVGHIFRKV 188
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 50/121 (41%), Positives = 75/121 (61%), Gaps = 2/121 (1%)
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
DL L ++ + K++++RN ER GLIR+R GA++SRG+V++FLD+HCE W+ P
Sbjct: 1 DLKIPLVKHLNELSDKIQVVRNEERSGLIRSRLAGAEKSRGDVLIFLDSHCETTPGWIEP 60
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LLA I + + VP I+ ID T E+R+ P+ RG F W ++Y N +PE E R
Sbjct: 61 LLARINEAKSNVVVPTIESIDADTLEYRASDNPEQ--RGGFSWDLMYDWNSIPENEKHLR 118
Query: 435 K 435
+
Sbjct: 119 Q 119
>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
3 [Canis lupus familiaris]
Length = 556
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 107/249 (42%), Positives = 150/249 (60%), Gaps = 7/249 (2%)
Query: 194 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D K +L P L +EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DKKERSLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNR 95
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++PD+R+E CK Y +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 96 SLPDVRLEGCKTKVYADELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDD 155
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 156 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 215
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+R
Sbjct: 216 GWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQR 272
Query: 430 EAKKRKYNR 438
E +RK +R
Sbjct: 273 EMDRRKGDR 281
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Saimiri boliviensis boliviensis]
Length = 682
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 167 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKHYLETLP 224
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 225 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFP- 283
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 284 SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 343
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 344 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 379
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 271 LKKPLEDYMALFP-SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 329
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 330 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 387
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 388 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 445
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 446 RVGHIYR 452
>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 599
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 84 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNRKRYLETLP 141
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 142 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMAHFPS 201
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 202 -VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 260
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 261 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 296
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 188 LKKPLEDYMAHFPS-VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 247 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 304
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 305 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 362
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 363 RVGHIYR 369
>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
Length = 634
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 119 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 176
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 177 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 236
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 237 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 295
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 296 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 331
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 223 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 281
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 282 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 339
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 340 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 397
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 398 RVGHIYR 404
>gi|380805795|gb|AFE74773.1| polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Macaca mulatta]
Length = 336
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 33 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 91
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 92 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 147
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 148 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 207
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 208 RVGHIYR 214
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/144 (51%), Positives = 98/144 (68%), Gaps = 3/144 (2%)
Query: 284 SLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLI 343
SL+RT+HSII RTP + EIILVDDFS + L KLE+Y+ RF+ KVR++R +REGLI
Sbjct: 1 SLLRTIHSIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLI 59
Query: 344 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 403
RTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P+ID ID+ + + +
Sbjct: 60 RTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPMIDVIDHNHFGYEA 119
Query: 404 VYEPDHHYRGIFEWGMLYKENELP 427
+ RG F+W M YK +P
Sbjct: 120 --QAGDAMRGAFDWEMYYKRIPIP 141
>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
Length = 551
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 36 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 93
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 94 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 153
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIAQNRKTIVCP 212
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 248
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 140 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 198
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 199 LDRIAQNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 256
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 257 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 314
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 315 RVGHIYR 321
>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 559
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 102/239 (42%), Positives = 148/239 (61%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+R+E C
Sbjct: 48 LEPVQKPH-EGPGEMGKPVIIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGC 106
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L +
Sbjct: 107 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMVEEIVLVDDASERDFLKRP 166
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I
Sbjct: 167 LESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARI 226
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+I I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 227 KQDRRTVVCPIICVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+I I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKQDRRTVVCPIICVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Callithrix jacchus]
Length = 667
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 152 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 209
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 210 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 269
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 270 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 328
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 329 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 364
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 256 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 315 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 372
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 373 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 430
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 431 RVGHIYR 437
>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Ailuropoda melanoleuca]
Length = 598
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 83 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 140
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 141 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFP- 199
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 200 SVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIAQNRKTIVCP 259
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 260 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 295
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 187 LKKPLEDYMALFP-SVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 246 LDRIAQNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 303
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 304 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 361
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 362 RVGHIYR 368
>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
anubis]
Length = 637
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 122 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 179
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 180 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 239
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 240 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 298
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 299 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 334
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 226 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 284
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 285 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 342
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 343 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 400
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 401 RVGHIYR 407
>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
Length = 603
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+V+ FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+V+ FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
Length = 603
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 101/218 (46%), Positives = 137/218 (62%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE K Y + +A R D + E G NM S+ IS +R++PD+R C Y LP
Sbjct: 88 GYGEQAKPYPMTDAERV--DQAYRENGFNMYVSDKISLNRSLPDIRHPNCNSKLYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|444727225|gb|ELW67727.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Tupaia
chinensis]
Length = 406
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 130/187 (69%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 32 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 90
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 91 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 146
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 147 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 206
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 207 RVGHIYR 213
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 75/112 (66%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 32 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 90
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 91 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 140
>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Anolis carolinensis]
Length = 559
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 101/241 (41%), Positives = 149/241 (61%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G++ ++ EGPGE GK +P+ + N+ S I+ +R++PD+R+E
Sbjct: 45 GDVPELVQKPHEGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLE 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK Y +LP SV++VFHNE +S+L+RTVHS+I R+P LEEIILVDD S + L
Sbjct: 105 GCKTKVYSDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHILEEIILVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ LE+Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA
Sbjct: 165 RLLENYVKKLQIPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 125/188 (66%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRLLENYVKKLQIPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKADRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 612
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 98/188 (52%), Positives = 126/188 (67%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDYI + KVR++R T REGLIR R GA+++ G+V++FLD+H E +NWLPPL
Sbjct: 207 LKKPLEDYIAKHWNKVRVVRATRREGLIRARLLGARQATGDVLIFLDSHTEANVNWLPPL 266
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKR 120
L PI D + + P ID IDY+T+ +R+ D RG F+W + YK LPE A
Sbjct: 267 LEPIAKDYRTVVCPFIDVIDYETFAYRAQ---DEGARGSFDWELYYKRLPLLPEDLANP- 322
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP+KSP AGGLFA+ R +F ELGGYD GL VWGGE +ELSFKIW CGG++ PC
Sbjct: 323 ---TEPFKSPVMAGGLFAISRRYFWELGGYDEGLDVWGGEQYELSFKIWQCGGTMVDAPC 379
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 380 SRVGHIYR 387
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 94/214 (43%), Positives = 136/214 (63%), Gaps = 3/214 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+GPGE G A+ LP D G N S+ I+ +R++PD+R C+ Y L
Sbjct: 100 KGPGEQGAAFFLPAGMEKKKDELYKVNGFNALASDFIALNRSLPDIRNPGCQKKRYVSKL 159
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ FHNE +++L+RT S++ R+P + ++EIIL DD+S+K L + LEDYI +
Sbjct: 160 PTVSVIVPFHNEHWTTLLRTATSVLNRSPPELIKEIILADDYSNKEQLKKPLEDYIAKHW 219
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R T REGLIR R GA+++ G+V++FLD+H E +NWLPPLL PI D + +
Sbjct: 220 NKVRVVRATRREGLIRARLLGARQATGDVLIFLDSHTEANVNWLPPLLEPIAKDYRTVVC 279
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID IDY+T+ +R+ D RG F+W + YK
Sbjct: 280 PFIDVIDYETFAYRAQ---DEGARGSFDWELYYK 310
>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/230 (43%), Positives = 142/230 (61%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GK +P+ + N+ S I+ +R++PD+R+E CK YP +L
Sbjct: 56 EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDNL 115
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SV++VFHNE +S+L+RTVHS+I R+P +EEI+LVDD S + L + LE Y+++
Sbjct: 116 PTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLK 175
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PLLA I DR+ +
Sbjct: 176 VPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLARIKHDRRTVVC 235
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 236 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 282
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PL
Sbjct: 163 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Taeniopygia guttata]
Length = 555
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R CK Y LP
Sbjct: 37 GNGEQGKPYPMTDAERV--DQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEKLP 94
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ FHNEG+SSL+RTVHS++ R+P + + E++LVDDFS + L ++LEDY+ +F
Sbjct: 95 NTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEVVLVDDFSDREHLKKRLEDYMAQF-P 153
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 SVRILRTKRREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 213
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ + + + + RG F+W M YK +P
Sbjct: 214 MIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIP 249
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ +F VR++R REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 141 LKKRLEDYMAQF-PSVRILRTKRREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 199
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P K
Sbjct: 200 LDRIARNRKTIVCPMIDVIDHDHFGYET--QAGDAMRGAFDWEMYYKRIPIPPELQKPDP 257
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 258 --SDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCS 315
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 316 RVGHIYR 322
>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
Length = 603
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 4 [Macaca mulatta]
Length = 602
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 87 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 144
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 145 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 204
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 205 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 263
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 264 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 299
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 191 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 249
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 250 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 307
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 308 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 365
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 366 RVGHIYR 372
>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
Length = 626
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 97/189 (51%), Positives = 126/189 (66%), Gaps = 2/189 (1%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L KLE YIE + VRL+R ER+GLIR R GAKE+ GEVI+FLD+HCE L WL
Sbjct: 222 ELQGKLERYIETQLPSIVRLVRLKERQGLIRARLAGAKEATGEVIIFLDSHCEATLGWLE 281
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I D++ + VP+ID ID +T E+ P+ G F W + ++P+RE K+
Sbjct: 282 PLLQRIKEDKRAVLVPIIDVIDDKTLEYYH-GSPESFQIGSFTWSGHFTWMDIPKREIKR 340
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R P SPT AGGLFA+DR +F +LG YD G+ VWGGEN E+SF+IWMCGGS+E +P
Sbjct: 341 RGSRVGPTNSPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSFRIWMCGGSLETIP 400
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 401 CSRVGHIFR 409
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 103/232 (44%), Positives = 145/232 (62%), Gaps = 2/232 (0%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
L + G G GGKA L A + + + N+ SN IS++RT+PD+R CK
Sbjct: 111 LNKIENGLGAGGKAVKLFGAELQEAEEIMKKEAFNLFISNRISYNRTLPDVRDSMCKGLT 170
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y LP ASVI++F NE +S L+RT+ S+I R+P ++L+EI+L+DDFS + +L KLE Y
Sbjct: 171 YDTILPSASVIIIFTNEAWSPLIRTIWSVINRSPRKFLKEILLIDDFSDRVELQGKLERY 230
Query: 324 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
I+ + VRL+R ER+GLIR R GAKE+ GEVI+FLD+HCE L WL PLL I D
Sbjct: 231 IETQLPSIVRLVRLKERQGLIRARLAGAKEATGEVIIFLDSHCEATLGWLEPLLQRIKED 290
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
++ + VP+ID ID +T E+ P+ G F W + ++P+RE K+R
Sbjct: 291 KRAVLVPIIDVIDDKTLEYYH-GSPESFQIGSFTWSGHFTWMDIPKREIKRR 341
>gi|18676612|dbj|BAB84958.1| FLJ00205 protein [Homo sapiens]
Length = 359
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 81 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 138
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 139 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFP- 197
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 198 SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 257
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 258 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 293
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/167 (50%), Positives = 110/167 (65%), Gaps = 5/167 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 185 LKKPLEDYMALFP-SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 243
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 244 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 301
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 168
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+
Sbjct: 302 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKV 346
>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Homo sapiens]
Length = 603
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
Length = 528
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 13 GYGEQGKPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 70
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 71 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 130
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+V+ FLD+HCE +NWLPPLL I +RK + P
Sbjct: 131 -VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPLLDRIARNRKTIVCP 189
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 190 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 225
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+V+ FLD+HCE +NWLPPL
Sbjct: 117 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPL 175
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 176 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 233
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 234 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 291
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 292 RVGHIYR 298
>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [synthetic
construct]
Length = 603
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
Mn2+
gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
Galnac-ser On Lectin Domain
Length = 570
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 55 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 112
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 113 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 172
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 173 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 231
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 232 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 267
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 159 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 217
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 218 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 275
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 276 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 333
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 334 RVGHIYR 340
>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 5 [Macaca mulatta]
Length = 603
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 603
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 99/226 (43%), Positives = 141/226 (62%), Gaps = 3/226 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G + E + ++ N S+ IS R++ D R ++CK YP+DLP
Sbjct: 107 PGELGTGVTVEENEKEKEKLGYEKHAFNQLVSDKISIHRSLKDYRNDQCKVKKYPVDLPP 166
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
SVI+ FHNE +S+L+RTVHS+I RTP QYL+EIILVDD S+ DL Q+L+DYI
Sbjct: 167 TSVIICFHNEAWSTLLRTVHSVINRTPPQYLKEIILVDDASTSDDLKQRLDDYIPNLK-I 225
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V ++R +R+GLIR R GAK+++G ++ FLDAHCE L W PLLA I DR+ + +PV
Sbjct: 226 VSIVRLRDRQGLIRARLEGAKKAKGPILTFLDAHCECTLGWAEPLLAKIKEDRQNVVMPV 285
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
ID I + + +V EP RG+F+W + + +P E ++RK+
Sbjct: 286 IDEISETNFNYNAVPEP--FQRGVFKWRLEFTWRPIPSYEEQRRKH 329
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 129/191 (67%), Gaps = 3/191 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL ++L+DYI V ++R +R+GLIR R GAK+++G ++ FLDAHCE L W P
Sbjct: 211 DLKQRLDDYIPNLK-IVSIVRLRDRQGLIRARLEGAKKAKGPILTFLDAHCECTLGWAEP 269
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + +PVID I + + +V EP RG+F+W + + +P E ++R
Sbjct: 270 LLAKIKEDRQNVVMPVIDEISETNFNYNAVPEP--FQRGVFKWRLEFTWRPIPSYEEQRR 327
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ S+ K+P AGGLF+++R +F E+G YD G+ +WGGEN E+SF+IWMCGGSIE +PC
Sbjct: 328 KHESDGIKTPVMAGGLFSINRDYFYEMGSYDTGMDIWGGENIEISFRIWMCGGSIEMLPC 387
Query: 181 SRIVSLIRPVF 191
SR+ + RP F
Sbjct: 388 SRVGHVFRPRF 398
>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
Length = 556
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 103/247 (41%), Positives = 144/247 (58%), Gaps = 12/247 (4%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
RPVF+ P PGE G+ +P+ Y+ + N S+ IS
Sbjct: 39 RPVFQP-------ALPQNHKPAAPGEYGRPVDVPKEYQQLSEELFQRNHFNQWVSDRISL 91
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
RT+PD R E CK YP+DLP SV++VF+NE +S+LMRTVHS++ R+P L E+ILV
Sbjct: 92 QRTLPDPRPEMCKSMTYPVDLPSTSVVIVFYNEAWSTLMRTVHSVLDRSPPDLLHEVILV 151
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DD S +L Q LE+Y+++ + KVRL RN++REGLIR R RG +++ ++ FLDAHCEV
Sbjct: 152 DD--SSDELHQPLEEYVRQLD-KVRLHRNSQREGLIRARLRGLEQTSAPIVTFLDAHCEV 208
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ WL PLL I+ DR + P ID ID + ++ Y P RG F W + +K + P
Sbjct: 209 TIGWLEPLLNRIHQDRTTVVCPEIDSIDLNNFAYK--YGPSGVLRGTFNWDLSFKWSIAP 266
Query: 428 EREAKKR 434
E +R
Sbjct: 267 TSERLRR 273
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 91/198 (45%), Positives = 126/198 (63%), Gaps = 3/198 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + LE+Y+ + + KVRL RN++REGLIR R RG +++ ++ FLDAHCEV + WL P
Sbjct: 157 ELHQPLEEYVRQLD-KVRLHRNSQREGLIRARLRGLEQTSAPIVTFLDAHCEVTIGWLEP 215
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I+ DR + P ID ID + ++ Y P RG F W + +K + P E +R
Sbjct: 216 LLNRIHQDRTTVVCPEIDSIDLNNFAYK--YGPSGVLRGTFNWDLSFKWSIAPTSERLRR 273
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++P +SPT AGGLFA+DR +FLELG YD GL +WG EN ELSFK+W CGG +E +PC
Sbjct: 274 TSATDPMRSPTMAGGLFAIDREYFLELGTYDRGLEIWGAENMELSFKVWQCGGKLEIIPC 333
Query: 181 SRIVSLIRPVFKADGKLG 198
S + + R V D +
Sbjct: 334 SHVGHVFREVQPYDTSVS 351
>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
troglodytes]
Length = 551
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 36 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 93
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 94 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 153
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 212
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 248
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 140 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 198
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 199 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 256
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 257 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 314
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 315 RVGHIYR 321
>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Monodelphis domestica]
Length = 556
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 143/231 (61%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +
Sbjct: 54 QEGPGEMGKAVRIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTV+S+I R+P L EIILVDD S + L LE+Y++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEIILVDDASERDFLKMALENYVKNL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V++IR +R GLIR R RGA S+G+VI FLDAHCE L WL PLLA I RK +
Sbjct: 174 EVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKESRKTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I +E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDLISDDNFEYTA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR +R GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 162 LKMALENYVKNLEVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I RK + P+ID I +E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKESRKTVVCPIIDLISDDNFEYTA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
mulatta]
Length = 565
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 50 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 107
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 108 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 167
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 168 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 226
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 227 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 262
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 154 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 212
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 213 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 270
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 271 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 328
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 329 RVGHIYR 335
>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
Length = 551
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 36 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 93
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 94 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 153
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 154 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 212
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 213 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 248
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 140 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 198
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 199 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 256
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 257 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 314
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 315 RVGHIYR 321
>gi|281343220|gb|EFB18804.1| hypothetical protein PANDA_020124 [Ailuropoda melanoleuca]
Length = 275
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 3 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 61
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 62 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 117
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 118 DPSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCS 177
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 178 RVGHIYR 184
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 74/112 (66%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L KLE+Y+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 3 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 61
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 62 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 111
>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
Length = 600
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/218 (46%), Positives = 137/218 (62%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE GK Y L E D+ E G N+ SN+I+ DR++PD+R CK Y +LP
Sbjct: 82 GKGEHGKPYPLVED--ECDDSVYKENGFNIYVSNNIALDRSLPDIRHPNCKQKLYLENLP 139
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RT+HSI RTP + EIILVDD+S + L L +Y+ RF
Sbjct: 140 NTSIIIPFHNEGWSSLLRTLHSISNRTPDHLIAEIILVDDYSDREHLKAHLAEYMSRF-P 198
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R +REGLIRTR GA +RGEV+ FLD+HCE +NWLPPLL I + K + P
Sbjct: 199 KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEANINWLPPLLDQIAQNPKTIVCP 258
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ + + + + RG F+W M YK +P
Sbjct: 259 MIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 294
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/187 (49%), Positives = 123/187 (65%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCE +NWLPPL
Sbjct: 186 LKAHLAEYMSRF-PKVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEANINWLPPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 245 LDQIAQNPKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPELQGPDP 302
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+PY+SP AGGLFA++R +F ELGGYD GL +WGGE FE+SFK+WMCGGS+ VPCS
Sbjct: 303 --SDPYQSPVMAGGLFAVNRQWFWELGGYDTGLEIWGGEQFEISFKVWMCGGSMYDVPCS 360
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 361 RVGHIYR 367
>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
scrofa]
Length = 506
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 95 LKKPLEDYMALF-PNVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 153
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 154 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 211
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 212 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 269
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 270 RVGHIYR 276
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 93/199 (46%), Positives = 128/199 (64%), Gaps = 3/199 (1%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D + E G N+ S+ IS +R++PD+R C Y LP S+I+ FHNEG+SSL+RT
Sbjct: 8 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEMLPNTSIIIPFHNEGWSSLLRT 67
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 348
VHS++ R+P + + EI+LVDDFS + L + LEDY+ F VR++R +REGLIRTR
Sbjct: 68 VHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALF-PNVRILRTKKREGLIRTRML 126
Query: 349 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
GA + G+VI FLD+HCE +NWLPPLL I +RK + P+ID ID+ +FR +
Sbjct: 127 GASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184
Query: 409 HHYRGIFEWGMLYKENELP 427
RG F+W M YK +P
Sbjct: 185 DAMRGAFDWEMYYKRIPIP 203
>gi|355689586|gb|AER98882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mustela putorius
furo]
Length = 320
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 20 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLETLP 77
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 78 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMALFPS 137
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 138 -VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 196
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 197 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 232
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 124 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 182
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 183 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 240
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 241 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 298
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 299 RVGHIYR 305
>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cavia porcellus]
Length = 602
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 124/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 191 LKKPLEDYMALFPS-VRILRTKRREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 249
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 250 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 307
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 308 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 365
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 366 RVGHIYR 372
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 136/218 (62%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + E R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 87 GNGEQGRPYPMTEGERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLEVLP 144
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 145 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 204
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 205 -VRILRTKRREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 263
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 264 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 299
>gi|260789712|ref|XP_002589889.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
gi|229275074|gb|EEN45900.1| hypothetical protein BRAFLDRAFT_81982 [Branchiostoma floridae]
Length = 534
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 104/227 (45%), Positives = 143/227 (62%), Gaps = 7/227 (3%)
Query: 210 GPGEGGKAY-HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
GPGE G+ Y + E + LG G N S+ IS +R +PD R + CK YP L
Sbjct: 14 GPGEYGRPYVYTEEDNKRKSFGYLGN-GFNAHVSDKISVERALPDTRDQPCKDRLYPSRL 72
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ FHNE +S+L+RTVH +I RTP L E+ILVDDFSSK + + L +Y+ F
Sbjct: 73 PNVSVIIPFHNEHWSTLLRTVHGVIGRTPPHLLGEVILVDDFSSKENCGRPLNEYMATFP 132
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
+VR++R +REGLIR R RG + +RG V+VF+DAHCEV +NWLPPLL PI +T+
Sbjct: 133 -QVRILRMKQREGLIRARLRGVEVARGNVLVFMDAHCEVNVNWLPPLLEPISVSMTTVTI 191
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
P ID ID+ T+E++ + RG+F+W + YK +P + + RK
Sbjct: 192 PTIDVIDHATFEYKE--QQGGPMRGVFDWQLNYK--RIPVLDGRGRK 234
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 89/186 (47%), Positives = 125/186 (67%), Gaps = 6/186 (3%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
+ L +Y+ F +VR++R +REGLIR R RG + +RG V+VF+DAHCEV +NWLPPLL
Sbjct: 122 RPLNEYMATFP-QVRILRMKQREGLIRARLRGVEVARGNVLVFMDAHCEVNVNWLPPLLE 180
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 123
PI +T+P ID ID+ T+E++ + RG+F+W + YK +P + + RK
Sbjct: 181 PISVSMTTVTIPTIDVIDHATFEYKE--QQGGPMRGVFDWQLNYK--RIPVLDGRGRKVR 236
Query: 124 -SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+ P+ +P GG+FA+D+ FF LGGYD GL +WGGE FELSFKIW CGG ++ VPCSR
Sbjct: 237 PTLPFSTPVMPGGVFAIDKEFFHHLGGYDSGLEIWGGEQFELSFKIWQCGGVLQEVPCSR 296
Query: 183 IVSLIR 188
+ + R
Sbjct: 297 VGHVFR 302
>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oryctolagus cuniculus]
Length = 603
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMTDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKVDP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cricetulus griseus]
Length = 715
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 138/218 (63%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 200 GNGEQGRPYPMTDAERE--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLETLP 257
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 258 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPS 317
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 318 -VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 376
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 377 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 412
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 304 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 362
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 363 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 420
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 421 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 478
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 479 RVGHIYR 485
>gi|405977048|gb|EKC41520.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 635
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 98/232 (42%), Positives = 140/232 (60%), Gaps = 5/232 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E +EG GE G+A L +A D+ + N+ S+ I+ DR+IPD+R C+ Y
Sbjct: 57 EGKREGLGEKGRAVTLQGEEKALADSLFKKEAFNIIASDKIALDRSIPDVRDSRCREVKY 116
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
P +LP ASVI++FHNE +S L+RT HS++ R+P +YL E+IL+DDFS + +L KL DY+
Sbjct: 117 PKELPTASVIIIFHNEAWSPLLRTAHSVVNRSPPRYLHEVILLDDFSDRPELGSKLHDYV 176
Query: 325 QRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
+ +G V++IR ER GLIR R GAK + G+V++FLD+HCE WL P+LA I D
Sbjct: 177 AKTWPDGIVKIIRTKERSGLIRARLAGAKAATGDVLIFLDSHCETNTGWLEPMLARIKED 236
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
R + P ID ID T ++ G F W + + +PE E K+R
Sbjct: 237 RTAVLCPEIDLIDKNTLQYGGT---GSFSVGGFWWSLHFSWRPIPEHEQKRR 285
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 60/132 (45%), Positives = 83/132 (62%), Gaps = 6/132 (4%)
Query: 58 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPERE 116
L P+L+ I + P++D ID T E Y + Y+ G F W + + ++P R+
Sbjct: 294 LEPILSRIKEFPNSVVCPIVDAIDAHTLE----YSKNGGYQVGGFSWSLHFTWRDVPSRD 349
Query: 117 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
RKY ++P SPT AGGLFA DR FF E+G YDPG+ VWGGEN E+SF+ WMCGG +E
Sbjct: 350 LVHRKY-TDPVGSPTMAGGLFAADRKFFFEIGAYDPGMDVWGGENLEISFRTWMCGGKLE 408
Query: 177 WVPCSRIVSLIR 188
++PCSR+ + R
Sbjct: 409 FIPCSRVGHIFR 420
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 70/130 (53%), Gaps = 5/130 (3%)
Query: 1 DLDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
+L KL DY+ + +G V++IR ER GLIR R GAK + G+V++FLD+HCE WL
Sbjct: 167 ELGSKLHDYVAKTWPDGIVKIIRTKERSGLIRARLAGAKAATGDVLIFLDSHCETNTGWL 226
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
P+LA I DR + P ID ID T ++ G F W + + +PE E K
Sbjct: 227 EPMLARIKEDRTAVLCPEIDLIDKNTLQYGGT---GSFSVGGFWWSLHFSWRPIPEHEQK 283
Query: 119 KRKYNSEPYK 128
+R P +
Sbjct: 284 RRSSGIAPIR 293
>gi|432098843|gb|ELK28338.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Myotis davidii]
Length = 428
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 17 LKKPLEDYMAHFPS-VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 75
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 76 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 133
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 134 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 191
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 192 RVGHIYR 198
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 70/112 (62%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L + LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 17 LKKPLEDYMAHFPS-VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 75
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I +RK + P+ID ID+ +FR + RG F+W M YK +P
Sbjct: 76 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 125
>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Ornithorhynchus anatinus]
Length = 556
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 102/242 (42%), Positives = 145/242 (59%), Gaps = 3/242 (1%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRM 256
L L + +EGPGE GKA + + + N+ S+ I+ +R++PD+R+
Sbjct: 43 LPALRAVISRSQEGPGEMGKAVLISKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRL 102
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
E CK YP +LP V++VFHNE +S+L+RTV S+I R+P L E+ILVDD S + L
Sbjct: 103 EGCKTKIYPDELPNTRVVIVFHNEAWSTLLRTVFSVINRSPRSLLSEVILVDDASERDFL 162
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
LE+Y++ + V++IR +R GLIR R RGA SRG+VI FLDAHCE WL PLL
Sbjct: 163 KTSLENYVKNLDVPVKIIRMEQRSGLIRARLRGAAASRGQVITFLDAHCECTFGWLEPLL 222
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
A I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 ARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKG 279
Query: 437 NR 438
+R
Sbjct: 280 DR 281
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ + V++IR +R GLIR R RGA SRG+VI FLDAHCE WL PL
Sbjct: 162 LKTSLENYVKNLDVPVKIIRMEQRSGLIRARLRGAAASRGQVITFLDAHCECTFGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
tropicalis]
gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
Length = 452
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 98/230 (42%), Positives = 142/230 (61%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GK +P+ + N+ S I+ +R++PD+R+E CK YP L
Sbjct: 56 EGPGEMGKPVVIPKEEQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDSL 115
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SV++VFHNE +++L+RTVHS+I R+P L+EIILVDD S + L + LE Y+++
Sbjct: 116 PTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLQEIILVDDASEREFLKRPLETYVKKLT 175
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V ++R +R GLIR R RGA S+G+VI FLDAHCE + WL PLLA I DR+ +
Sbjct: 176 VPVHVLRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLARIKHDRRTVVC 235
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +R+ +R
Sbjct: 236 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRRGDR 282
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V ++R +R GLIR R RGA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRPLETYVKKLTVPVHVLRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +R+
Sbjct: 223 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRR 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Nomascus leucogenys]
gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
Length = 506
Score = 191 bits (484), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 95 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 153
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 154 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 211
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 212 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 269
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 270 RVGHIYR 276
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/199 (46%), Positives = 128/199 (64%), Gaps = 3/199 (1%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D + E G N+ S+ IS +R++PD+R C Y LP S+I+ FHNEG+SSL+RT
Sbjct: 8 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRT 67
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 348
VHS++ R+P + + EI+LVDDFS + L + LEDY+ F VR++R +REGLIRTR
Sbjct: 68 VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALF-PSVRILRTKKREGLIRTRML 126
Query: 349 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
GA + G+VI FLD+HCE +NWLPPLL I +RK + P+ID ID+ +FR +
Sbjct: 127 GASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184
Query: 409 HHYRGIFEWGMLYKENELP 427
RG F+W M YK +P
Sbjct: 185 DAMRGAFDWEMYYKRIPIP 203
>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
griseus]
Length = 494
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 77 LKKPLEDYMALF-PSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 135
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 136 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 193
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 194 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 251
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 252 RVGHIYR 258
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 80/160 (50%), Positives = 108/160 (67%), Gaps = 3/160 (1%)
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LEDY+ F
Sbjct: 29 LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALF 88
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK +
Sbjct: 89 -PSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLDRIARNRKTIV 147
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ +FR + RG F+W M YK +P
Sbjct: 148 CPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 185
>gi|395736404|ref|XP_002816158.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pongo
abelii]
Length = 527
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 57 LKKPLEDYMALFP-SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 115
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 116 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 173
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 174 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 231
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 232 RVGHIYR 238
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 72/116 (62%), Gaps = 3/116 (2%)
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
++ L + LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NW
Sbjct: 53 NQKHLKKPLEDYMALFP-SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNW 111
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
LPPLL I +RK + P+ID ID+ +FR + RG F+W M YK +P
Sbjct: 112 LPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 165
>gi|195567537|ref|XP_002107316.1| GD15630 [Drosophila simulans]
gi|194204722|gb|EDX18298.1| GD15630 [Drosophila simulans]
Length = 411
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 94/166 (56%), Positives = 118/166 (71%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
R V K LGN EP + GPGE G+A+ L + DAS EYGMN+ S+ IS
Sbjct: 61 REVPKLVDGLGNFEPKDVKPRSGPGENGEAHSLSPDKKHMSDASEMEYGMNIACSDEISM 120
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++ D R+EEC++WDYP DLP+ SVI+VFHNEGFS LMRTVHS+I R+P L EIILV
Sbjct: 121 HRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVIDRSPTHMLHEIILV 180
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
DDFS K +L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+
Sbjct: 181 DDFSDKENLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEA 226
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/39 (58%), Positives = 32/39 (82%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKES 39
+L +L++Y+ +F G V++IRN EREGLIRTRSRGA E+
Sbjct: 188 NLRSQLDEYVLQFKGLVKVIRNKEREGLIRTRSRGAMEA 226
>gi|48734736|gb|AAH72450.1| GALNT10 protein, partial [Homo sapiens]
Length = 413
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 96/187 (51%), Positives = 125/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 2 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 60
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 61 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 118
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 119 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 176
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 177 RVGHIYR 183
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 53/112 (47%), Positives = 70/112 (62%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L + LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 2 LKKPLEDYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 60
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I +RK + P+ID ID+ +FR + RG F+W M YK +P
Sbjct: 61 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 110
>gi|449270895|gb|EMC81541.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Columba livia]
Length = 440
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 26 LKEKLEEYMVRF-AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 84
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 85 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 140
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 141 DPSDPFESPVMAGGLFAVNRQWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCS 200
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 201 RVGHIYR 207
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 74/112 (66%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +KLE+Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 26 LKEKLEEYMVRF-AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 84
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 85 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 134
>gi|326918600|ref|XP_003205576.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Meleagris gallopavo]
Length = 303
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 129/187 (68%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 28 LKEKLEEYMVRF-AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 86
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + K + P+ID ID+ + + + + RG F+W M YK +P +R
Sbjct: 87 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRA 142
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCS
Sbjct: 143 DPSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCS 202
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 203 RVGHIYR 209
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 54/112 (48%), Positives = 74/112 (66%), Gaps = 3/112 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +KLE+Y+ RF KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPL
Sbjct: 28 LKEKLEEYMVRF-AKVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 86
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
L I + K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 87 LNQIALNHKTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 136
>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
17 [Ciona intestinalis]
Length = 870
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 104/228 (45%), Positives = 140/228 (61%), Gaps = 13/228 (5%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
GPGE G A HL R+ ++ E G N+ SN IS +R++PD+R + C Y LP
Sbjct: 355 GPGELGVAVHLSTEERSR--SAYSENGFNILVSNRISLNRSLPDIRHKNCASRKYLAQLP 412
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS----KADLDQKLEDYIQ 325
AS+I+ FHNEG ++L+RT+HSII RTP L EIILVDD S+ K+ LDQ+L Y Q
Sbjct: 413 DASIIIPFHNEGRTTLLRTIHSIINRTPKILLREIILVDDCSTVDHLKSSLDQELSKYRQ 472
Query: 326 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
V+L+R +REGLIR R G +++G IV LD+H EV NWLPPLL PI DRK+
Sbjct: 473 -----VKLVRLAKREGLIRARLAGVHQAKGNTIVILDSHVEVTNNWLPPLLEPIALDRKV 527
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+T P+ID I+ +F + +P RG F+W + YK +P + K
Sbjct: 528 ITCPMIDIINKD--DFHYLTQPGDAMRGAFDWELYYKRIPIPPEKQLK 573
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 120/187 (64%), Gaps = 9/187 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
LD++L Y +V+L+R +REGLIR R G +++G IV LD+H EV NWLPPL
Sbjct: 463 LDQELSKY-----RQVKLVRLAKREGLIRARLAGVHQAKGNTIVILDSHVEVTNNWLPPL 517
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI DRK++T P+ID I+ +F + +P RG F+W + YK +P K+ K
Sbjct: 518 LEPIALDRKVITCPMIDIINKD--DFHYLTQPGDAMRGAFDWELYYKRIPIPPE--KQLK 573
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++ P AGGLFA+DR +F E+G YD GL +WGGE +ELSFK WMCGG I PCS
Sbjct: 574 DPSDPFEDPVMAGGLFAIDRLYFKEIGEYDDGLEIWGGEQYELSFKAWMCGGKILDAPCS 633
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 634 RVGHIYR 640
>gi|307186144|gb|EFN71869.1| N-acetylgalactosaminyltransferase 6 [Camponotus floridanus]
Length = 602
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 98/198 (49%), Positives = 128/198 (64%), Gaps = 8/198 (4%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KKL+DYI ++ KV ++R +R GLIR R GAK +R +V+VFLD+H E +NWLPP
Sbjct: 187 ELKKKLDDYIAQYLPKVSIVRLAKRSGLIRGRLAGAKAARAKVLVFLDSHSEANVNWLPP 246
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI + K P ID I Y+T+E+R+ D RG F+W + YK L + K+
Sbjct: 247 LLEPIAQNYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR- 302
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP+KSP AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PC
Sbjct: 303 --PAEPFKSPIMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPC 360
Query: 181 SRIVSLIR--PVFKADGK 196
SR+ + R P F G+
Sbjct: 361 SRVGHIYRKFPPFPNPGR 378
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 139/229 (60%), Gaps = 3/229 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE G+ L + A + G N S+ IS +R++PD+R +C+ Y
Sbjct: 77 EEKRTGMGEHGRPAFLSPSLDARKEKLYQVNGFNAALSDEISLNRSVPDIRHPDCRKKKY 136
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+LMRT S+I R+P LEEIILVDD S+K +L +KL+DYI
Sbjct: 137 SKNLDPVSVIVSFHNEHFSTLMRTCWSVINRSPPSLLEEIILVDDASTKVELKKKLDDYI 196
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
++ KV ++R +R GLIR R GAK +R +V+VFLD+H E +NWLPPLL PI + K
Sbjct: 197 AQYLPKVSIVRLAKRSGLIRGRLAGAKAARAKVLVFLDSHSEANVNWLPPLLEPIAQNYK 256
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
P ID I Y+T+E+R+ D RG F+W + YK L + K+
Sbjct: 257 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR 302
>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
mutus]
Length = 564
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/244 (43%), Positives = 148/244 (60%), Gaps = 9/244 (3%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL----- 254
LEP +P+ EGPGE GK +P+ + N+ S I+ +R++PD+
Sbjct: 48 LEPVQKPH-EGPGEMGKPVVIPKEDQEKMKEMFKINQFNLMASEMIALNRSLPDVSLPDV 106
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R+E CK YP +LP SV++VFHNE +S+L+RTVHSII +P LEEI+LVDD S +
Sbjct: 107 RLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSIINHSPRHMLEEIVLVDDASERD 166
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL P
Sbjct: 167 FLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEP 226
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +R
Sbjct: 227 LLARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 283
Query: 435 KYNR 438
K +R
Sbjct: 284 KGDR 287
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 168 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 227
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 228 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 284
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 285 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 344
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 345 SHVGHVFR 352
>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Rattus norvegicus]
Length = 609
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 108/247 (43%), Positives = 145/247 (58%), Gaps = 11/247 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D N+E P + + E G ++ E + D ++ NM
Sbjct: 67 VLEPQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 124
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 125 SNRLGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLL 184
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 185 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 244
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 245 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 300
Query: 421 YKENELP 427
+K + +P
Sbjct: 301 FKWDLVP 307
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 198 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P +
Sbjct: 258 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSDLGG 313
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 314 ADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 373
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 374 CSRVGHIFR 382
>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Otolemur garnettii]
Length = 603
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 137/218 (62%), Gaps = 5/218 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GNGEQGRPYPMSDAERV--DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + LE Y+ F
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEAYMALFPS 205
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPLL I +RK + P
Sbjct: 206 -VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCP 264
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ID ID+ +FR + RG F+W M YK +P
Sbjct: 265 MIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 300
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 124/187 (66%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LE Y+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 192 LKKPLEAYMALFPS-VRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 309 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|149031396|gb|EDL86386.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_a [Rattus norvegicus]
Length = 356
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 108/247 (43%), Positives = 145/247 (58%), Gaps = 11/247 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D N+E P + + E G ++ E + D ++ NM
Sbjct: 67 VLEPQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 124
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 125 SNRLGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLL 184
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 185 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 244
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 245 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 300
Query: 421 YKENELP 427
+K + +P
Sbjct: 301 FKWDLVP 307
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 68/149 (45%), Positives = 91/149 (61%), Gaps = 5/149 (3%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 198 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P +
Sbjct: 258 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSDLGG 313
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELG 148
+ P +SPT AGGLFAM+R +F +LG
Sbjct: 314 ADSATAPIRSPTMAGGLFAMNRQYFNDLG 342
>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Rattus norvegicus]
Length = 608
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 108/247 (43%), Positives = 145/247 (58%), Gaps = 11/247 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D N+E P + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANRMDDLMNNNIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAECRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELP 427
+K + +P
Sbjct: 300 FKWDLVP 306
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 197 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P +
Sbjct: 257 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSDLGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 ADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
Length = 558
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 127/189 (67%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++++++ GK++LIRNT+REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 182 DLKGELDEFVQKHLPGKIKLIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 241
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I DR+ + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 242 PLLAAIREDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELEG 297
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P KSPT AGGLFAM+R++F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 298 PEGATAPIKSPTMAGGLFAMNRSYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 357
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 358 CSRVGHIFR 366
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 128/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 96 DLGYQKHAFNLLISNRLGHHRDVPDTRNAACKDKIYPTDLPVASVVICFYNEALSALLRT 155
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L EIILVDD S DL +L++++Q+ GK++LIRNT+REGLIR R
Sbjct: 156 VHSVLDRTPARLLHEIILVDDSSDFDDLKGELDEFVQKHLPGKIKLIRNTKREGLIRGRM 215
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 216 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 271
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 272 SPVVRGGFNWGLHFKWDLVPLSE 294
>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus laevis]
gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
Length = 556
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 97/230 (42%), Positives = 143/230 (62%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +L
Sbjct: 55 EGPGELGKAVIIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDIRLEGCKTKVYPDEL 114
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P S+++VFHNE +S+L+RTVHS+I R+P + + EIILVDD S + L LE+Y++
Sbjct: 115 PNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDASERDFLKTPLENYVKHLE 174
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V+++R +R GLIR R GA ++G++I FLDAHCE WL PLLA I DRK +
Sbjct: 175 VAVKILRMEQRSGLIRARLSGANVAKGKIITFLDAHCECTFGWLEPLLARIKEDRKTVVC 234
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 235 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V+++R +R GLIR R GA ++G++I FLDAHCE WL PL
Sbjct: 162 LKTPLENYVKHLEVAVKILRMEQRSGLIRARLSGANVAKGKIITFLDAHCECTFGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDKKYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|241622516|ref|XP_002407424.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
gi|215500988|gb|EEC10482.1| pp-GalNAc-transferase, putative [Ixodes scapularis]
Length = 471
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 96/188 (51%), Positives = 125/188 (66%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDYI + V ++R T+REGLIR R GA+++ G+V++FLD+H E NWLPPL
Sbjct: 66 LKKPLEDYIAKHFTNVHVVRATKREGLIRARLMGARQATGDVLIFLDSHTEANTNWLPPL 125
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKR 120
L PI D + + P ID IDY+T+ +R+ D RG F+W + YK LP+ AK
Sbjct: 126 LEPIAKDYRTVVCPFIDVIDYETFAYRA---QDEGARGSFDWELYYKRLPLLPDDLAKP- 181
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP+KSP AGGLFA+ R +F ELGGYD GL VWGGE +ELSFKIW CGG++ PC
Sbjct: 182 ---TEPFKSPVMAGGLFAISRKYFWELGGYDEGLDVWGGEQYELSFKIWQCGGTMVDAPC 238
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 239 SRVGHIYR 246
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/170 (42%), Positives = 104/170 (61%), Gaps = 5/170 (2%)
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LP ASV++ FHNE +++L+RT S++ RT + I D S L + LEDY
Sbjct: 15 YVKELPTASVVVPFHNEHWTTLLRTATSVLNRTTEVKIG-YIKRDAHRSLHQLKKPLEDY 73
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
I + V ++R T+REGLIR R GA+++ G+V++FLD+H E NWLPPLL PI D
Sbjct: 74 IAKHFTNVHVVRATKREGLIRARLMGARQATGDVLIFLDSHTEANTNWLPPLLEPIAKDY 133
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAK 432
+ + P ID IDY+T+ +R+ D RG F+W + YK LP+ AK
Sbjct: 134 RTVVCPFIDVIDYETFAYRA---QDEGARGSFDWELYYKRLPLLPDDLAK 180
>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11-like [Oryzias
latipes]
Length = 574
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 90/178 (50%), Positives = 117/178 (65%), Gaps = 4/178 (2%)
Query: 11 ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 70
E GKVRL+RN +REGLIR R GA + G+V+VFLD+HCEV +WL PLLAPI DR+
Sbjct: 175 EELQGKVRLVRNRKREGLIRGRMIGAAHATGDVLVFLDSHCEVNQDWLQPLLAPIQKDRR 234
Query: 71 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSP 130
+ P+ID I T Y RG F WG+ +K + +P E + + P +SP
Sbjct: 235 TVVCPIIDIISADTL----TYSSSPIVRGGFNWGLHFKWDPVPPSEISGPEGAAGPIRSP 290
Query: 131 THAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
T AGGLFAM+R +F ELG YDPG+ +WGGEN E+SF+IWMCGG + +PCSR+ + R
Sbjct: 291 TMAGGLFAMNREYFNELGRYDPGMDIWGGENLEISFRIWMCGGQLLIIPCSRVGHIFR 348
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 94/210 (44%), Positives = 127/210 (60%), Gaps = 5/210 (2%)
Query: 222 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 281
EA + DA + N+ SN + R +PD R ++C+ YP LP ASV++ F NE
Sbjct: 71 EADQEVRDAGYHRHAFNVLISNRLGSHRELPDTRDKQCRKRSYPQALPSASVVICFFNEA 130
Query: 282 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTERE 340
S+L+RTVHS++ RTPA L EIILVDD S +L + L+ + + GKVRL+RN +RE
Sbjct: 131 LSALLRTVHSVLDRTPAYLLHEIILVDDQSELEELKEGLDRCVREELQGKVRLVRNRKRE 190
Query: 341 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 400
GLIR R GA + G+V+VFLD+HCEV +WL PLLAPI DR+ + P+ID I T
Sbjct: 191 GLIRGRMIGAAHATGDVLVFLDSHCEVNQDWLQPLLAPIQKDRRTVVCPIIDIISADTL- 249
Query: 401 FRSVYEPDHHYRGIFEWGMLYKENELPERE 430
Y RG F WG+ +K + +P E
Sbjct: 250 ---TYSSSPIVRGGFNWGLHFKWDPVPPSE 276
>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Danio rerio]
Length = 557
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 97/230 (42%), Positives = 143/230 (62%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+GPGE GK + + + N+ S I+ +R++PD+R+E CK YP DL
Sbjct: 54 DGPGEMGKPVVIAKDQQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDDL 113
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P+ SV++VFHNE +++L+RTVHS+I R+P LEEI+LVDD S + L ++LE Y+++
Sbjct: 114 PRTSVVIVFHNEAWTTLLRTVHSVIDRSPRHLLEEIVLVDDASERDFLKRQLEHYVRKLE 173
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R +R GLIR R +GA S G+VI FLDAHCE WL PLL+ I D+K +
Sbjct: 174 VPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLSRIKLDKKTVVC 233
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 PIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 280
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+ + VR++R +R GLIR R +GA S G+VI FLDAHCE WL PL
Sbjct: 161 LKRQLEHYVRKLEVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPL 220
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I D+K + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 221 LSRIKLDKKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 277
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 278 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 337
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 338 SHVGHVFR 345
>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Monodelphis domestica]
Length = 609
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 105/249 (42%), Positives = 144/249 (57%), Gaps = 12/249 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAY-------RAAGDASLGEYGMNMETS 242
+ + K+ +EP LE E G+ PE + D ++ N+ S
Sbjct: 66 LLEPQSKVNKIEPILENNGEDAGKEEDTELSPEMGMIFNERDQELRDLGYQKHAFNLLIS 125
Query: 243 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
N + + R +PD R ECK YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 126 NRLGYHRDVPDTRNAECKEKSYPSDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLH 185
Query: 303 EIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
EIILVDD S DL +L+ Y+Q++ GK++++RN +REGLIR R GA + GEV+VFL
Sbjct: 186 EIILVDDNSEFDDLKGELDKYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFL 245
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 421
D+HCEV WL PLL PI DR+ + PVID I T +Y RG F WG+ +
Sbjct: 246 DSHCEVNKMWLQPLLVPIQEDRRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHF 301
Query: 422 KENELPERE 430
K + +P E
Sbjct: 302 KWDLVPFSE 310
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 124/189 (65%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L+ Y++++ GK++++RN +REGLIR R GA + GEV+VFLD+HCEV WL
Sbjct: 198 DLKGELDKYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQ 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI DR+ + PVID I T +Y RG F WG+ +K + +P E +
Sbjct: 258 PLLVPIQEDRRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHFKWDLVPFSELEG 313
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 314 PEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 373
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 374 CSRVGHIFR 382
>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
Length = 507
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 126/189 (66%), Gaps = 5/189 (2%)
Query: 2 LDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L +KL+ YI+ +G V+++R ER+GLIR R GAK + GEV+VFLD+HCE + WL
Sbjct: 103 LGEKLDGYIKDTWPDGIVKVVRAPERQGLIRARVLGAKAATGEVLVFLDSHCECNVQWLE 162
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+A I R + P+ID ID + + + G F W + + LP+RE K+
Sbjct: 163 PLVARIKESRSALLCPMIDVIDAKAMSYNGIGAGS---VGGFWWSLHFSWRPLPQRERKR 219
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + E +SPT AGGLFA DR +F E+GGYDPG+ VWGGEN E+SF++WMCGG++E+VP
Sbjct: 220 RKSSVETIRSPTMAGGLFAADRKYFFEIGGYDPGMDVWGGENLEISFRVWMCGGTLEFVP 279
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 280 CSRVGHIFR 288
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 89/223 (39%), Positives = 134/223 (60%), Gaps = 5/223 (2%)
Query: 215 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 274
G+ L + D + N+ S+ I+ +R++ D R +C YP +P ASV+
Sbjct: 2 GRRVELSAEKQEEADKLFKKEAFNIVASDMIALNRSVSDNRDPQCSRVSYPKVMPNASVV 61
Query: 275 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF--NGKVR 332
++FHNE +S L+RTVHS++ R+P +YL E+IL+DDFS +A L +KL+ YI+ +G V+
Sbjct: 62 IIFHNEAWSPLLRTVHSVVNRSPPEYLHEVILLDDFSDRAGLGEKLDGYIKDTWPDGIVK 121
Query: 333 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVID 392
++R ER+GLIR R GAK + GEV+VFLD+HCE + WL PL+A I R + P+ID
Sbjct: 122 VVRAPERQGLIRARVLGAKAATGEVLVFLDSHCECNVQWLEPLVARIKESRSALLCPMID 181
Query: 393 GIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
ID + + + G F W + + LP+RE K+RK
Sbjct: 182 VIDAKAMSYNGIGAGS---VGGFWWSLHFSWRPLPQRERKRRK 221
>gi|226482458|emb|CAX73828.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
Length = 603
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 92/175 (52%), Positives = 121/175 (69%), Gaps = 6/175 (3%)
Query: 14 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 73
+GKVR+I ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI ++ + +
Sbjct: 196 DGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAANYRTVV 255
Query: 74 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 133
P ID ID +E+R+ D RG F+W + YK LP R + + EP+ SP A
Sbjct: 256 CPFIDVIDADNFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDNHHPEEPFDSPVMA 309
Query: 134 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRI + R
Sbjct: 310 GGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDTPCSRIGHIYR 364
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 138/228 (60%), Gaps = 6/228 (2%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
LE + GPGE G L + ++ E G ++ S I DR+I D+R CK
Sbjct: 70 LENSRVGPGENGMPVKLSTHEKKIAAKTINENGFSVYVSTKIKTDRSIKDIRHPNCKGKL 129
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y LP ASVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 130 YSNKLPTASVIIPFFEEHWETLLRTVASVLNRAPSALIKEVILVDDGSSREYLKDRLDSH 189
Query: 324 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
I +GKVR+I ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 IISAYPDGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAA 249
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPE 428
+ + + P ID ID +E+R+ D RG F+W + YK LPE
Sbjct: 250 NYRTVVCPFIDVIDADNFEYRA---QDEGARGAFDWELYYKRLPRLPE 294
>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
Length = 536
Score = 188 bits (477), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 100/239 (41%), Positives = 144/239 (60%), Gaps = 8/239 (3%)
Query: 203 PLEPYKEGP---GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
P P+ P GE G++ +P+ +A D +G N S+H+S RT+PDLR C
Sbjct: 22 PTLPHNFNPNAIGENGESVIVPDKAKAESDKLFKNHGFNQWASDHMSLHRTLPDLRPSLC 81
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K +P DLP+ SV++VFHNE S+L+RTVHS++ R+ + +IILVDDFSS D
Sbjct: 82 KSQVFPKDLPQTSVVIVFHNEALSTLLRTVHSVLDRSAPDLIHQIILVDDFSSIKGHD-P 140
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
L+ YI KV L+RN +REGLIR+R G + ++ FLDAHCEV + WL PLL +
Sbjct: 141 LKKYIADLK-KVILVRNPKREGLIRSRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDRV 199
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR-KYN 437
+ +R ++ P ID ID +T+++R+ D RG+F W M ++ P +E K+R YN
Sbjct: 200 HQNRSVVVCPEIDVIDDKTFQYRAGSSGD--IRGVFNWDMKFRWRLTPSQEQKRRNNYN 256
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 85/187 (45%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
L+ YI KV L+RN +REGLIR+R G + ++ FLDAHCEV + WL PLL +
Sbjct: 141 LKKYIADLK-KVILVRNPKREGLIRSRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDRV 199
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK-YNS 124
+ +R ++ P ID ID +T+++R+ D RG+F W M ++ P +E K+R YN
Sbjct: 200 HQNRSVVVCPEIDVIDDKTFQYRAGSSGD--IRGVFNWDMKFRWRLTPSQEQKRRNNYNV 257
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
+SPT AGGLFA+DR +F E+G YD + +WGGEN ELSF+IW CGG +E +PCS +
Sbjct: 258 LFARSPTMAGGLFAIDRQYFQEIGLYDSQMDIWGGENLELSFRIWQCGGQLEIMPCSHVG 317
Query: 185 SLIRPVF 191
+ R V
Sbjct: 318 HVFRNVI 324
>gi|340713833|ref|XP_003395440.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like [Bombus
terrestris]
Length = 610
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 101/215 (46%), Positives = 134/215 (62%), Gaps = 3/215 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK L + A + G N S+ IS +R++PD+R +CK Y +
Sbjct: 88 RTGIGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLKN 147
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
L SVI+ FHNE FS+LMRT S+I R+PA L+EIILVDD S+KA+L + LEDYI
Sbjct: 148 LDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKAELKKPLEDYITER 207
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KV+L+R ER GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 208 FTKVKLVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYKTCV 267
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 299
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 101/199 (50%), Positives = 127/199 (63%), Gaps = 10/199 (5%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L K LEDYI ERF KV+L+R ER GLI+ R GAK ++ +V+VFLD+H E +NWLP
Sbjct: 195 ELKKPLEDYITERFT-KVKLVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLP 253
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 254 PLLEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN 310
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + P
Sbjct: 311 ---PTEPFKSPVMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAP 367
Query: 180 CSRIVSLIR--PVFKADGK 196
CSR+ + R P F GK
Sbjct: 368 CSRVGHIYRKFPPFPNPGK 386
>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
laevis]
gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
Length = 559
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 99/235 (42%), Positives = 146/235 (62%), Gaps = 4/235 (1%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
L+P +EGPGE GK + + + N+ S I+ +R++PD+R+E CK
Sbjct: 52 LKP-QEGPGEMGKPVVILKEEQERMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKV 110
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
YP +LP SV++VFHNE +++L+RTVHS+I R+P L EI+LVDD S + L + LE Y
Sbjct: 111 YPDNLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLREIVLVDDASERDFLKRALETY 170
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+++ + V +IR +R GLIR R RGA S+G+VI FLDAHCE + WL PLLA I DR
Sbjct: 171 VKKLSVPVHVIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLARINHDR 230
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +R+ +R
Sbjct: 231 RTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRRGDR 282
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ + V +IR +R GLIR R RGA S+G+VI FLDAHCE + WL PL
Sbjct: 163 LKRALETYVKKLSVPVHVIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +R+
Sbjct: 223 LARINHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRR 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_c [Rattus norvegicus]
Length = 560
Score = 187 bits (476), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 103/230 (44%), Positives = 138/230 (60%), Gaps = 7/230 (3%)
Query: 199 NLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 258
N+E P + + E G ++ E + D ++ NM SN + + R +PD R E
Sbjct: 35 NIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLISNRLGYHRDVPDTRNAE 92
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C+ YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA L EIILVDD S DL
Sbjct: 93 CRGKSYPTDLPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKG 152
Query: 319 KLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+L++YIQR+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL PLLA
Sbjct: 153 ELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 212
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
I D + PVID I T Y RG F WG+ +K + +P
Sbjct: 213 IILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVP 258
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 149 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 208
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P +
Sbjct: 209 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSDLGG 264
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 265 ADSATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 324
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 325 CSRVGHIFR 333
>gi|148671132|gb|EDL03079.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_c [Mus
musculus]
Length = 345
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 108/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA+ +L N +E P + + E G ++ E + D ++ NM
Sbjct: 66 VLDPQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPVSE 309
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/151 (45%), Positives = 92/151 (60%), Gaps = 5/151 (3%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 197 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P E
Sbjct: 257 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSELGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGY 150
+ P +SPT AGGLFAM+R +F +LG +
Sbjct: 313 PDGATAPIRSPTMAGGLFAMNRQYFNDLGHF 343
>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
Length = 527
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 117 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 176
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 177 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGRA 232
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 233 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 292
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 293 SRVGHIFR 300
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 103/222 (46%), Positives = 138/222 (62%), Gaps = 7/222 (3%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G G+GG ++ E + D ++ NM S+ + + R +PD R CK YP DLP
Sbjct: 13 GCGQGGMIFN--ERDQELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLP 70
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-N 328
ASV++ F+NE FS+L+RTVHS+I RTPA L EIILVDD S DL +L++Y+Q++
Sbjct: 71 AASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLP 130
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL PLLA I DR +
Sbjct: 131 GKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVC 190
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
PVID I T Y RG F WG+ +K + +P E
Sbjct: 191 PVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSE 228
>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
Length = 626
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 101/237 (42%), Positives = 141/237 (59%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P ++P +E PGE GKA ++P E N+ S+ IS +R++ D+R E+CK
Sbjct: 113 PFVKPQEETPGEMGKAVNIPIEQEKVMLEKFQENQFNLLASDMISLNRSLTDVRFEKCKA 172
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP LP SV++VFHNE +++L+RT+ S I R+P L+EIILVDD S K L +KLE
Sbjct: 173 KRYPTLLPTTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEKEHLGKKLE 232
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+YI+ RL R R GLIR R GAK +G+VI FLDAHCE WL PLL+ I
Sbjct: 233 EYIKTLPVSTRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPLLSRIVE 292
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR + P+ID I T+E+ + D + G F W + ++ +PERE ++R +R
Sbjct: 293 DRSTVVCPIIDVISDTTFEY--IQASDMTWGG-FNWKLNFRWYRVPEREMQRRGGDR 346
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE+YI+ RL R R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 227 LGKKLEEYIKTLPVSTRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPL 286
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ +PERE ++R
Sbjct: 287 LSRIVEDRSTVVCPIIDVISDTTFEY--IQASDMTWGG-FNWKLNFRWYRVPEREMQRRG 343
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFA+DR +F ++G YD G+ +WGGEN E+SF++W CGG +E VPC
Sbjct: 344 GDRTAPLRTPTMAGGLFAIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPC 403
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 404 SHVGHVFR 411
>gi|322787059|gb|EFZ13283.1| hypothetical protein SINV_13249 [Solenopsis invicta]
Length = 540
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 98/198 (49%), Positives = 128/198 (64%), Gaps = 8/198 (4%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KKL+DY+ + KV ++R +R GLIR R GAK++R +V+VFLD+H E +NWLPP
Sbjct: 127 ELKKKLDDYVAQHLPKVLIVRLPKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPP 186
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI D K P ID I Y+T+E+R+ D RG F+W + YK L + K+
Sbjct: 187 LLEPIARDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKR- 242
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP+KSP AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PC
Sbjct: 243 --PAEPFKSPIMAGGLFAISTKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPC 300
Query: 181 SRIVSLIR--PVFKADGK 196
SR+ + R P F G+
Sbjct: 301 SRVGHIYRKFPPFPNPGR 318
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 134/218 (61%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE G+ L + + G N S+ IS +R++PD+R +CK Y
Sbjct: 17 EERRTGMGEHGRPAFLSPSLDVRKEKLYQVNGFNAALSDEISVNRSVPDIRHSDCKKKQY 76
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+L+RT S++ R+P LEEIILVDD S+K +L +KL+DY+
Sbjct: 77 LKNLDPVSVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKKKLDDYV 136
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ KV ++R +R GLIR R GAK++R +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 137 AQHLPKVLIVRLPKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEPIARDYK 196
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 197 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 231
>gi|443727149|gb|ELU14019.1| hypothetical protein CAPTEDRAFT_197005 [Capitella teleta]
Length = 613
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 136/220 (61%), Gaps = 4/220 (1%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
+E + GPGE G A L DA G N S+ IS R++ D+R +C+
Sbjct: 86 IEKQRTGPGEQGAAVILSSDEEKKKDALYKVNGFNGFASDKISLQRSLKDIRHPQCRTQK 145
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y LP SV++ FHNE +S+L+RT S++ R+P + + EIILVDDFSSK + L+D+
Sbjct: 146 YWNKLPTVSVVVPFHNEHWSTLLRTAESVLVRSPPELIHEIILVDDFSSKEHCGKPLDDH 205
Query: 324 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
+ + GKV++I +REGLIRTR GA+E+ G+V++FLD+HCE +NWLPPLL PI D
Sbjct: 206 LATHYGGKVKVIHQPKREGLIRTRLAGAREATGDVLIFLDSHCEANVNWLPPLLDPIAED 265
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+ + P ID +DY+T+ +R+ D RG F+W YK
Sbjct: 266 YRTVVCPFIDVVDYETFAYRAQ---DEGARGAFDWEFFYK 302
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/187 (49%), Positives = 125/187 (66%), Gaps = 9/187 (4%)
Query: 4 KKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 62
K L+D++ + GKV++I +REGLIRTR GA+E+ G+V++FLD+HCE +NWLPPLL
Sbjct: 200 KPLDDHLATHYGGKVKVIHQPKREGLIRTRLAGAREATGDVLIFLDSHCEANVNWLPPLL 259
Query: 63 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREAKKRK 121
PI D + + P ID +DY+T+ +R+ D RG F+W YK LPE K
Sbjct: 260 DPIAEDYRTVVCPFIDVVDYETFAYRAQ---DEGARGAFDWEFFYKRLPLLPE----DLK 312
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ + P+KSP AGGLFA+ +F ELGGYDPGL +WGGE +ELSFK+W CGG + PCS
Sbjct: 313 HPARPFKSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKLWQCGGQMLDAPCS 372
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 373 RVGHIYR 379
>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
Length = 624
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 124/189 (65%), Gaps = 13/189 (6%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL I
Sbjct: 226 LEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDRI 285
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREAKK 119
D + PVID ID T+E+ HH + G F+WG+ + + +PER+ K
Sbjct: 286 KRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKN 338
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E VP
Sbjct: 339 RTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVP 398
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 399 CSHVGHVFR 407
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 89 NLAKPKFMVD--------PNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLN 140
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 313
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 314 FTSVGGFDWGLQFNWHSIPERDRKNR 339
>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
Length = 608
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 108/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA+ +L N +E P + + E G ++ E + D ++ NM
Sbjct: 66 VLDPQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPVSE 309
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 197 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P E
Sbjct: 257 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSELGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
musculus]
gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
musculus]
gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
musculus]
Length = 608
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 108/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA+ +L N +E P + + E G ++ E + D ++ NM
Sbjct: 66 VLDPQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPVSE 309
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 197 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P E
Sbjct: 257 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSELGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
Length = 624
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/191 (47%), Positives = 125/191 (65%), Gaps = 13/191 (6%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
+ LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL
Sbjct: 224 RPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLD 283
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREA 117
I D + PVID ID T+E+ HH + G F+WG+ + + +PER+
Sbjct: 284 RIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDR 336
Query: 118 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 177
K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E
Sbjct: 337 KNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEI 396
Query: 178 VPCSRIVSLIR 188
VPCS + + R
Sbjct: 397 VPCSHVGHVFR 407
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 89 NLAKPKFMVD--------PNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLN 140
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 313
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 314 FTSVGGFDWGLQFNWHSIPERDRKNR 339
>gi|301759365|ref|XP_002915552.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Ailuropoda melanoleuca]
Length = 448
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/193 (49%), Positives = 126/193 (65%), Gaps = 4/193 (2%)
Query: 230 ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 289
A +YG N S + +R +PD R + C YP DLP ASV++ FHNE F++L RT+
Sbjct: 100 AGFLKYGFNAILSKSLGSERDVPDTRNKMCLQKHYPADLPTASVVICFHNEEFNALFRTM 159
Query: 290 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 349
S+I TP LEEIILVDD SS DL +KL+ ++ F GK++LIRN +REGLIR+R G
Sbjct: 160 SSVINLTPHHILEEIILVDDLSSVDDLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIG 219
Query: 350 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 409
A + G+V+VFLD+HCEV WL PLLAPI D K++ P+ID ID++T E+R P
Sbjct: 220 ASRASGDVLVFLDSHCEVNHVWLQPLLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSP 275
Query: 410 HYRGIFEWGMLYK 422
RG F W + +K
Sbjct: 276 VVRGAFTWHLEFK 288
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 87/183 (47%), Positives = 123/183 (67%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ +E F GK++LIRN +REGLIR+R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIGASRASGDVLVFLDSHCEVNHVWLQP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI D K++ P+ID ID++T E+R P RG F W + +K + + E
Sbjct: 245 LLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSPVVRGAFTWHLEFKWDNVLSYEIDGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP AGG+FA++R +F E+G YD + +WG EN ELS +IWMCGG + +PC
Sbjct: 301 EGPTKPIRSPAMAGGVFAINRHYFNEIGKYDRDMELWGAENLELSLRIWMCGGQLFILPC 360
Query: 181 SRI 183
SR+
Sbjct: 361 SRV 363
>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGRA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM S+ + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
[Tribolium castaneum]
gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
Length = 602
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 100/237 (42%), Positives = 142/237 (59%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + P PGE GKA H+P N+ S+ IS +R++ D+R+E CK
Sbjct: 86 PTVLPAHGLPGEMGKAVHIPPEQEGLMKEKFKLNQFNLLASDMISLNRSLADVRLEGCKD 145
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP LP S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L +KLE
Sbjct: 146 KKYPKLLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRPLLKEIILVDDASEREHLGRKLE 205
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+Y+Q V ++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I
Sbjct: 206 EYVQTLPVPVIVLRTHKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVQ 265
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DRK + P+ID I +T+E+ + D + G F W + ++ +P+RE ++R +R
Sbjct: 266 DRKTVVCPIIDVISDETFEY--ITASDMTWGG-FNWKLNFRWYRVPQREMERRNNDR 319
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y++ V ++R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 200 LGRKLEEYVQTLPVPVIVLRTHKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 259
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I +T+E+ + D + G F W + ++ +P+RE ++R
Sbjct: 260 LARIVQDRKTVVCPIIDVISDETFEY--ITASDMTWGG-FNWKLNFRWYRVPQREMERRN 316
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG +E +PC
Sbjct: 317 NDRTAPLRTPTMAGGLFSIDKEYFYELGSYDEGMDIWGGENLEMSFRVWQCGGKLEIIPC 376
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 377 SHVGHVFR 384
>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Oryzias latipes]
Length = 557
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 145/241 (60%), Gaps = 3/241 (1%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
G + + EGPGE GKA ++ + + N+ S+ I+ +R++PD+R++
Sbjct: 45 GQVVTVISRSHEGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLD 104
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK Y DLP S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD S + L
Sbjct: 105 GCKTKVYADDLPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLK 164
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+KLE Y++ V+++R +R GLIR R RGA + G+VI FLDAHCE WL PLLA
Sbjct: 165 KKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLA 224
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK +
Sbjct: 225 RIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGD 281
Query: 438 R 438
R
Sbjct: 282 R 282
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE Y+ V+++R +R GLIR R RGA + G+VI FLDAHCE WL PL
Sbjct: 163 LKKKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPL 222
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 223 LARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 279
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 339
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 340 SHVGHVFR 347
>gi|21707970|gb|AAH34184.1| Galnt11 protein [Mus musculus]
Length = 411
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 108/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKAD--GKLGN--LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA+ +L N +E P + + E G ++ E + D ++ NM
Sbjct: 66 VLDPQFKANRIDRLMNNHIEDPDKGLSKSSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RTVHS++ RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPVSE 309
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 197 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P E
Sbjct: 257 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSELGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
partial [Pongo abelii]
Length = 587
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELRGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 110/252 (43%), Positives = 149/252 (59%), Gaps = 11/252 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D +E P E + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVVCFYNEAFSALLRTVHSVIDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I DR + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPEREAK 432
+K + +P E +
Sbjct: 300 FKWDLVPLSELR 311
>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGRA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 109/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D +E P E + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
S+ + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 124 SDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I DR + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPLSE 309
>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
[Homo sapiens]
gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
sapiens]
gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Homo sapiens]
gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[synthetic construct]
gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 608
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGRA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 109/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D +E P E + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
S+ + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 124 SDRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I DR + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPLSE 309
>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase 10-like [Metaseiulus
occidentalis]
Length = 590
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 127/188 (67%), Gaps = 7/188 (3%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+ +LE+Y+ F +V+L+R T+REGLIR R GA+ + G+V++FLD+H E +NWLPP
Sbjct: 184 VSTELENYLSSHFGSQVKLLRATKREGLIRARLLGARAAEGDVLIFLDSHTEANVNWLPP 243
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI +R+ + P ID I Y+T+ +RS D RG F+W + YK L + K+
Sbjct: 244 LLDPIARNRRTVVCPFIDVIHYETFAYRS---QDEGARGAFDWELYYKRLPLLSEDLKR- 299
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP++SP AGGLFA+DR++F ELGGYD GL VWGGE +ELSFKIW CGG + PC
Sbjct: 300 --PTEPFRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFKIWQCGGQMFDAPC 357
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 358 SRVGHIYR 365
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/233 (41%), Positives = 144/233 (61%), Gaps = 7/233 (3%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAG--DASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
E +GPGE G A LP+ + G N S+ I+ +R++PD+R EC+
Sbjct: 70 EKLAQGPGEQGAAVELPKDAETEQRKEKLYKVNGFNAAVSDLIALNRSLPDIRHSECQNI 129
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK-ADLDQKLE 321
Y LP AS+++ FHNE S L+RT+ S+++R+P ++EIILVDDFSSK + + +LE
Sbjct: 130 RYAARLPTASIVIPFHNEHLSVLLRTITSVLRRSPKSLIKEIILVDDFSSKKSXVSTELE 189
Query: 322 DYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 380
+Y+ F +V+L+R T+REGLIR R GA+ + G+V++FLD+H E +NWLPPLL PI
Sbjct: 190 NYLSSHFGSQVKLLRATKREGLIRARLLGARAAEGDVLIFLDSHTEANVNWLPPLLDPIA 249
Query: 381 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+R+ + P ID I Y+T+ +RS D RG F+W + YK L + K+
Sbjct: 250 RNRRTVVCPFIDVIHYETFAYRS---QDEGARGAFDWELYYKRLPLLSEDLKR 299
>gi|281339845|gb|EFB15429.1| hypothetical protein PANDA_003532 [Ailuropoda melanoleuca]
Length = 447
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 95/193 (49%), Positives = 126/193 (65%), Gaps = 4/193 (2%)
Query: 230 ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 289
A +YG N S + +R +PD R + C YP DLP ASV++ FHNE F++L RT+
Sbjct: 100 AGFLKYGFNAILSKSLGSERDVPDTRNKMCLQKHYPADLPTASVVICFHNEEFNALFRTM 159
Query: 290 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 349
S+I TP LEEIILVDD SS DL +KL+ ++ F GK++LIRN +REGLIR+R G
Sbjct: 160 SSVINLTPHHILEEIILVDDLSSVDDLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIG 219
Query: 350 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 409
A + G+V+VFLD+HCEV WL PLLAPI D K++ P+ID ID++T E+R P
Sbjct: 220 ASRASGDVLVFLDSHCEVNHVWLQPLLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSP 275
Query: 410 HYRGIFEWGMLYK 422
RG F W + +K
Sbjct: 276 VVRGAFTWHLEFK 288
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 87/183 (47%), Positives = 123/183 (67%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ +E F GK++LIRN +REGLIR+R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDHRLEIFRGKIKLIRNKKREGLIRSRLIGASRASGDVLVFLDSHCEVNHVWLQP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLAPI D K++ P+ID ID++T E+R P RG F W + +K + + E
Sbjct: 245 LLAPIAKDPKMVVCPLIDPIDHKTLEYR----PSPVVRGAFTWHLEFKWDNVLSYEIDGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP AGG+FA++R +F E+G YD + +WG EN ELS +IWMCGG + +PC
Sbjct: 301 EGPTKPIRSPAMAGGVFAINRHYFNEIGKYDRDMELWGAENLELSLRIWMCGGQLFILPC 360
Query: 181 SRI 183
SR+
Sbjct: 361 SRV 363
>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
Length = 597
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 124/189 (65%), Gaps = 13/189 (6%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL I
Sbjct: 234 LEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGWIEPLLDRI 293
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREAKK 119
D + PVID ID T+E+ HH + G F+WG+ + + +PER+ K
Sbjct: 294 KRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKN 346
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E VP
Sbjct: 347 RTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVP 406
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 407 CSHVGHVFR 415
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 108/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 97 NLAKPKFMVD--------PNDPIYKKGDTSQAGELGKAVVVDKSKLTSEQKAIYDKGMLN 148
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 149 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLE 208
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEEI+LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 209 RTPDHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIAT 268
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 269 GEVLTYLDSHCECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 321
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 322 FTSVGGFDWGLQFNWHSIPERDRKNR 347
>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Oryzias latipes]
Length = 556
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 97/230 (42%), Positives = 142/230 (61%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GKA ++ + + N+ S+ I+ +R++PD+R++ CK Y DL
Sbjct: 55 EGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADDL 114
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD S + L +KLE Y++
Sbjct: 115 PTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLE 174
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V+++R +R GLIR R RGA + G+VI FLDAHCE WL PLLA I DR +
Sbjct: 175 VPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVVC 234
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 235 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE Y+ V+++R +R GLIR R RGA + G+VI FLDAHCE WL PL
Sbjct: 162 LKKKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
gigas]
Length = 613
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL ++E+Y+ + VRL+R EREGLIR R GA + G+V+VFLD+HCEV +WL P
Sbjct: 192 DLSCQIENYVNQHFNNVRLVRTPEREGLIRARVFGANLASGQVLVFLDSHCEVNTDWLEP 251
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D + VPVID I++ T E Y+ RG F WG+ + + LP+ E
Sbjct: 252 LLLRISHDPTTVVVPVIDIINHDTME----YQQSPLVRGGFNWGLHFSWDRLPDNEKNDP 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
S+P SPT AGGLFAM R +F LG YD G+ +WGGEN E+SF+IWMCGG +E +PC
Sbjct: 308 DLGSKPILSPTMAGGLFAMKRDYFHHLGEYDLGMDIWGGENLEISFRIWMCGGKLEIIPC 367
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 368 SRVGHIFR 375
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 121/206 (58%), Gaps = 4/206 (1%)
Query: 225 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 284
+ A D + N S+ I F R IPD R +C+ +P S+I+ F NE S+
Sbjct: 102 QIARDEGYQNFAFNALVSDKIGFHRAIPDTRYPKCQDVTFPAINLDTSIIVCFFNEQPSA 161
Query: 285 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 344
L+R VHSI +TP + ++EIILVDD S+ DL ++E+Y+ + VRL+R EREGLIR
Sbjct: 162 LLRLVHSINDQTPQELVKEIILVDDSSTLDDLSCQIENYVNQHFNNVRLVRTPEREGLIR 221
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
R GA + G+V+VFLD+HCEV +WL PLL I D + VPVID I++ T E
Sbjct: 222 ARVFGANLASGQVLVFLDSHCEVNTDWLEPLLLRISHDPTTVVVPVIDIINHDTME---- 277
Query: 405 YEPDHHYRGIFEWGMLYKENELPERE 430
Y+ RG F WG+ + + LP+ E
Sbjct: 278 YQQSPLVRGGFNWGLHFSWDRLPDNE 303
>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
Length = 608
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 126/188 (67%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T + S RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTLAYSS----SPVVRGGFNWGLHFKWDLVPLSELGEA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 109/252 (43%), Positives = 148/252 (58%), Gaps = 15/252 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNM 239
L+ P FKA+ ++ ++ + E P EG + E + D ++ NM
Sbjct: 66 LLEPQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNM 121
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA
Sbjct: 122 LISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAH 181
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+
Sbjct: 182 LLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVL 241
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
VFLD+HCEV + WL PLLA I DR + PVID I T Y RG F WG
Sbjct: 242 VFLDSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWG 297
Query: 419 MLYKENELPERE 430
+ +K + +P E
Sbjct: 298 LHFKWDLVPLSE 309
>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
Length = 623
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 124/189 (65%), Gaps = 13/189 (6%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL I
Sbjct: 226 LEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDRI 285
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREAKK 119
D + PVID ID T+E+ HH + G F+WG+ + + +PER+ K
Sbjct: 286 KRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKN 338
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E VP
Sbjct: 339 RTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVP 398
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 399 CSHVGHVFR 407
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 89 NLAKPKFMVD--------PNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLN 140
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 313
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 314 FTSVGGFDWGLQFNWHSIPERDRKNR 339
>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
Length = 608
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGEA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 109/252 (43%), Positives = 148/252 (58%), Gaps = 15/252 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNM 239
L+ P FKA+ ++ ++ + E P EG + E + D ++ NM
Sbjct: 66 LLEPQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNM 121
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA
Sbjct: 122 LISNRLGYRRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAH 181
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+
Sbjct: 182 LLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVL 241
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
VFLD+HCEV + WL PLLA I DR + PVID I T Y RG F WG
Sbjct: 242 VFLDSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWG 297
Query: 419 MLYKENELPERE 430
+ +K + +P E
Sbjct: 298 LHFKWDLVPLSE 309
>gi|347971870|ref|XP_313714.5| AGAP004429-PA [Anopheles gambiae str. PEST]
gi|333469065|gb|EAA09257.5| AGAP004429-PA [Anopheles gambiae str. PEST]
Length = 663
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 94/186 (50%), Positives = 125/186 (67%), Gaps = 7/186 (3%)
Query: 14 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 73
KVRL+RNT+REGLIR+R GA+ + G+V++FLD+H EV ++W+ PLLA I DR I+
Sbjct: 256 TNKVRLLRNTDREGLIRSRVYGARNATGQVLIFLDSHIEVNVDWIEPLLARIKHDRTILA 315
Query: 74 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 133
+PVID I+ T+ VY RG F WG+ +K + LP+ ++ P+ SPT A
Sbjct: 316 MPVIDIINSDTF----VYTASPLVRGGFNWGLHFKWDNLPKGSLERDTDFVGPFNSPTMA 371
Query: 134 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPV 190
GGLFA+DRA+F ELG YD G+ VWGGEN E+SF+ W CGGSIE +PCSRI + RP
Sbjct: 372 GGLFAIDRAYFKELGEYDMGMDVWGGENLEISFRAWQCGGSIELLPCSRIGHVFRKRRPY 431
Query: 191 FKADGK 196
DG+
Sbjct: 432 GSPDGQ 437
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 88/203 (43%), Positives = 128/203 (63%), Gaps = 7/203 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + R IPD R + C+ Y LP ASV++ F+NE +L+R+
Sbjct: 154 DIGYRKHAFNVLVSNKLGPFRPIPDTRHKLCQAQVYDKVLPVASVVMCFYNEHLETLVRS 213
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLD---QKLEDYIQRFNGKVRLIRNTEREGLIRT 345
+H+++KRTPA L+E+ILVDD S DL Q ++ Q KVRL+RNT+REGLIR+
Sbjct: 214 IHTVLKRTPAYLLKELILVDDCSDFEDLTVGGQLEKELAQLGTNKVRLLRNTDREGLIRS 273
Query: 346 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 405
R GA+ + G+V++FLD+H EV ++W+ PLLA I DR I+ +PVID I+ T+ VY
Sbjct: 274 RVYGARNATGQVLIFLDSHIEVNVDWIEPLLARIKHDRTILAMPVIDIINSDTF----VY 329
Query: 406 EPDHHYRGIFEWGMLYKENELPE 428
RG F WG+ +K + LP+
Sbjct: 330 TASPLVRGGFNWGLHFKWDNLPK 352
>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
Length = 626
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 90/191 (47%), Positives = 125/191 (65%), Gaps = 13/191 (6%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
+ LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL
Sbjct: 224 RPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLD 283
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREA 117
I D + PVID ID T+E+ HH + G F+WG+ + + +PER+
Sbjct: 284 RIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDR 336
Query: 118 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 177
K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E
Sbjct: 337 KNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEI 396
Query: 178 VPCSRIVSLIR 188
VPCS + + R
Sbjct: 397 VPCSHVGHVFR 407
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 89 NLAKPKFMVD--------PNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLN 140
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 313
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 314 FTSVGGFDWGLQFNWHSIPERDRKNR 339
>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
6 [Macaca mulatta]
gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
Length = 608
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGEA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 109/252 (43%), Positives = 148/252 (58%), Gaps = 15/252 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNM 239
L+ P FKA+ ++ ++ + E P EG + E + D ++ NM
Sbjct: 66 LLEPQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNM 121
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA
Sbjct: 122 LISNRLGYRRNVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAH 181
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+
Sbjct: 182 LLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVL 241
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
VFLD+HCEV + WL PLLA I DR + PVID I T Y RG F WG
Sbjct: 242 VFLDSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWG 297
Query: 419 MLYKENELPERE 430
+ +K + +P E
Sbjct: 298 LHFKWDLVPLSE 309
>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
Length = 630
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 124/189 (65%), Gaps = 13/189 (6%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL I
Sbjct: 230 LEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGWIEPLLDRI 289
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREAKK 119
D + PVID ID T+E+ HH + G F+WG+ + + +PER+ K
Sbjct: 290 KRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKN 342
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E VP
Sbjct: 343 RTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVP 402
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 403 CSHVGHVFR 411
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 147/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L + D +
Sbjct: 93 NLAKPKFMVD--------PNDPIYKKGDASQAGELGKAVIVDKTKLTPEQKGIYDKGMLN 144
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 145 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLE 204
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEEI+LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 205 RTPEHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIAT 264
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 265 GEVLTYLDSHCECMEGWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 317
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 318 FTSVGGFDWGLQFNWHSIPERDRKNR 343
>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Oryzias latipes]
Length = 558
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/230 (42%), Positives = 142/230 (61%), Gaps = 3/230 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
EGPGE GKA ++ + + N+ S+ I+ +R++PD+R++ CK Y DL
Sbjct: 57 EGPGEMGKAVNIAKDDQEKMKELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADDL 116
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P S+++VFHNE +S+L+RTVHS+I R+P L EI+LVDD S + L +KLE Y++
Sbjct: 117 PTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLE 176
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V+++R +R GLIR R RGA + G+VI FLDAHCE WL PLLA I DR +
Sbjct: 177 VPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVVC 236
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 237 PIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 283
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE Y+ V+++R +R GLIR R RGA + G+VI FLDAHCE WL PL
Sbjct: 164 LKKKLEGYVRTLEVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPL 223
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + P+ID I +T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 224 LARIKEDRTAVVCPIIDVISDETFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 280
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YDPG+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 281 GDRTLPVRTPTMAGGLFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTC 340
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 341 SHVGHVFR 348
>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
Length = 623
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 124/189 (65%), Gaps = 13/189 (6%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL I
Sbjct: 226 LEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDRI 285
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREAKK 119
D + PVID ID T+E+ HH + G F+WG+ + + +PER+ K
Sbjct: 286 KRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKN 338
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E VP
Sbjct: 339 RTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVP 398
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 399 CSHVGHVFR 407
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 89 NLAKPKFMVD--------PNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLN 140
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 313
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 314 FTSVGGFDWGLQFNWHSIPERDRKNR 339
>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oreochromis niloticus]
Length = 619
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 120/189 (63%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L +L+ Y+ GKV+L+RN REGLIR R GA + GEV+VFLD+HCEV WL
Sbjct: 173 ELKDELDRYVRAELQGKVQLVRNQRREGLIRGRMIGASHATGEVLVFLDSHCEVNQAWLQ 232
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLAPI D + + PVID I T Y P RG F WG+ +K + +P E
Sbjct: 233 PLLAPIQKDHRTVVCPVIDIISADTL----AYSPSPIVRGGFNWGLHFKWDPVPPSELSG 288
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P +SPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 289 PEGASGPIRSPTMAGGLFAMNRKYFNELGQYDAGMDIWGGENLEISFRIWMCGGQLFIIP 348
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 349 CSRVGHIFR 357
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/210 (45%), Positives = 127/210 (60%), Gaps = 5/210 (2%)
Query: 222 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 281
EA + D+ + N+ SN + F R +P+ R +C+ YP+ LP ASV++ F NE
Sbjct: 80 EADQEVRDSGYHRHAFNVLISNRLGFHRQLPETRDAQCREKSYPVALPSASVVICFFNEA 139
Query: 282 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 340
S+L+RTVHS++ RTPA L EIILVDD S +L +L+ Y++ GKV+L+RN RE
Sbjct: 140 LSALLRTVHSVLDRTPAYLLHEIILVDDHSELEELKDELDRYVRAELQGKVQLVRNQRRE 199
Query: 341 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 400
GLIR R GA + GEV+VFLD+HCEV WL PLLAPI D + + PVID I T
Sbjct: 200 GLIRGRMIGASHATGEVLVFLDSHCEVNQAWLQPLLAPIQKDHRTVVCPVIDIISADTL- 258
Query: 401 FRSVYEPDHHYRGIFEWGMLYKENELPERE 430
Y P RG F WG+ +K + +P E
Sbjct: 259 ---AYSPSPIVRGGFNWGLHFKWDPVPPSE 285
>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
Length = 415
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 93/224 (41%), Positives = 144/224 (64%), Gaps = 3/224 (1%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
G+ G+A +P+ + + + N+ S+ +S R +PD R + CK YPL LPK+
Sbjct: 1 GDMGEAVSVPKRLKEKEEEGYELHSFNLVASDMMSLYRRLPDYRNDACKAKKYPLHLPKS 60
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
S+I+ FHNE +S+L+RTVHS+I RTP + LEEI+L+DD S++ +L +KLE+Y+ + V
Sbjct: 61 SIIICFHNEAWSTLLRTVHSVINRTPPRLLEEILLIDDASNRDELKEKLEEYVAKLK-VV 119
Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
R+IR ++R+GLIR R +GA ++G ++ FLDAHCE WL PL A I + + +PVI
Sbjct: 120 RIIRLSKRQGLIRARLKGAAAAKGSILTFLDAHCECSKGWLEPLAAKIAENSSNVVMPVI 179
Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
D I T+ + +V EP H RG+F W + + +P+ E ++RK
Sbjct: 180 DEISDTTFYYHAVPEPFH--RGVFRWRLEFGWKPVPQYEMERRK 221
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 129/191 (67%), Gaps = 3/191 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +KLE+Y+ + VR+IR ++R+GLIR R +GA ++G ++ FLDAHCE WL P
Sbjct: 104 ELKEKLEEYVAKLK-VVRIIRLSKRQGLIRARLKGAAAAKGSILTFLDAHCECSKGWLEP 162
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L A I + + +PVID I T+ + +V EP H RG+F W + + +P+ E ++R
Sbjct: 163 LAAKIAENSSNVVMPVIDEISDTTFYYHAVPEPFH--RGVFRWRLEFGWKPVPQYEMERR 220
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++ ++P AGGLF++D+ +F ++G YD G+ +WGGEN E+SF+IWMCGG+IE +PC
Sbjct: 221 KDEADGIRTPVMAGGLFSIDKNYFEKIGTYDTGMDIWGGENLEISFRIWMCGGAIEMLPC 280
Query: 181 SRIVSLIRPVF 191
SR+ + RP F
Sbjct: 281 SRVGHVFRPRF 291
>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
Length = 626
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/191 (47%), Positives = 125/191 (65%), Gaps = 13/191 (6%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
+ LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE W+ PLL
Sbjct: 224 RPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLD 283
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREA 117
I D + PVID ID T+E+ HH + G F+WG+ + + +PER+
Sbjct: 284 RIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDR 336
Query: 118 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 177
K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN ELSFKIWMCGG++E
Sbjct: 337 KNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEI 396
Query: 178 VPCSRIVSLIR 188
VPCS + + R
Sbjct: 397 VPCSHVGHVFR 407
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 148/266 (55%), Gaps = 31/266 (11%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +P F D P +P YK+G GE GKA L +A D +
Sbjct: 89 NLAKPKFMVD--------PNDPIYKKGDAAQAGELGKAVVVDKTKLSTEEKAKYDKGMLN 140
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNEAWSVLLRTVHSVLE 200
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEE++LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 201 RTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVAT 260
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
GEV+ +LD+HCE W+ PLL I D + PVID ID T+E+ HH +
Sbjct: 261 GEVLTYLDSHCECMEGWMEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAY 313
Query: 413 ----GIFEWGMLYKENELPEREAKKR 434
G F+WG+ + + +PER+ K R
Sbjct: 314 FTSVGGFDWGLQFNWHSIPERDRKNR 339
>gi|226482456|emb|CAX73827.1| polypeptide GalNAc transferase 6 [Schistosoma japonicum]
Length = 603
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 91/175 (52%), Positives = 121/175 (69%), Gaps = 6/175 (3%)
Query: 14 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 73
+GKVR+I ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI ++ + +
Sbjct: 196 DGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAANYRTVV 255
Query: 74 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHA 133
P ID ID +E+R+ D RG F+W + YK LP R + + +P+ SP A
Sbjct: 256 CPFIDVIDADNFEYRA---QDEGARGAFDWELYYK--RLP-RLPEDSHHPEKPFDSPVMA 309
Query: 134 GGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRI + R
Sbjct: 310 GGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDTPCSRIGHIYR 364
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 138/228 (60%), Gaps = 6/228 (2%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
LE + GPGE G L + ++ E G ++ S I DR+I D+R CK
Sbjct: 70 LENSRVGPGENGMPVKLSTHEKKIAAKTINENGFSVYVSTKIKTDRSIKDIRHPNCKGKL 129
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y LP ASVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 130 YSNKLPTASVIIPFFEEHWETLLRTVASVLNRAPSALIKEVILVDDGSSREYLKDRLDSH 189
Query: 324 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
I +GKVR+I ER+GLIR ++ GAKE+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 IISAYPDGKVRVIHLKERQGLIRAKTAGAKEATGEVLIFLDSHCEAGINWLPPLLDPIAA 249
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPE 428
+ + + P ID ID +E+R+ D RG F+W + YK LPE
Sbjct: 250 NYRTVVCPFIDVIDADNFEYRA---QDEGARGAFDWELYYKRLPRLPE 294
>gi|405950576|gb|EKC18555.1| Putative polypeptide N-acetylgalactosaminyltransferase 10
[Crassostrea gigas]
Length = 526
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 89/183 (48%), Positives = 122/183 (66%), Gaps = 6/183 (3%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
L+DY++ V+++R +REGLIRTR GA+ + G+V++FLD+HCE +NWLPPLL PI
Sbjct: 122 LDDYVKEHFTNVKVVRAKKREGLIRTRLLGARAATGQVLIFLDSHCEANINWLPPLLEPI 181
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 125
D K + P ID ID++ + +R+ D RG F+W YK L E + K + +E
Sbjct: 182 AEDYKTVVCPFIDVIDFENFAYRA---QDEGARGAFDWEFFYKRLPLLEEDLK---HPAE 235
Query: 126 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVS 185
P+KSP AGGLFA+ +F E+GGYDPGL +WGGE +ELSFK+W CGG + PCSRI
Sbjct: 236 PFKSPVMAGGLFAISAKWFWEMGGYDPGLDIWGGEQYELSFKLWQCGGMMVDAPCSRIGH 295
Query: 186 LIR 188
+ R
Sbjct: 296 IYR 298
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 96/223 (43%), Positives = 136/223 (60%), Gaps = 5/223 (2%)
Query: 211 PGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
PGE G+A L P+ + GD G N S+ IS R++ D+R +CK Y L
Sbjct: 13 PGEQGQALILSPDEEKKKGDL-YKVNGFNAYASDKISLHRSLKDIRHSDCKKKKYLNHLM 71
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
ASVI+ FHNE +S+L+RT S++ R+P + E+ILVDD+SSK Q L+DY++
Sbjct: 72 NASVIVPFHNEHWSTLLRTAWSVLNRSPKHLIHEVILVDDYSSKEHCKQPLDDYVKEHFT 131
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
V+++R +REGLIRTR GA+ + G+V++FLD+HCE +NWLPPLL PI D K + P
Sbjct: 132 NVKVVRAKKREGLIRTRLLGARAATGQVLIFLDSHCEANINWLPPLLEPIAEDYKTVVCP 191
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
ID ID++ + +R+ D RG F+W YK L E + K
Sbjct: 192 FIDVIDFENFAYRA---QDEGARGAFDWEFFYKRLPLLEEDLK 231
>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
musculus]
Length = 529
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 127/203 (62%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R EC+ YP DLP AS+++ F+NE FS+L+RT
Sbjct: 32 DLGYQKHAFNMLISNRLGYHRDVPDTRNAECRRKSYPTDLPTASIVICFYNEAFSALLRT 91
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA L EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R
Sbjct: 92 VHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRM 151
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I D + PVID I T Y
Sbjct: 152 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSS 207
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 208 SPVVRGGFNWGLHFKWDLVPVSE 230
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 118 DLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 177
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P E
Sbjct: 178 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSELGG 233
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 234 PDGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILP 293
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 294 CSRVGHIFR 302
>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
anubis]
Length = 608
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T + S RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTLAYSS----SPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 109/252 (43%), Positives = 148/252 (58%), Gaps = 15/252 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNM 239
L+ P FKA+ ++ ++ + E P EG + E + D ++ NM
Sbjct: 66 LLEPQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNM 121
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA
Sbjct: 122 LISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAH 181
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+
Sbjct: 182 LLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVL 241
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
VFLD+HCEV + WL PLLA I DR + PVID I T Y RG F WG
Sbjct: 242 VFLDSHCEVNMMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWG 297
Query: 419 MLYKENELPERE 430
+ +K + +P E
Sbjct: 298 LHFKWDLVPLSE 309
>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Gorilla gorilla gorilla]
Length = 527
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 117 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 176
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E
Sbjct: 177 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 232
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 233 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 292
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 293 SRVGHIFR 300
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 99/203 (48%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 30 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 89
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 90 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 149
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 205
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSE 228
>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
troglodytes]
Length = 527
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 117 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 176
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E
Sbjct: 177 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 232
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 233 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 292
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 293 SRVGHIFR 300
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE FS+L+RT
Sbjct: 30 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRT 89
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 90 VHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 149
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 205
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSE 228
>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
Length = 639
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/237 (42%), Positives = 146/237 (61%), Gaps = 4/237 (1%)
Query: 203 PLEPYKEG-PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P+ P + G PGE GK HLP + N+ S+ IS +R++PD+R+E C+
Sbjct: 123 PVVPEQAGQPGEMGKPVHLPADQESLMREKFRLNQFNLLASDSISLNRSLPDVRLEGCRD 182
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP LP S+++VFHNE +S+L+RTV SII R+P + L EIILVDD S + L ++LE
Sbjct: 183 KSYPGLLPTTSIVIVFHNEAWSTLLRTVWSIITRSPRELLAEIILVDDASERDYLGKELE 242
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
D++ F V ++R +R GLIR R GAK+ +G+VI FLDAHCE WL PLLA +
Sbjct: 243 DHVANFPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDAHCECTEGWLEPLLARVAE 302
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+RKI+ P+ID I +++E+ V D + G F W + ++ +P+RE +R +R
Sbjct: 303 NRKIVVCPIIDVISDESFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMDRRNGDR 356
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 125/188 (66%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LED++ F V ++R +R GLIR R GAK+ +G+VI FLDAHCE WL PL
Sbjct: 237 LGKELEDHVANFPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDAHCECTEGWLEPL 296
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA + +RKI+ P+ID I +++E+ V D + G F W + ++ +P+RE +R
Sbjct: 297 LARVAENRKIVVCPIIDVISDESFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMDRRN 353
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF++W CGG +E +PC
Sbjct: 354 GDRTQPLRTPTMAGGLFSIDKDYFEEIGTYDEGMDIWGGENLEMSFRVWQCGGELEIIPC 413
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 414 SHVGHVFR 421
>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Pan troglodytes]
gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
4 [Pan troglodytes]
gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
Length = 608
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 109/250 (43%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D +E P E + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R CK YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I DR + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPLSE 309
>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Gorilla gorilla gorilla]
gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Gorilla gorilla gorilla]
Length = 608
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 110/250 (44%), Positives = 148/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D +E P E + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VF
Sbjct: 184 HEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I DR + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPLSE 309
>gi|268564602|ref|XP_002647197.1| C. briggsae CBR-GLY-10 protein [Caenorhabditis briggsae]
Length = 623
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/228 (44%), Positives = 141/228 (61%), Gaps = 9/228 (3%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDY 264
+EGPGE GK LP+ +A L Y G N S+ IS +R+I D+R +ECK Y
Sbjct: 95 REGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKKMTY 153
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LED++
Sbjct: 154 SAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLEDFL 213
Query: 325 Q--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
+ + + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI D
Sbjct: 214 KKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPIADD 273
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
+ + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 274 YRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD 318
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 158/290 (54%), Gaps = 37/290 (12%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLP
Sbjct: 205 LRQPLEDFLKKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLP 264
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 265 PLLDPIADDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD--- 318
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ ++P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 319 RENPTKPFDSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGKMVDAP 378
Query: 180 CSRIVSLIRPVFK--ADGKLGNL---------EPPLEPYKEG-----PGEGGKAYHLPEA 223
CSR+ + R + + +G+ E ++ YKE PG G
Sbjct: 379 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDEYKETLYKHRPGIGNA------- 431
Query: 224 YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASV 273
AGD L + G+ E SFD + ++ ++ KY YP PKAS
Sbjct: 432 --DAGDLKLMK-GIR-EKLQCKSFDWFMKEIAFDQDKY--YPAIEPKASA 475
>gi|350409603|ref|XP_003488790.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Bombus
impatiens]
Length = 610
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 99/215 (46%), Positives = 133/215 (61%), Gaps = 3/215 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK L + A + G N S+ IS +R++PD+R +CK Y +
Sbjct: 88 RTGIGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLRN 147
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
L SVI+ FHNE FS+LMRT S+I R+PA L+EIILVDD S+K +L + LEDYI
Sbjct: 148 LDSVSVIVSFHNEHFSTLMRTCWSVINRSPAFLLKEIILVDDASTKVELKKPLEDYITEH 207
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KV+++R ER GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 208 LTKVKIVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANVNWLPPLLEPIAQDYKTCV 267
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 299
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/198 (48%), Positives = 124/198 (62%), Gaps = 8/198 (4%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L K LEDYI KV+++R ER GLI+ R GAK ++ +V+VFLD+H E +NWLPP
Sbjct: 195 ELKKPLEDYITEHLTKVKIVRLEERSGLIKGRLAGAKIAKAKVLVFLDSHSEANVNWLPP 254
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 255 LLEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN- 310
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PC
Sbjct: 311 --PTEPFKSPVMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPC 368
Query: 181 SRIVSLIR--PVFKADGK 196
SR+ + R P F GK
Sbjct: 369 SRVGHIYRKFPPFPNPGK 386
>gi|261260064|sp|A8Y236.2|GLT10_CAEBR RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
10; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 10; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
Length = 629
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 102/228 (44%), Positives = 141/228 (61%), Gaps = 9/228 (3%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDY 264
+EGPGE GK LP+ +A L Y G N S+ IS +R+I D+R +ECK Y
Sbjct: 101 REGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKKMTY 159
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LED++
Sbjct: 160 SAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLEDFL 219
Query: 325 Q--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
+ + + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI D
Sbjct: 220 KKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPIADD 279
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
+ + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 280 YRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD 324
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 158/290 (54%), Gaps = 37/290 (12%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLP
Sbjct: 211 LRQPLEDFLKKNKIDHIVKILRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLP 270
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 271 PLLDPIADDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD--- 324
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ ++P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 325 RENPTKPFDSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGKMVDAP 384
Query: 180 CSRIVSLIRPVFK--ADGKLGNL---------EPPLEPYKEG-----PGEGGKAYHLPEA 223
CSR+ + R + + +G+ E ++ YKE PG G
Sbjct: 385 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDEYKETLYKHRPGIGNA------- 437
Query: 224 YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASV 273
AGD L + G+ E SFD + ++ ++ KY YP PKAS
Sbjct: 438 --DAGDLKLMK-GIR-EKLQCKSFDWFMKEIAFDQDKY--YPAIEPKASA 481
>gi|326427851|gb|EGD73421.1| GALNT4 protein [Salpingoeca sp. ATCC 50818]
Length = 537
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 84/174 (48%), Positives = 120/174 (68%), Gaps = 6/174 (3%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
K R+IR ER GLIR + GA+++RG+V+V+LD+HCEV WL PLL I +RK + +P
Sbjct: 265 KTRVIRLPERSGLIRAKVYGAQQARGDVLVYLDSHCEVNDGWLEPLLDRIRRNRKTVAMP 324
Query: 76 VIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 134
+ID IDY+TWE R+ + E RGIF+W +++K +L + + R +++P+ SP AG
Sbjct: 325 IIDAIDYETWEHRTGLLE-----RGIFDWSLVFKWKQLTADDKRGRPDDTDPFASPAMAG 379
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLFAMDR +F E+G YD G+ WGGEN E+S ++W CGG IE +PCS + + R
Sbjct: 380 GLFAMDRKYFFEVGAYDMGMETWGGENIEMSMRVWACGGRIEALPCSHVAHVFR 433
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/203 (43%), Positives = 127/203 (62%), Gaps = 13/203 (6%)
Query: 237 MNMETSNHISFDRTIPDLRMEEC---KYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSI 292
N S+ +S R D R EC KY YPL +LP SVIL+F+NE S+L+RTV S+
Sbjct: 171 FNQWISDRLSLHRRAYDTRPVECLHKKY--YPLSELPTVSVILIFYNEARSTLLRTVWSV 228
Query: 293 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKE 352
+ R+P ++EI+LVDD SS L L+ + K R+IR ER GLIR + GA++
Sbjct: 229 LDRSPRSLIKEILLVDDHSSMPHLGYPLDQEVAGIP-KTRVIRLPERSGLIRAKVYGAQQ 287
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHY 411
+RG+V+V+LD+HCEV WL PLL I +RK + +P+ID IDY+TWE R+ + E
Sbjct: 288 ARGDVLVYLDSHCEVNDGWLEPLLDRIRRNRKTVAMPIIDAIDYETWEHRTGLLE----- 342
Query: 412 RGIFEWGMLYKENELPEREAKKR 434
RGIF+W +++K +L + + R
Sbjct: 343 RGIFDWSLVFKWKQLTADDKRGR 365
>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Pan paniscus]
gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Pan paniscus]
Length = 608
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLATIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 107/252 (42%), Positives = 148/252 (58%), Gaps = 15/252 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNM 239
++ P FKA+ ++ ++ + E P EG + E + D ++ NM
Sbjct: 66 VLEPQFKAN----KIDDVIDSHVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNM 121
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
SN + + R +PD R CK YP DLP AS+++ F+NE FS+L+RTVHS+I RTPA
Sbjct: 122 LISNRLGYHRDVPDTRNAACKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAH 181
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + GEV+
Sbjct: 182 LLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVL 241
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
VFLD+HCEV + WL PLLA I DR + PVID I T Y RG F WG
Sbjct: 242 VFLDSHCEVNVMWLQPLLATIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWG 297
Query: 419 MLYKENELPERE 430
+ +K + +P E
Sbjct: 298 LHFKWDLVPLSE 309
>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Ailuropoda melanoleuca]
gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
Length = 608
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +LE+Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D++ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIQQDQRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 104/248 (41%), Positives = 148/248 (59%), Gaps = 11/248 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 243
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNRIDDMIDSHVEDPEKGNMKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125
Query: 244 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
+ + R +PD R CK YP+DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 304 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
IILVDD S DL +LE+Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLD 245
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+HCEV + WL PLLA I D++ + PVID I T Y RG F WG+ +K
Sbjct: 246 SHCEVNVMWLQPLLAAIQQDQRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301
Query: 423 ENELPERE 430
+ +P E
Sbjct: 302 WDLVPLSE 309
>gi|115533032|ref|NP_001041036.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
gi|182676440|sp|O45947.3|GLT10_CAEEL RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 10; Short=pp-GaNTase
10; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 10; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|3880991|emb|CAA16378.1| Protein GLY-10, isoform a [Caenorhabditis elegans]
Length = 684
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 146/240 (60%), Gaps = 11/240 (4%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 261
E +EGPGE GK LPE +A L Y G N S+ IS +R+I D+R +ECK
Sbjct: 153 EKRREGPGEWGKPVKLPEDKEVEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKN 211
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 212 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 271
Query: 322 DYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
D+++ + + V+++R +REGLIR R GA+++ GE+++FLDAH E NWLPPLL PI
Sbjct: 272 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPI 331
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP--EREAKKRKYN 437
D + + P +D ID +T+E R D RG F+W YK L +RE+ + +N
Sbjct: 332 AEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKDRESPTKPFN 388
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 163/305 (53%), Gaps = 37/305 (12%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+++ GE+++FLDAH E NWLP
Sbjct: 266 LRQPLEDFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLP 325
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 326 PLLDPIAEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKD--- 379
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ ++P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 380 RESPTKPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAP 439
Query: 180 CSRIVSLIRPVFK--ADGKLGNL---------EPPLEPYKEG-----PGEGGKAYHLPEA 223
CSR+ + R + + +G+ E ++ YKE PG G
Sbjct: 440 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDDYKETLYKHRPGVGNA------- 492
Query: 224 YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFS 283
AGD L + G+ E SFD + ++ ++ KY YP PKAS N G +
Sbjct: 493 --DAGDLKLMK-GIR-EKLQCKSFDWFMKEIAFDQDKY--YPAVEPKASAEGEIRNVGTN 546
Query: 284 SLMRT 288
+ T
Sbjct: 547 FCIDT 551
>gi|149032268|gb|EDL87174.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 7, isoform CRA_b
[Rattus norvegicus]
Length = 303
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 96/180 (53%), Positives = 130/180 (72%), Gaps = 5/180 (2%)
Query: 189 PVFKADGKLGNLEPP-LEPYK--EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
PV + G LGN EP EP+ GPGE K L Y+ A AS+ E+G NM S+ I
Sbjct: 125 PVLRP-GVLGNFEPKEPEPHGVVGGPGENAKPLVLGPEYKQAAQASIKEFGFNMAASDMI 183
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S DR++ DLR EECKYW Y +L +SV++VFHNEG+S+LMRTVHS+IKRTP +YL EI+
Sbjct: 184 SLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRTPRKYLAEIV 243
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 364
L+DDFS+K L +KL +YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 244 LIDDFSNKEHLKEKLTEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/50 (50%), Positives = 40/50 (80%), Gaps = 1/50 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAH 50
L +KL +YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAH
Sbjct: 254 LKEKLTEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAH 303
>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
caballus]
Length = 609
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 199 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 258
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+++ PVID I T Y RG F WG+ +K + +P E
Sbjct: 259 LLAVIQEDRRMVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 314
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 315 EGATAPIKSPTMAGGLFAMSRRYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 374
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 375 SRVGHIFR 382
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 131/203 (64%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 112 DLGYQKHAFNMLISNRLGYHREVPDTRNAACKDKSYPTDLPVASVVICFYNEALSALLRT 171
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L E+ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 172 VHSVLDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 231
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+++ PVID I T Y
Sbjct: 232 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAVIQEDRRMVVCPVIDIISADTL----AYSS 287
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 288 SPVVRGGFNWGLHFKWDLVPLSE 310
>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus laevis]
gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
Length = 603
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/245 (42%), Positives = 144/245 (58%), Gaps = 8/245 (3%)
Query: 188 RPVFKADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 246
+P+ G GN LE E + E G ++ E + D ++ N+ SN +
Sbjct: 66 QPIASHQGLNGNQLETKAEANADLSPELGMIFN--EQDQDVRDVGYQKHAFNLLISNRLG 123
Query: 247 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 306
+ R +PD R +C YP DLP AS+++ F+NE FS+L+RTVHS++ RTPAQ L EIIL
Sbjct: 124 YHRDVPDTRDSKCSKKTYPADLPHASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIIL 183
Query: 307 VDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
VDD S DL + L++Y+Q + KV+L+RN +REGLIR R GA + G+V+VFLD+HC
Sbjct: 184 VDDNSELDDLKKDLDNYMQENLSEKVKLVRNKQREGLIRGRMVGASRATGDVLVFLDSHC 243
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 425
EV WL PLLAPI + K + PVID I T +Y RG F WG+ +K +
Sbjct: 244 EVNEMWLQPLLAPIRENPKTVVCPVIDIISSDTL----IYSSSPVVRGGFNWGLHFKWDP 299
Query: 426 LPERE 430
+P E
Sbjct: 300 VPLSE 304
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 123/189 (65%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL K L++Y+ E + KV+L+RN +REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 192 DLKKDLDNYMQENLSEKVKLVRNKQREGLIRGRMVGASRATGDVLVFLDSHCEVNEMWLQ 251
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLAPI + K + PVID I T +Y RG F WG+ +K + +P E
Sbjct: 252 PLLAPIRENPKTVVCPVIDIISSDTL----IYSSSPVVRGGFNWGLHFKWDPVPLSELGG 307
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P++SPT AGGLF MDR +F LG YD G+ +WGGEN E+SF+IWMCGGS+ VP
Sbjct: 308 PEGYTAPFRSPTMAGGLFVMDREYFNTLGHYDSGMDIWGGENLEISFRIWMCGGSLLIVP 367
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 368 CSRVGHIFR 376
>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Nasonia vitripennis]
Length = 637
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 95/188 (50%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +++ DY+++ F+ KV+ R REGLIR R GAK++ GEV+VFLD+H EV WL P
Sbjct: 219 LHEQVRDYVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEVNKMWLEP 278
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I R I+ +PVID I+ T+++ S RG F WG+ +K + LP
Sbjct: 279 LLARISHSRTIVPMPVIDIINADTFQYSS----SPLVRGGFNWGLHFKWDSLPIGTLSLE 334
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P KSPT AGGLFAMDR +F ELG YD G+ VWGGEN E+SF+IWMCGGSIE +PC
Sbjct: 335 QDFVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISFRIWMCGGSIELIPC 394
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 395 SRVGHVFR 402
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/240 (39%), Positives = 142/240 (59%), Gaps = 11/240 (4%)
Query: 193 ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFD 248
+D L L P P++P +G E G ++ E + + + N+ S+++S
Sbjct: 94 SDKLLQQLMPVPVKPSVTVGQGLDELGLVKNMDEQKKR--EEGYKSFAFNVLVSDNLSLH 151
Query: 249 RTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVD 308
R IPD R + CK Y LP AS+++ F+NE +++L+R+++SI+ RTP L EIIL++
Sbjct: 152 RDIPDTRHKLCKNQTYDQKLPNASIVICFYNEHYNTLLRSLYSILDRTPKHLLHEIILIN 211
Query: 309 DFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DFS L +++ DY+ Q F+ KV+ R REGLIR R GAK++ GEV+VFLD+H EV
Sbjct: 212 DFSDSKSLHEQVRDYVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEV 271
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLLA I R I+ +PVID I+ T+++ S RG F WG+ +K + LP
Sbjct: 272 NKMWLEPLLARISHSRTIVPMPVIDIINADTFQYSS----SPLVRGGFNWGLHFKWDSLP 327
>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Saccoglossus kowalevskii]
Length = 1082
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 96/226 (42%), Positives = 138/226 (61%), Gaps = 5/226 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
GPGE G+ L + D + +G N+ S+ IS +R+I D++ C Y DLP
Sbjct: 583 GPGENGQPVLLYGEQKKEADETFDVHGFNVVVSDMISLERSITDVKHSLCDTVRYNKDLP 642
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 328
ASVI+ FHNE +S+L+RT++S+I R+ + L+EIILVDD+S + +L L++YIQ FN
Sbjct: 643 TASVIISFHNEAWSTLLRTIYSVINRSKIKLLQEIILVDDYSDRDELKVALDEYIQSNFN 702
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KV+++ TEREGLIR R GA ++ G+++VFLD+HCEV NWL PL+ IY D +
Sbjct: 703 NKVKILHTTEREGLIRARLIGASKATGKILVFLDSHCEVNYNWLEPLIERIYRDSSTIAC 762
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID ID ++ Y RG WG+ +K +P E +R
Sbjct: 763 PVIDIIDPDSF----AYSASPLVRGGVNWGLQFKWKNVPPVELLRR 804
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 88/189 (46%), Positives = 120/189 (63%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L++YI+ FN KV+++ TEREGLIR R GA ++ G+++VFLD+HCEV NWL
Sbjct: 688 ELKVALDEYIQSNFNNKVKILHTTEREGLIRARLIGASKATGKILVFLDSHCEVNYNWLE 747
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ IY D + PVID ID ++ Y RG WG+ +K +P E +
Sbjct: 748 PLIERIYRDSSTIACPVIDIIDPDSF----AYSASPLVRGGVNWGLQFKWKNVPPVELLR 803
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R EP KSP AGGLFA+DR +F +G YD + +WGGE+ ELSF+IW CGG++E VP
Sbjct: 804 RNSEIEPIKSPIMAGGLFAVDRNYFEHIGSYDKDMQIWGGEHLELSFRIWQCGGTLEIVP 863
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 864 CSRVGHIFR 872
>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Saccoglossus kowalevskii]
Length = 524
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 90/187 (48%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y++RF +V+++R REGLIR R RG + ++GEV+ FLD+H E G+ WL P+
Sbjct: 124 LKAPLDKYMKRF-PQVKILRLENREGLIRGRLRGTEIAQGEVLTFLDSHIECGVGWLEPM 182
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I DR+ + P+IDGID + Y + RG F W M +K +P+ E K+RK
Sbjct: 183 LQRIKEDRRNVVAPMIDGIDATKFS----YAASNLIRGGFSWEMQFKWKPIPDYEMKRRK 238
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ P +SPT AGGLFA+D+++FLE+G YDPGL +WG EN ELSFKIWMCGG++E +PCS
Sbjct: 239 DETWPIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCGGNLEMIPCS 298
Query: 182 RIVSLIR 188
+ + R
Sbjct: 299 HVGHVFR 305
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 98/241 (40%), Positives = 141/241 (58%), Gaps = 6/241 (2%)
Query: 196 KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 254
++ NL+ P +GPGE G + A N S+ IS +R IPD+
Sbjct: 3 RVQNLDVTTAPRNPKGPGEYGVSVITRPEDEAKVKTGWKHASFNEFVSDMISVERAIPDV 62
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R EEC+ Y LP S+I+ F E +S+L+R+VHS+I R+P Q ++EIILVDDFSS+
Sbjct: 63 RPEECQDKLYSDSLPSTSIIICFTEESWSTLVRSVHSVINRSPPQLIKEIILVDDFSSRE 122
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L L+ Y++RF +V+++R REGLIR R RG + ++GEV+ FLD+H E G+ WL P
Sbjct: 123 YLKAPLDKYMKRF-PQVKILRLENREGLIRGRLRGTEIAQGEVLTFLDSHIECGVGWLEP 181
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+L I DR+ + P+IDGID + Y + RG F W M +K +P+ E K+R
Sbjct: 182 MLQRIKEDRRNVVAPMIDGIDATKFS----YAASNLIRGGFSWEMQFKWKPIPDYEMKRR 237
Query: 435 K 435
K
Sbjct: 238 K 238
>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 639
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 126/189 (66%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERFN-GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL LE +++FN K+RLIRN +REGL+R+R GA+ + G+V+VFLD+H EV ++W+
Sbjct: 220 DLGDDLEVGLKKFNNSKIRLIRNRDREGLMRSRVYGARNATGDVLVFLDSHIEVNVDWIE 279
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I +R I+ +PVID I+ T+ Y RG F WG+ +K + LP+ K
Sbjct: 280 PLLQRIKVNRTILAMPVIDIINSDTF----AYTSSPLVRGGFNWGLHFKWDNLPKGSLAK 335
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
P++SPT AGGLFAMDR +F ELG YD G+ VWGGEN E+SF+ W CGGSIE +P
Sbjct: 336 ETDFVGPFQSPTMAGGLFAMDRKYFKELGEYDMGMDVWGGENLEISFRAWQCGGSIELLP 395
Query: 180 CSRIVSLIR 188
CSRI + R
Sbjct: 396 CSRIGHVFR 404
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 126/206 (61%), Gaps = 5/206 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ S+ I R +PD R + C Y LP AS+I+ F+NE +L+R+
Sbjct: 134 DVGYRKHAFNVLVSSKIGPFREVPDTRHKLCPEQSYDKVLPSASIIMCFYNEHLQTLLRS 193
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRTRS 347
V+S++ RTPA L EIILVDD S DL LE +++FN K+RLIRN +REGL+R+R
Sbjct: 194 VNSVLGRTPAYLLHEIILVDDCSDFDDLGDDLEVGLKKFNNSKIRLIRNRDREGLMRSRV 253
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA+ + G+V+VFLD+H EV ++W+ PLL I +R I+ +PVID I+ T+ Y
Sbjct: 254 YGARNATGDVLVFLDSHIEVNVDWIEPLLQRIKVNRTILAMPVIDIINSDTF----AYTS 309
Query: 408 DHHYRGIFEWGMLYKENELPEREAKK 433
RG F WG+ +K + LP+ K
Sbjct: 310 SPLVRGGFNWGLHFKWDNLPKGSLAK 335
>gi|115533034|ref|NP_001041037.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
gi|87251651|emb|CAJ76949.1| Protein GLY-10, isoform b [Caenorhabditis elegans]
Length = 622
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 106/240 (44%), Positives = 146/240 (60%), Gaps = 11/240 (4%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 261
E +EGPGE GK LPE +A L Y G N S+ IS +R+I D+R +ECK
Sbjct: 91 EKRREGPGEWGKPVKLPEDKEVEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHKECKN 149
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 150 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 209
Query: 322 DYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
D+++ + + V+++R +REGLIR R GA+++ GE+++FLDAH E NWLPPLL PI
Sbjct: 210 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPI 269
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP--EREAKKRKYN 437
D + + P +D ID +T+E R D RG F+W YK L +RE+ + +N
Sbjct: 270 AEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKDRESPTKPFN 326
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 111/305 (36%), Positives = 163/305 (53%), Gaps = 37/305 (12%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+++ GE+++FLDAH E NWLP
Sbjct: 204 LRQPLEDFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLP 263
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 264 PLLDPIAEDYRTVVCPFVDVIDCETYEVRP---QDEGARGSFDWAFNYKRLPLTKKD--- 317
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ ++P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 318 RESPTKPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAP 377
Query: 180 CSRIVSLIRPVFK--ADGKLGNL---------EPPLEPYKEG-----PGEGGKAYHLPEA 223
CSR+ + R + + +G+ E ++ YKE PG G
Sbjct: 378 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDDYKETLYKHRPGVGNA------- 430
Query: 224 YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFS 283
AGD L + G+ E SFD + ++ ++ KY YP PKAS N G +
Sbjct: 431 --DAGDLKLMK-GIR-EKLQCKSFDWFMKEIAFDQDKY--YPAVEPKASAEGEIRNVGTN 484
Query: 284 SLMRT 288
+ T
Sbjct: 485 FCIDT 489
>gi|256081587|ref|XP_002577050.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 469
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 92/174 (52%), Positives = 119/174 (68%), Gaps = 6/174 (3%)
Query: 15 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 74
G VR+I R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI ++ K +
Sbjct: 158 GIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAANYKTVVC 217
Query: 75 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 134
P ID ID T+E+R+ D RG F+W + YK LP R + R + EP+ SP AG
Sbjct: 218 PFIDVIDADTFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDRYHPEEPFDSPVMAG 271
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRI + R
Sbjct: 272 GLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMVDAPCSRIGHIYR 325
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/228 (41%), Positives = 137/228 (60%), Gaps = 6/228 (2%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
LE + GPGE G + L + + ++ E G ++ S I DR+I D+R CK
Sbjct: 31 LESLRVGPGENGMPFELSYHDKELSNKTVNENGFSVYVSGKIKIDRSIKDIRHPRCKGKL 90
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LP SVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 91 YSSNLPTVSVIIPFFEEHWETLLRTVSSVLNRAPSGLIKEVILVDDGSSRKYLKDRLDSH 150
Query: 324 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+ G VR+I R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 151 LATAYPGGIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAA 210
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPE 428
+ K + P ID ID T+E+R+ D RG F+W + YK LPE
Sbjct: 211 NYKTVVCPFIDVIDADTFEYRA---QDEGARGAFDWELYYKRLPRLPE 255
>gi|431895737|gb|ELK05156.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Pteropus alecto]
Length = 447
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 123/189 (65%), Gaps = 4/189 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N S + +R +PD R + C+ YP+ LP AS+++ FHNE F++L RTV S+I
Sbjct: 104 EYGFNAVVSTSLGRERLVPDTRDKMCRRKHYPVSLPTASIVICFHNEEFNALFRTVSSVI 163
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
TP LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR R GA +
Sbjct: 164 NLTPHHVLEEIILVDDMSEFDDLKEKLDHHLEMFRGKIKLIRNQKREGLIRARLIGASRA 223
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+V+VFLD+HCEV WL PLL I DRK++ PVID ID T E+R P RG
Sbjct: 224 SGDVLVFLDSHCEVNRVWLEPLLYAISKDRKMVVCPVIDVIDSTTLEYR----PSPLVRG 279
Query: 414 IFEWGMLYK 422
F+W + +K
Sbjct: 280 AFDWYLQFK 288
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 91/191 (47%), Positives = 124/191 (64%), Gaps = 4/191 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDHHLEMFRGKIKLIRNQKREGLIRARLIGASRASGDVLVFLDSHCEVNRVWLEP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I DRK++ PVID ID T E+R P RG F+W + +K + + E
Sbjct: 245 LLYAISKDRKMVVCPVIDVIDSTTLEYR----PSPLVRGAFDWYLQFKWDNVFSYELDGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA+ R +F E+G YD G+ +WGGEN ELS +IWMCGG I +PC
Sbjct: 301 EGLTRPIRSPAMAGGIFAIRRHYFNEIGQYDKGMDLWGGENLELSLRIWMCGGQIFILPC 360
Query: 181 SRIVSLIRPVF 191
SR+ + + F
Sbjct: 361 SRVGHITKQQF 371
>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
scrofa]
Length = 608
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++YI+++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYIQKYLTGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ ++ + +P E +
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSASPVVRGGFNWGLHFRWDLVPLSELEGP 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 128/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK Y DLP ASVI+ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYRTDLPVASVIICFYNEALSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLTGKIKVIRNTKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSA 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ ++ + +P E
Sbjct: 287 SPVVRGGFNWGLHFRWDLVPLSE 309
>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 565
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 139/231 (60%), Gaps = 5/231 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--LD 267
PGE GK + N+ SN IS +R++PD+RM+ CK YP
Sbjct: 61 APGEMGKGVVIAPEEEELKKEMFKINQFNLLASNKISVNRSLPDVRMDGCKKKTYPPHNT 120
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LPK S+++VFHNE +S+L+R VHSII R+P LEEIILVDD S + L ++LEDY+++
Sbjct: 121 LPKTSIVIVFHNEAWSTLIRNVHSIINRSPRMLLEEIILVDDASERDFLGKELEDYVKKL 180
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+VR+ R +R GLIR R RGA S GEVI FLDAHCE WL PL+A I DR +
Sbjct: 181 PVRVRVERMDKRSGLIRARLRGAGVSTGEVITFLDAHCECTQGWLEPLMARIAEDRSRVV 240
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I +T+EF + D Y G F W + ++ +P+RE +RK +R
Sbjct: 241 CPIIDVISDETFEFHA--GSDMTYGG-FNWKLNFRWYSVPKREMDRRKGDR 288
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+++ +VR+ R +R GLIR R RGA S GEVI FLDAHCE WL PL
Sbjct: 169 LGKELEDYVKKLPVRVRVERMDKRSGLIRARLRGAGVSTGEVITFLDAHCECTQGWLEPL 228
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
+A I DR + P+ID I +T+EF + D Y G F W + ++ +P+RE +RK
Sbjct: 229 MARIAEDRSRVVCPIIDVISDETFEFHA--GSDMTYGG-FNWKLNFRWYSVPKREMDRRK 285
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +PT AGGLFA+ + +F E+G YD G+ +WGGEN E+SF+IWMCGG++E V C
Sbjct: 286 GDRTIPLNTPTMAGGLFAIHKDYFEEIGTYDAGMDIWGGENLEMSFRIWMCGGTLEIVTC 345
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 346 SHVGHVFR 353
>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Cricetulus griseus]
gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
griseus]
Length = 608
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 127/203 (62%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R +C+ YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAKCRGKSYPADLPTASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA L EIILVDD S DL +L++YIQR+ KV++IRN +REGLIR R
Sbjct: 171 VHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNRKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I D + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPVSE 309
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L++YI+R+ KV++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL
Sbjct: 197 DLKGELDEYIQRYLPAKVKVIRNRKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I D + PVID I T Y RG F WG+ +K + +P E
Sbjct: 257 PLLAIILEDPHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPVSELGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 ADGATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
Length = 632
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 106/272 (38%), Positives = 153/272 (56%), Gaps = 30/272 (11%)
Query: 194 DGKLGNLEPPLEPY----KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D K G+L P L EGPGE GKA +P+ + N+ S+ I+ +R
Sbjct: 36 DRKDGSLLPALRAVISRRHEGPGEMGKAVVIPKDEQEKMKELFKINQFNLMASDMIALNR 95
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++PD+R++ CK YP D+P SV++VFHNE +S+L+RTVHS+I R+P L EI+LVDD
Sbjct: 96 SLPDVRLDGCKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDD 155
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L +KLE+Y++ VR++R +R GLIR R RGA ++G+VI FLDAHCE +
Sbjct: 156 ASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTV 215
Query: 370 NWLPPLLAPIYSD-----------------------RKIMTVPVIDGIDYQTWEFRSVYE 406
WL PLLA I D R + P+ID I +T+E+ +
Sbjct: 216 GWLEPLLARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMA--G 273
Query: 407 PDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
D Y G F W + ++ +P+RE +RK +R
Sbjct: 274 SDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 304
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 88/211 (41%), Positives = 125/211 (59%), Gaps = 27/211 (12%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE+Y+ VR++R +R GLIR R RGA ++G+VI FLDAHCE + WL PL
Sbjct: 162 LKKKLENYVRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPL 221
Query: 62 LAPIYSDR-----------------------KIMTVPVIDGIDYQTWEFRSVYEPDHHYR 98
LA I DR + P+ID I +T+E+ + D Y
Sbjct: 222 LARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMA--GSDMTYG 279
Query: 99 GIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVW 157
G F W + ++ +P+RE +RK + + P ++PT AGGLF++D+ +F E+G YDPG+ +W
Sbjct: 280 G-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIW 338
Query: 158 GGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GGEN E+SF+IW CGGS+E V CS + + R
Sbjct: 339 GGENLEMSFRIWQCGGSLEIVTCSHVGHVFR 369
>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
(Silurana) tropicalis]
gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus (Silurana) tropicalis]
Length = 563
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 123/189 (65%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL K L+ Y+ E + KV+L+RN +REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 152 DLKKDLDGYMQENLSKKVKLVRNKQREGLIRGRMVGASHATGDVLVFLDSHCEVNEMWLQ 211
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLAPI + + + PVID I T +Y RG F WG+ +K + +P E
Sbjct: 212 PLLAPIKENPRTVVCPVIDIISADTL----IYSSSPVVRGGFNWGLHFKWDPVPLAELGG 267
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P++SPT AGGLFAMDR +F LG YD G+ +WGGEN E+SF+IWMCGGS+ VP
Sbjct: 268 PEGFSAPFRSPTMAGGLFAMDREYFNMLGQYDSGMDIWGGENLEISFRIWMCGGSLLIVP 327
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 328 CSRVGHIFR 336
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 106/263 (40%), Positives = 144/263 (54%), Gaps = 14/263 (5%)
Query: 177 WVPCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--------EAYRAAG 228
W P SR I P F D +G P + + E L E +
Sbjct: 7 WSPTSRFPKKIYPRFSRD-SMGQHSDPRKGHNGNQLETEANADLSPELGMIFNEQDQDVR 65
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R +C YP DLP AS+++ F+NE FS+L+RT
Sbjct: 66 DVGYQKHAFNLLISNRLGYHRDVPDTRDSKCAKKTYPPDLPMASIVICFYNEAFSALLRT 125
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 347
VHS++ RTPAQ L EIILVDD S DL + L+ Y+Q + KV+L+RN +REGLIR R
Sbjct: 126 VHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDGYMQENLSKKVKLVRNKQREGLIRGRM 185
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + G+V+VFLD+HCEV WL PLLAPI + + + PVID I T +Y
Sbjct: 186 VGASHATGDVLVFLDSHCEVNEMWLQPLLAPIKENPRTVVCPVIDIISADTL----IYSS 241
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 242 SPVVRGGFNWGLHFKWDPVPLAE 264
>gi|332030162|gb|EGI69956.1| N-acetylgalactosaminyltransferase 6 [Acromyrmex echinatior]
Length = 603
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 96/198 (48%), Positives = 128/198 (64%), Gaps = 8/198 (4%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KKL+DY+ + KV ++R ++R GLIR R GAK++R +V+VFLD+H E +NWLPP
Sbjct: 188 ELKKKLDDYVAQHLPKVSIVRLSKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPP 247
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI + K P ID I Y+T+E+ + D RG F+W + YK L + K+
Sbjct: 248 LLEPIAQNYKTCVCPFIDVIAYETFEYIA---QDEGSRGAFDWELYYKRLPLLPEDLKR- 303
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+EP+KSP AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PC
Sbjct: 304 --PTEPFKSPIMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPC 361
Query: 181 SRIVSLIR--PVFKADGK 196
SR+ + R P F G+
Sbjct: 362 SRVGHVYRKFPPFPNPGR 379
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 134/218 (61%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + + G N S+ IS +R++PD+R +C+ Y
Sbjct: 78 EEKRTGIGEHGKPAFLSPSLDVLKEKLYQVNGFNAAVSDEISMNRSVPDIRHPDCRKKKY 137
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+L+RT S++ R+P LEEIILVDD S+K +L +KL+DY+
Sbjct: 138 LKNLDPISVIVSFHNEHFSTLLRTCWSVVNRSPPSLLEEIILVDDASTKIELKKKLDDYV 197
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ KV ++R ++R GLIR R GAK++R +V+VFLD+H E +NWLPPLL PI + K
Sbjct: 198 AQHLPKVSIVRLSKRSGLIRGRLAGAKKARAKVLVFLDSHSEANVNWLPPLLEPIAQNYK 257
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+ + D RG F+W + YK
Sbjct: 258 TCVCPFIDVIAYETFEYIA---QDEGSRGAFDWELYYK 292
>gi|350644736|emb|CCD60531.1| n-acetylgalactosaminyltransferase,putative [Schistosoma mansoni]
Length = 508
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 92/174 (52%), Positives = 119/174 (68%), Gaps = 6/174 (3%)
Query: 15 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 74
G VR+I R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI ++ K +
Sbjct: 197 GIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAANYKTVVC 256
Query: 75 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 134
P ID ID T+E+R+ D RG F+W + YK LP R + R + EP+ SP AG
Sbjct: 257 PFIDVIDADTFEYRA---QDEGARGAFDWELYYKR--LP-RLPEDRYHPEEPFDSPVMAG 310
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + PCSRI + R
Sbjct: 311 GLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMVDAPCSRIGHIYR 364
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/228 (41%), Positives = 137/228 (60%), Gaps = 6/228 (2%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
LE + GPGE G + L + + ++ E G ++ S I DR+I D+R CK
Sbjct: 70 LESLRVGPGENGMPFELSYHDKELSNKTINENGFSVYVSGKIKIDRSIKDIRHPRCKGKL 129
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LP SVI+ F E + +L+RTV S++ R P+ ++E+ILVDD SS+ L +L+ +
Sbjct: 130 YSSNLPTVSVIIPFFEEHWETLLRTVSSVLNRAPSGLIKEVILVDDGSSRKYLKDRLDSH 189
Query: 324 IQRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+ G VR+I R GLIR ++ GA+E+ GEV++FLD+HCE G+NWLPPLL PI +
Sbjct: 190 LATAYPGGIVRVIHLEHRGGLIRAKTAGAREATGEVLIFLDSHCEAGINWLPPLLDPIAA 249
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPE 428
+ K + P ID ID T+E+R+ D RG F+W + YK LPE
Sbjct: 250 NYKTVVCPFIDVIDADTFEYRA---QDEGARGAFDWELYYKRLPRLPE 294
>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 653
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L +Y+ + VRLIR +REGLIR R GA+ + GEV+VFLD+HCEV + WL
Sbjct: 235 DLGPQLAEYVSSQLPSHVRLIRTRDREGLIRARMFGARNASGEVLVFLDSHCEVNVGWLE 294
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I ++R +T P+ID I+ T+E Y RG F WG+ +K P A+K
Sbjct: 295 PLLERIRANRATVTCPIIDIINADTFE----YTASPIVRGGFNWGLHFKWESPPAGLARK 350
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P SPT AGGLFAMDR FF LG YD G+ +WGGEN E+SF+IWMCGG +E +P
Sbjct: 351 GRGAIAPIPSPTMAGGLFAMDRKFFHRLGEYDDGMDIWGGENLEISFRIWMCGGQLEIIP 410
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 411 CSRVGHVFR 419
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/201 (43%), Positives = 128/201 (63%), Gaps = 5/201 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
++ N+ SN + F R++PD R C+ ++ +LP ASV++ F+NE +S+L+RTVH+++
Sbjct: 154 QHAFNLLISNRLGFYRSLPDTRNPLCRSEEHGAELPTASVVVCFYNEAWSTLLRTVHTVL 213
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
RTP L E+ILVDD S++ DL +L +Y+ + VRLIR +REGLIR R GA+
Sbjct: 214 GRTPRHLLHEVILVDDNSTQVDLGPQLAEYVSSQLPSHVRLIRTRDREGLIRARMFGARN 273
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ GEV+VFLD+HCEV + WL PLL I ++R +T P+ID I+ T+E Y R
Sbjct: 274 ASGEVLVFLDSHCEVNVGWLEPLLERIRANRATVTCPIIDIINADTFE----YTASPIVR 329
Query: 413 GIFEWGMLYKENELPEREAKK 433
G F WG+ +K P A+K
Sbjct: 330 GGFNWGLHFKWESPPAGLARK 350
>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Takifugu rubripes]
Length = 580
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y++ GKV+++RN REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 170 ELKGDLDRYVQAELQGKVKVLRNQRREGLIRGRMIGAAHASGQVLVFLDSHCEVNQMWLE 229
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLA I+ DR+ + PVID I T Y P RG F WG+ +K + +P E K
Sbjct: 230 PLLASIHEDRRTVVCPVIDIISADTLS----YSPSPIVRGGFNWGLHFKWDPVPPSELKS 285
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
K +P +SPT AGGLFA++R +F E+G YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 286 PKGPVDPIRSPTMAGGLFAINRKYFNEMGQYDAGMDIWGGENLEISFRIWMCGGQLLIIP 345
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 346 CSRVGHIFR 354
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/215 (43%), Positives = 126/215 (58%), Gaps = 5/215 (2%)
Query: 222 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 281
EA + D+ + N+ S + R +PD R +C+ YP DLP ASV++ F NE
Sbjct: 77 EADQQLRDSGYHRHAFNLLISTRLGPHRDLPDTRDPQCRDRIYPRDLPPASVVICFFNEA 136
Query: 282 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 340
S+L+RTVHS++ RT L EIILVDD+S +L L+ Y+Q GKV+++RN RE
Sbjct: 137 LSALLRTVHSVLDRTAPFLLHEIILVDDYSELEELKGDLDRYVQAELQGKVKVLRNQRRE 196
Query: 341 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 400
GLIR R GA + G+V+VFLD+HCEV WL PLLA I+ DR+ + PVID I T
Sbjct: 197 GLIRGRMIGAAHASGQVLVFLDSHCEVNQMWLEPLLASIHEDRRTVVCPVIDIISADTLS 256
Query: 401 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
Y P RG F WG+ +K + +P E K K
Sbjct: 257 ----YSPSPIVRGGFNWGLHFKWDPVPPSELKSPK 287
>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Otolemur garnettii]
Length = 608
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 106/250 (42%), Positives = 153/250 (61%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P F+A D K G++E P++ + + E G ++ E + D ++ N+
Sbjct: 66 VLEPQFRANRIDDMKDGHVEDPVKDHLKFSSELGMIFN--ERDQELRDLGYQKHAFNVLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTP L
Sbjct: 124 SNRLGYHRDVPDTRNAACKEQSYPTDLPVASVVICFYNEAFSALLRTVHSVIDRTPVHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
E+ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA ++ GEV+VF
Sbjct: 184 HEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAQATGEVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D++ + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLH 299
Query: 421 YKENELPERE 430
+K + +P E
Sbjct: 300 FKWDLVPLSE 309
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 126/188 (67%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA ++ GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAQATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D++ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGE 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Oryctolagus cuniculus]
Length = 608
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDRHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSEQGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRLYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRNAACKDKSYPADLPVASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 VHSVLDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 641
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 103/251 (41%), Positives = 146/251 (58%), Gaps = 13/251 (5%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHIS 246
RPV+ K + P K GEGG+A +L EA + + + N S+ IS
Sbjct: 110 RPVYDISAKKN-----INPMK---GEGGEASYLDTEAEKQYAEKIFANHSFNSVLSDKIS 161
Query: 247 FDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
DRT+ D+R + C K+ YP LP ASVI+ FHNE +S L+RTVHS++ RTP L +I
Sbjct: 162 LDRTMRDVRGDLCIEKHKTYPRKLPTASVIICFHNEAYSVLLRTVHSVLNRTPPDLLTDI 221
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
ILVDD S +L + L+D++ + + K+++IRN +R GLIR+R GA SRG+V++FLD+H
Sbjct: 222 ILVDDKSEYENLKRPLDDHVAQLSKKIKIIRNAKRSGLIRSRINGADLSRGDVLIFLDSH 281
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKEN 424
CE W PLLA I + VP+I+ I+ T ++ + PD RG F W + YK
Sbjct: 282 CETTPGWAEPLLARIAEKSSNVVVPIIEVINADTLQYAAAANPDQ--RGGFSWDLFYKWK 339
Query: 425 ELPEREAKKRK 435
+P E RK
Sbjct: 340 PIPLDEQHLRK 350
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 122/188 (64%), Gaps = 2/188 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + L+D++ + + K+++IRN +R GLIR+R GA SRG+V++FLD+HCE W P
Sbjct: 232 NLKRPLDDHVAQLSKKIKIIRNAKRSGLIRSRINGADLSRGDVLIFLDSHCETTPGWAEP 291
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I + VP+I+ I+ T ++ + PD RG F W + YK +P E R
Sbjct: 292 LLARIAEKSSNVVVPIIEVINADTLQYAAAANPDQ--RGGFSWDLFYKWKPIPLDEQHLR 349
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + ++PT AGGLFA+DR +F ++G YD + +WGGEN E+SF+IWMCGG I+ +PC
Sbjct: 350 KSPIDVIRTPTMAGGLFAIDRKYFYDMGTYDEEMDIWGGENLEMSFRIWMCGGRIDIIPC 409
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 410 SRVGHIFR 417
>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
ricinus]
Length = 582
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 96/228 (42%), Positives = 139/228 (60%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G+ + + A N+ S+ I+ +R++PD+R+E+CK YP LP
Sbjct: 81 PGENGRGVEIGKDEEALKKEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEKLPT 140
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
SV +VFHNE +S+L+RTVHS+I+ +P LEEIILVDD S + L ++LEDY+ + +
Sbjct: 141 TSVDIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKQLEDYVVKLDTP 200
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V+++R +R GLIR R GA +G+VI FLDAHCE NWL PLLA I DR + PV
Sbjct: 201 VKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLARIAEDRTRVVCPV 260
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I +T+E+ S + G F W + ++ +P+RE +R +R
Sbjct: 261 IDVISDETFEYISASDLTW---GGFNWKLNFRGYRVPQRELDRRGGDR 305
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + V+++R +R GLIR R GA +G+VI FLDAHCE NWL PL
Sbjct: 186 LGKQLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + PVID I +T+E+ S + G F W + ++ +P+RE +R
Sbjct: 246 LARIAEDRTRVVCPVIDVISDETFEYISASDLTW---GGFNWKLNFRGYRVPQRELDRRG 302
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFA+D+ +F+ELG YD G+ +WGGEN ELSF+IWMCGG +E VPC
Sbjct: 303 GDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPC 362
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 363 SHVGHVFR 370
>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 95/187 (50%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE + +F KV+LIRN R+GLIR+R RG ++G V+V LD+H EV NWLPPL
Sbjct: 200 LYKELESSLAKF-PKVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVSTNWLPPL 258
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI DRK + P+ID ID + +++ V +P RG F+W + YK +P K+ K
Sbjct: 259 LHPISLDRKTVVCPMIDIIDNENFQY--VTQPGDAMRGAFDWELYYKRIPIPNE--KRPK 314
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
SEP++SP AGGLFA++R +F E+G YD GL +WGGE +ELSFK+WMCGG I PCS
Sbjct: 315 DPSEPFESPVMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSFKVWMCGGRILDSPCS 374
Query: 182 RIVSLIR 188
RI + R
Sbjct: 375 RIGHIYR 381
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 141/229 (61%), Gaps = 6/229 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + ++ + G N+ S+ IS DR++ D+R CK Y
Sbjct: 92 EAARTGLGEQGKPVTLFGHEKL--HSAYKDNGFNILVSDRISLDRSLHDIRHASCKSKKY 149
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
DLP SVI+ FHNEG S+L+RT+HS+ R+P L+EI+LVDD SS+ L ++LE +
Sbjct: 150 YSDLPDVSVIIPFHNEGLSTLLRTIHSLHNRSPESLLKEIVLVDDASSRP-LYKELESSL 208
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+F KV+LIRN R+GLIR+R RG ++G V+V LD+H EV NWLPPLL PI DRK
Sbjct: 209 AKF-PKVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVSTNWLPPLLHPISLDRK 267
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+ P+ID ID + +++ V +P RG F+W + YK +P + K
Sbjct: 268 TVVCPMIDIIDNENFQY--VTQPGDAMRGAFDWELYYKRIPIPNEKRPK 314
>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
Length = 604
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 86/173 (49%), Positives = 117/173 (67%), Gaps = 2/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
K R++R +ER GLIR + GA++++G+V+ FLD+HCE + WL PLL IY DR + P
Sbjct: 214 KTRVLRLSERSGLIRAKVFGAEQAKGKVVTFLDSHCECNVGWLEPLLERIYLDRTTVVTP 273
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID ID +T+ + P RGIF W + + +LP E KKRK P SPT AGG
Sbjct: 274 VIDNIDKKTFAYTG--SPTVITRGIFTWSLTFSWLDLPWFEQKKRKDPIAPLPSPTMAGG 331
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF+MDR +F E+G YD G+ VWGGEN E+SF+IW CGG++E++PCSR+ + R
Sbjct: 332 LFSMDREYFFEIGSYDMGMDVWGGENLEISFRIWQCGGTLEFIPCSRVGHVYR 384
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 86/201 (42%), Positives = 120/201 (59%), Gaps = 4/201 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS R I D R CK YPLD LP +VI+ FHNE ++L+RTV SI+
Sbjct: 120 NFNQYISDRISLHRPIKDTRHAMCKDRTYPLDKLPDTTVIIPFHNEARTTLLRTVWSILD 179
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
R+P + EI+L+DD S+ L L++ + K R++R +ER GLIR + GA++++
Sbjct: 180 RSPPSLINEILLIDDASTMEHLKAPLDEELATI-PKTRVLRLSERSGLIRAKVFGAEQAK 238
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+ FLD+HCE + WL PLL IY DR + PVID ID +T+ + P RGI
Sbjct: 239 GKVVTFLDSHCECNVGWLEPLLERIYLDRTTVVTPVIDNIDKKTFAYTG--SPTVITRGI 296
Query: 415 FEWGMLYKENELPEREAKKRK 435
F W + + +LP E KKRK
Sbjct: 297 FTWSLTFSWLDLPWFEQKKRK 317
>gi|47221374|emb|CAF97292.1| unnamed protein product [Tetraodon nigroviridis]
Length = 352
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 92/187 (49%), Positives = 126/187 (67%), Gaps = 5/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NWLPPL
Sbjct: 13 LKVPLEEYMKRMP-KVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPL 71
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ + + + + RG F+W M YK +P +
Sbjct: 72 LDRIAQNRKSIVCPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIPAEMQGEDP 129
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S+P++SP AGGLFA+DR +F ELGGYD GL +WGGE +E+SFK+WMCGG +E +PCS
Sbjct: 130 --SQPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCS 187
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 188 RVGHIYR 194
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 3/116 (2%)
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
S L LE+Y++R KVR++R +REGLIRTR GA ++GEVI FLD+HCE +NW
Sbjct: 9 STEHLKVPLEEYMKRMP-KVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNW 67
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
LPPLL I +RK + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 68 LPPLLDRIAQNRKSIVCPMIDVIDHDNFGYDT--QAGDAMRGAFDWEMYYKRIPIP 121
>gi|358331987|dbj|GAA50722.1| putative polypeptide N-acetylgalactosaminyltransferase 10
[Clonorchis sinensis]
Length = 738
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 95/189 (50%), Positives = 126/189 (66%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L L+ Y+E+ +G VR+IR+ +REGLI R RGA+ + GEV+VFLD+HCE NWLP
Sbjct: 318 LKGPLDRYLEQHYPDGLVRVIRSPKREGLITARIRGARAATGEVLVFLDSHCEANPNWLP 377
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ PI D K++T P ID I T+E+R+ D RG F+W + YK LP + +
Sbjct: 378 PLVDPIARDYKVVTCPFIDVISADTFEYRA---QDEGARGAFDWELFYKR--LP-KLPQD 431
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P+ SP AGGLFA+ +F ELGGYDPGL++WGGE +ELSFKIWMCGG + +P
Sbjct: 432 LPHPERPFDSPVMAGGLFAISAKWFWELGGYDPGLVIWGGEQYELSFKIWMCGGRMIDIP 491
Query: 180 CSRIVSLIR 188
CSRI + R
Sbjct: 492 CSRIGHIYR 500
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 91/220 (41%), Positives = 135/220 (61%), Gaps = 5/220 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + GPGE G A L + + L + G N S+ I+ DR++ D+R +CK Y
Sbjct: 207 EANRVGPGEQGAAVRLFGEQKVESEKFLNQNGFNTYISDMIAIDRSVADIRHPKCKAMLY 266
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+++ F E +++L+RT S +KR+P ++E+ILVDD S++ L L+ Y+
Sbjct: 267 LAKLPSVSLVIPFFQENWNALLRTFVSSLKRSPPGLIKEVILVDDGSTREYLKGPLDRYL 326
Query: 325 QRF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
++ +G VR+IR+ +REGLI R RGA+ + GEV+VFLD+HCE NWLPPL+ PI D
Sbjct: 327 EQHYPDGLVRVIRSPKREGLITARIRGARAATGEVLVFLDSHCEANPNWLPPLVDPIARD 386
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
K++T P ID I T+E+R+ D RG F+W + YK
Sbjct: 387 YKVVTCPFIDVISADTFEYRA---QDEGARGAFDWELFYK 423
>gi|341897758|gb|EGT53693.1| CBN-GLY-10 protein [Caenorhabditis brenneri]
Length = 620
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 102/231 (44%), Positives = 141/231 (61%), Gaps = 9/231 (3%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 261
E +EGPGE GK LP+ +A L Y G N S+ IS +R+I D+R +CK
Sbjct: 89 EKAREGPGEWGKPVKLPDDKETEKEA-LSLYKANGYNAYISDMISLNRSIKDIRHRDCKK 147
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 148 MTYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207
Query: 322 DYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
D+++ + + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD 315
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/189 (44%), Positives = 119/189 (62%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLP
Sbjct: 202 LRQPLEDFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLP 261
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 262 PLLDPIAEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD--- 315
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ + P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 316 RENPTTPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAP 375
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 376 CSRVAHIYR 384
>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
glaber]
Length = 330
Score = 184 bits (468), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 98/224 (43%), Positives = 139/224 (62%), Gaps = 3/224 (1%)
Query: 215 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 274
GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +LP SV+
Sbjct: 2 GKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVV 61
Query: 275 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 334
+VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++ V++I
Sbjct: 62 IVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKFTLENYVKNLEVPVKII 121
Query: 335 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 394
R ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK + P+ID I
Sbjct: 122 RMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVI 181
Query: 395 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 182 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 222
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 103 LKFTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 162
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 163 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 219
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 220 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEISFRIWQCGGSLEIVTC 279
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 280 SHVGHVFR 287
>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Nomascus leucogenys]
Length = 527
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 117 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 176
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 177 LLAAIREDQHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 232
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 233 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 292
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 293 SRVGHIFR 300
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 134/221 (60%), Gaps = 5/221 (2%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PG G + E + D ++ NM SN + + R +PD R C+ YP DLP
Sbjct: 12 PGCGQRGMIFNERDQELRDLGYQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPDLPS 71
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NG 329
ASV++ F+NE FS+L+RT HS+I RTPA L EIILVDD S DL +L++Y+Q++ G
Sbjct: 72 ASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPG 131
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
K+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL PLLA I D+ + P
Sbjct: 132 KIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCP 191
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
VID I T Y RG F WG+ +K + +P E
Sbjct: 192 VIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSE 228
>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Nomascus leucogenys]
Length = 608
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDQHTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 128/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R C+ YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPDLPSASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
HS+I RTPA L EIILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 171 AHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I D+ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 621
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 140/228 (61%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G+ + A N+ S+ I+ +R++PD+R+E+CK YP LP
Sbjct: 120 PGERGRGVEIGPEEEALKKEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEKLPT 179
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
SV++VFHNE +S+L+RTVHS+I+ +P LEEIILVDD S + L +KLEDY+ +
Sbjct: 180 TSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKKLEDYVVKLEVP 239
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V+++R +R GLIR R GA +G+VI FLDAHCE +WL PLLA I DR + PV
Sbjct: 240 VKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQHWLEPLLARIAEDRTRVVCPV 299
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I +T+E+ S D + G F W + ++ +P+RE ++R +R
Sbjct: 300 IDVISDETFEYISA--SDMTWGG-FNWKLNFRWYRVPQREVERRGGDR 344
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLEDY+ + V+++R +R GLIR R GA +G+VI FLDAHCE +WL PL
Sbjct: 225 LGKKLEDYVVKLEVPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQHWLEPL 284
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + PVID I +T+E+ S D + G F W + ++ +P+RE ++R
Sbjct: 285 LARIAEDRTRVVCPVIDVISDETFEYISA--SDMTWGG-FNWKLNFRWYRVPQREVERRG 341
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN ELSF+IWMCGG +E VPC
Sbjct: 342 GDRTLPIRTPTMAGGLFSIDKDYFNELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPC 401
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 402 SHVGHVFR 409
>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
Length = 662
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 92/189 (48%), Positives = 125/189 (66%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL LE + N KVRLIRN EREGL+R+R GA+ + G+V++FLD+H EV ++W+
Sbjct: 243 DLRDNLEHELNALKNSKVRLIRNAEREGLMRSRVYGARNATGDVLIFLDSHIEVNVDWVE 302
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I +++ I+ +PVID I+ T+ +Y RG F WG+ +K + LP+ K
Sbjct: 303 PLLQRIKTNKTILAMPVIDIINSDTF----IYSSSPLVRGGFNWGLHFKWDNLPKGTLAK 358
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
P++SPT AGGLFA+DR +F +LG YD G+ VWGGEN E+SF+ W CGGSIE VP
Sbjct: 359 ESDFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFRTWQCGGSIELVP 418
Query: 180 CSRIVSLIR 188
CSRI + R
Sbjct: 419 CSRIGHVFR 427
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 87/201 (43%), Positives = 124/201 (61%), Gaps = 5/201 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN I R +PD R + C Y LP AS+I+ F+NE +L+R+
Sbjct: 157 DVGYRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQSYDKVLPSASIIMCFYNEHLETLVRS 216
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
V SII+RTP+ L EIILVDD S DL LE + N KVRLIRN EREGL+R+R
Sbjct: 217 VTSIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHELNALKNSKVRLIRNAEREGLMRSRV 276
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA+ + G+V++FLD+H EV ++W+ PLL I +++ I+ +PVID I+ T+ +Y
Sbjct: 277 YGARNATGDVLIFLDSHIEVNVDWVEPLLQRIKTNKTILAMPVIDIINSDTF----IYSS 332
Query: 408 DHHYRGIFEWGMLYKENELPE 428
RG F WG+ +K + LP+
Sbjct: 333 SPLVRGGFNWGLHFKWDNLPK 353
>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
Length = 654
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 123/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKVIRNKKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDRRTVVCPVIDIISADTL----AYSSSPAVRGGFNWGLHFKWDLVPLSELAGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 GGATAPIKSPTMAGGLFAMNRQYFSELGQYDSGMDIWGGENLEISFRIWMCGGQLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 130/203 (64%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRSAACKGKSYPADLPVASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+I RTPA+ L E+ILVDD S DL +L++Y+Q++ GK+++IRN +REGLIR R
Sbjct: 171 VHSVIDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNKKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPAVRGGFNWGLHFKWDLVPLSE 309
>gi|348519900|ref|XP_003447467.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oreochromis niloticus]
Length = 777
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 126/188 (67%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+L+DY+ +F KVR++R ER+GLIR R GA ++GEV+ FLD+H E + WL PL
Sbjct: 382 LKKQLDDYMAQF-PKVRIVRLKERQGLIRARLAGAAVAKGEVLTFLDSHIECNVGWLEPL 440
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L +Y DRK + PVI+ I + + V D+ RGIF+W +++ + +P + KK
Sbjct: 441 LERVYLDRKKVPCPVIEVISDKDMSYMMV---DNFQRGIFKWPLVFGWSAVPPEDIKKFN 497
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
S+P + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFKIWMCGG IE +PC
Sbjct: 498 LTISDPIRCPVMAGGLFSIDKQYFFELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPC 557
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 558 SRVGHIFR 565
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 127/200 (63%), Gaps = 4/200 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
E N+ S+ I DR IPD R + C+ DLP SVI F +E +S+L+R+VHS++
Sbjct: 300 EGHFNVYLSDKIPVDRAIPDTRPQMCEQSLVHDDLPSTSVIFCFVDEVWSTLLRSVHSVL 359
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
R+P L+EIILVDDFS+K L ++L+DY+ +F KVR++R ER+GLIR R GA +
Sbjct: 360 NRSPPHLLKEIILVDDFSTKDYLKKQLDDYMAQF-PKVRIVRLKERQGLIRARLAGAAVA 418
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
+GEV+ FLD+H E + WL PLL +Y DRK + PVI+ I + + V D+ RG
Sbjct: 419 KGEVLTFLDSHIECNVGWLEPLLERVYLDRKKVPCPVIEVISDKDMSYMMV---DNFQRG 475
Query: 414 IFEWGMLYKENELPEREAKK 433
IF+W +++ + +P + KK
Sbjct: 476 IFKWPLVFGWSAVPPEDIKK 495
>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cricetulus griseus]
Length = 497
Score = 184 bits (467), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 98/224 (43%), Positives = 139/224 (62%), Gaps = 3/224 (1%)
Query: 215 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 274
GKA +P+ + N+ S+ I+ +R++PD+R+E CK YP +LP SV+
Sbjct: 2 GKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVV 61
Query: 275 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 334
+VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L LE+Y++ V++I
Sbjct: 62 IVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKII 121
Query: 335 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 394
R ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK + P+ID I
Sbjct: 122 RMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVI 181
Query: 395 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 182 SDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 222
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 103 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 162
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 163 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 219
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 220 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 279
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 280 SHVGHVFR 287
>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Oreochromis niloticus]
Length = 584
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 105/236 (44%), Positives = 144/236 (61%), Gaps = 10/236 (4%)
Query: 205 EPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC-- 259
+P PGE G+A HL PE + D S+ Y +N+ S+ IS R I D RM+EC
Sbjct: 74 QPDNNAPGEWGRATHLNLSPEEKKQEQD-SVERYAINIYVSDKISLHRHIQDHRMKECRS 132
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K +DY LP SVI+ F+NE +S+L+RT+HS+++ TPA L+EIILVDDFS + L K
Sbjct: 133 KKFDY-RHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILVDDFSDRGYLKSK 191
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
L DYI +VRLIR +REGL+R R GA + G+V+ FLD HCE W+ PLL I
Sbjct: 192 LADYISDLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLERI 250
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ + PVID ID+ T+EF + D G F+W + ++ + +PE E K+RK
Sbjct: 251 SENASTIVCPVIDTIDWNTFEF--YMQTDEPMIGGFDWRLTFQWHSVPEMERKRRK 304
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 87/182 (47%), Positives = 116/182 (63%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL DYI +VRLIR +REGL+R R GA + G+V+ FLD HCE W+ PL
Sbjct: 188 LKSKLADYISDLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+EF + D G F+W + ++ + +PE E K+RK
Sbjct: 247 LERISENASTIVCPVIDTIDWNTFEF--YMQTDEPMIGGFDWRLTFQWHSVPEMERKRRK 304
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ +A+F LG YD G+ VWGGEN ELSF++W CGGS+E PCS
Sbjct: 305 SRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSFRVWQCGGSLEIHPCS 364
Query: 182 RI 183
+
Sbjct: 365 HV 366
>gi|426228255|ref|XP_004008229.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Ovis
aries]
Length = 448
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 90/197 (45%), Positives = 126/197 (63%), Gaps = 4/197 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID IDY T E Y+P RG F W + +K + + E +
Sbjct: 245 LLNAIAKDPKMVVCPLIDVIDYMTLE----YQPSPIVRGAFNWHLEFKWDHVLSYEIEGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA+ R +F E+G YD G+ +WGGEN ELS +IWMCGG + +PC
Sbjct: 301 EGPTTPIRSPAMAGGIFAISRNYFNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPC 360
Query: 181 SRIVSLIRPVFKADGKL 197
SR+ + R D ++
Sbjct: 361 SRVGHINRQHMTNDSEI 377
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
YG N S ++ R++PD R C+ YP LP AS+I+ FHNE FS+L RT+ SI+
Sbjct: 105 YGFNHIISKNLGHYRSVPDTRNVMCRKKTYPARLPTASIIICFHNEEFSALFRTLSSIMA 164
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR R GA +
Sbjct: 165 LTPQYILEEIILVDDTSDFDDLKEKLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHAS 224
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY T E Y+P RG
Sbjct: 225 GDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYMTLE----YQPSPIVRGA 280
Query: 415 FEWGMLYK 422
F W + +K
Sbjct: 281 FNWHLEFK 288
>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
Length = 600
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 122/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL K L+ Y+ E + KV+L+RN REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 189 DLKKDLDYYMQENLSKKVKLVRNKRREGLIRGRMVGASHATGDVLVFLDSHCEVNEMWLQ 248
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLAPI + K + PVID I T +Y RG F WG+ +K + +P E
Sbjct: 249 PLLAPIRENPKTVVCPVIDIISADTL----IYSQSPVVRGGFNWGLHFKWDPVPLSELGG 304
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P++SPT AGGLFAMDR +F LG YD G+ +WGGEN E+SF+IWMCGGS+ VP
Sbjct: 305 PEGFTAPFRSPTMAGGLFAMDREYFNTLGQYDSGMDIWGGENLEISFRIWMCGGSLLIVP 364
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 365 CSRVGHIFR 373
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 125/203 (61%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R +C YP DLP AS+++ F+NE S+L+RT
Sbjct: 103 DVGYQKHAFNLLISNRLGYHRDLPDTRDSKCSKKTYPADLPLASIVICFYNEASSALLRT 162
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 347
VHS++ RTPAQ L EIILVDD S DL + L+ Y+Q + KV+L+RN REGLIR R
Sbjct: 163 VHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDYYMQENLSKKVKLVRNKRREGLIRGRM 222
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + G+V+VFLD+HCEV WL PLLAPI + K + PVID I T +Y
Sbjct: 223 VGASHATGDVLVFLDSHCEVNEMWLQPLLAPIRENPKTVVCPVIDIISADTL----IYSQ 278
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 279 SPVVRGGFNWGLHFKWDPVPLSE 301
>gi|195377912|ref|XP_002047731.1| GJ13596 [Drosophila virilis]
gi|194154889|gb|EDW70073.1| GJ13596 [Drosophila virilis]
Length = 675
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 131/213 (61%), Gaps = 3/213 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+A L E+ R E G N S+ IS +R++PD+R +EC+ Y LP
Sbjct: 152 GIGEHGEAAKLDESLRDKEQVLSLENGFNALLSDSISVNRSLPDIRHKECRKKQYLSKLP 211
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS +A L + LEDY+
Sbjct: 212 NVSVIIIFYNEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAPLFKPLEDYVAEHFS 271
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VR++R +R GLI RS GA+ + +V++FLD+H E NWLPPLL PI +++ P
Sbjct: 272 MVRIVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPLLDPIAQNKRAAVCP 331
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
ID ID+ + +R+ D RG F+W YK
Sbjct: 332 FIDVIDHSNFNYRA---QDEGARGAFDWDFFYK 361
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 118/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ VR++R +R GLI RS GA+ + +V++FLD+H E NWLPPL
Sbjct: 258 LFKPLEDYVAEHFSMVRIVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPL 317
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 318 LDPIAQNKRAAVCPFIDVIDHSNFNYRA---QDEGARGAFDWDFFYKRLPLLPE----DL 370
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ S+P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 371 KHPSDPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 430
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 431 SRVGHIYR 438
>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 632
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 125/188 (66%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LE+Y+ KVR++R +REGLIR R +GA S+G V+ FLD+HCE W+ PL
Sbjct: 229 LKKPLEEYMSALK-KVRIVRMDKREGLIRARLKGAAVSKGAVVTFLDSHCECMEGWIEPL 287
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREAKKR 120
L I + + PVID ID +T+E+ Y + G F+W + + + +PER+ K R
Sbjct: 288 LDRIKRNSSTVVCPVIDVIDDETFEYH--YSKAYFTNVGGFDWSLQFNWHAIPERDRKNR 345
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + +P +SPT AGGLF++DRA+F +LG YDPG +WGGEN ELSFKIWMCGG++E VPC
Sbjct: 346 KRHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPC 405
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 406 SHVGHVFR 413
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 102/247 (41%), Positives = 141/247 (57%), Gaps = 15/247 (6%)
Query: 199 NLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGD-----ASLGEYGMNMETSNHISFDR 249
N + P+ YK+G GEGGK + + +A + A N S+ IS R
Sbjct: 105 NADSPI--YKKGDKNQAGEGGKPVKINQEQLSAQEREKYAAGFRNNAFNQYVSDMISIHR 162
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
++P EECK Y DLP SVI+ FHNE +S L+RTVHS+I+RTP L E+ILVDD
Sbjct: 163 SLPSTIDEECKTEKYLDDLPSTSVIICFHNEAWSVLLRTVHSVIERTPEHLLTEVILVDD 222
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS L + LE+Y+ KVR++R +REGLIR R +GA S+G V+ FLD+HCE
Sbjct: 223 FSDMDHLKKPLEEYMSALK-KVRIVRMDKREGLIRARLKGAAVSKGAVVTFLDSHCECME 281
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPE 428
W+ PLL I + + PVID ID +T+E+ Y + G F+W + + + +PE
Sbjct: 282 GWIEPLLDRIKRNSSTVVCPVIDVIDDETFEYH--YSKAYFTNVGGFDWSLQFNWHAIPE 339
Query: 429 REAKKRK 435
R+ K RK
Sbjct: 340 RDRKNRK 346
>gi|119582043|gb|EAW61639.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_a [Homo sapiens]
Length = 269
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 93/199 (46%), Positives = 128/199 (64%), Gaps = 3/199 (1%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D + E G N+ S+ IS +R++PD+R C Y LP S+I+ FHNEG+SSL+RT
Sbjct: 8 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRT 67
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 348
VHS++ R+P + + EI+LVDDFS + L + LEDY+ F VR++R +REGLIRTR
Sbjct: 68 VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALF-PSVRILRTKKREGLIRTRML 126
Query: 349 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
GA + G+VI FLD+HCE +NWLPPLL I +RK + P+ID ID+ +FR +
Sbjct: 127 GASVATGDVITFLDSHCEANVNWLPPLLDRIARNRKTIVCPMIDVIDHD--DFRYETQAG 184
Query: 409 HHYRGIFEWGMLYKENELP 427
RG F+W M YK +P
Sbjct: 185 DAMRGAFDWEMYYKRIPIP 203
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/167 (50%), Positives = 110/167 (65%), Gaps = 5/167 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ F VR++R +REGLIRTR GA + G+VI FLD+HCE +NWLPPL
Sbjct: 95 LKKPLEDYMALF-PSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 153
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 154 LDRIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP 211
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 168
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+
Sbjct: 212 --SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKV 256
>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
Length = 615
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 133/218 (61%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GKA L + D G N S+ IS +R+IPD+R +CK Y
Sbjct: 83 EEKRTGTGEQGKAATLSPSMEDLKDRLYKVNGFNAALSDLISLNRSIPDIRHPDCKNKRY 142
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
DL SV++ FHNE FS+LMRT S+I R+P L EIILVDD S+K +L KL++Y+
Sbjct: 143 LKDLDPVSVVVSFHNEHFSTLMRTCWSVINRSPPSLLHEIILVDDASTKVELKDKLDEYV 202
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
++ KV+++R R GLIR R GA+++ +++VFLD+H E +NWLPPLL PI D K
Sbjct: 203 KKNLPKVKIVRLPRRSGLIRGRLAGARKATAKILVFLDSHSEANVNWLPPLLEPIAKDYK 262
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 263 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 297
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/198 (47%), Positives = 126/198 (63%), Gaps = 8/198 (4%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL++Y+++ KV+++R R GLIR R GA+++ +++VFLD+H E +NWLPP
Sbjct: 193 ELKDKLDEYVKKNLPKVKIVRLPRRSGLIRGRLAGARKATAKILVFLDSHSEANVNWLPP 252
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI D K P ID I Y+T+E+R+ D RG F+W + YK L + K
Sbjct: 253 LLEPIAKDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLKN- 308
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
SEP+KSP AGGLFA+ FF ELGGYDPGL +WGGE +ELSFKIW CGG + PC
Sbjct: 309 --PSEPFKSPVMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPC 366
Query: 181 SRIVSLIR--PVFKADGK 196
SR+ + R P F G+
Sbjct: 367 SRVGHIYRKFPPFPNPGR 384
>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
Length = 579
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/251 (39%), Positives = 150/251 (59%), Gaps = 7/251 (2%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
L+R +G+L + P +EGPGE GK L G A + ++ MN+ S+ I
Sbjct: 51 LVRLPETWNGELHQI-PNYTAPREGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKI 109
Query: 246 SFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
S DR +PD R++ CK +DY LPK SVI++F +E ++ L+RTVHS+I R+P + L+E
Sbjct: 110 SLDRDVPDPRIQACKDIKYDYAA-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQE 168
Query: 304 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
+IL+DD S + +L + L+++I+RF GKVRLIR +R GLIR + GA+E+ G++IVFLD+
Sbjct: 169 VILLDDNSKRQELQEPLDEHIKRFGGKVRLIRKHDRHGLIRAKLAGAREAVGDIIVFLDS 228
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 423
HCE WL P++ I +R + P+ID I T + + G F W + +
Sbjct: 229 HCEANHGWLEPIVQRISDERTAIVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTW 285
Query: 424 NELPEREAKKR 434
L E E K+R
Sbjct: 286 EGLSEEEQKRR 296
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 119/188 (63%), Gaps = 3/188 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + L+++I+RF GKVRLIR +R GLIR + GA+E+ G++IVFLD+HCE WL P
Sbjct: 180 ELQEPLDEHIKRFGGKVRLIRKHDRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
++ I +R + P+ID I T + + G F W + + L E E K+R
Sbjct: 240 IVQRISDERTAIVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTWEGLSEEEQKRR 296
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ +SPT AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PC
Sbjct: 297 TKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPC 356
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 357 SHVGHIFR 364
>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
lupus familiaris]
Length = 608
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +LE+Y++++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELEEYVQKYLPGKIKVIRNIKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D++ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIQEDQQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R C+ +P DLP ASV++ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACRDKSFPADLPAASVVICFYNEALSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPAQ L EIILVDD S DL +LE+Y+Q++ GK+++IRN +REGLIR R
Sbjct: 171 VHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNIKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I D++ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIQEDQQTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
Length = 579
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/235 (41%), Positives = 142/235 (60%), Gaps = 6/235 (2%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK- 260
P +EGPGE GK L G A + ++ MN+ S+ IS DR +PD R++ CK
Sbjct: 66 PDYSQPREGPGEKGKPVVLSGKEAELGHADMKKWFMNVHASDKISLDRDVPDPRIQACKD 125
Query: 261 -YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+DY LPK SVI++F +E ++ L+RTVHS+I R+P + L+EIIL+DD S + +L +
Sbjct: 126 IKYDYAT-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEIILLDDNSKRQELQEP 184
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
L+++I+RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P++ I
Sbjct: 185 LDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRI 244
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+R + P+ID I T + + G F W + + LP+ E K+R
Sbjct: 245 SDERTAIVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGLPDEELKRR 296
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 119/188 (63%), Gaps = 3/188 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + L+++I+RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P
Sbjct: 180 ELQEPLDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
++ I +R + P+ID I T + + G F W + + LP+ E K+R
Sbjct: 240 IVQRISDERTAIVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGLPDEELKRR 296
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ +SPT AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PC
Sbjct: 297 TKVTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPC 356
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 357 SHVGHIFR 364
>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11 [Sarcophilus
harrisii]
Length = 970
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 124/189 (65%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L+DY++++ GK++++RN +REGLIR R GA + GEV+VFLD+HCEV WL
Sbjct: 559 DLKGELDDYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQ 618
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI+ D + + PVID I T +Y RG F WG+ +K + +P E
Sbjct: 619 PLLVPIHEDHRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHFKWDLVPFSELGG 674
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 675 PEGAIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 734
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 735 CSRVGHIFR 743
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 90/200 (45%), Positives = 123/200 (61%), Gaps = 5/200 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R ECK YP LP AS+++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRNAECKEKSYPTGLPAASIVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+I RTPA L EIILVDD S DL +L+DY+Q++ GK++++RN + EGLI R
Sbjct: 171 VHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDDYVQKYLPGKIQVVRNEKGEGLIXGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA GEV+VFLD+HCEV WL PLL PI+ D + + PVID I T +Y
Sbjct: 231 IGAAHGTGEVLVFLDSHCEVNKMWLQPLLVPIHEDHRTVVCPVIDIISADTL----MYSS 286
Query: 408 DHHYRGIFEWGMLYKENELP 427
G F W + +K + +P
Sbjct: 287 SPIVCGGFNWDLHFKWDLVP 306
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 118/189 (62%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL +L+DY++++ GK++++RN + EGLI R GA GEV+VFLD+HCEV WL
Sbjct: 197 DLKGELDDYVQKYLPGKIQVVRNEKGEGLIXGRMIGAAHGTGEVLVFLDSHCEVNKMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI+ D + + PVID I T +Y G F W + +K + +P +
Sbjct: 257 PLLVPIHEDHRTVVCPVIDIISADTL----MYSSSPIVCGGFNWDLHFKWDLVPFSKLGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P KSP AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PEGAIAPIKSPAMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 84/169 (49%), Positives = 112/169 (66%), Gaps = 5/169 (2%)
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
YP LP AS+++ F+NE FS+L+RTVHS+I RTPA L EIILVDD S DL +L+D
Sbjct: 507 SYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDD 566
Query: 323 YIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
Y+Q++ GK++++RN +REGLIR R GA + GEV+VFLD+HCEV WL PLL PI+
Sbjct: 567 YVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQPLLVPIHE 626
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
D + + PVID I T +Y RG F WG+ +K + +P E
Sbjct: 627 DHRTVVCPVIDIISADTL----MYSSSPIVRGGFNWGLHFKWDLVPFSE 671
>gi|383863685|ref|XP_003707310.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like [Megachile
rotundata]
Length = 610
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 95/215 (44%), Positives = 135/215 (62%), Gaps = 3/215 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK L + + + G N S+ IS +R++PD+R +CK Y +
Sbjct: 88 RSGTGEHGKPAFLSPSLDSLKEKLYQVNGFNAALSDEISMNRSVPDIRHPDCKKKKYLKN 147
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
L SVI+ FHNE FS+LMRT S+I R+PA LEEIILVDD S+K +L ++L+DY+ +
Sbjct: 148 LDAVSVIVSFHNEHFSTLMRTCWSVINRSPASLLEEIILVDDASTKVELKKELDDYVAQR 207
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KV++IR +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI + +
Sbjct: 208 LPKVKIIRLPQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANVNWLPPLLEPIAQNYRTCV 267
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 268 CPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 299
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/199 (47%), Positives = 127/199 (63%), Gaps = 10/199 (5%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L K+L+DY+ + KV++IR +R GLI+ R GAK ++ +V+VFLD+H E +NWLPP
Sbjct: 195 ELKKELDDYVAQRLPKVKIIRLPQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANVNWLPP 254
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 119
LL PI + + P ID I Y+T+E+R+ D RG F+W + YK L PE
Sbjct: 255 LLEPIAQNYRTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPE----D 307
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
K+ + P+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + P
Sbjct: 308 LKHPTLPFKSPVMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGEMYDAP 367
Query: 180 CSRIVSLIR--PVFKADGK 196
CSR+ + R P F GK
Sbjct: 368 CSRVGHIYRKFPPFPNPGK 386
>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
Length = 579
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 100/252 (39%), Positives = 151/252 (59%), Gaps = 7/252 (2%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
L+R +G+L + P +EGPGE GK L G A + ++ MN+ S+ I
Sbjct: 51 LVRLPETWNGQLHQI-PDYSIPREGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKI 109
Query: 246 SFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
S DR +PD R++ CK +DY LPK SVI++F +E ++ L+RTVHS+I R+P + L+E
Sbjct: 110 SLDRDVPDPRIQACKDIKYDYS-SLPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQE 168
Query: 304 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
+IL+DD S + +L + L+++I+RF GKV+LIR R GLIR + GA+E+ G++IVFLD+
Sbjct: 169 VILLDDNSKRQELQEPLDEHIKRFGGKVKLIRKHVRHGLIRAKLAGAREAVGDIIVFLDS 228
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 423
HCE WL P++ I +R + P+ID I T + + G F W + +
Sbjct: 229 HCEANHGWLEPIVQRISDERTAIVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTW 285
Query: 424 NELPEREAKKRK 435
+PE E K+RK
Sbjct: 286 EGIPEDEQKRRK 297
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 120/188 (63%), Gaps = 3/188 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + L+++I+RF GKV+LIR R GLIR + GA+E+ G++IVFLD+HCE WL P
Sbjct: 180 ELQEPLDEHIKRFGGKVKLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
++ I +R + P+ID I T + + G F W + + +PE E K+R
Sbjct: 240 IVQRISDERTAIVCPMIDSISDSTLAYHGDWSLS---VGGFSWALHFTWEGIPEDEQKRR 296
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++ +SPT AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PC
Sbjct: 297 KKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPC 356
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 357 SHVGHIFR 364
>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 399
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/239 (41%), Positives = 144/239 (60%), Gaps = 4/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
LEP +P+ EGPGE GK +P+ + +N+ S I+ +R++P+ R+E C
Sbjct: 27 LEPVQKPH-EGPGEMGKPVDIPKEDQEKMKEMFKINQVNLMASEMIALNRSLPNDRLEGC 85
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K YP +LP SV++VFHNE +S+L+RTVHS+I +P +EEI+LVDD + + L +
Sbjct: 86 KTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINCSPRHMVEEIVLVDDANERDFLKRT 145
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE Y+++ V +IR R GLIR R +G S+G+VI+FLDAHCE + WL PLL I
Sbjct: 146 LESYVKKLKVPVHVIRMEHRSGLIRDRLKGDAVSKGQVIIFLDAHCECTVGWLEPLLTRI 205
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
DR+ + P+ID I T F + D Y G F W + ++ +P+RE +RK +R
Sbjct: 206 KQDRRTVVCPIIDVISDDT--FECMAGSDMTYGG-FNWKLNFRWYLVPQREMDRRKGDR 261
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/183 (43%), Positives = 115/183 (62%), Gaps = 4/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR R GLIR R +G S+G+VI+FLDAHCE + WL PL
Sbjct: 142 LKRTLESYVKKLKVPVHVIRMEHRSGLIRDRLKGDAVSKGQVIIFLDAHCECTVGWLEPL 201
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I DR+ + P+ID I T F + D Y G F W + ++ +P+RE +RK
Sbjct: 202 LTRIKQDRRTVVCPIIDVISDDT--FECMAGSDMTYGG-FNWKLNFRWYLVPQREMDRRK 258
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGG F++DR +F E+G YD G+ +WG EN E+SF+IW CGG++E V C
Sbjct: 259 GDRTLPVRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEISFRIWQCGGTLEIVTC 318
Query: 181 SRI 183
S +
Sbjct: 319 SHV 321
>gi|115497708|ref|NP_001069909.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Bos taurus]
gi|83405338|gb|AAI11261.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Bos taurus]
gi|440895696|gb|ELR47826.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Bos grunniens mutus]
Length = 448
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/183 (48%), Positives = 122/183 (66%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID IDY T E Y+P RG F W + +K + + E +
Sbjct: 245 LLNAIAKDPKMVVCPLIDVIDYMTLE----YQPSPIVRGAFNWRLEFKWDHVLSYEIEGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA++R +F E+G YD G+ +WGGEN ELS +IWMCGG + +PC
Sbjct: 301 EGPTTPIRSPAMAGGIFAINRHYFNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPC 360
Query: 181 SRI 183
SR+
Sbjct: 361 SRV 363
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 122/205 (59%), Gaps = 9/205 (4%)
Query: 218 YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 277
Y +PE YG N S ++ R +PD R C+ YP LP AS+I+ F
Sbjct: 93 YSIPEVIHG-----YSTYGFNSIISKNLGHYRNVPDTRNVMCQKKMYPAKLPTASIIICF 147
Query: 278 HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 337
HNE F++L RT+ SI+ T LEEIILVDD S DL +KL+ +++ F GK++LIRN
Sbjct: 148 HNEEFNALFRTLSSIMTLTQQYILEEIILVDDMSDFDDLKEKLDYHLEIFRGKIKLIRNK 207
Query: 338 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 397
+REGLIR R GA + G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY
Sbjct: 208 KREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYM 267
Query: 398 TWEFRSVYEPDHHYRGIFEWGMLYK 422
T E Y+P RG F W + +K
Sbjct: 268 TLE----YQPSPIVRGAFNWRLEFK 288
>gi|296488205|tpg|DAA30318.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 5 [Bos
taurus]
Length = 447
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/183 (48%), Positives = 122/183 (66%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDYHLEIFRGKIKLIRNKKREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID IDY T E Y+P RG F W + +K + + E +
Sbjct: 245 LLNAIAKDPKMVVCPLIDVIDYMTLE----YQPSPIVRGAFNWRLEFKWDHVLSYEIEGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA++R +F E+G YD G+ +WGGEN ELS +IWMCGG + +PC
Sbjct: 301 EGPTTPIRSPAMAGGIFAINRHYFNEIGQYDKGMNLWGGENLELSLRIWMCGGQLYVIPC 360
Query: 181 SRI 183
SR+
Sbjct: 361 SRV 363
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/205 (45%), Positives = 122/205 (59%), Gaps = 9/205 (4%)
Query: 218 YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVF 277
Y +PE YG N S ++ R +PD R C+ YP LP AS+I+ F
Sbjct: 93 YSIPEVIHG-----YSTYGFNSIISKNLGHYRNVPDTRNVMCQKKMYPAKLPTASIIICF 147
Query: 278 HNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNT 337
HNE F++L RT+ SI+ T LEEIILVDD S DL +KL+ +++ F GK++LIRN
Sbjct: 148 HNEEFNALFRTLSSIMTLTQQYILEEIILVDDMSDFDDLKEKLDYHLEIFRGKIKLIRNK 207
Query: 338 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 397
+REGLIR R GA + G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY
Sbjct: 208 KREGLIRARMTGASHASGDVLVFLDSHCEVNKVWLEPLLNAIAKDPKMVVCPLIDVIDYM 267
Query: 398 TWEFRSVYEPDHHYRGIFEWGMLYK 422
T E Y+P RG F W + +K
Sbjct: 268 TLE----YQPSPIVRGAFNWRLEFK 288
>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Callithrix jacchus]
Length = 607
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 197 LKGELDEYVQKYLPGKIKIIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 256
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D+ + PVID I T + S RG F WG+ ++ + +P E
Sbjct: 257 LLAAIREDQHTVVCPVIDIISADTLAYSS----SPIVRGGFNWGLHFRWDLVPLSELGGA 312
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 313 EGATTPIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 372
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 373 SRVGHIFR 380
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 110 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRT 169
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+I RTPA L E+ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 170 VHSVIDRTPAHLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKIIRNTKREGLIRGRM 229
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I D+ + PVID I T Y
Sbjct: 230 IGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSS 285
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ ++ + +P E
Sbjct: 286 SPIVRGGFNWGLHFRWDLVPLSE 308
>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Takifugu rubripes]
Length = 583
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/272 (41%), Positives = 158/272 (58%), Gaps = 14/272 (5%)
Query: 173 GSIEWVPCSRIVSLIRPVFKADGKLGNLEPPL----EPYKEGPGEGGKAYHL---PEAYR 225
SIE +R V R + G G L PL P PGE G+A HL P+ +
Sbjct: 38 SSIEEDEKARHVVAQRMSLEEMGPEGQLARPLYVKPPPDTNAPGELGRAAHLNLSPDEKK 97
Query: 226 AAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFS 283
D S+ Y +N+ S+ IS R I D RM+EC K ++Y LP SVI+ F+NE +S
Sbjct: 98 QEED-SIERYAINIFVSDKISLHRHIQDHRMKECRSKTFNY-RRLPTTSVIIAFYNEAWS 155
Query: 284 SLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLI 343
+L+RT+HS+++ TPA L+EIIL+DDFS +A L +L DYI +VRLIR +REGL+
Sbjct: 156 TLLRTIHSVLETTPAILLKEIILIDDFSDRAYLKSQLADYISNLE-RVRLIRTKKREGLV 214
Query: 344 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 403
R R GA + GEV+ FLD HCE W+ PLL I + + PVID ID+ T+EF
Sbjct: 215 RARLIGATYATGEVLTFLDCHCECVPGWIEPLLERIGENSSTIVCPVIDTIDWNTFEF-- 272
Query: 404 VYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ + G F+W + ++ + +PERE K+RK
Sbjct: 273 YMQTEEPMIGGFDWRLTFQWHSVPERERKRRK 304
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 87/182 (47%), Positives = 117/182 (64%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L DYI +VRLIR +REGL+R R GA + GEV+ FLD HCE W+ PL
Sbjct: 188 LKSQLADYISNLE-RVRLIRTKKREGLVRARLIGATYATGEVLTFLDCHCECVPGWIEPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+EF + + G F+W + ++ + +PERE K+RK
Sbjct: 247 LERIGENSSTIVCPVIDTIDWNTFEF--YMQTEEPMIGGFDWRLTFQWHSVPERERKRRK 304
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+++ FF LG YD G+ VWGGEN ELSF++W CGGS+E PCS
Sbjct: 305 SPVDPIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCS 364
Query: 182 RI 183
+
Sbjct: 365 HV 366
>gi|308457549|ref|XP_003091148.1| CRE-GLY-10 protein [Caenorhabditis remanei]
gi|308258137|gb|EFP02090.1| CRE-GLY-10 protein [Caenorhabditis remanei]
Length = 620
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 142/231 (61%), Gaps = 9/231 (3%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 261
E +EGPGE GK +P+ +A L Y G N S+ IS +R+I D+R ++CK
Sbjct: 89 EKAREGPGEWGKPVKVPDDKETEKEA-LSLYKANGYNAYVSDMISLNRSIKDIRHKDCKK 147
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 148 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207
Query: 322 DYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
D+++ + + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD 315
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 157/290 (54%), Gaps = 37/290 (12%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLP
Sbjct: 202 LRQPLEDFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLP 261
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 262 PLLDPIAEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD--- 315
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ + P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 316 RENPTTPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAP 375
Query: 180 CSRIVSLIRPVFK--ADGKLGNL---------EPPLEPYKEG-----PGEGGKAYHLPEA 223
CSR+ + R + + +G+ E ++ YKE PG G
Sbjct: 376 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDEYKETLYKHRPGVGSA------- 428
Query: 224 YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASV 273
AGD L + G+ E SFD + ++ ++ KY YP PKAS
Sbjct: 429 --DAGDLKLMK-GVR-EKLQCKSFDWFMKEIAFDQDKY--YPAVEPKASA 472
>gi|308452095|ref|XP_003088913.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
gi|308244364|gb|EFO88316.1| hypothetical protein CRE_04439 [Caenorhabditis remanei]
Length = 620
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 142/231 (61%), Gaps = 9/231 (3%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKY 261
E +EGPGE GK +P+ +A L Y G N S+ IS +R+I D+R ++CK
Sbjct: 89 EKAREGPGEWGKPVKVPDDKETEKEA-LSLYKANGYNAYVSDMISLNRSIKDIRHKDCKK 147
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP SVI FH E S+L+R+V+S+I R+P + L+EIILVDDFS K L Q LE
Sbjct: 148 MMYSAKLPTVSVIFPFHEEHNSTLLRSVYSVINRSPPELLKEIILVDDFSEKPALRQPLE 207
Query: 322 DYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
D+++ + + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLPPLL PI
Sbjct: 208 DFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLPPLLDPI 267
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 268 AEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD 315
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 157/290 (54%), Gaps = 37/290 (12%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LED++++ + V+++R +REGLIR R GA+E+ GE+++FLDAH E NWLP
Sbjct: 202 LRQPLEDFLKKNKIDHIVKVLRTKKREGLIRGRQLGAQEATGEILIFLDAHSECNYNWLP 261
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI D + + P +D ID +T+E R D RG F+W YK L +++
Sbjct: 262 PLLDPIAEDYRTVVCPFVDVIDCETYEIRP---QDEGARGSFDWAFNYKRLPLTKKD--- 315
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R+ + P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFK+W C G + P
Sbjct: 316 RENPTTPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAP 375
Query: 180 CSRIVSLIRPVFK--ADGKLGNL---------EPPLEPYKEG-----PGEGGKAYHLPEA 223
CSR+ + R + + +G+ E ++ YKE PG G
Sbjct: 376 CSRVAHIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDEYKETLYKHRPGVGSA------- 428
Query: 224 YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASV 273
AGD L + G+ E SFD + ++ ++ KY YP PKAS
Sbjct: 429 --DAGDLKLMK-GVR-EKLQCKSFDWFMKEIAFDQDKY--YPAVEPKASA 472
>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
taurus]
Length = 605
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++YI+++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 196 LKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 255
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 256 LLAAIREDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 311
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 312 EGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 371
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 372 SRVGHIFR 379
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASIVICFYNEALSALLRT 168
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 284
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSE 307
>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Bos taurus]
gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
taurus]
Length = 606
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++YI+++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 196 LKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 255
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 256 LLAAIREDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 311
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 312 EGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 371
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 372 SRVGHIFR 379
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 96/203 (47%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP AS+++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASIVICFYNEALSALLRT 168
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTVVCPVIDIISADTL----AYSS 284
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSE 307
>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
Length = 592
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 137/228 (60%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE GK +P + + E N+ S+ I +R++ D+R +CK YP LP
Sbjct: 82 PGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPTKLPT 141
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L Q+LEDY+Q
Sbjct: 142 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGQQLEDYVQTLPVH 201
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 202 TYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 261
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I +T+E+ V D + G F W + ++ +P RE ++R ++R
Sbjct: 262 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDR 306
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY++ ++R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 187 LGQQLEDYVQTLPVHTYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I +T+E+ V D + G F W + ++ +P RE ++R
Sbjct: 247 LARIVLDRKTVVCPIIDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRN 303
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PC
Sbjct: 304 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPC 363
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 364 SHVGHVFR 371
>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
mutus]
Length = 606
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++YI+++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 196 LKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 255
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 256 LLAAIREDRQTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 311
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 312 EGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 371
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 372 SRVGHIFR 379
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRT 168
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRQTVVCPVIDIISADTL----AYSS 284
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSE 307
>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Strongylocentrotus purpuratus]
Length = 521
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 128/188 (68%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+DY+ F KV++IR +REGLIRTR GA + G+V++FLD+HCE NWLPPL
Sbjct: 106 LKGPLDDYMSAFP-KVKIIRLEKREGLIRTRLLGAGPATGDVVLFLDSHCEANYNWLPPL 164
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R+ + P+ID I + + + S + RG F+W + YK + E E K+R
Sbjct: 165 LERIALNRRRIVCPMIDVISNEDFHYES--QAGDVMRGAFDWELYYKRIPISEAENKRRS 222
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFL-ELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S+P+++P AGGLFA+DR +F+ ELGGYD GL +WGGE ++LSFK+WMCGG +E +PC
Sbjct: 223 HESDPFRTPIMAGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPC 282
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 283 SRVGHIYR 290
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 99/226 (43%), Positives = 132/226 (58%), Gaps = 3/226 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G A L + G N S+ IS DR +PD+R CK Y LP
Sbjct: 1 PGERGVAVKLTPEMKKTEKKDTSANGFNERVSDMISMDRALPDIRNPRCKEITYLAKLPN 60
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
SVI+ FHNE S+L RTVHSI R+P + + EIILVDDFS +A L L+DY+ F K
Sbjct: 61 VSVIIPFHNEALSTLKRTVHSIFNRSPPELIHEIILVDDFSDRAYLKGPLDDYMSAFP-K 119
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V++IR +REGLIRTR GA + G+V++FLD+HCE NWLPPLL I +R+ + P+
Sbjct: 120 VKIIRLEKREGLIRTRLLGAGPATGDVVLFLDSHCEANYNWLPPLLERIALNRRRIVCPM 179
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
ID I + + + S + RG F+W + YK + E E K+R +
Sbjct: 180 IDVISNEDFHYES--QAGDVMRGAFDWELYYKRIPISEAENKRRSH 223
>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
aries]
Length = 606
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 92/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++YI+++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 196 LKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 255
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 256 LLAAIREDRRAVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 311
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 312 EGATAPIKSPTMAGGLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 371
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 372 SRVGHIFR 379
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 97/203 (47%), Positives = 130/203 (64%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP+DLP ASV++ F+NE S+L+RT
Sbjct: 109 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRT 168
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA+ L EIILVDD S DL +L++YIQ++ GK+++IRN +REGLIR R
Sbjct: 169 VHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRM 228
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 229 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRAVVCPVIDIISADTL----AYSS 284
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 285 SPVVRGGFNWGLHFKWDLVPLSE 307
>gi|256083753|ref|XP_002578103.1| peptidase [Schistosoma mansoni]
Length = 1860
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 124/189 (65%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L ++L++Y+ R G V +I EREGLIR R GAK + G+V++FLD+HCE +NWLP
Sbjct: 110 LKERLDNYLSRTYPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLP 169
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI + + +T P ID ID T+E+R+ D RG F+W YK LP R +
Sbjct: 170 PLLDPISKNYRTVTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYKR--LP-RLSTD 223
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P++SP AGGLFA+ R +F ELGGYDP L +WGGE +ELSFKIWMCGG + VP
Sbjct: 224 SLHPETPFESPVMAGGLFAISRKWFWELGGYDPLLHIWGGEQYELSFKIWMCGGRLIDVP 283
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 284 CSRVGHIFR 292
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 132/217 (60%), Gaps = 5/217 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
++GPGE G L + +A +L G N+ S I DR++ D+R CK Y
Sbjct: 2 RQGPGENGLPVRLSNSQKALSKKTLNFNGFNIFVSEKIKTDRSVKDIRYPNCKGALYSKQ 61
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ + E + +L+RTV S++ R+P + ++E+ILVDD SS+ L ++L++Y+ R
Sbjct: 62 LPLVSIIIPVYEEHWETLIRTVVSVLNRSPLELIKEVILVDDGSSRRYLKERLDNYLSRT 121
Query: 328 --NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
G V +I EREGLIR R GAK + G+V++FLD+HCE +NWLPPLL PI + +
Sbjct: 122 YPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLPPLLDPISKNYRT 181
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+T P ID ID T+E+R+ D RG F+W YK
Sbjct: 182 VTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYK 215
>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6-like [Hydra
magnipapillata]
Length = 591
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/236 (42%), Positives = 140/236 (59%), Gaps = 6/236 (2%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
L P G G+A +A D S +YG N S+ IS +R+IPD R C D
Sbjct: 67 LNPEPGSAGMEGQAVSNSVNEKAIEDKSFDDYGFNELASSKISLERSIPDNRDSSCFNVD 126
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK---ADLDQKL 320
YP+ L SVI++FHNE +S L+RTVH+++ R+P L+EIILVDD S K L +KL
Sbjct: 127 YPVKLSTTSVIVIFHNEAWSVLLRTVHTVLARSPPHMLKEIILVDDASVKEKYGHLGEKL 186
Query: 321 EDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIY 380
E+Y+ + KV+LIR+ R GL + R GA + GEV+VFLD+HCE WL PLLA +
Sbjct: 187 ENYVNTLS-KVKLIRSPVRVGLTQARLIGADNAVGEVLVFLDSHCEASFGWLEPLLARLQ 245
Query: 381 SDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 436
+ K+ VP I+ I ++ +E+ S E + RGIF W +++ LP RE +RKY
Sbjct: 246 ENPKLAVVPDIEVISFKNFEYSS--EKGSYNRGIFSWELMFNWGPLPPREKMRRKY 299
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 91/196 (46%), Positives = 123/196 (62%), Gaps = 12/196 (6%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ + KV+LIR+ R GL + R GA + GEV+VFLD+HCE WL PL
Sbjct: 182 LGEKLENYVNTLS-KVKLIRSPVRVGLTQARLIGADNAVGEVLVFLDSHCEASFGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA + + K+ VP I+ I ++ +E+ S E + RGIF W +++ LP RE +RK
Sbjct: 241 LARLQENPKLAVVPDIEVISFKNFEYSS--EKGSYNRGIFSWELMFNWGPLPPREKMRRK 298
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYD---------PGLLVWGGENFELSFKIWMCG 172
Y S+P KSPT AGGLFAM+R +F E G YD L WGGEN E+SF++WMCG
Sbjct: 299 YESDPIKSPTMAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYWGGENVEMSFRLWMCG 358
Query: 173 GSIEWVPCSRIVSLIR 188
IE +PCSR+ + R
Sbjct: 359 EGIEIIPCSRVGHVFR 374
>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 608
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 122/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L+++++ + GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV WL P
Sbjct: 198 LKGQLDEFVQTQLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNTMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLATIQEDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLIPPSELGGP 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 GGATAPIKSPTMAGGLFAMNRDYFDELGRYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 95/203 (46%), Positives = 128/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNLLISNRLGYHRDVPDTRSAACKDETYPEDLPVASVVICFYNEALSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 347
VHS++ RTPAQ L E+ILVDD S DL +L++++Q + GK+++IRNT+REGLIR R
Sbjct: 171 VHSVLDRTPAQLLREVILVDDDSDFDDLKGQLDEFVQTQLPGKIKVIRNTKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV WL PLLA I DR+ + PVID I T Y
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNTMWLQPLLATIQEDRRTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLIPPSE 309
>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mustela putorius
furo]
Length = 609
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 94/191 (49%), Positives = 124/191 (64%), Gaps = 8/191 (4%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG---LNW 57
L +LE+Y++++ GK+++IRN +REGLIR R GA S GEV+VFLD+HCEV L W
Sbjct: 198 LKGELEEYVQKYLPGKIKVIRNAKREGLIRGRMIGAAHSTGEVLVFLDSHCEVNVMWLMW 257
Query: 58 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 117
L PLLA I DR+ + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LQPLLAAIQQDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSEL 313
Query: 118 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 177
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG +
Sbjct: 314 GGPEGATAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 373
Query: 178 VPCSRIVSLIR 188
+PCSR+ + R
Sbjct: 374 IPCSRVGHIFR 384
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 106/251 (42%), Positives = 147/251 (58%), Gaps = 14/251 (5%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 243
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNKIDDTVDNHVEDPEKGNMKFSSELGMIFNERDQELRDLGYQKHAFNMLISN 125
Query: 244 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
+ + R +PD R CK YP+DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYHRDVPDTRNAACKDKSYPVDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 304 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
IILVDD S DL +LE+Y+Q++ GK+++IRN +REGLIR R GA S GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNAKREGLIRGRMIGAAHSTGEVLVFLD 245
Query: 363 AHCEVG---LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 419
+HCEV L WL PLLA I DR+ + PVID I T Y RG F WG+
Sbjct: 246 SHCEVNVMWLMWLQPLLAAIQQDRRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGL 301
Query: 420 LYKENELPERE 430
+K + +P E
Sbjct: 302 HFKWDLVPLSE 312
>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Anolis carolinensis]
Length = 498
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 93/202 (46%), Positives = 132/202 (65%), Gaps = 3/202 (1%)
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRT 296
N+ S+ I+ +R++PD+R+E CK YP +LP SV++VFHNE +S+L+RT++S+I R
Sbjct: 10 FNLMASDMIALNRSLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTIYSVINRA 69
Query: 297 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 356
P L EIILVDD S + L LE+Y++ V+++R +R GLIR R RGA S+G+
Sbjct: 70 PHYLLAEIILVDDASERDFLKVPLENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQ 129
Query: 357 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 416
VI FLDAHCE L WL PLLA I DRKI+ P+ID I T+E+ + D Y G F
Sbjct: 130 VITFLDAHCECTLGWLEPLLARIKEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FN 186
Query: 417 WGMLYKENELPEREAKKRKYNR 438
W + ++ +P+RE +RK +R
Sbjct: 187 WKLNFRWYPVPQREMDRRKGDR 208
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 123/184 (66%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+Y++ V+++R +R GLIR R RGA S+G+VI FLDAHCE L WL PLLA I
Sbjct: 93 LENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 152
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
DRKI+ P+ID I T+E+ + D Y G F W + ++ +P+RE +RK + +
Sbjct: 153 KEDRKIVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRT 209
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS +
Sbjct: 210 LPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVG 269
Query: 185 SLIR 188
+ R
Sbjct: 270 HVFR 273
>gi|348568063|ref|XP_003469818.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Cavia porcellus]
Length = 499
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 99/212 (46%), Positives = 128/212 (60%), Gaps = 9/212 (4%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PG+ Y PE + EYG N+ S + DR +PD R + C++ YPL LP
Sbjct: 138 PGDQNINYSDPELFNG-----YLEYGFNVIVSRSLGHDREVPDTRDKSCRHRHYPLHLPT 192
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
ASVI+ FHNE F++L+RTV S++ TP LEEIILVDD S DL KL Y++ F K
Sbjct: 193 ASVIICFHNEEFNALLRTVSSVVYLTPPYLLEEIILVDDMSKFDDLKSKLNYYLESFRDK 252
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V+L+RN +REGLIR R GA + GEV+VFLD+HCEV WL PLLA I D + + PV
Sbjct: 253 VQLVRNKKREGLIRARMIGAWYASGEVLVFLDSHCEVNRVWLEPLLAAISKDSRTVVTPV 312
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
ID ID + + Y P RG F+W + +K
Sbjct: 313 IDIIDGISLQ----YLPSPLVRGAFDWKLQFK 340
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/198 (46%), Positives = 119/198 (60%), Gaps = 4/198 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL Y+E F KV+L+RN +REGLIR R GA + GEV+VFLD+HCEV WL P
Sbjct: 237 DLKSKLNYYLESFRDKVQLVRNKKREGLIRARMIGAWYASGEVLVFLDSHCEVNRVWLEP 296
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D + + PVID ID + + Y P RG F+W + +K + + E
Sbjct: 297 LLAAISKDSRTVVTPVIDIIDGISLQ----YLPSPLVRGAFDWKLQFKWDSVFSYETDSE 352
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P +SP AGG+FAM R FF ELG YD + +WGGEN ELS +IWMCGG + +PC
Sbjct: 353 GSPTNPIRSPAMAGGIFAMHRPFFYELGEYDKDMDLWGGENLELSLRIWMCGGQLLIIPC 412
Query: 181 SRIVSLIRPVFKADGKLG 198
SR+ + + K D L
Sbjct: 413 SRVGHITKLYSKPDSALS 430
>gi|354478320|ref|XP_003501363.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Cricetulus griseus]
Length = 435
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 90/202 (44%), Positives = 129/202 (63%), Gaps = 9/202 (4%)
Query: 221 PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 280
PE Y+ +YG+N+ S + R +PD R + C YP +LP AS+I+ FHNE
Sbjct: 76 PEFYKG-----FAQYGLNVVISRRLGIQREVPDSRDKICHQKHYPFNLPTASIIICFHNE 130
Query: 281 GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE 340
F++L+RTV S+I TP+ +LEEIILVDD S DL +KL+ +++ F GK++LIRN +RE
Sbjct: 131 EFNTLLRTVSSVINLTPSHFLEEIILVDDMSDTDDLKEKLDYHLELFRGKIKLIRNKKRE 190
Query: 341 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 400
GLIR+R GA + G+++VFLD+HCEV WL PLL I D K++ P+ID ID T +
Sbjct: 191 GLIRSRMIGASRASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPMIDVIDDTTLD 250
Query: 401 FRSVYEPDHHYRGIFEWGMLYK 422
Y RG F+W ++++
Sbjct: 251 ----YTAAPLVRGAFDWDLMFR 268
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 121/183 (66%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR+R GA + G+++VFLD+HCEV WL P
Sbjct: 165 DLKEKLDYHLELFRGKIKLIRNKKREGLIRSRMIGASRASGDILVFLDSHCEVNRVWLEP 224
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID T + Y RG F+W ++++ + + E
Sbjct: 225 LLHAIAKDHKMVVCPMIDVIDDTTLD----YTAAPLVRGAFDWDLMFRWDNVFSYEMDGP 280
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S+P +SP AGG+FA+DR +F ELG YD + +WGGEN ELS +IWMCGG + +PC
Sbjct: 281 EGTSKPIRSPAMAGGIFAIDRHYFTELGQYDKDMDLWGGENVELSLRIWMCGGQLFILPC 340
Query: 181 SRI 183
SR+
Sbjct: 341 SRV 343
>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 90/193 (46%), Positives = 128/193 (66%), Gaps = 5/193 (2%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
+G N+ SN +S RTI D R E C+ YP +LP AS+++ F+NE ++ L+RT+HS++
Sbjct: 21 HGFNLLISNRLSLHRTIKDTRHELCRGKTYPKNLPVASIVICFYNEAWTILLRTIHSVLD 80
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP Q+L EIILVDDFS+ +L KL+ Y+ K+R++RN +REGLIR R GA+ +
Sbjct: 81 RTPHQFLHEIILVDDFSNMLELKSKLDRYLSTMP-KIRIVRNNKREGLIRGRIIGAEAAT 139
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VFLD+HCEV +NWL PLL I+ D+K + PVID I T+E+ S RG
Sbjct: 140 GQVLVFLDSHCEVNINWLQPLLQHIHDDQKAVACPVIDVISSDTFEYSS----SPMVRGG 195
Query: 415 FEWGMLYKENELP 427
F WG+ + +P
Sbjct: 196 FNWGLHFTWEPIP 208
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L KL+ Y+ K+R++RN +REGLIR R GA+ + G+V+VFLD+HCEV +NWL P
Sbjct: 101 ELKSKLDRYLSTMP-KIRIVRNNKREGLIRGRIIGAEAATGQVLVFLDSHCEVNINWLQP 159
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I+ D+K + PVID I T+E+ S RG F WG+ + +P K
Sbjct: 160 LLQHIHDDQKAVACPVIDVISSDTFEYSS----SPMVRGGFNWGLHFTWEPIPPSLLVKP 215
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P +SPT AGGLFA+DR +F +LG YD G+ +WG EN E+SF+IWMCGGS++ +PC
Sbjct: 216 EDYVKPIRSPTMAGGLFAVDREYFTQLGKYDSGMDIWGAENLEISFRIWMCGGSLDILPC 275
Query: 181 SRIVSLIR 188
SR+ L R
Sbjct: 276 SRVGHLFR 283
>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
Length = 579
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 100/251 (39%), Positives = 149/251 (59%), Gaps = 7/251 (2%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
L+R +G+L + P +EGPGE GK L G A + ++ MN+ S+ I
Sbjct: 51 LVRLPETWNGELHQI-PNYTAPREGPGEKGKPVVLTGKDAELGQADMKKWFMNVHASDKI 109
Query: 246 SFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
S DR +PD R++ CK +DY LPK SVI++F +E ++ L+RTVHS+I R+P + L+E
Sbjct: 110 SLDRDVPDPRIQACKDIKYDYAA-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQE 168
Query: 304 IILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDA 363
+IL+DD S + +L + L+++I+RF GKVRLIR R GLIR + GA+E+ G++IVFLD+
Sbjct: 169 VILLDDNSKRQELQEPLDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDS 228
Query: 364 HCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 423
HCE WL P++ I +R + P+ID I T + + G F W + +
Sbjct: 229 HCEANHGWLEPIVQRISDERTAIVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTW 285
Query: 424 NELPEREAKKR 434
L E E K+R
Sbjct: 286 EGLSEEEQKRR 296
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 3/188 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + L+++I+RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P
Sbjct: 180 ELQEPLDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
++ I +R + P+ID I T + + G F W + + L E E K+R
Sbjct: 240 IVQRISDERTAIVCPMIDSISDNTLAYHGDWSLS---TGGFSWALHFTWEGLSEEEQKRR 296
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ +SPT AGGL A +R +F E+GGYD + +WGGEN E+SF+ WMCGGSIE++PC
Sbjct: 297 TKPTDYIRSPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPC 356
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 357 SHVGHIFR 364
>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
Length = 499
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 91/201 (45%), Positives = 132/201 (65%), Gaps = 3/201 (1%)
Query: 238 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
N+ S I+ +R++PD+R+E CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P
Sbjct: 25 NLMASEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSP 84
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
+EEI+LVDD S + L + LE Y+++ V +IR +R GLIR R +GA S+G+V
Sbjct: 85 RHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQV 144
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 417
I FLDAHCE + WL PLLA I DR+ + P+ID I T+E+ + D Y G F W
Sbjct: 145 ITFLDAHCECTVGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 201
Query: 418 GMLYKENELPEREAKKRKYNR 438
+ ++ +P+RE +RK +R
Sbjct: 202 KLNFRWYPVPQREMDRRKGDR 222
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 103 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 162
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 163 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 219
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 220 GDRTLPVRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 279
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 280 SHVGHVFR 287
>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Felis catus]
Length = 527
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +LE+Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 117 LKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 176
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D + + PVID I T Y RG F WG+ +K + +P E
Sbjct: 177 LLAAIREDPRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 232
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 233 EGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 292
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 293 SRVGHIFR 300
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 98/203 (48%), Positives = 129/203 (63%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 30 DLGYQKHAFNMLISNRLGYRRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRT 89
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPAQ L EIILVDD S DL +LE+Y+Q++ GK+++IRNT+REGLIR R
Sbjct: 90 VHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRM 149
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I D + + PVID I T Y
Sbjct: 150 IGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTL----AYSS 205
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 206 SPVVRGGFNWGLHFKWDLVPLSE 228
>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Felis catus]
Length = 608
Score = 182 bits (461), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +LE+Y++++ GK+++IRNT+REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D + + PVID I T Y RG F WG+ +K + +P E
Sbjct: 258 LLAAIREDPRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFKWDLVPLSELGGP 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIRSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 104/248 (41%), Positives = 146/248 (58%), Gaps = 11/248 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYH------LPEAYRAAGDASLGEYGMNMETSN 243
V ++ K+ ++ ++ + E P +G + E + D ++ NM SN
Sbjct: 66 VLESQFKVNRIDDMIDNHVEDPEKGNTKFSSELGMIFDERDQELRDLGYQKHAFNMLISN 125
Query: 244 HISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEE 303
+ + R +PD R CK YP DLP ASV++ F+NE S+L+RTVHS++ RTPAQ L E
Sbjct: 126 RLGYRRDVPDTRNAACKDKSYPADLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHE 185
Query: 304 IILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
IILVDD S DL +LE+Y+Q++ GK+++IRNT+REGLIR R GA + GEV+VFLD
Sbjct: 186 IILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLD 245
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+HCEV + WL PLLA I D + + PVID I T Y RG F WG+ +K
Sbjct: 246 SHCEVNVLWLQPLLAAIREDPRTVVCPVIDIISADTL----AYSSSPVVRGGFNWGLHFK 301
Query: 423 ENELPERE 430
+ +P E
Sbjct: 302 WDLVPLSE 309
>gi|311275140|ref|XP_003134592.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Sus scrofa]
Length = 446
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 87/197 (44%), Positives = 126/197 (63%), Gaps = 4/197 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+++VFLD+HCEV WL P
Sbjct: 184 DLKEKLDYHLEIFRGKIKVIRNKKREGLIRARLVGASRASGDILVFLDSHCEVNKIWLEP 243
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P++D IDY T E Y+P RG+F W + ++ + + E
Sbjct: 244 LLDAIVKDPKMVVCPIMDVIDYVTLE----YKPSPVVRGVFNWHLQFEWDRVFSYEMDGP 299
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P +SP GGLFA+ R +F E+G YD G+ +WGGEN ELS +IWMCGG + +PC
Sbjct: 300 DGPTRPIRSPAMVGGLFAIHRHYFNEIGQYDKGMNLWGGENLELSLRIWMCGGQLFLLPC 359
Query: 181 SRIVSLIRPVFKADGKL 197
SR+ + +P F G++
Sbjct: 360 SRVGHINKPYFTNQGEI 376
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 122/189 (64%), Gaps = 4/189 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
+YG N S + R +PD R + C YP +LP AS+I+ FHNE F++L+RTV SI+
Sbjct: 103 KYGFNHIVSKSLGNYRNVPDSRNKMCHQKHYPANLPTASIIICFHNEEFNALLRTVSSIM 162
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
TP +EEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA +
Sbjct: 163 TLTPHHIIEEIILVDDMSEYDDLKEKLDYHLEIFRGKIKVIRNKKREGLIRARLVGASRA 222
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+++VFLD+HCEV WL PLL I D K++ P++D IDY T E Y+P RG
Sbjct: 223 SGDILVFLDSHCEVNKIWLEPLLDAIVKDPKMVVCPIMDVIDYVTLE----YKPSPVVRG 278
Query: 414 IFEWGMLYK 422
+F W + ++
Sbjct: 279 VFNWHLQFE 287
>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
Length = 608
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 121/189 (64%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL K L++Y++ + +L+RN +REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 197 DLKKDLDEYVKTQLPKTTKLVRNEKREGLIRGRMIGASHATGQVLVFLDSHCEVNEMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI DR+ + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 257 PLLTPIREDRRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLSELEG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 102/241 (42%), Positives = 141/241 (58%), Gaps = 8/241 (3%)
Query: 192 KADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 250
K LGN ++ P++ E E G ++ E + D ++ NM SN + + R
Sbjct: 75 KIGNALGNHVQDPVKGEVEFSPEMGMIFN--EEDQEVRDLGYQKHAFNMLISNRLGYHRE 132
Query: 251 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
+PD R +C+ YP DLP ASVI+ F+NE S+L+RTVHS++ RTPA L EIILVDD
Sbjct: 133 VPDTRDVKCREKSYPSDLPSASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDN 192
Query: 311 SSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S ADL + L++Y++ + +L+RN +REGLIR R GA + G+V+VFLD+HCEV
Sbjct: 193 SELADLKKDLDEYVKTQLPKTTKLVRNEKREGLIRGRMIGASHATGQVLVFLDSHCEVNE 252
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLL PI DR+ + PVID I T Y RG F WG+ +K + +P
Sbjct: 253 MWLQPLLTPIREDRRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLS 308
Query: 430 E 430
E
Sbjct: 309 E 309
>gi|380024967|ref|XP_003696256.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
florea]
Length = 611
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 132/218 (60%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 86 EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 145
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+LMRT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 146 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 205
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 206 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 265
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 266 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 300
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPL
Sbjct: 197 LKKTLDDYVATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPL 256
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 257 LEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN-- 311
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCS
Sbjct: 312 -PTEPFKSPIMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCS 370
Query: 182 RIVSLIR--PVFKADGK 196
R+ + R P F GK
Sbjct: 371 RVGHIYRKFPPFPNPGK 387
>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Acyrthosiphon pisum]
Length = 615
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 138/237 (58%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
PP+ + GEGG+ + A E N+ S+ IS +R++ D+R ECK
Sbjct: 103 PPVREKRGKHGEGGRGVTMKPEQEALMKQKFKENQFNIIASDMISLNRSLQDIRQGECKS 162
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP +P S+++VFHNE +S+L+RTV S+I R+P L+EI+LVDD S + L +KLE
Sbjct: 163 KQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGKKLE 222
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
DY+ + +++R +R GLIR R GAK G+VI FLDAHCE WL PLLA I
Sbjct: 223 DYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLARIVL 282
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+RK + PVID I T+E+ V D + G F W + ++ +P+RE +R +R
Sbjct: 283 NRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRNQDR 336
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLEDY+ + +++R +R GLIR R GAK G+VI FLDAHCE WL PL
Sbjct: 217 LGKKLEDYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I T+E+ V D + G F W + ++ +P+RE +R
Sbjct: 277 LARIVLNRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRN 333
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F +LG YD G+ +WGGEN E+SF+IWMCGG++E PC
Sbjct: 334 QDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRIWMCGGTLEISPC 393
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 394 SHVGHVFR 401
>gi|350400046|ref|XP_003485719.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus impatiens]
Length = 643
Score = 181 bits (460), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K+E YI FNGKV+ + +REGLIR R GA+++ GE+++FLD+H EV W+ P
Sbjct: 221 LHEKIETYIANNFNGKVKFFKTEKREGLIRARMFGARKATGEILIFLDSHIEVNKRWIEP 280
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + +P
Sbjct: 281 LLSQIAHSKTIIAMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVPVGTFAHD 336
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P KSPT AGGLFAMDR +F +LG YD G+ +WGGEN E+SF+IWMCGGSIE +PC
Sbjct: 337 EDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPC 396
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 397 SRVGHVFR 404
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/249 (38%), Positives = 146/249 (58%), Gaps = 12/249 (4%)
Query: 185 SLIRPVFK-ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNM 239
+L R FK +D L L+P P++P +G E G + + + D Y N+
Sbjct: 87 NLRRNAFKNSDKLLQQLQPVPVKPAVTLGQGLDELGMVKNFEDQRKR--DEGYKNYSFNI 144
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S++I R +PD R + C+ Y LP AS+++ F+NE + +L+R++HSII RTPA
Sbjct: 145 LVSDNIGLHRELPDTRHKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPAS 204
Query: 300 YLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILV+D+S L +K+E YI FNGKV+ + +REGLIR R GA+++ GE++
Sbjct: 205 LLHEIILVNDWSDSKALHEKIETYIANNFNGKVKFFKTEKREGLIRARMFGARKATGEIL 264
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
+FLD+H EV W+ PLL+ I + I+ +PVID I+ T++ Y RG F WG
Sbjct: 265 IFLDSHIEVNKRWIEPLLSQIAHSKTIIAMPVIDIINPDTFQ----YTGSPLVRGGFNWG 320
Query: 419 MLYKENELP 427
+ +K + +P
Sbjct: 321 LHFKWDNVP 329
>gi|260793003|ref|XP_002591503.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
gi|229276709|gb|EEN47514.1| hypothetical protein BRAFLDRAFT_105269 [Branchiostoma floridae]
Length = 618
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 136/225 (60%), Gaps = 5/225 (2%)
Query: 215 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 274
G + L E + DA ++ N S+ IS R + + R ECK YP DLP SVI
Sbjct: 178 GVSLTLTEEEKKLEDAGYKQHAFNEYASSKISLHRRLHEARHAECKDTRYPHDLPTTSVI 237
Query: 275 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 334
+ FHNE +S+L+RTVHS+++ +PA+ LEE++LVDDFS L + LE Y+ + VRL+
Sbjct: 238 ITFHNEAWSTLLRTVHSVLETSPARLLEEVVLVDDFSELDHLKEPLEMYLSQLRA-VRLV 296
Query: 335 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 394
R T REGLIR R GA +RGEV+ FLD+HCE WL PLLA I +R + PVID +
Sbjct: 297 RTTRREGLIRARLLGAAHARGEVLTFLDSHCECHEQWLEPLLARIAENRSNVVTPVIDVL 356
Query: 395 DYQTWEFRSVYEPDHH----YRGIFEWGMLYKENELPEREAKKRK 435
D++T+E++ E RGIF+W + + +P+ E +RK
Sbjct: 357 DWKTFEYQHTMEVYLFGLGVQRGIFDWRLTFTWGLIPDYERSRRK 401
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 78/170 (45%), Positives = 108/170 (63%), Gaps = 5/170 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ + VRL+R T REGLIR R GA +RGEV+ FLD+HCE WL PL
Sbjct: 279 LKEPLEMYLSQLRA-VRLVRTTRREGLIRARLLGAAHARGEVLTFLDSHCECHEQWLEPL 337
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH----YRGIFEWGMLYKENELPEREA 117
LA I +R + PVID +D++T+E++ E RGIF+W + + +P+ E
Sbjct: 338 LARIAENRSNVVTPVIDVLDWKTFEYQHTMEVYLFGLGVQRGIFDWRLTFTWGLIPDYER 397
Query: 118 KKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 167
+RK +P +SPT AGGLFA+D+ +F +G YD G+ VWGGEN E+SF+
Sbjct: 398 SRRKSPVDPVRSPTMAGGLFAIDKWYFEHIGTYDAGMDVWGGENLEMSFR 447
>gi|380024969|ref|XP_003696257.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
florea]
Length = 598
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 132/218 (60%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 73 EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 132
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+LMRT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 133 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 192
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 193 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 252
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 253 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 287
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPL
Sbjct: 184 LKKTLDDYVATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPL 243
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 244 LEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN-- 298
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCS
Sbjct: 299 -PTEPFKSPIMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCS 357
Query: 182 RIVSLIR--PVFKADGK 196
R+ + R P F GK
Sbjct: 358 RVGHIYRKFPPFPNPGK 374
>gi|328781647|ref|XP_003250009.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 1 [Apis
mellifera]
Length = 611
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPL
Sbjct: 197 LKKTLDDYVATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPL 256
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 257 LEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQNP- 312
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCS
Sbjct: 313 --TEPFKSPVMAGGLFAISSKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCS 370
Query: 182 RIVSLIR--PVFKADGK 196
R+ + R P F GK
Sbjct: 371 RVGHIYRKFPPFPNPGK 387
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 132/218 (60%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 86 EAKRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 145
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+L+RT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 146 LRNLDSVSVIVSFHNEHFSTLIRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 205
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 206 ATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 265
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 266 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 300
>gi|432092278|gb|ELK24901.1| Polypeptide N-acetylgalactosaminyltransferase-like 6 [Myotis
davidii]
Length = 363
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 90/179 (50%), Positives = 123/179 (68%), Gaps = 5/179 (2%)
Query: 10 IERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 69
+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I +
Sbjct: 1 MARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNH 59
Query: 70 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKS 129
K + P+ID ID+ + + + + RG F+W M YK +P +R S+P++S
Sbjct: 60 KTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSDPFES 115
Query: 130 PTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
P AGGLFA++R +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+ + R
Sbjct: 116 PVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYR 174
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 49/104 (47%), Positives = 68/104 (65%), Gaps = 3/104 (2%)
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+ RF+ KVR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I +
Sbjct: 1 MARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPLLNQIALNH 59
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
K + P+ID ID+ + + + + RG F+W M YK +P
Sbjct: 60 KTIVCPMIDVIDHNHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 101
>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
Length = 588
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 101/232 (43%), Positives = 137/232 (59%), Gaps = 5/232 (2%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-- 266
+GPGE G +P+ E N+ SN IS +RT+ D+RM CK DY
Sbjct: 82 KGPGEMGAPVKIPKDKEKESKKMFQENQFNLMASNMISLNRTLKDVRMSGCKKHDYANLG 141
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LPK S+I VFHNE +S+L+R++HS+I R+P + LEEIILVDD S K L ++L+DY++
Sbjct: 142 ALPKTSIIFVFHNEAWSTLLRSIHSVINRSPREMLEEIILVDDKSEKDFLGKQLDDYVKN 201
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
V +IR REGLIR R GAK ++GEV+ FLDAH E WL PLL I DR +
Sbjct: 202 LPVPVHIIRQQHREGLIRARLEGAKIAKGEVLTFLDAHIEASPGWLEPLLYEIKKDRTNV 261
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+EF + D Y G F W + ++ +P+RE +R +R
Sbjct: 262 ICPIIDVISDDTFEF--LTGSDLTYGG-FNWKLNFRWYPVPQREVDRRGGDR 310
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+L+DY++ V +IR REGLIR R GAK ++GEV+ FLDAH E WL PL
Sbjct: 191 LGKQLDDYVKNLPVPVHIIRQQHREGLIRARLEGAKIAKGEVLTFLDAHIEASPGWLEPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I DR + P+ID I T+EF + D Y G F W + ++ +P+RE +R
Sbjct: 251 LYEIKKDRTNVICPIIDVISDDTFEF--LTGSDLTYGG-FNWKLNFRWYPVPQREVDRRG 307
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P ++PT AGGLF++D+++F E+G YD G+ +WGGEN E+SF+IWMCGG++ C
Sbjct: 308 GDRSLPMQTPTMAGGLFSIDKSYFYEIGSYDSGMDIWGGENLEMSFRIWMCGGTVLIATC 367
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 368 SHVGHVFR 375
>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
Length = 415
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 91/190 (47%), Positives = 124/190 (65%), Gaps = 7/190 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+ Y+ +F +VR++ ER GLIR R +GA+ + G V+ FLD+H E + WL PL
Sbjct: 136 LKQKLDQYMSKF-PQVRVVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPL 194
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I DR + P ID ++ T+ + E + RG F+W + ++ LP EAK+R
Sbjct: 195 LDRIREDRTRVVCPSIDRVNEATFAYEVANE---NVRGGFDWELFFQWVSLPAVEAKRRT 251
Query: 122 YN---SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+N E +SPT AGGLF++DR FF ELGGYDPG +WGGEN ELSFKIWMCGGS+E +
Sbjct: 252 HNVFQHEVIRSPTMAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIWMCGGSLEIL 311
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 312 PCSRVGHVFR 321
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 141/244 (57%), Gaps = 8/244 (3%)
Query: 199 NLEPPLEPYK-EGPGEGGKAYH--LPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
N++ EP PG G+A +P+ ++A +A N S+ I ++R++PD R
Sbjct: 17 NIDATTEPRDPHAPGARGRAVEDAMPQ-HQADIEAGWKAASFNQFVSDLIPYERSLPDTR 75
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
C + DLP S+I+ F E +S+L+R+VHS+I R+P +EEI+L+DD S ++
Sbjct: 76 PPRCAEQEVADDLPTTSIIMCFCEESWSTLLRSVHSVINRSPPHLVEEILLIDDASRRSH 135
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L QKL+ Y+ +F +VR++ ER GLIR R +GA+ + G V+ FLD+H E + WL PL
Sbjct: 136 LKQKLDQYMSKF-PQVRVVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPL 194
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
L I DR + P ID ++ T+ + E + RG F+W + ++ LP EAK+R
Sbjct: 195 LDRIREDRTRVVCPSIDRVNEATFAYEVANE---NVRGGFDWELFFQWVSLPAVEAKRRT 251
Query: 436 YNRY 439
+N +
Sbjct: 252 HNVF 255
>gi|328781649|ref|XP_003250010.1| PREDICTED: n-acetylgalactosaminyltransferase 6-like isoform 2 [Apis
mellifera]
Length = 598
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPL
Sbjct: 184 LKKTLDDYVATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPL 243
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 244 LEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN-- 298
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCS
Sbjct: 299 -PTEPFKSPVMAGGLFAISSKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCS 357
Query: 182 RIVSLIR--PVFKADGK 196
R+ + R P F GK
Sbjct: 358 RVGHIYRKFPPFPNPGK 374
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 95/218 (43%), Positives = 132/218 (60%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 73 EAKRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 132
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+L+RT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 133 LRNLDSVSVIVSFHNEHFSTLIRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 192
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 193 ATHLPKVKIVRLKQRSGLIKGRLAGAKVAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 252
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 253 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 287
>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
Length = 590
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL + L+ Y+++ KV+++RN +REGLIR R GA + GEV+VFLD+HCEV WL
Sbjct: 183 DLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRMIGASHATGEVLVFLDSHCEVNEAWLQ 242
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI +RK + PVID I T VY P RG F WG+ +K + +P E
Sbjct: 243 PLLTPIKENRKTVVCPVIDIISADTL----VYTPSPIVRGGFNWGLHFKWDPVPMSELNS 298
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+SPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG + VP
Sbjct: 299 ---PDGAIRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISFRIWMCGGQLLIVP 355
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 356 CSRVGHIFR 364
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 126/203 (62%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R ++C+ Y + LP AS+++ F NE FS+L+RT
Sbjct: 97 DMGYHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVSLPTASIVICFFNEAFSALLRT 156
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 347
VHS++ RTP L EIILVDD S DL + L+ Y+Q+ KV+++RN +REGLIR R
Sbjct: 157 VHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRM 216
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV WL PLL PI +RK + PVID I T VY P
Sbjct: 217 IGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTVVCPVIDIISADTL----VYTP 272
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 273 SPIVRGGFNWGLHFKWDPVPMSE 295
>gi|149032269|gb|EDL87175.1| similar to GalNAc transferase 10 isoform a (predicted) [Rattus
norvegicus]
Length = 267
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 89/172 (51%), Positives = 118/172 (68%), Gaps = 4/172 (2%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P+
Sbjct: 7 VRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPM 66
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID ID+ + + + + RG F+W M YK +P +R SEP++SP AGGL
Sbjct: 67 IDVIDHSHFGYEA--QAGDAMRGAFDWEMYYKRIPIPPE--LQRADPSEPFESPVMAGGL 122
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
FA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG + VPCSR+ + R
Sbjct: 123 FAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSRVGHIYR 174
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 46/97 (47%), Positives = 63/97 (64%), Gaps = 2/97 (2%)
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
VR++R +REGLIRTR GA +RGEV+ FLD+HCEV +NWLPPLL I + K + P+
Sbjct: 7 VRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPLLNQIALNHKTIVCPM 66
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
ID ID+ + + + + RG F+W M YK +P
Sbjct: 67 IDVIDHSHFGYEA--QAGDAMRGAFDWEMYYKRIPIP 101
>gi|195129477|ref|XP_002009182.1| GI11401 [Drosophila mojavensis]
gi|193920791|gb|EDW19658.1| GI11401 [Drosophila mojavensis]
Length = 673
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 94/188 (50%), Positives = 118/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDYI + KVR++R R GLI RS GA+ + EV++FLD+H E NWLPPL
Sbjct: 253 LFKPLEDYIAQHFTKVRVVRLPRRTGLIGARSAGARNATAEVLIFLDSHVEANYNWLPPL 312
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 313 LEPIAQNKRTAVCPFIDVIDHSNFNYRA---QDEGARGAFDWDFFYKRLPLLPE----DL 365
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ +EP+KSP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 366 KHPAEPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGQMYDAPC 425
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 426 SRIGHIYR 433
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/214 (43%), Positives = 132/214 (61%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GE G A L + + + +L E G N S+ IS +R++PD+R ++C+ Y L
Sbjct: 146 GIGEQGVAAKLEDESQREYERALSLENGFNALLSDSISVNRSVPDIRHKDCRKKLYLSKL 205
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS + L + LEDYI +
Sbjct: 206 PTVSVIIIFYNEYMSVLMRSVHSLINRSPPELLKEIILVDDFSDRDYLFKPLEDYIAQHF 265
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 266 TKVRVVRLPRRTGLIGARSAGARNATAEVLIFLDSHVEANYNWLPPLLEPIAQNKRTAVC 325
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 326 PFIDVIDHSNFNYRA---QDEGARGAFDWDFFYK 356
>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Acyrthosiphon pisum]
Length = 615
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 138/237 (58%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
PP+ + GEGG+ + A E N+ S+ IS +R++ D+R ECK
Sbjct: 103 PPVREKRGKHGEGGRGVTMKPEQEALMKQKFKENQFNIIASDMISLNRSLQDIRQGECKS 162
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP +P S+++VFHNE +S+L+RTV S+I R+P L+EI+LVDD S + L +KLE
Sbjct: 163 KQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLGKKLE 222
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
DY+ + +++R +R GLIR R GAK G+VI FLDAHCE WL PLLA I
Sbjct: 223 DYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLARIVL 282
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+RK + PVID I T+E+ V D + G F W + ++ +P+RE +R +R
Sbjct: 283 NRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRNQDR 336
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLEDY+ + +++R +R GLIR R GAK G+VI FLDAHCE WL PL
Sbjct: 217 LGKKLEDYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I T+E+ V D + G F W + ++ +P+RE +R
Sbjct: 277 LARIVLNRKTVVCPVIDVISDDTFEY--VTASDMTWGG-FNWKLNFRWYRVPQREMTRRN 333
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F +LG YD G+ +WGGEN E+SF++W CGG++E +PC
Sbjct: 334 QDRTAPLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRVWQCGGTLEIIPC 393
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 394 SHVGHVFR 401
>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
Length = 590
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 93/189 (49%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL + L+ Y+++ KV+++RN +REGLIR R GA + GEV+VFLD+HCEV WL
Sbjct: 183 DLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRMIGASHATGEVLVFLDSHCEVNEAWLQ 242
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI +RK + PVID I T VY P RG F WG+ +K + +P E
Sbjct: 243 PLLTPIKENRKTVVCPVIDIISADTL----VYTPSPIVRGGFNWGLHFKWDPVPMSELNS 298
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+SPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG + VP
Sbjct: 299 ---PDGAIRSPTMAGGLFAMDRNYFYELGQYDRGMDIWGGENLEISFRIWMCGGQLLIVP 355
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 356 CSRVGHIFR 364
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/203 (45%), Positives = 126/203 (62%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R ++C+ Y + LP AS+++ F NE FS+L+RT
Sbjct: 97 DMGYHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVSLPTASIVICFFNEAFSALLRT 156
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 347
VHS++ RTP L EIILVDD S DL + L+ Y+Q+ KV+++RN +REGLIR R
Sbjct: 157 VHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRM 216
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV WL PLL PI +RK + PVID I T VY P
Sbjct: 217 IGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTVVCPVIDIISADTL----VYTP 272
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 273 SPIVRGGFNWGLHFKWDPVPMSE 295
>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
gallus]
Length = 608
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 92/189 (48%), Positives = 120/189 (63%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL K L +Y++ R +L+RN +REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 197 DLKKDLVEYVKTRLPKTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCEVNEMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI DR+ + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 257 PLLTPIKEDRRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLSELEG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 123/203 (60%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R +C+ YP DLP ASVI+ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHREVPDTRDAKCREKSYPSDLPFASVIICFYNEALSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA L EIILVDD S DL + L +Y++ R +L+RN +REGLIR R
Sbjct: 171 VHSVLDRTPAHLLHEIILVDDNSELDDLKKDLVEYVKTRLPKTTKLVRNEKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + G+V+VFLD+HCEV WL PLL PI DR+ + PVID I T Y
Sbjct: 231 IGASHATGKVLVFLDSHCEVNEMWLQPLLTPIKEDRRTVVCPVIDIISADTL----TYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|380024971|ref|XP_003696258.1| PREDICTED: N-acetylgalactosaminyltransferase 6-like isoform 3 [Apis
florea]
Length = 590
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 132/218 (60%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK L + A + G N S+ IS +R++PD+R CK Y
Sbjct: 65 EARRIGKGEHGKPAFLSPSLDALKEKLYQVNGFNAALSDEISVNRSVPDIRHPGCKDKKY 124
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SVI+ FHNE FS+LMRT S++ R+PA L+EIILVDD S+K L + L+DY+
Sbjct: 125 LRNLDSVSVIVSFHNEHFSTLMRTCWSVVNRSPASLLQEIILVDDASTKVGLKKTLDDYV 184
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPLL PI D K
Sbjct: 185 ATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPLLEPIAQDYK 244
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W + YK
Sbjct: 245 TCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYK 279
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ KV+++R +R GLI+ R GAK ++ +V+VFLD+H E +NWLPPL
Sbjct: 176 LKKTLDDYVATHLPKVKIVRLKQRSGLIKGRLAGAKIAKAKVLVFLDSHSEANINWLPPL 235
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D K P ID I Y+T+E+R+ D RG F+W + YK L + +
Sbjct: 236 LEPIAQDYKTCVCPFIDVIAYETFEYRA---QDEGARGAFDWELYYKRLPLLPEDLQN-- 290
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP+KSP AGGLFA+ FF ELGGYDP L +WGGE +ELSFKIW CGG + PCS
Sbjct: 291 -PTEPFKSPIMAGGLFAISAKFFWELGGYDPELDIWGGEQYELSFKIWQCGGQMYDAPCS 349
Query: 182 RIVSLIR--PVFKADGK 196
R+ + R P F GK
Sbjct: 350 RVGHIYRKFPPFPNPGK 366
>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
Length = 594
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 93/228 (40%), Positives = 137/228 (60%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE GK +P + + E N+ S+ I +R++ D+R +CK YP LP
Sbjct: 92 PGEMGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAKLPT 151
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L ++LEDY++
Sbjct: 152 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGRQLEDYVKTLPVS 211
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 212 TIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 271
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I +T+E+ V D + G F W + ++ +P RE ++R ++R
Sbjct: 272 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDR 316
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY++ ++R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 197 LGRQLEDYVKTLPVSTIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 256
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I +T+E+ V D + G F W + ++ +P RE ++R
Sbjct: 257 LARIVLDRKTVVCPIIDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRN 313
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PC
Sbjct: 314 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPC 373
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 374 SHVGHVFR 381
>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
Length = 483
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 89/201 (44%), Positives = 133/201 (66%), Gaps = 3/201 (1%)
Query: 238 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
N+ S+ I+ +R++PD+R+E CK YP +LP S+++VFHNE +S+L+RTVHS+I R+P
Sbjct: 11 NLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRSP 70
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
+ + EIILVDD S + L LE+Y++ V+++R +R GLIR R RGA ++G++
Sbjct: 71 HRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQI 130
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 417
I FLDAHCE + WL PLLA I DRK + P+ID I T+E+ + D Y G F W
Sbjct: 131 ITFLDAHCECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNW 187
Query: 418 GMLYKENELPEREAKKRKYNR 438
+ ++ +P+RE +RK +R
Sbjct: 188 KLNFRWYPVPQREMDRRKGDR 208
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V+++R +R GLIR R RGA ++G++I FLDAHCE + WL PL
Sbjct: 89 LKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQIITFLDAHCECTIGWLEPL 148
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 149 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 205
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 206 GDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTC 265
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 266 SHVGHVFR 273
>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
(Silurana) tropicalis]
gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus (Silurana) tropicalis]
Length = 498
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 133/202 (65%), Gaps = 3/202 (1%)
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRT 296
N+ S+ I+ +R++PD+R+E CK YP +LP S+++VFHNE +S+L+RTVHS+I R+
Sbjct: 10 FNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSIVIVFHNEAWSTLLRTVHSVINRS 69
Query: 297 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 356
P + + EIILVDD S + L LE+Y++ V+++R +R GLIR R RGA ++G+
Sbjct: 70 PHRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQ 129
Query: 357 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 416
+I FLDAHCE + WL PLLA I DRK + P+ID I T+E+ + D Y G F
Sbjct: 130 IITFLDAHCECTIGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FN 186
Query: 417 WGMLYKENELPEREAKKRKYNR 438
W + ++ +P+RE +RK +R
Sbjct: 187 WKLNFRWYPVPQREMDRRKGDR 208
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V+++R +R GLIR R RGA ++G++I FLDAHCE + WL PL
Sbjct: 89 LKSPLENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQIITFLDAHCECTIGWLEPL 148
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 149 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 205
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 206 GDRTLPVRTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTC 265
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 266 SHVGHVFR 273
>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 547
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 92/227 (40%), Positives = 139/227 (61%), Gaps = 5/227 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 267
G GE G+ L EA D + N+ S+ I+ +R++PD+R +C+ Y +
Sbjct: 73 GAGEDGRPVKLSEADERLSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSE 132
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++ L +L++++ +
Sbjct: 133 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQL 192
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+++IR ER GLIR R GA E++G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 193 GTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 252
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 253 CPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSR 296
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L++++ + +++IR ER GLIR R GA E++G+V+ FLD+HCE W+ PL
Sbjct: 181 LKAELDNFVAQLGTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 241 LARIKENRKAVVCPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRS 297
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P SPT AGGLF++DR +F E+G YD + +WGGEN E+S ++W CGG IE +PC
Sbjct: 298 NDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPC 357
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 358 SHVGHVFR 365
>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
Length = 353
Score = 181 bits (458), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 87/190 (45%), Positives = 120/190 (63%), Gaps = 4/190 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL Y+ + VR++R ++REGLIR R GA+ ++G+VI FLDAHCE ++WL PL
Sbjct: 83 LKSKLTAYVAKLR-NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPL 141
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I+SDR I+ VPVID I + + G F W M + + LP +RK
Sbjct: 142 LSRIHSDRTIVAVPVIDIISSTNFMYSGTPSA---VIGGFSWDMQFTWHSLPNNRQSERK 198
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ P ++PT AGGLF++DR +F E G YD G+ VWGGEN E+SF+IW CGG +E +PCS
Sbjct: 199 DRTAPIRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCS 258
Query: 182 RIVSLIRPVF 191
R+ + R F
Sbjct: 259 RVGHVFRTRF 268
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/178 (44%), Positives = 111/178 (62%), Gaps = 4/178 (2%)
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
+C YP LP +V++ FHNE +S+L+RTVHS+I R+PA L EI+L+DDFS+ L
Sbjct: 25 KCSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLK 84
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
KL Y+ + VR++R ++REGLIR R GA+ ++G+VI FLDAHCE ++WL PLL+
Sbjct: 85 SKLTAYVAKLR-NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLLS 143
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I+SDR I+ VPVID I + + G F W M + + LP +RK
Sbjct: 144 RIHSDRTIVAVPVIDIISSTNFMYSGTPSA---VIGGFSWDMQFTWHSLPNNRQSERK 198
>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 573
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 93/188 (49%), Positives = 122/188 (64%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKL+ Y++R KV++IR ER GLIR R GAK+++ +V++FLD+H E +NWLPPL
Sbjct: 170 LKKKLDIYVDRHLPKVKIIRLPERMGLIRARLAGAKKAKAQVLLFLDSHTEANVNWLPPL 229
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI + K P ID I + T+E+R+ D RG F+W YK L PE
Sbjct: 230 LEPIAENYKTCVCPFIDVIAHDTFEYRA---QDEGRRGAFDWEFFYKRLPLLPE----DL 282
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ +EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PC
Sbjct: 283 KHPTEPFQSPVMAGGLFAISAKFFWELGGYDEGLAIWGGEQYELSFKIWQCGGKMVDAPC 342
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 343 SRVGHIYR 350
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 136/218 (62%), Gaps = 4/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E ++ G GE GK LP+ + +A G N S+ I + ++PD+R CK Y
Sbjct: 60 ESHRIGVGEQGKPAFLPDKEKVQKEALYAVNGFNALLSDKIYLN-SLPDIRHPGCKEKKY 118
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+L SV++ FHNE +S+L+RTV+S++ R+P+ L+EIILVDD+SSK L +KL+ Y+
Sbjct: 119 RKNLNTVSVVVPFHNEHWSTLLRTVYSVLNRSPSHLLKEIILVDDYSSKPFLKKKLDIYV 178
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
R KV++IR ER GLIR R GAK+++ +V++FLD+H E +NWLPPLL PI + K
Sbjct: 179 DRHLPKVKIIRLPERMGLIRARLAGAKKAKAQVLLFLDSHTEANVNWLPPLLEPIAENYK 238
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I + T+E+R+ D RG F+W YK
Sbjct: 239 TCVCPFIDVIAHDTFEYRA---QDEGRRGAFDWEFFYK 273
>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 561
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/229 (43%), Positives = 140/229 (61%), Gaps = 5/229 (2%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP--L 266
+GPGE G+ +P N+ SN IS +RT+PD+R++ CK YP
Sbjct: 52 KGPGEMGQPVIIPPEEEELKKEMFKINQFNLLASNKISVNRTLPDVRIDGCKKKIYPPSQ 111
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LP S+I+VFHNE +S+L+R +HSII R+P + LEEIILVDD S + L ++L+DY++
Sbjct: 112 KLPTTSIIIVFHNEAWSTLIRNIHSIINRSPREILEEIILVDDASERDFLGKQLDDYVRG 171
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
+ +VR++R ER G++ R RGA S GEV+ FLDAHCE WL PL+A I DR +
Sbjct: 172 LSVRVRVVRMAERSGIVGARLRGAAISTGEVLTFLDAHCECTKGWLEPLIARIAEDRTRV 231
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID I +T+E+ SV E G F W + ++ + +RE K+RK
Sbjct: 232 VSPVIDSISDETFEYNSVPELGC---GGFNWRLNFRWYPMSKREKKRRK 277
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+L+DY+ + +VR++R ER G++ R RGA S GEV+ FLDAHCE WL PL
Sbjct: 161 LGKQLDDYVRGLSVRVRVVRMAERSGIVGARLRGAAISTGEVLTFLDAHCECTKGWLEPL 220
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
+A I DR + PVID I +T+E+ SV E G F W + ++ + +RE K+RK
Sbjct: 221 IARIAEDRTRVVSPVIDSISDETFEYNSVPELGC---GGFNWRLNFRWYPMSKREKKRRK 277
Query: 122 YNSE-PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ P +PT AGGLF++ + +F +G YD G+ +WGGEN E+SF+IWMCGG++E VPC
Sbjct: 278 GDATIPINTPTMAGGLFSIHKEYFYRIGTYDEGMDIWGGENLEMSFRIWMCGGTLEIVPC 337
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 338 SHVGHVFR 345
>gi|360043880|emb|CCD81426.1| putative n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 526
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 94/189 (49%), Positives = 124/189 (65%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L ++L++Y+ R G V +I EREGLIR R GAK + G+V++FLD+HCE +NWLP
Sbjct: 110 LKERLDNYLSRTYPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLP 169
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI + + +T P ID ID T+E+R+ D RG F+W YK LP R +
Sbjct: 170 PLLDPISKNYRTVTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYKR--LP-RLSTD 223
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P++SP AGGLFA+ R +F ELGGYDP L +WGGE +ELSFKIWMCGG + VP
Sbjct: 224 SLHPETPFESPVMAGGLFAISRKWFWELGGYDPLLHIWGGEQYELSFKIWMCGGRLIDVP 283
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 284 CSRVGHIFR 292
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 132/217 (60%), Gaps = 5/217 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
++GPGE G L + +A +L G N+ S I DR++ D+R CK Y
Sbjct: 2 RQGPGENGLPVRLSNSQKALSKKTLNFNGFNIFVSEKIKTDRSVKDIRYPNCKGALYSKQ 61
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+ + E + +L+RTV S++ R+P + ++E+ILVDD SS+ L ++L++Y+ R
Sbjct: 62 LPLVSIIIPVYEEHWETLIRTVVSVLNRSPLELIKEVILVDDGSSRRYLKERLDNYLSRT 121
Query: 328 --NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
G V +I EREGLIR R GAK + G+V++FLD+HCE +NWLPPLL PI + +
Sbjct: 122 YPGGLVWVIHLKEREGLIRARLSGAKLATGDVLIFLDSHCETNVNWLPPLLDPISKNYRT 181
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+T P ID ID T+E+R+ D RG F+W YK
Sbjct: 182 VTCPFIDVIDADTFEYRA---QDDGARGAFDWSFYYK 215
>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 597
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/227 (40%), Positives = 139/227 (61%), Gaps = 5/227 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 267
G GE G+ L EA D + N+ S+ I+ +R++PD+R +C+ Y +
Sbjct: 62 GAGEDGRPVKLSEADERLSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSE 121
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++ L +L++++ +
Sbjct: 122 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQL 181
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+++IR ER GLIR R GA E++G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 182 GTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 241
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 242 CPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSR 285
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L++++ + +++IR ER GLIR R GA E++G+V+ FLD+HCE W+ PL
Sbjct: 170 LKAELDNFVAQLGTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPL 229
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 230 LARIKENRKAVVCPVIDVINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRS 286
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P SPT AGGLF++DR +F E+G YD + +WGGEN E+S ++W CGG IE +PC
Sbjct: 287 NDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPC 346
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 347 SHVGHVFR 354
>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
catus]
Length = 546
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA SRG+VI FLDAHCE L WL PL
Sbjct: 152 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLEPL 211
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 212 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 268
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 269 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 328
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 329 SHVGHVFR 336
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/184 (47%), Positives = 118/184 (64%), Gaps = 3/184 (1%)
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R + CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S +
Sbjct: 91 RFDRCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERD 150
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L LE+Y++ V++IR ER GLIR R RGA SRG+VI FLDAHCE L WL P
Sbjct: 151 FLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLEP 210
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +R
Sbjct: 211 LLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRR 267
Query: 435 KYNR 438
K +R
Sbjct: 268 KGDR 271
>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
Length = 632
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 140/237 (59%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + + PGE GK +P + E N+ S+ IS +R++ D+R E C++
Sbjct: 122 PTVRESRGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 181
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE
Sbjct: 182 KHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 241
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 242 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 301
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P+RE +R +R
Sbjct: 302 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDR 355
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 236 LGKQLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPL 295
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P+RE +R
Sbjct: 296 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRN 352
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 353 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 412
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 413 SHVGHVFR 420
>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
Length = 608
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L+ +++++ GK+++IRN +REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDAFVQKYLPGKIKVIRNRKREGLIRGRMIGASHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DR+ + PVID I T + S RG F WG+ +K + +P E
Sbjct: 258 LLAAIQEDRRTVVCPVIDIISADTLAYSS----SPVVRGGFNWGLHFKWDLVPLPEPGGP 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 EGATAPIKSPTMAGGLFAMNRDYFSELGQYDRGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/200 (46%), Positives = 127/200 (63%), Gaps = 5/200 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM S+ + + R +PD R CK YP DLP ASV++ F+NE S+L+RT
Sbjct: 111 DLGYQKHAFNMLISDRLGYHRDVPDTRNAACKDKTYPADLPVASVVICFYNEALSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS++ RTPAQ L E+ILVDD S DL +L+ ++Q++ GK+++IRN +REGLIR R
Sbjct: 171 VHSVLDRTPAQLLHEVILVDDDSDFDDLKGELDAFVQKYLPGKIKVIRNRKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV + WL PLLA I DR+ + PVID I T Y
Sbjct: 231 IGASHATGEVLVFLDSHCEVNVMWLQPLLAAIQEDRRTVVCPVIDIISADTL----AYSS 286
Query: 408 DHHYRGIFEWGMLYKENELP 427
RG F WG+ +K + +P
Sbjct: 287 SPVVRGGFNWGLHFKWDLVP 306
>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
glaber]
Length = 608
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++YI+++ K++LIRN REGLIR R GA + GEV+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYIQKYLPAKIKLIRNPRREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA ++ D + PVID I T + S RG F WG+ +K + +P E
Sbjct: 258 LLAVVHGDPHTVVCPVIDIISADTLAYSS----SPVVRGGFNWGLHFKWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 DSATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/252 (42%), Positives = 147/252 (58%), Gaps = 8/252 (3%)
Query: 181 SRIVSLIRPVFKADGKLGN-LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNM 239
SRI+ L K D + N +E P + + + E G + E + D ++ NM
Sbjct: 64 SRILDLQFKANKIDDMIDNQIEDPEKGHLKFSSELGMIFS--EHDQELRDLGYQKHAFNM 121
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS++ RTPA
Sbjct: 122 LISNRLGYHRDVPDTRNAVCKEKSYPTDLPVASVVICFYNEAFSALLRTVHSVLDRTPAY 181
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILVDD S DL +L++YIQ++ K++LIRN REGLIR R GA + GEV+
Sbjct: 182 LLHEIILVDDDSDFDDLKGELDEYIQKYLPAKIKLIRNPRREGLIRGRMIGAAHATGEVL 241
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
VFLD+HCEV + WL PLLA ++ D + PVID I T + S RG F WG
Sbjct: 242 VFLDSHCEVNVMWLQPLLAVVHGDPHTVVCPVIDIISADTLAYSS----SPVVRGGFNWG 297
Query: 419 MLYKENELPERE 430
+ +K + +P E
Sbjct: 298 LHFKWDLVPLSE 309
>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
Length = 595
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 91/211 (43%), Positives = 134/211 (63%), Gaps = 5/211 (2%)
Query: 219 HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFH 278
H PE + D + N+ S+ I F R IPD R ++C+ YP LPK S+++ F
Sbjct: 92 HSPED-QETRDMGYRRHAFNLLISDRIGFHRNIPDTRNDKCRGKSYPSGLPKTSIVICFF 150
Query: 279 NEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTE 338
NE +S+L+RTVHS++ RTP + L+EIIL+DDFS ++ L ++LE+YI+ V+L R +
Sbjct: 151 NEAWSTLLRTVHSVLDRTPRELLQEIILIDDFSDQSHLKEELEEYIRDHLPMVQLYRTDK 210
Query: 339 REGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQT 398
REGLIR R +GA + G+V++FLD+HCEV WL PLLA I DR + P+ID I+ T
Sbjct: 211 REGLIRARVKGATHASGDVLMFLDSHCEVSKQWLEPLLARIAEDRTRVVCPIIDIINSDT 270
Query: 399 WEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
+E Y RG F WG+ +K +++P++
Sbjct: 271 FE----YTASPLVRGGFNWGLHFKWDQVPQQ 297
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 123/187 (65%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE+YI V+L R +REGLIR R +GA + G+V++FLD+HCEV WL PL
Sbjct: 188 LKEELEEYIRDHLPMVQLYRTDKREGLIRARVKGATHASGDVLMFLDSHCEVSKQWLEPL 247
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + P+ID I+ T+E Y RG F WG+ +K +++P++ +
Sbjct: 248 LARIAEDRTRVVCPIIDIINSDTFE----YTASPLVRGGFNWGLHFKWDQVPQQLLQGPD 303
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ P SPT AGGLFA+DR +F ELG YD G+ +WGGEN E+SF+IWMCGG++E +PCS
Sbjct: 304 GAAAPINSPTMAGGLFAIDREYFDELGRYDEGMDIWGGENLEISFRIWMCGGTLEIIPCS 363
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 364 RVGHVFR 370
>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
Length = 644
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 123/187 (65%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+++ + V L+R REGLIR R G ++G+V+VFLD+HCE WLPPL
Sbjct: 251 LGEPLENHLSQLEN-VYLLRTKIREGLIRARLLGVSYAKGDVLVFLDSHCECAEGWLPPL 309
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +DR + P++D I++QT+E+R+ E H G F+W + + +LPE E K+R
Sbjct: 310 LLAIEADRTKIVCPLVDVIEFQTFEYRAAKEELH---GAFDWNLQFIWKDLPEHEMKRRT 366
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++ ++PT GGLFA+DR +F +G YD G+ +WG EN ELSF++WMCGGS+E PCS
Sbjct: 367 SPADNIRAPTIIGGLFAVDRLYFKRIGSYDSGMDIWGSENLELSFRVWMCGGSLEISPCS 426
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 427 RVGHVFR 433
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 127/206 (61%), Gaps = 5/206 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
++ + + N+ S + DR IPD R +C + L +VI+ FHNE +S+L+RT
Sbjct: 165 ESGMQRHSFNVRASELLPLDRPIPDYRPTQCPSINQS-TLSPTTVIICFHNEAWSTLLRT 223
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 348
+HS+I R+P+ + EIILVDD S+ L + LE+++ + V L+R REGLIR R
Sbjct: 224 LHSVINRSPSHLIMEIILVDDASTFDYLGEPLENHLSQLEN-VYLLRTKIREGLIRARLL 282
Query: 349 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
G ++G+V+VFLD+HCE WLPPLL I +DR + P++D I++QT+E+R+ E
Sbjct: 283 GVSYAKGDVLVFLDSHCECAEGWLPPLLLAIEADRTKIVCPLVDVIEFQTFEYRAAKEEL 342
Query: 409 HHYRGIFEWGMLYKENELPEREAKKR 434
H G F+W + + +LPE E K+R
Sbjct: 343 H---GAFDWNLQFIWKDLPEHEMKRR 365
>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Anolis carolinensis]
Length = 608
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 125/203 (61%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R +CK YPLDLP AS+I+ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRDAKCKGKKYPLDLPSASIIICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 347
VHS++ RTP+ L EIILVDD S DL + L+ Y+++ V+L+RN +REGLIR R
Sbjct: 171 VHSVLDRTPSHLLHEIILVDDNSELVDLKEDLDVYLRKNLPNNVKLVRNGKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + G+V+VFLD+HCEV WL PLL PI RK + PVID I T Y
Sbjct: 231 IGASHATGKVLVFLDSHCEVNELWLQPLLTPIRESRKTVVCPVIDIISADTL----TYSS 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 119/189 (62%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL + L+ Y+ + V+L+RN +REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 197 DLKEDLDVYLRKNLPNNVKLVRNGKREGLIRGRMIGASHATGKVLVFLDSHCEVNELWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI RK + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 257 PLLTPIRESRKTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLSELEG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLLIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
Acetylgalactosaminyltransferase-T1
Length = 472
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 91/198 (45%), Positives = 129/198 (65%), Gaps = 3/198 (1%)
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S I+ +R++PD+R+E CK YP +LP SV++VFHNE +S+L+RTVHS+I R+P
Sbjct: 1 ASEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHM 60
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
+EEI+LVDD S + L + LE Y+++ V +IR +R GLIR R +GA SRG+VI F
Sbjct: 61 IEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITF 120
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LDAHCE WL PLLA I DR+ + P+ID I T+E+ + D Y G F W +
Sbjct: 121 LDAHCECTAGWLEPLLARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLN 177
Query: 421 YKENELPEREAKKRKYNR 438
++ +P+RE +RK +R
Sbjct: 178 FRWYPVPQREMDRRKGDR 195
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PL
Sbjct: 76 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPL 135
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 136 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 192
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 193 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 252
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 253 SHVGHVFR 260
>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Megachile rotundata]
Length = 637
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/189 (47%), Positives = 126/189 (66%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L +K++ +I F+ KV+ + +REGLIR R GA+++ GEV++FLD+H EV W+
Sbjct: 215 ELHEKIKAFINNNFDRKVKFFKTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKMWIE 274
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL+ I + I+ +PVID I+ T++ Y RG F WG+ +K +LP +
Sbjct: 275 PLLSRIAHSKTIVAMPVIDIINADTFQ----YTASPLVRGGFNWGLHFKWEQLPTKLVHD 330
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ +P KSPT AGGLFAMDR +F+ELG YD G+ VWGGEN E+SF+IWMCGGSIE +P
Sbjct: 331 EDF-IKPIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGGENLEISFRIWMCGGSIELIP 389
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 390 CSRVGHVFR 398
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 81/196 (41%), Positives = 126/196 (64%), Gaps = 5/196 (2%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
Y N+ S++I DR +PD R + C+ YP LP AS+++ F+NE + +L+R++HSII+
Sbjct: 135 YAFNVLISDNIGLDRKLPDTRHKLCQMQQYPNKLPNASIVICFYNEHYMTLLRSIHSIIE 194
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKES 353
RTP L EIILV+D+S +L +K++ +I F+ KV+ + +REGLIR R GA+++
Sbjct: 195 RTPKHLLHEIILVNDWSDSKELHEKIKAFINNNFDRKVKFFKTEKREGLIRARMFGARKA 254
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
GEV++FLD+H EV W+ PLL+ I + I+ +PVID I+ T++ Y RG
Sbjct: 255 TGEVLIFLDSHIEVNKMWIEPLLSRIAHSKTIVAMPVIDIINADTFQ----YTASPLVRG 310
Query: 414 IFEWGMLYKENELPER 429
F WG+ +K +LP +
Sbjct: 311 GFNWGLHFKWEQLPTK 326
>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus terrestris]
Length = 643
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K++ YI FNGKV+ + +REGLIR R GA+++ GEV++FLD+H EV W+ P
Sbjct: 221 LHEKIKTYIVNNFNGKVKFYKTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKRWIEP 280
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I + I+ +P+ID I+ T++ Y RG F WG+ +K + +P
Sbjct: 281 LLSQIAQSKTIVAMPIIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVPVGTFAHD 336
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P KSPT AGGLFAMDR +F +LG YD G+ +WGGEN E+SF+IWMCGGSIE +PC
Sbjct: 337 EDFIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPC 396
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 397 SRVGHVFR 404
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 95/249 (38%), Positives = 146/249 (58%), Gaps = 12/249 (4%)
Query: 185 SLIRPVFK-ADGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNM 239
+L R +K +D L L+P P++P +G E G + + + D Y N+
Sbjct: 87 NLRRNAYKNSDKLLQQLQPVPVKPAVTLGQGLDELGMVKNFEDQRKR--DEGYKNYSFNI 144
Query: 240 ETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S++I R IPD R + C+ Y LP AS+++ F+NE + +L+R++HSII RTPA
Sbjct: 145 LVSDNIGLHREIPDTRHKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPAS 204
Query: 300 YLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVI 358
L EIILV+D+S L +K++ YI FNGKV+ + +REGLIR R GA+++ GEV+
Sbjct: 205 LLHEIILVNDWSDSKALHEKIKTYIVNNFNGKVKFYKTEKREGLIRARMFGARKATGEVL 264
Query: 359 VFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWG 418
+FLD+H EV W+ PLL+ I + I+ +P+ID I+ T++ Y RG F WG
Sbjct: 265 IFLDSHIEVNKRWIEPLLSQIAQSKTIVAMPIIDIINPDTFQ----YTGSPLVRGGFNWG 320
Query: 419 MLYKENELP 427
+ +K + +P
Sbjct: 321 LHFKWDNVP 329
>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
Length = 675
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 140/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R + C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKDKFKENQFNLLASDMISLNRSLTDVRHDGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ YP LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LEDY+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEDYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 353
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 234 LGKQLEDYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 410
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 411 SHVGHVFR 418
>gi|115704811|ref|XP_788279.2| PREDICTED: probable N-acetylgalactosaminyltransferase 9-like,
partial [Strongylocentrotus purpuratus]
Length = 405
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 98/242 (40%), Positives = 138/242 (57%), Gaps = 7/242 (2%)
Query: 194 DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 253
D LG L+ + + GPGEGG+ L S ++ N+ S+ IS +RT+ D
Sbjct: 77 DSVLGQLDKDIPRF--GPGEGGQPLILSGKDMEKAKKSRDQHNFNLVVSDKISLERTVKD 134
Query: 254 LRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
R C+ Y P ASVI+ FHNE +S+LMRTVHS++ RTP L E++LVDD S+
Sbjct: 135 TRDSRCQDITYRFSKFPTASVIIAFHNEAWSTLMRTVHSVVNRTPRDILTEVVLVDDAST 194
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
DL +KL + + GKVRL+ T REG+ R + RGA+E+RGEV+VFLDAHCEV +WL
Sbjct: 195 DEDLKKKLLNIPKSVRGKVRLVHTTHREGVARAKMRGAREARGEVLVFLDAHCEVNTHWL 254
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
P+L ++ + P+ID ID +T+ F E R F W + + L + E
Sbjct: 255 EPMLDLVHQGPTTVVSPIIDKIDPETFGF----EDGSLARVTFRWSLETRRIPLSQIEKA 310
Query: 433 KR 434
+R
Sbjct: 311 ER 312
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 98/167 (58%), Gaps = 4/167 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KKL + + GKVRL+ T REG+ R + RGA+E+RGEV+VFLDAHCEV +WL P
Sbjct: 197 DLKKKLLNIPKSVRGKVRLVHTTHREGVARAKMRGAREARGEVLVFLDAHCEVNTHWLEP 256
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
+L ++ + P+ID ID +T+ F E R F W + + L + E +R
Sbjct: 257 MLDLVHQGPTTVVSPIIDKIDPETFGF----EDGSLARVTFRWSLETRRIPLSQIEKAER 312
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 167
EP +SP GG+FA+ ++FF ++GG D GL WG + + S K
Sbjct: 313 LNPLEPVRSPLTNGGIFAVSKSFFEKIGGIDAGLDGWGADGLDFSMK 359
>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Nasonia vitripennis]
Length = 635
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 136/231 (58%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
++ PGE GKA H+P A N+ S+ IS +R++ D+R+ CK +P
Sbjct: 123 RDSPGEMGKAVHIPPEQDAIQQELFKLNQFNLMASDMISLNRSLKDVRLSGCKSKKFPKL 182
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L QKLEDY++
Sbjct: 183 LPDTSIVIVFHNEAWSTLLRTVWSVINRSPRALLKEIILVDDASEREHLKQKLEDYVETL 242
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I D+K +
Sbjct: 243 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAHDKKTVV 302
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 303 CPIIDVISDDTFEY--ITASDMTWGG-FNWKLNFRWYRVAQREMDRRNGDR 350
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 117/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLEDY+E + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 231 LKQKLEDYVETLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I D+K + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 291 LARIAHDKKTVVCPIIDVISDDTFEY--ITASDMTWGG-FNWKLNFRWYRVAQREMDRRN 347
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG +E PC
Sbjct: 348 GDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGILEISPC 407
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 408 SHVGHVFR 415
>gi|195454523|ref|XP_002074278.1| GK18434 [Drosophila willistoni]
gi|194170363|gb|EDW85264.1| GK18434 [Drosophila willistoni]
Length = 646
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/217 (43%), Positives = 135/217 (62%), Gaps = 3/217 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE G+ H+ + D G N S+ IS +R++PD+R EECK Y
Sbjct: 121 RTGMGEHGEPSHIDAQEKELEDKIYRMNGFNGLLSDRISINRSVPDVRREECKSRKYLAK 180
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-R 326
LP+ASVI +F+NE F++L+R+++S+I RTP + L++I+LVDD S L Q+L+DY+
Sbjct: 181 LPQASVIFIFYNEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSDWEVLKQQLDDYVSLH 240
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
F V ++RN ER GLI R GAK + GEV+VF D+H EV NWLPPLL PI D KI
Sbjct: 241 FPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPPLLEPIAIDSKIS 300
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE 423
T P++D I++ T+ + ++ RG F+W YK+
Sbjct: 301 TCPIVDSIEHSTFAYSGGHQ--EGSRGGFDWRFYYKQ 335
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+DY+ F V ++RN ER GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 229 LKQQLDDYVSLHFPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPP 288
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI D KI T P++D I++ T+ + ++ RG F+W YK+ LPE K
Sbjct: 289 LLEPIAIDSKISTCPIVDSIEHSTFAYSGGHQEGS--RGGFDWRFYYKQLPVLPEDSLDK 346
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY++P GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 347 ----SLPYRNPVMMGGLFAINTKFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVP 402
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 403 CSRVAHIFR 411
>gi|410953294|ref|XP_003983307.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Felis
catus]
Length = 443
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 122/189 (64%), Gaps = 4/189 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
+YG N S + +R +PD R ++C YP +LP ASV++ FHNE FS+L RT+ S++
Sbjct: 99 KYGFNTVLSKSLGSEREVPDTRNKKCFQKHYPANLPTASVVVCFHNEEFSALFRTMFSVV 158
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
TP +LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR+R GA +
Sbjct: 159 NLTPRHFLEEIILVDDMSDSDDLKEKLDHHLEVFRGKIKLIRNKKREGLIRSRMIGASRA 218
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+V+VFLD+HCEV WL PLL I D K++ P+ID ID T E Y P RG
Sbjct: 219 SGDVLVFLDSHCEVNKVWLEPLLHAIAKDPKMVVCPLIDVIDSVTLE----YWPSPVVRG 274
Query: 414 IFEWGMLYK 422
F W + +K
Sbjct: 275 AFNWHLQFK 283
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 86/183 (46%), Positives = 119/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR+R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDHHLEVFRGKIKLIRNKKREGLIRSRMIGASRASGDVLVFLDSHCEVNKVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID T E Y P RG F W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVCPLIDVIDSVTLE----YWPSPVVRGAFNWHLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA++R +F E+G YD G+ +WG EN ELS +IWMCGG + +PC
Sbjct: 296 EGPTLPIRSPAMAGGIFAINRHYFREIGQYDKGMNLWGAENLELSLRIWMCGGQLFVLPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 756
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 133/231 (57%), Gaps = 6/231 (2%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
E PG GK +P ++ D N+ S+ I +R++PD+R ++C Y Y L
Sbjct: 248 ELPGANGKPVQIPSELQSEADDLFIINSFNLMASDMIGINRSLPDVRPKQCLYKQYSSAL 307
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+VFHNE +S+L+RTVHS+I RTP QYL EIILVDD S A L +L+ Y+ +
Sbjct: 308 PNTSVIIVFHNEAWSALLRTVHSVINRTPRQYLSEIILVDDASIHAHLGHQLDSYVAKLP 367
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V + R R GLIR R RGA ++G+V+ FLD+HCE WL PLLA I DR +
Sbjct: 368 VPVHVERMGVRSGLIRARMRGALVAQGQVLTFLDSHCEASHGWLEPLLARIAEDRSNVVT 427
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKKRKYN 437
PVID I+ Q YE D+ G+F+W + ++ + R+ K++
Sbjct: 428 PVIDVINAQNL----AYEADNQTPAIGVFDWSLTFRWQSIQRRDLPLLKHD 474
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 114/190 (60%), Gaps = 7/190 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+ + V + R R GLIR R RGA ++G+V+ FLD+HCE WL PL
Sbjct: 355 LGHQLDSYVAKLPVPVHVERMGVRSGLIRARMRGALVAQGQVLTFLDSHCEASHGWLEPL 414
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR--GIFEWGMLYKENELPEREAKK 119
LA I DR + PVID I+ Q YE D+ G+F+W + ++ + R+
Sbjct: 415 LARIAEDRSNVVTPVIDVINAQNL----AYEADNQTPAIGVFDWSLTFRWQSIQRRDLPL 470
Query: 120 RKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
K++ + P SPT AGGLFA+DR++F+E G YD G +WG EN E+SFK WMCGG IE +
Sbjct: 471 LKHDPTHPIPSPTMAGGLFAIDRSYFIETGMYDSGFEIWGAENLEISFKTWMCGGRIEIL 530
Query: 179 PCSRIVSLIR 188
PCS + + R
Sbjct: 531 PCSHVGHIFR 540
>gi|195467145|ref|XP_002076010.1| GK16099 [Drosophila willistoni]
gi|194172095|gb|EDW86996.1| GK16099 [Drosophila willistoni]
Length = 348
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 98/228 (42%), Positives = 139/228 (60%), Gaps = 4/228 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE G+ H+ + D G N S+ IS +R++PD+R EECK Y
Sbjct: 36 RTGMGEHGEPSHIDAQEKELEDKIYRMNGFNGLLSDRISINRSVPDVRREECKTRKYLAK 95
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-R 326
LP+ASVI +F+NE F++L+R+++S+I RTP + L++I+LVDD S L Q+L+DY+
Sbjct: 96 LPQASVIFIFYNEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSDWEVLKQQLDDYVSLH 155
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
F V ++RN ER GLI R GAK + GEV+VF D+H EV NWLPPLL PI D KI
Sbjct: 156 FPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPPLLEPIAIDSKIS 215
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 433
T P++D I++ T+ + ++ RG F+W YK+ LPE K
Sbjct: 216 TCPIVDSIEHSTFAYSGGHQ--EGSRGGFDWRFYYKQLPVLPEDSLDK 261
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+DY+ F V ++RN ER GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 144 LKQQLDDYVSLHFPQLVHVVRNPERRGLIGARIAGAKVATGEVLVFFDSHIEVNYNWLPP 203
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI D KI T P++D I++ T+ + ++ RG F+W YK+ LPE K
Sbjct: 204 LLEPIAIDSKISTCPIVDSIEHSTFAYSGGHQEGS--RGGFDWRFYYKQLPVLPEDSLDK 261
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY++P GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 262 ----SLPYRNPVMMGGLFAINTKFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVP 317
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 318 CSRVAHIFR 326
>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
Length = 633
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 139/237 (58%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + + PGE GK +P + E N+ S+ IS +R++ D+R E C++
Sbjct: 123 PTVRESRGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 182
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
YP LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE
Sbjct: 183 KHYPSKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 242
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 243 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 302
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I T+E+ + D + G F W + ++ +P+RE +R +R
Sbjct: 303 NRRTVVCPIIDVISDDTFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDR 356
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 237 LGKQLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPL 296
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I T+E+ + D + G F W + ++ +P+RE +R
Sbjct: 297 LARIVQNRRTVVCPIIDVISDDTFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRN 353
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 354 NDRTAPLRTPTMAGGLFSIDKEYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 413
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 414 SHVGHVFR 421
>gi|195167889|ref|XP_002024765.1| GL22638 [Drosophila persimilis]
gi|194108170|gb|EDW30213.1| GL22638 [Drosophila persimilis]
Length = 676
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/188 (49%), Positives = 120/188 (63%), Gaps = 7/188 (3%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +LE YI E F+ VR++R R GLI RS GA+ + EV++FLD+H E NWLPP
Sbjct: 257 LHAELELYIKEHFSKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPP 316
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI +++ P ID ID+ T+ +R+ D RG F+W YK L + + K
Sbjct: 317 LLEPIAKNKRTAVCPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK-- 371
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
Y ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 372 -YPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 430
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 431 SRIGHIYR 438
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 98/225 (43%), Positives = 138/225 (61%), Gaps = 5/225 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L + + + E G N S+ IS +R++PD+R + C+ DY +L
Sbjct: 150 GIGEGGKAAKLEDEATLEQERRMSLENGFNALLSDSISVNRSLPDIRHKLCRQKDYLANL 209
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 327
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS + L +LE YI + F
Sbjct: 210 PTVSVIIIFYNEYLSVLMRSVHSLINRSPKELLKEIILVDDFSDRDYLHAELELYIKEHF 269
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ VR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 270 SKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPPLLEPIAKNKRTAV 329
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
P ID ID+ T+ +R+ D RG F+W YK L + + K
Sbjct: 330 CPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK 371
>gi|125977364|ref|XP_001352715.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
gi|54641464|gb|EAL30214.1| GA15243 [Drosophila pseudoobscura pseudoobscura]
Length = 676
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 93/188 (49%), Positives = 120/188 (63%), Gaps = 7/188 (3%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +LE YI E F+ VR++R R GLI RS GA+ + EV++FLD+H E NWLPP
Sbjct: 257 LHAELELYIKEHFSKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPP 316
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI +++ P ID ID+ T+ +R+ D RG F+W YK L + + K
Sbjct: 317 LLEPIAKNKRTAVCPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK-- 371
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
Y ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 372 -YPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 430
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 431 SRIGHIYR 438
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 98/225 (43%), Positives = 138/225 (61%), Gaps = 5/225 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L + + + E G N S+ IS +R++PD+R + C+ DY +L
Sbjct: 150 GIGEGGKAAKLEDEATLEQERRMSLENGFNALLSDSISVNRSLPDIRHKLCRQKDYLANL 209
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 327
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS + L +LE YI + F
Sbjct: 210 PTVSVIIIFYNEYLSVLMRSVHSLINRSPKELLKEIILVDDFSDRDYLHAELELYIKEHF 269
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ VR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 270 SKIVRVVRLPNRTGLIGARSAGARNATAEVLLFLDSHVEANYNWLPPLLEPIAKNKRTAV 329
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
P ID ID+ T+ +R+ D RG F+W YK L + + K
Sbjct: 330 CPFIDVIDHATFNYRA---QDEGARGAFDWEFYYKRLPLLDEDLK 371
>gi|195429102|ref|XP_002062603.1| GK16570 [Drosophila willistoni]
gi|194158688|gb|EDW73589.1| GK16570 [Drosophila willistoni]
Length = 679
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 92/184 (50%), Positives = 116/184 (63%), Gaps = 8/184 (4%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE+YI VR++R T+R GLI RS GA+ + GEV++FLD+H E NWLPPLL PI
Sbjct: 262 LENYIAEHFKNVRVVRLTKRTGLIGARSEGARNATGEVLIFLDSHVEANYNWLPPLLEPI 321
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKYNS 124
+ + P ID ID+ + +R+ D RG F+W YK L PE K+ S
Sbjct: 322 AINERTAVCPFIDVIDHSNFNYRA---QDEGARGAFDWEFFYKRLPLLPE----DLKHPS 374
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
EP+KSP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCSR+
Sbjct: 375 EPFKSPVMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRVG 434
Query: 185 SLIR 188
+ R
Sbjct: 435 HIYR 438
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 94/214 (43%), Positives = 130/214 (60%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GE GK L + + + + E G N S+ IS +R+I D+R + C+ +Y L
Sbjct: 151 GIGEQGKPAKLDDENQRELERKMSLENGFNALLSDSISVNRSIADIRHKSCRKKEYLAKL 210
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + L+EIILVDDFS +A L LE+YI
Sbjct: 211 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAYLYVPLENYIAEHF 270
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R T+R GLI RS GA+ + GEV++FLD+H E NWLPPLL PI + +
Sbjct: 271 KNVRVVRLTKRTGLIGARSEGARNATGEVLIFLDSHVEANYNWLPPLLEPIAINERTAVC 330
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 331 PFIDVIDHSNFNYRA---QDEGARGAFDWEFFYK 361
>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
Length = 374
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 88/190 (46%), Positives = 125/190 (65%), Gaps = 4/190 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L DY+ KV+LIR +REGLIR R GA +R +V+VFLD+HCE + WL PL
Sbjct: 110 LKAQLTDYVAGLP-KVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPL 168
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R + PVID ID++T+E+R + RG F+W ++++ ++P K+R
Sbjct: 169 LDRIVQNRSHVVTPVIDVIDFKTFEYRHL--AIIQVRG-FDWRLIFRWEKIPASYEKRRG 225
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ +P SPT AGGLFA+D+ +F LG YD G+ +WGGEN ELSF+IW CGG++E +PCS
Sbjct: 226 LSVDPILSPTMAGGLFAIDKEYFHHLGLYDTGMEIWGGENLELSFRIWQCGGTLEIMPCS 285
Query: 182 RIVSLIRPVF 191
R+ + R F
Sbjct: 286 RVGHVFRQRF 295
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 94/228 (41%), Positives = 134/228 (58%), Gaps = 8/228 (3%)
Query: 211 PGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
PGE G+ +L R + +Y N S I RTIPD R CK +Y +
Sbjct: 1 PGELGQGVVLRNLSPQDRKQLEEGYKKYAFNEFASTKIPLTRTIPDGRHWLCKSKEYDVS 60
Query: 268 -LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LP SVI+ FHNE +S+LMRTVHS+++ P++ L E+I+VDD S L +L DY+
Sbjct: 61 RLPAVSVIICFHNEAWSTLMRTVHSVLRTAPSELLTEVIMVDDDSQYDHLKAQLTDYVAG 120
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
KV+LIR +REGLIR R GA +R +V+VFLD+HCE + WL PLL I +R +
Sbjct: 121 LP-KVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPLLDRIVQNRSHV 179
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID ID++T+E+R + RG F+W ++++ ++P K+R
Sbjct: 180 VTPVIDVIDFKTFEYRHL--AIIQVRG-FDWRLIFRWEKIPASYEKRR 224
>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 460
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + V+++R +R GLIR R GA +G+VI FLDAHCE NWL PL
Sbjct: 64 LGKQLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPL 123
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR + PVID I +T+E+ S + G F W + ++ +P+RE +R
Sbjct: 124 LARIAEDRTRVVCPVIDVISDETFEYISASDLTW---GGFNWKLNFRWYRVPQRELDRRG 180
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFA+D+ +F+ELG YD G+ +WGGEN ELSF+IWMCGG +E VPC
Sbjct: 181 GDRTLPVRTPTMAGGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPC 240
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 241 SHVGHVFR 248
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/180 (46%), Positives = 115/180 (63%), Gaps = 3/180 (1%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
CK YP LP SV++VFHNE +S+L+RTVHS+I+ +P LEEIILVDD S + L +
Sbjct: 7 CKDKVYPEKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGK 66
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+LEDY+ + + V+++R +R GLIR R GA +G+VI FLDAHCE NWL PLLA
Sbjct: 67 QLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLAR 126
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
I DR + PVID I +T+E+ S + G F W + ++ +P+RE +R +R
Sbjct: 127 IAEDRTRVVCPVIDVISDETFEYISASDLTW---GGFNWKLNFRWYRVPQRELDRRGGDR 183
>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
partial [Papio anubis]
Length = 444
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 96 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 155
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 156 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 212
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 213 GDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 272
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 273 SHVGHVFR 280
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 86/180 (47%), Positives = 116/180 (64%), Gaps = 3/180 (1%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 39 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 98
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 99 TLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 158
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 159 IKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 215
>gi|296210176|ref|XP_002751862.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Callithrix jacchus]
Length = 443
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 87/183 (47%), Positives = 121/183 (66%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ PVID IDY+T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVCPVIDVIDYRTLE----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP AGG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 296 EGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/189 (46%), Positives = 123/189 (65%), Gaps = 4/189 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
+YG N+ S + +R +PD R + C YP+ LP AS+++ F+NE F++L RT+ S+
Sbjct: 99 KYGFNIIISRSLGIEREVPDTRNKMCLQKRYPVRLPTASIVICFYNEEFNALFRTMSSVW 158
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
TP LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA +
Sbjct: 159 NLTPHHLLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHA 218
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+V+VFLD+HCEV WL PLL I D K++ PVID IDY+T E Y+P RG
Sbjct: 219 SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDYRTLE----YKPSPVVRG 274
Query: 414 IFEWGMLYK 422
F+W + +K
Sbjct: 275 AFDWNLQFK 283
>gi|403276614|ref|XP_003929989.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Saimiri boliviensis boliviensis]
Length = 566
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 106/250 (42%), Positives = 149/250 (59%), Gaps = 11/250 (4%)
Query: 186 LIRPVFKA----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
++ P FKA D ++E P E + + E G ++ E + D ++ NM
Sbjct: 66 VLEPQFKANKIDDVIDSHIEEPEEGHLKFSSELGMIFN--ERDQELRDLGYQKHAFNMLI 123
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RTVHS+I RTPA L
Sbjct: 124 SNRLGYHRDVPDTRNAACKDKFYPSDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLL 183
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
E+ILVDD S DL +L++Y+Q++ GK+++IRNT+REGLIR R GA + G+V+VF
Sbjct: 184 HEVILVDDDSDFDDLKGELDEYVQKYLPGKIKIIRNTKREGLIRGRMIGAAHATGDVLVF 243
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD+HCEV + WL PLLA I D+ + PVID I T Y RG F WG+
Sbjct: 244 LDSHCEVNVMWLQPLLAAIREDQHTVVCPVIDIISADTL----AYSSSPIVRGGFNWGLH 299
Query: 421 YKENELPERE 430
++ + +P E
Sbjct: 300 FRWDLVPLSE 309
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 79/170 (46%), Positives = 111/170 (65%), Gaps = 5/170 (2%)
Query: 2 LDKKLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y++++ GK+++IRNT+REGLIR R GA + G+V+VFLD+HCEV + WL P
Sbjct: 198 LKGELDEYVQKYLPGKIKIIRNTKREGLIRGRMIGAAHATGDVLVFLDSHCEVNVMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D+ + PVID I T Y RG F WG+ ++ + +P E
Sbjct: 258 LLAAIREDQHTVVCPVIDIISADTL----AYSSSPIVRGGFNWGLHFRWDLVPLSELGGA 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWM 170
+ + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF++ +
Sbjct: 314 EGATTPIKSPTMAGGLFAMDRQYFHELGQYDSGMDIWGGENLEISFRVIL 363
>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 576
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 94/188 (50%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE Y+ +F+ KVR++R +REGLIR R RGA S+G VI +LD+HCE W+ PL
Sbjct: 137 LKADLEIYMRQFS-KVRILRLEKREGLIRARIRGAAISKGSVITYLDSHCECLEGWVEPL 195
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREAKKR 120
L I + K + PVID ID T+E+ Y + G F+W + + + +PE++ K R
Sbjct: 196 LDRIKRNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAIPEKDRKGR 253
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + +P KSPT AGGLF++DR FF ELG YDPGL +WGGEN ELSFKIWMCGG +E VPC
Sbjct: 254 R-DIDPVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSFKIWMCGGILEIVPC 312
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 313 SHVGHIFR 320
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 108/254 (42%), Positives = 146/254 (57%), Gaps = 15/254 (5%)
Query: 192 KADGKLGNLEPPLEPYKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETS 242
K + L N + P+ YK G PGEGGKA L R D + N S
Sbjct: 6 KPNKALFNPDSPI--YKSGDENQPGEGGKAVVIDRNKLSLDERKIYDDGFTKNAFNQYIS 63
Query: 243 NHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
+ IS R++P EECK Y DLP SVI+ FHNE +S L+RTVHS+++RTP L
Sbjct: 64 DMISIHRSLPSYIDEECKNEKYTSDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLA 123
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
E+ILVDDFS A L LE Y+++F+ KVR++R +REGLIR R RGA S+G VI +LD
Sbjct: 124 ELILVDDFSDMAHLKADLEIYMRQFS-KVRILRLEKREGLIRARIRGAAISKGSVITYLD 182
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLY 421
+HCE W+ PLL I + K + PVID ID T+E+ Y + G F+W + +
Sbjct: 183 SHCECLEGWVEPLLDRIKRNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQF 240
Query: 422 KENELPEREAKKRK 435
+ +PE++ K R+
Sbjct: 241 NWHAIPEKDRKGRR 254
>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Taeniopygia guttata]
Length = 608
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 101/245 (41%), Positives = 142/245 (57%), Gaps = 12/245 (4%)
Query: 192 KADGKLGN-----LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 246
+ D K+GN ++ P++ E E G ++ E + D ++ NM SN +
Sbjct: 71 QKDNKIGNSFGNHIQDPVKGEIEFSPEMGMIFN--EEDQEVRDLGYQKHAFNMLISNRLG 128
Query: 247 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 306
+ R +PD R +C+ YP DLP ASV++ F+NE S+L+RTVHS++ RTPA L EIIL
Sbjct: 129 YHREVPDTRDAKCREKSYPADLPSASVVICFYNEALSALLRTVHSVLDRTPAHLLHEIIL 188
Query: 307 VDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
VDD S ADL + L +Y++ + +L+RN +REGLIR R GA + G+V+VFLD+HC
Sbjct: 189 VDDNSELADLKKDLSEYVKTQLPRTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHC 248
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 425
EV WL PLLAPI D + + PVID I T Y RG F WG+ +K +
Sbjct: 249 EVNEMWLQPLLAPIREDPRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDL 304
Query: 426 LPERE 430
+P E
Sbjct: 305 VPLAE 309
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 120/189 (63%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL K L +Y++ + +L+RN +REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 197 DLKKDLSEYVKTQLPRTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCEVNEMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLLAPI D + + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 257 PLLAPIREDPRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLAELEG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P KSPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PEGATAPIKSPTMAGGLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
adamanteus]
Length = 608
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 118/189 (62%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL + L+ Y+ + KV+L+RN REGLIR R GA + G+V+VFLD+HCEV WL
Sbjct: 197 DLKEDLDIYLTKDLPNKVKLVRNENREGLIRGRMVGASHATGKVLVFLDSHCEVNEMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI R+ + PVID I T Y RG F WG+ +K + +P E +
Sbjct: 257 PLLTPIQESRRTVVCPVIDIISADTL----TYSSSPVVRGGFNWGLHFKWDLVPLLEMEG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P KSPT AGGLFAMDR +F LG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PEQATAPIKSPTMAGGLFAMDREYFNALGQYDSGMDIWGGENLEISFRIWMCGGKLVIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 91/200 (45%), Positives = 123/200 (61%), Gaps = 5/200 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R +CK YP DLP AS+I+ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRDRKCKEKIYPHDLPSASIIICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 347
+HS++ RTP+ L EIILVDD S ADL + L+ Y+ + KV+L+RN REGLIR R
Sbjct: 171 IHSVLDRTPSHLLHEIILVDDRSELADLKEDLDIYLTKDLPNKVKLVRNENREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + G+V+VFLD+HCEV WL PLL PI R+ + PVID I T Y
Sbjct: 231 VGASHATGKVLVFLDSHCEVNEMWLQPLLTPIQESRRTVVCPVIDIISADTL----TYSS 286
Query: 408 DHHYRGIFEWGMLYKENELP 427
RG F WG+ +K + +P
Sbjct: 287 SPVVRGGFNWGLHFKWDLVP 306
>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
partial [Taeniopygia guttata]
Length = 402
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 126/188 (67%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y+++ +R++R +R GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 2 LKASLENYVKKLGVPIRILRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 61
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE ++RK
Sbjct: 62 LSRIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMERRK 118
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR++F E+G YD G+ +WGGEN E+SF+IW CGGS+E V C
Sbjct: 119 GDRTLPVRTPTMAGGLFSIDRSYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTC 178
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 179 SHVGHVFR 186
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 77/123 (62%), Gaps = 3/123 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L LE+Y+++ +R++R +R GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 2 LKASLENYVKKLGVPIRILRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 61
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
L+ I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE ++RK
Sbjct: 62 LSRIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMERRK 118
Query: 436 YNR 438
+R
Sbjct: 119 GDR 121
>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4 [Ciona intestinalis]
Length = 582
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 121/183 (66%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL D+++ +VRL+R T+REGL+R R GA ++GEV+ FLD HCE WL PL
Sbjct: 188 LKDKLADFLKAL-ARVRLVRTTKREGLVRARLLGASLAKGEVLTFLDCHCECVEGWLEPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREAKKR 120
L I D ++ VPVID ID+ T+E+ Y H + G F+W + ++ + +P+ E K+R
Sbjct: 247 LERIMEDESVIVVPVIDTIDWNTFEY---YYGGHEPQIGGFDWRLTFQWHTIPDHERKRR 303
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F +G YD G+ +WGGEN ELSF+ WMCGG +E +PC
Sbjct: 304 KSPVDPIRSPTMAGGLFAVSKRYFTRIGTYDAGMEIWGGENLELSFRTWMCGGKLETIPC 363
Query: 181 SRI 183
S +
Sbjct: 364 SHV 366
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 104/244 (42%), Positives = 144/244 (59%), Gaps = 14/244 (5%)
Query: 200 LEPPLEPYKEGPGEGGKAYHL----PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
L P + GPGEGG A L PE + D S+ Y +N S IS R + D R
Sbjct: 67 LSRPADIDPRGPGEGGSAVRLLNLSPEVSKQQED-SIQTYAVNQFVSERISLHRRLQDPR 125
Query: 256 MEECKY---WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
E CK +DY LP SV++ F+NEG+S+L+RTV S++ +P L EIILVDD+S
Sbjct: 126 HEMCKSRRPFDY-RSLPTTSVVIAFYNEGWSTLIRTVFSVLHNSPDALLTEIILVDDYSD 184
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
K L KL D+++ +VRL+R T+REGL+R R GA ++GEV+ FLD HCE WL
Sbjct: 185 KVYLKDKLADFLKAL-ARVRLVRTTKREGLVRARLLGASLAKGEVLTFLDCHCECVEGWL 243
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREA 431
PLL I D ++ VPVID ID+ T+E+ Y H + G F+W + ++ + +P+ E
Sbjct: 244 EPLLERIMEDESVIVVPVIDTIDWNTFEY---YYGGHEPQIGGFDWRLTFQWHTIPDHER 300
Query: 432 KKRK 435
K+RK
Sbjct: 301 KRRK 304
>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
Length = 693
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 93/194 (47%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLEDY + V+++R +REGLIR R GAK ++ VI +LD+HCE WL PL
Sbjct: 265 LKKKLEDYFADYP-MVKIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPL 323
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID ID T EF HYR G F+W + + + +PE
Sbjct: 324 LDRIARNSTTVVCPVIDVIDDATLEF--------HYRDSSGVNVGGFDWNLQFSWHSVPE 375
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ SEP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 376 REKKRHNSTSEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 435
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 436 LEIVPCSHVGHIFR 449
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 131/237 (55%), Gaps = 21/237 (8%)
Query: 209 EGPGEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK-YWD 263
+ GE GK LP+ + A D N S+ IS R++PD R CK
Sbjct: 153 DNAGEMGKPVVLPKEMAPDMKKAVDEGWTNNAFNQYVSDLISVHRSLPDPRDAWCKDSAR 212
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LPK VI+ FHNE +S L+RTVHS++ R+P++ + EIILVDD+S L +KLEDY
Sbjct: 213 YLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPSELIGEIILVDDYSDMTHLKKKLEDY 272
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+ V+++R +REGLIR R GAK ++ VI +LD+HCE WL PLL I +
Sbjct: 273 FADY-PMVKIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDRIARNS 331
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKK 433
+ PVID ID T EF HYR G F+W + + + +PERE K+
Sbjct: 332 TTVVCPVIDVIDDATLEF--------HYRDSSGVNVGGFDWNLQFSWHSVPEREKKR 380
>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
Length = 621
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 139/237 (58%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + K PGE GK +P + E N+ S+ IS +R++ D+R E C++
Sbjct: 111 PTIRESKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHENCRH 170
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE
Sbjct: 171 KHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLE 230
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 231 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVAGEVITFLDAHCECTEGWLEPLLARIVQ 290
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P+RE +R +R
Sbjct: 291 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRNNDR 344
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 225 LGKQLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVAGEVITFLDAHCECTEGWLEPL 284
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P+RE +R
Sbjct: 285 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPQREMARRN 341
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 342 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 401
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 402 SHVGHVFR 409
>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
Length = 499
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 121/188 (64%), Gaps = 3/188 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL + L D+I N KV+LIR R+GL+ R RGA+E+RGEV+ FLD+HCE WL P
Sbjct: 108 DLKQPLIDHISMLN-KVKLIRMPSRQGLVPARLRGAEEARGEVLTFLDSHCEATPGWLEP 166
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I DR+ + PVI+ I+ +FR H RG F W + + +PE E K+R
Sbjct: 167 LLVRIAEDRRNVVCPVIEVINAD--DFRYQASDVIHERGGFTWDLFFTWKAIPEAEKKRR 224
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++ +SPT AGGLFA+ + +F +LG YD + +WGGEN E+SF+IWMCGG +E VPC
Sbjct: 225 KDETDYIRSPTMAGGLFAIHKKYFYDLGSYDSKMEIWGGENLEMSFRIWMCGGQLEIVPC 284
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 285 SRVGHVFR 292
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 135/228 (59%), Gaps = 5/228 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYPLD 267
G G+ G+A LP ++ + + N+ S+ IS DR + D+R +CK + YP
Sbjct: 1 GLGDLGEAATLPTRFKEHAAHAFDNHSFNVMLSDRISLDRRLKDVRGPKCKRKHKLYPRA 60
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ FHNE S L+RTVHS++ +P + + +IILVDD+S DL Q L D+I
Sbjct: 61 LPTTSVIICFHNEALSVLLRTVHSVLNESPPRLIADIILVDDYSEYDDLKQPLIDHISML 120
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
N KV+LIR R+GL+ R RGA+E+RGEV+ FLD+HCE WL PLL I DR+ +
Sbjct: 121 N-KVKLIRMPSRQGLVPARLRGAEEARGEVLTFLDSHCEATPGWLEPLLVRIAEDRRNVV 179
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVI+ I+ +FR H RG F W + + +PE E K+RK
Sbjct: 180 CPVIEVINAD--DFRYQASDVIHERGGFTWDLFFTWKAIPEAEKKRRK 225
>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Ornithorhynchus anatinus]
Length = 608
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 91/189 (48%), Positives = 119/189 (62%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
DL L++YI +++IRN +REGLIR R GA ++ GEV+VFLD+HCEV WL
Sbjct: 197 DLKSGLDEYIRLHLPRNIQVIRNEKREGLIRGRMIGAAQATGEVLVFLDSHCEVNAMWLQ 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI DR+ + PVID I T + S RG F WG+ +K + +P E
Sbjct: 257 PLLVPIREDRRTVVCPVIDIIGADTLAYSS----SPVVRGGFNWGLHFKWDLVPLSELGG 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +P
Sbjct: 313 PGRATAPIKSPTMAGGLFAMNREYFRELGQYDSGMDIWGGENLEISFRIWMCGGQLFIIP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVGHIFR 381
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 93/203 (45%), Positives = 124/203 (61%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + R +PD R ECK YP LP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGSHRDVPDTRDAECKEKSYPPHLPAASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 347
+HS++ RTPA L EIILVDD S DL L++YI+ +++IRN +REGLIR R
Sbjct: 171 IHSVLDRTPAHLLHEIILVDDNSELDDLKSGLDEYIRLHLPRNIQVIRNEKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA ++ GEV+VFLD+HCEV WL PLL PI DR+ + PVID I T + S
Sbjct: 231 IGAAQATGEVLVFLDSHCEVNAMWLQPLLVPIREDRRTVVCPVIDIIGADTLAYSS---- 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
Length = 670
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 92/194 (47%), Positives = 118/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY + KV+++R +REGLIR R GA+ ++ VI +LD+HCE WL PL
Sbjct: 276 LKKQLEDYFASY-PKVKIVRGPQREGLIRARLLGAEYAKSPVITYLDSHCECAEGWLEPL 334
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID ID T EF HYR G F+W + + + +PE
Sbjct: 335 LDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGFDWNLQFSWHAVPE 386
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ SEP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 387 REKKRHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 446
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 447 LEIVPCSHVGHIFR 460
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 100/246 (40%), Positives = 138/246 (56%), Gaps = 21/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
L+PP ++ PGE GK LP+ + A D + N S+ IS R++PD R
Sbjct: 155 LDPPAANLEDSPGELGKPVILPKDMSPEMKKAVDDGWTKNAFNQYVSDLISVRRSLPDPR 214
Query: 256 MEECKYWD-YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE +S L+RTVHS++ R+P + + EIILVDDFS
Sbjct: 215 DAWCKDSALYLSNLPKTDVIICFHNEAWSVLIRTVHSVLDRSPPELIGEIILVDDFSDMP 274
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV+++R +REGLIR R GA+ ++ VI +LD+HCE WL P
Sbjct: 275 HLKKQLEDYFASY-PKVKIVRGPQREGLIRARLLGAEYAKSPVITYLDSHCECAEGWLEP 333
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID ID T EF HYR G F+W + + + +P
Sbjct: 334 LLDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGFDWNLQFSWHAVP 385
Query: 428 EREAKK 433
ERE K+
Sbjct: 386 EREKKR 391
>gi|297289711|ref|XP_001107363.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Macaca mulatta]
Length = 449
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 27 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 86
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 87 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 146
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 147 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 202
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 203 RGAFDWNLQFK 213
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 72/108 (66%), Gaps = 4/108 (3%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 110 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 169
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 108
LL I D K++ P+ID ID +T E Y+P RG F+W + +K
Sbjct: 170 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVVRGAFDWNLQFK 213
>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDYI + V +++ R GLIR R GA ++G+V+ FLD+HCEV WL P+
Sbjct: 103 LGKSLEDYIAKLPVSVYVVKMKGRSGLIRARMAGAAVAKGQVLTFLDSHCEVTEGWLEPM 162
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR PVID I T++++ +P G F W + +K +P+RE +RK
Sbjct: 163 LARIAEDRTTSVCPVIDVISDDTFQYQHGNDPQ---MGGFGWSLFFKWFPVPKREQIRRK 219
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +EP + T AGGLFA+D+++F ELG YDPG +WGGEN ELSFK+WMCGG +E++PC
Sbjct: 220 GDPTEPVRVSTMAGGLFAIDKSYFEELGQYDPGFNIWGGENLELSFKLWMCGGKLEFIPC 279
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 280 SHVGHVFR 287
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRT 296
N+ S+ I+ +R++PD+R C YP LP SVILV+HNE S+L+R VHSII R+
Sbjct: 24 FNLMASDRIALNRSLPDVRPRGCANKVYPKKLPTTSVILVYHNEARSTLLRNVHSIINRS 83
Query: 297 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 356
P L EIILVDD S + L + LEDYI + V +++ R GLIR R GA ++G+
Sbjct: 84 PHDLLAEIILVDDASDQEHLGKSLEDYIAKLPVSVYVVKMKGRSGLIRARMAGAAVAKGQ 143
Query: 357 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 416
V+ FLD+HCEV WL P+LA I DR PVID I T++++ +P G F
Sbjct: 144 VLTFLDSHCEVTEGWLEPMLARIAEDRTTSVCPVIDVISDDTFQYQHGNDPQ---MGGFG 200
Query: 417 WGMLYKENELPEREAKKRK 435
W + +K +P+RE +RK
Sbjct: 201 WSLFFKWFPVPKREQIRRK 219
>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
laevis]
gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
Length = 582
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 106/253 (41%), Positives = 149/253 (58%), Gaps = 16/253 (6%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSN 243
L +PV+K +PP +P PGE GKA L + + S+ +Y +N+ S+
Sbjct: 63 LKQPVYK--------KPPPDP--NMPGEWGKAARLELGPTEKKMQEESIEKYALNIYLSD 112
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE S+L+RT+HS+++ +PA L
Sbjct: 113 QISLHRHIMDNRMYECKSKTFSYRKLPTTSVIIAFYNEALSTLLRTIHSVLESSPAVLLR 172
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDDFS K L +LEDYI + +VRLIR T+REGL+R R GA + G+V+ FLD
Sbjct: 173 EIILVDDFSDKVYLKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLD 231
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
HCE WL PLL I + + PVID ID+ T+EF + G F+W + ++
Sbjct: 232 CHCECVTGWLEPLLERIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQ 289
Query: 423 ENELPEREAKKRK 435
+ +PE+E ++RK
Sbjct: 290 WHAVPEKERQRRK 302
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 118/182 (64%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LEDYI + +VRLIR T+REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLDCHCECVTGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+EF + G F+W + ++ + +PE+E ++RK
Sbjct: 245 LERIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHAVPEKERQRRK 302
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG++E PCS
Sbjct: 303 SRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEIEPCS 362
Query: 182 RI 183
+
Sbjct: 363 HV 364
>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
(Silurana) tropicalis]
gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 582
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 106/253 (41%), Positives = 149/253 (58%), Gaps = 16/253 (6%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSN 243
L +PV+K +PP +P PGE GKA L + D S+ +Y +N+ S+
Sbjct: 63 LRQPVYK--------KPPPDP--NMPGEWGKAARLELGPTEKKMQDESIEKYALNIYLSD 112
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SV++ F+NE S+L+RT+HS+++ +PA L
Sbjct: 113 QISLHRHIMDNRMYECKSKTFNYRKLPTTSVVIAFYNEALSTLLRTIHSVLETSPAVLLR 172
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDDFS K L +LEDYI + +VRLIR T+REGL+R R GA + G+V+ FLD
Sbjct: 173 EIILVDDFSDKVYLKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLD 231
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
HCE WL PLL I + + PVID ID+ T+EF + G F+W + ++
Sbjct: 232 CHCECISGWLEPLLQRIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQ 289
Query: 423 ENELPEREAKKRK 435
+ +PE+E ++RK
Sbjct: 290 WHAVPEKERQRRK 302
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 118/182 (64%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LEDYI + +VRLIR T+REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKSQLEDYIGGLD-RVRLIRTTKREGLVRARIIGATYAIGDVLTFLDCHCECISGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+EF + G F+W + ++ + +PE+E ++RK
Sbjct: 245 LQRIGENETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHAVPEKERQRRK 302
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG++E PCS
Sbjct: 303 SRIDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEIEPCS 362
Query: 182 RI 183
+
Sbjct: 363 HV 364
>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
Length = 613
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 93/186 (50%), Positives = 117/186 (62%), Gaps = 8/186 (4%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
K L+DY+ KVR I ER GLIR R GA+ + +V++FLD+H E +NWLPPLL
Sbjct: 203 KPLDDYLAANLTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLE 262
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKY 122
PI D K P ID I Y+T+E+R+ D RG F+W YK L PE ++
Sbjct: 263 PIAQDYKTCVCPFIDVIQYETFEYRA---QDEGARGAFDWEFFYKRLPLLPE----DLEH 315
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+EP+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR
Sbjct: 316 PTEPFKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSR 375
Query: 183 IVSLIR 188
+ + R
Sbjct: 376 VGHIYR 381
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 124/215 (57%), Gaps = 3/215 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK L A + G N S+ I+ DR +PD+R CK Y D
Sbjct: 93 RRGTGEQGKPAFLTAAESDNYEKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD 152
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++ FHNE +++L+RT S++ R+P L+E+ILVDD S+K + L+DY+
Sbjct: 153 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 212
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR I ER GLIR R GA+ + +V++FLD+H E +NWLPPLL PI D K
Sbjct: 213 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 272
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W YK
Sbjct: 273 CPFIDVIQYETFEYRA---QDEGARGAFDWEFFYK 304
>gi|395838452|ref|XP_003792129.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Otolemur garnettii]
Length = 869
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 91/191 (47%), Positives = 121/191 (63%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N TS ++ F R +PD R + C Y LP ASVI+ FHNE F++L RT+ S
Sbjct: 283 LSKYGFNTITSTNVGFKREVPDTRHKMCLQNHYSTHLPTASVIICFHNEEFNALFRTMFS 342
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP LEEIILVDD S DL +KL+ ++ F GK++LIRN +REGLIR R GA
Sbjct: 343 VVNLTPNSLLEEIILVDDMSEFDDLKEKLDYVLEVFRGKIKLIRNQKREGLIRGRMIGAA 402
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID T E+R+
Sbjct: 403 RASGDVLVFLDSHCEVNKGWLEPLLYSIAKDHKMVVCPLIDVIDETTLEYRA----SPVV 458
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 459 RGAFDWELKFK 469
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 88/192 (45%), Positives = 122/192 (63%), Gaps = 4/192 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ +E F GK++LIRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 366 DLKEKLDYVLEVFRGKIKLIRNQKREGLIRGRMIGAARASGDVLVFLDSHCEVNKGWLEP 425
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID T E+R+ RG F+W + +K + + E
Sbjct: 426 LLYSIAKDHKMVVCPLIDVIDETTLEYRA----SPVVRGAFDWELKFKWDNVFSYEMDGP 481
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SP AGG+FA+ R +F E+G YD G+ +WGGEN ELS +IWMCGG + +PC
Sbjct: 482 DRPIKPIRSPAMAGGIFAIYRHYFNEIGQYDKGMDLWGGENLELSLRIWMCGGQLFIIPC 541
Query: 181 SRIVSLIRPVFK 192
SR+ + + FK
Sbjct: 542 SRVGHITKKQFK 553
>gi|338724473|ref|XP_001495495.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Equus caballus]
Length = 448
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 122/191 (63%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
YG N S + +R +PD R + C YP LP AS+++ FHNE F++L+RTV S
Sbjct: 102 FSRYGFNAMISQRLGNEREVPDTRNKMCLQKHYPTRLPSASIVICFHNEEFNALLRTVSS 161
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++K TP + LEEIILVDD S DL +KL+ +++ F GK++LIRN ++EGLIR R GA
Sbjct: 162 VMKLTPYRVLEEIILVDDMSEFDDLKEKLDHHLEFFRGKIKLIRNKKKEGLIRARLIGAS 221
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID IDY T + Y+P
Sbjct: 222 LASGDVLVFLDSHCEVNKVWLEPLLLAIAKDPKMVVCPLIDVIDYMTLK----YKPSPVV 277
Query: 412 RGIFEWGMLYK 422
RG F W + +K
Sbjct: 278 RGAFNWHLQFK 288
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 86/183 (46%), Positives = 119/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN ++EGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDHHLEFFRGKIKLIRNKKKEGLIRARLIGASLASGDVLVFLDSHCEVNKVWLEP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID IDY T + Y+P RG F W + +K + + E
Sbjct: 245 LLLAIAKDPKMVVCPLIDVIDYMTLK----YKPSPVVRGAFNWHLQFKWDNVFSYEMDGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P +SP AGG+FA+DR +F E+G YD + +WGGEN ELS +IWMCGG + +PC
Sbjct: 301 EGPIAPIRSPAMAGGIFAIDRQYFNEIGRYDKDMNLWGGENLELSLRIWMCGGQLFVLPC 360
Query: 181 SRI 183
SR+
Sbjct: 361 SRV 363
>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 623
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 125/188 (66%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + + +Y+ F KV+L ++ +R GLIR R G++++ G+V+VFLD+H EV +NWL P
Sbjct: 205 LHRNISNYVNTNFTDKVKLFKSKKRLGLIRARIFGSRKASGDVLVFLDSHIEVNVNWLQP 264
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I +K + VP+ID I+ T++ Y RG F WG+ +K LP+ K
Sbjct: 265 LLSRIVDSKKNVVVPIIDIINADTFK----YSSSPLVRGGFNWGLHFKWENLPKSTLKSN 320
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P SPT AGGLFA++RA+F ELG YD G+ +WGGEN E+SF+IWMCGG++E +PC
Sbjct: 321 EDFVKPILSPTMAGGLFAINRAYFKELGEYDNGMNIWGGENLEISFRIWMCGGNLELIPC 380
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 381 SRVGHVFR 388
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 129/209 (61%), Gaps = 5/209 (2%)
Query: 225 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 284
R D + N+ S+ I R +PD R CK Y +LP ASVI+ F+NE F++
Sbjct: 114 RRKRDEGYKNFAFNILVSDAIGIHRELPDTRHNLCKKKKYSKNLPTASVIICFYNEHFTT 173
Query: 285 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLI 343
L+R+++S+++RTP+ L+EIILV+DFS A L + + +Y+ F KV+L ++ +R GLI
Sbjct: 174 LLRSIYSVLERTPSYLLKEIILVNDFSDLAGLHRNISNYVNTNFTDKVKLFKSKKRLGLI 233
Query: 344 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 403
R R G++++ G+V+VFLD+H EV +NWL PLL+ I +K + VP+ID I+ T++
Sbjct: 234 RARIFGSRKASGDVLVFLDSHIEVNVNWLQPLLSRIVDSKKNVVVPIIDIINADTFK--- 290
Query: 404 VYEPDHHYRGIFEWGMLYKENELPEREAK 432
Y RG F WG+ +K LP+ K
Sbjct: 291 -YSSSPLVRGGFNWGLHFKWENLPKSTLK 318
>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 123/202 (60%), Gaps = 18/202 (8%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRG-------------EVIVF 46
+L L+ Y++ GKVR++RN +REGLIR R GA ++ G EV+VF
Sbjct: 172 ELKGDLDRYVQAELRGKVRVLRNQKREGLIRGRMIGAAQASGVSPDPQILDLCSGEVLVF 231
Query: 47 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 106
LD+HCEV WL PLLAPI DR+ + PVID I T Y P RG F WG+
Sbjct: 232 LDSHCEVNQMWLQPLLAPIRQDRRTVVCPVIDIISADTLS----YSPSPIVRGGFNWGLH 287
Query: 107 YKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSF 166
+K + +P E K + P +SPT AGGLFA++R +F E+G YD G+ +WGGEN E+SF
Sbjct: 288 FKWDPVPPAELKSPQGPVGPIRSPTMAGGLFAINRKYFNEIGQYDAGMDIWGGENLEISF 347
Query: 167 KIWMCGGSIEWVPCSRIVSLIR 188
+IWMCGG + +PCSR+ + R
Sbjct: 348 RIWMCGGQLFIIPCSRVGHIFR 369
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 97/226 (42%), Positives = 130/226 (57%), Gaps = 18/226 (7%)
Query: 222 EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEG 281
EA + D+ + N+ S + + R +PD R +C+ YP DLP+ASV++ F NE
Sbjct: 79 EADQQLRDSGYHRHAFNLLISTRLGYHRELPDTRDPQCRDRTYPGDLPRASVVICFFNEA 138
Query: 282 FSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTERE 340
S+L+RTVHS++ RTP L EIILVDD+S +L L+ Y+Q GKVR++RN +RE
Sbjct: 139 LSALLRTVHSVLDRTPPFLLHEIILVDDYSELEELKGDLDRYVQAELRGKVRVLRNQKRE 198
Query: 341 GLIRTRSRGAKES-------------RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
GLIR R GA ++ GEV+VFLD+HCEV WL PLLAPI DR+ +
Sbjct: 199 GLIRGRMIGAAQASGVSPDPQILDLCSGEVLVFLDSHCEVNQMWLQPLLAPIRQDRRTVV 258
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
PVID I T Y P RG F WG+ +K + +P E K
Sbjct: 259 CPVIDIISADTLS----YSPSPIVRGGFNWGLHFKWDPVPPAELKS 300
>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
kowalevskii]
Length = 753
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 99/242 (40%), Positives = 147/242 (60%), Gaps = 10/242 (4%)
Query: 198 GNLEP-----PLEP--YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 250
GN+ P PLE Y + PGEGG L A+ ++ N+ S+ I+ +RT
Sbjct: 221 GNILPQLGHRPLEQPWYPDSPGEGGMPVDLTPQEARLSKATFYQFEFNIIASDKIALNRT 280
Query: 251 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
+PD R C++ +YP LPK SVI+VFHNE +++L+RTV S+I R+P Q LEEI+LVDD
Sbjct: 281 LPDSRPVACEHREYPHILPKTSVIIVFHNEAWTTLLRTVISVIDRSPWQLLEEILLVDDA 340
Query: 311 SSKAD--LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
S+ L +L++Y+ + R+IR +R GLI+ R RG +E+RGEV+ FLD+HCE
Sbjct: 341 STSEKYWLQSELDEYVAKLPVITRVIRTGKRVGLIQGRLRGVEEARGEVLTFLDSHCECN 400
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
+ WL PLL+ I +DR + P +D I +T+ + + +P+ G F W + +K LP+
Sbjct: 401 IGWLEPLLSEIVNDRTTVVAPNLDVISDKTFGY-TFIKPEQTMIGGFGWLVDFKWYSLPK 459
Query: 429 RE 430
RE
Sbjct: 460 RE 461
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 119/188 (63%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L++Y+ + R+IR +R GLI+ R RG +E+RGEV+ FLD+HCE + WL PL
Sbjct: 348 LQSELDEYVAKLPVITRVIRTGKRVGLIQGRLRGVEEARGEVLTFLDSHCECNIGWLEPL 407
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I +DR + P +D I +T+ + + +P+ G F W + +K LP+RE +
Sbjct: 408 LSEIVNDRTTVVAPNLDVISDKTFGY-TFIKPEQTMIGGFGWLVDFKWYSLPKRERLRVN 466
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P ++PT AGGLFA+D +F +G YDPG WG EN ELSF++W CGG++E VPC
Sbjct: 467 NDMSRPLRTPTIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELSFRVWQCGGTLEIVPC 526
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 527 SHVGHVFR 534
>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
Length = 682
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 121/190 (63%), Gaps = 8/190 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K LEDY+ +F GKVR+IR R GLIR R +GA + G+V+ +LD+HCE WL PL
Sbjct: 278 LKKPLEDYMSQF-GKVRIIRLENRMGLIRARLKGASVATGKVLTYLDSHCECMNRWLEPL 336
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR---GIFEWGMLYKENELPEREAK 118
L I + + PVID I+ +T + Y H R G F WG+++ + LP+R+ +
Sbjct: 337 LDRIAQNSTNVVTPVIDTINLETLQ----YHLSSHRRLSVGGFNWGLVFNWHILPDRDYQ 392
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
K +P SPT AGGLF++DR +F +LGGYDPG +WG EN E+SFKIWMCGG +E V
Sbjct: 393 AMKSRIDPIPSPTMAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISFKIWMCGGRLEVV 452
Query: 179 PCSRIVSLIR 188
PCS + + R
Sbjct: 453 PCSHVGHIFR 462
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 100/236 (42%), Positives = 135/236 (57%), Gaps = 13/236 (5%)
Query: 207 YKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
YK+G PGE G P+ R D N SN IS R++P+ E C+
Sbjct: 166 YKQGDPNQPGEFGTGKLSPKE-RKLFDEGFKRNSFNEYVSNMISIHRSLPNNTDELCQKA 224
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
Y DLP SVI+ FHNE +S L+RTVHS+++RTP L+EIILVDDFS L + LED
Sbjct: 225 SYRNDLPDTSVIICFHNEAWSVLLRTVHSVLERTPDHLLKEIILVDDFSDFDHLKKPLED 284
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
Y+ +F GKVR+IR R GLIR R +GA + G+V+ +LD+HCE WL PLL I +
Sbjct: 285 YMSQF-GKVRIIRLENRMGLIRARLKGASVATGKVLTYLDSHCECMNRWLEPLLDRIAQN 343
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYR---GIFEWGMLYKENELPEREAKKRK 435
+ PVID I+ +T + Y H R G F WG+++ + LP+R+ + K
Sbjct: 344 STNVVTPVIDTINLETLQ----YHLSSHRRLSVGGFNWGLVFNWHILPDRDYQAMK 395
>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Length = 591
Score = 177 bits (449), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 93/186 (50%), Positives = 117/186 (62%), Gaps = 8/186 (4%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
K L+DY+ KVR I ER GLIR R GA+ + +V++FLD+H E +NWLPPLL
Sbjct: 181 KPLDDYLAANLTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLE 240
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKRKY 122
PI D K P ID I Y+T+E+R+ D RG F+W YK L PE ++
Sbjct: 241 PIAQDYKTCVCPFIDVIQYETFEYRA---QDEGARGAFDWEFFYKRLPLLPE----DLEH 293
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+EP+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR
Sbjct: 294 PTEPFKSPVMAGGLFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSR 353
Query: 183 IVSLIR 188
+ + R
Sbjct: 354 VGHIYR 359
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 124/215 (57%), Gaps = 3/215 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE GK L A + G N S+ I+ DR +PD+R CK Y D
Sbjct: 71 RRGTGEQGKPAFLTAAESDNYEKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD 130
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++ FHNE +++L+RT S++ R+P L+E+ILVDD S+K + L+DY+
Sbjct: 131 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 190
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR I ER GLIR R GA+ + +V++FLD+H E +NWLPPLL PI D K
Sbjct: 191 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 250
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID I Y+T+E+R+ D RG F+W YK
Sbjct: 251 CPFIDVIQYETFEYRA---QDEGARGAFDWEFFYK 282
>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
floridanus]
Length = 667
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K+ YI+ F KVRL + +REGLIR R GA+++ G+V++FLD+H EV W+ P
Sbjct: 245 LHEKIHAYIKNNFGAKVRLFKTKKREGLIRARVFGARKATGDVLIFLDSHIEVNEIWIEP 304
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + LP K
Sbjct: 305 LLSRIAYSKTIVPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLKHE 360
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P KSPT AGGLFA+DR +F+++G YD G+ VWGGEN E+SF+IWMCGGSIE +PC
Sbjct: 361 NDFVKPIKSPTMAGGLFAIDREYFIKIGEYDTGMDVWGGENLEISFRIWMCGGSIELIPC 420
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 421 SRVGHVFR 428
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/236 (38%), Positives = 141/236 (59%), Gaps = 11/236 (4%)
Query: 197 LGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
L L+P P++P +G E G ++ + + Y N+ S+++ R +P
Sbjct: 124 LKQLQPAPVKPAVTLDQGLDELGMVKNMEDQQKRT--IGYKNYAFNVLISDNLGVRRNVP 181
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D R + CK Y +LP AS+I+ F+NE +++L+R++HSI++RTPA L EIILV+DFS
Sbjct: 182 DTRHKLCKTQKYSSNLPNASIIICFYNEHYTTLLRSLHSILERTPAALLHEIILVNDFSD 241
Query: 313 KADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L +K+ YI+ F KVRL + +REGLIR R GA+++ G+V++FLD+H EV W
Sbjct: 242 SDILHEKIHAYIKNNFGAKVRLFKTKKREGLIRARVFGARKATGDVLIFLDSHIEVNEIW 301
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ PLL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 302 IEPLLSRIAYSKTIVPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLP 353
>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
Length = 630
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P +E K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPTVEEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LEDY+ + + ++R +R GLIR R GA+ G+VI FLDAHCE WL PLLA I
Sbjct: 238 LEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRNNDR 353
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + ++R +R GLIR R GA+ G+VI FLDAHCE WL PL
Sbjct: 234 LGKQLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 410
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 411 SHVGHVFR 418
>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P +E K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPTVEEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LEDY+ + + ++R +R GLIR R GA+ G+VI FLDAHCE WL PLLA I
Sbjct: 238 LEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRNNDR 353
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + + ++R +R GLIR R GA+ G+VI FLDAHCE WL PL
Sbjct: 234 LGKQLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMSRRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 410
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 411 SHVGHVFR 418
>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oryzias latipes]
Length = 758
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 123/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L+ Y+ +F KVR++R ER+GLIR R GA + GEV+ FLD+H E + WL PL
Sbjct: 361 LKEPLDKYMSQF-PKVRIVRLKERQGLIRARLAGAAVATGEVLTFLDSHVECNVGWLEPL 419
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L IY DR+ + PVI+ I+ + + + D+ RGIF+W +++ N L E +K
Sbjct: 420 LERIYLDRRKVPCPVIEVINDKDMSYMLI---DNFQRGIFKWPLVFGWNALSEDYIRKHN 476
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
S+P + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFKIWMCGG IE +PC
Sbjct: 477 ITVSDPIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPC 536
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 537 SRVGHIFR 544
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 91/222 (40%), Positives = 130/222 (58%), Gaps = 4/222 (1%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
G+ G+ LP + E N+ S+ I DR IPD R E C DLP
Sbjct: 257 GQFGRGVILPSSEDEEVRKRWDEGHFNVYLSDRIPVDRAIPDTRPEVCSQAVVHDDLPST 316
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
SVI F +E +S+L+R+VHS++ R+P L+EIILVDDFS+K L + L+ Y+ +F KV
Sbjct: 317 SVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTKDYLKEPLDKYMSQF-PKV 375
Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
R++R ER+GLIR R GA + GEV+ FLD+H E + WL PLL IY DR+ + PVI
Sbjct: 376 RIVRLKERQGLIRARLAGAAVATGEVLTFLDSHVECNVGWLEPLLERIYLDRRKVPCPVI 435
Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+ I+ + + + D+ RGIF+W +++ N L E +K
Sbjct: 436 EVINDKDMSYMLI---DNFQRGIFKWPLVFGWNALSEDYIRK 474
>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
gallus]
Length = 556
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/231 (42%), Positives = 147/231 (63%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R++ CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLDGCKTKVYPDE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTVHS++ R+P + L EIILVDD S + L LE+Y+++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVHSVVARSPRRLLAEIILVDDASEREFLKASLENYVKKL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V+++R +R GLIR R RGA +RG+VI FLDAHCE WL PLLA I+ DR+ +
Sbjct: 174 EVPVKILRMEQRSGLIRARLRGAAAARGQVITFLDAHCECTRGWLEPLLARIWEDRRTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y+++ V+++R +R GLIR R RGA +RG+VI FLDAHCE WL PL
Sbjct: 162 LKASLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVITFLDAHCECTRGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I+ DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIWEDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++W CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|357625888|gb|EHJ76177.1| hypothetical protein KGM_07902 [Danaus plexippus]
Length = 535
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 119/188 (63%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L+DY+ + KV++IR R GLI R GA+++ +V+VFLD+H E +NWLPPL
Sbjct: 120 LKEQLDDYVSKHMPKVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPL 179
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKR 120
L PI + K + P ID + Y T+ +R+ D RG F+W + YK LP EA
Sbjct: 180 LEPIALNYKTVVCPFIDVVAYDTFAYRA---QDEGARGAFDWELFYKRLPVLPADEANM- 235
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
EP+ SP AGGLFA+ R FF ELGGYDPGL +WGGE +ELSFK+W CGG + PC
Sbjct: 236 ---PEPFPSPVMAGGLFAISRVFFWELGGYDPGLDIWGGEQYELSFKLWQCGGKMLDAPC 292
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 293 SRVGHIYR 300
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 129/223 (57%), Gaps = 4/223 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G HLP G N S+ I +R++PD+R C+ Y LP
Sbjct: 14 GIGEHGLPAHLPIKDSEIEKDLYAVNGFNGALSDKIPLNRSLPDIRHPGCQNRLYIESLP 73
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SV++ FHNE +S+L+RT +S++ R+P ++E+ LVDD S+K L ++L+DY+ +
Sbjct: 74 TVSVVVPFHNEHWSTLLRTAYSVLNRSPTFLIKEVFLVDDASTKDFLKEQLDDYVSKHMP 133
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KV++IR R GLI R GA+++ +V+VFLD+H E +NWLPPLL PI + K + P
Sbjct: 134 KVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPLLEPIALNYKTVVCP 193
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREA 431
ID + Y T+ +R+ D RG F+W + YK LP EA
Sbjct: 194 FIDVVAYDTFAYRA---QDEGARGAFDWELFYKRLPVLPADEA 233
>gi|15207811|dbj|BAB62930.1| hypothetical protein [Macaca fascicularis]
Length = 373
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 27 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 86
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 87 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 146
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 147 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 202
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 203 RGAFDWNLQFK 213
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 120/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 110 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 169
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 170 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 225
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP +GG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 226 EGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 285
Query: 181 SRI 183
SR+
Sbjct: 286 SRV 288
>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
Length = 630
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 353
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 234 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 410
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 411 SHVGHVFR 418
>gi|321469963|gb|EFX80941.1| hypothetical protein DAPPUDRAFT_224457 [Daphnia pulex]
Length = 498
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 96/270 (35%), Positives = 152/270 (56%), Gaps = 22/270 (8%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
++ ++E+++ + F KV+L++ +REGLIR R GAK++ G+V++FLD+HCEV W+
Sbjct: 71 NIHNEVEEFVGKNFPPKVQLLKTMKREGLIRARIFGAKKATGQVLIFLDSHCEVNREWVQ 130
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+A I +R + P+ID I+ T+++ S RG F WG+ +K + LP+ K
Sbjct: 131 PLIARIQENRTFVVTPIIDIINSDTFQYTS----SPLVRGGFNWGLHFKWDSLPDDTLKT 186
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ +P SPT AGGLFA++R +F ++G YD G+ VWGGEN E+SF+IWMCGG +E +P
Sbjct: 187 NEDFVKPILSPTMAGGLFAIEREYFFDIGEYDAGMNVWGGENLEISFRIWMCGGRLEIIP 246
Query: 180 CSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNM 239
CSR+ + R PY GE Y+ A D + +
Sbjct: 247 CSRVGHVFR--------------RRRPYGSPNGEDTMTYNSLRAAHVWLDDYIEHFFHVR 292
Query: 240 ETSNHISFDRTIPDLRME---ECKYWDYPL 266
+ H+S+ P R+ +C+ +D+ L
Sbjct: 293 PDARHVSYGDVGPRQRLRRLMKCQSFDWYL 322
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 114/184 (61%), Gaps = 10/184 (5%)
Query: 254 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
L M C+ +YP SVI+ F+NE S+L RTVHS++ +TPA+ L E++LVDD SS
Sbjct: 7 LLMSLCQTQEYPKLYLNTSVIICFYNEDPSTLFRTVHSVLDQTPAELLHEVLLVDD-SSD 65
Query: 314 ADLDQKLEDYIQRFNGK-----VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
+L + + ++ F GK V+L++ +REGLIR R GAK++ G+V++FLD+HCEV
Sbjct: 66 GELLGNIHNEVEEFVGKNFPPKVQLLKTMKREGLIRARIFGAKKATGQVLIFLDSHCEVN 125
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
W+ PL+A I +R + P+ID I+ T+++ S RG F WG+ +K + LP+
Sbjct: 126 REWVQPLIARIQENRTFVVTPIIDIINSDTFQYTS----SPLVRGGFNWGLHFKWDSLPD 181
Query: 429 REAK 432
K
Sbjct: 182 DTLK 185
>gi|402865469|ref|XP_003896945.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Papio
anubis]
Length = 475
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 129 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 188
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 189 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 248
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 249 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 304
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 305 RGAFDWNLQFK 315
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 120/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 212 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 271
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 272 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 327
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP +GG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 328 EGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 387
Query: 181 SRI 183
SR+
Sbjct: 388 SRV 390
>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
Length = 481
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 99/240 (41%), Positives = 137/240 (57%), Gaps = 9/240 (3%)
Query: 200 LEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 258
P L Y+ GE G+A +P Y+ D N S+ IS RT+PD R
Sbjct: 5 FRPTLPHYRRNSYGENGQAVVVPAVYKEESDRLFSRNRFNQWASDRISLHRTLPDQRPAA 64
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS---SKAD 315
C+ +P +LP AS+++VFHNE +S+L+RTVHS++ R+ + + EIILVDD S +
Sbjct: 65 CRKQLFPTNLPPASLVIVFHNEAWSTLLRTVHSVLDRSDPRLMREIILVDDCSEIKGHEE 124
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L LE YIQ+ V+L+RN +R+GLIR R RG KE VIVFLDAHCEV WL PL
Sbjct: 125 LQAPLEKYIQKLK-IVKLVRNKKRQGLIRARLRGYKEVTSPVIVFLDAHCEVVDGWLEPL 183
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LA I+ +R + P ID I ++ + Y RG+F W + ++ LP E ++RK
Sbjct: 184 LARIHENRSNVVCPEIDVISFENFG----YSYASGIRGVFNWNLHFRWRTLPAVEQQRRK 239
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 5/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L LE YI++ V+L+RN +R+GLIR R RG KE VIVFLDAHCEV WL P
Sbjct: 124 ELQAPLEKYIQKLK-IVKLVRNKKRQGLIRARLRGYKEVTSPVIVFLDAHCEVVDGWLEP 182
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I+ +R + P ID I ++ + Y RG+F W + ++ LP E ++R
Sbjct: 183 LLARIHENRSNVVCPEIDVISFENFG----YSYASGIRGVFNWNLHFRWRTLPAVEQQRR 238
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F ++G YD + +WGGEN E+SF+IW CGG++E +PC
Sbjct: 239 KSVIDPIRSPTMAGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSFRIWQCGGNLEIIPC 298
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 299 SHVGHVFR 306
>gi|355561193|gb|EHH17879.1| hypothetical protein EGK_14362 [Macaca mulatta]
gi|355748153|gb|EHH52650.1| hypothetical protein EGM_13120 [Macaca fascicularis]
Length = 443
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 272
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 273 RGAFDWNLQFK 283
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/184 (40%), Positives = 106/184 (57%), Gaps = 6/184 (3%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAG-GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ ++P + + GL M FLE+ LL G L+ IWMCGG + +P
Sbjct: 296 EGPTKPIRDYSFKFLGLMKMFCYLFLEVFNTVMQLLSRRGFTM-LARLIWMCGGQLFIIP 354
Query: 180 CSRI 183
CSR+
Sbjct: 355 CSRV 358
>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
Length = 630
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 353
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 234 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF++WMCGG +E PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRVWMCGGVLEIAPC 410
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 411 SRVGHVFR 418
>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 617
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 105 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 164
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 165 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 224
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 225 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 284
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 285 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 340
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 221 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 280
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 281 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 337
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 338 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 397
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 398 SHVGHVFR 405
>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Strongylocentrotus purpuratus]
Length = 654
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L K+L+ Y+ + FNG V +I N +REGLIR R+ GA+ + G+V++FLD+HCEV WL P
Sbjct: 236 LKKELDQYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEVNEQWLEP 295
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I +D + P+ID I++ T+ Y +G F WGM +K + + R+ +
Sbjct: 296 LLERIKADSHTVVCPIIDIINHDTF----AYTASPLVKGGFNWGMHFKWDTIRSRQLVGK 351
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P +SPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IW CGG +E VPC
Sbjct: 352 EDYVKPIESPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGGKLEIVPC 411
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 412 SRVGHVFR 419
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/251 (36%), Positives = 143/251 (56%), Gaps = 6/251 (2%)
Query: 182 RIVSLIRPVFKADGKLGNLEPPLEPYKEGP-GEGGKAYHLPEAYRAAGDASLGEYGMNME 240
++++ V+K GK + L+ +G GE + R+ D ++ N
Sbjct: 101 KVLNPKNAVYKKQGKPMRAKQRLKADAQGDWGEDELGMVRTDEERSIRDGGYRQHAFNEL 160
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S I F R + D R CKY Y +LP S+++ F+NE +S+L+RTV+S++ RTP +
Sbjct: 161 ISQRIGFHRNVTDTRNPLCKYQVYSEELPTVSIVICFYNEAWSTLLRTVYSVLDRTPRRL 220
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
+ E+ILVDDFS L ++L+ Y+ + FNG V +I N +REGLIR R+ GA+ + G+V++
Sbjct: 221 IHELILVDDFSELTHLKKELDQYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLM 280
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 419
FLD+HCEV WL PLL I +D + P+ID I++ T+ Y +G F WGM
Sbjct: 281 FLDSHCEVNEQWLEPLLERIKADSHTVVCPIIDIINHDTF----AYTASPLVKGGFNWGM 336
Query: 420 LYKENELPERE 430
+K + + R+
Sbjct: 337 HFKWDTIRSRQ 347
>gi|115497012|ref|NP_001070072.1| polypeptide N-acetylgalactosaminyltransferase 10 precursor [Danio
rerio]
gi|115313072|gb|AAI24253.1| Zgc:153114 [Danio rerio]
Length = 261
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 92/191 (48%), Positives = 122/191 (63%), Gaps = 6/191 (3%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ G GE G+ Y L +A R D + E G N+ S+ I+ +R++PD+R CK Y D
Sbjct: 77 RTGAGEQGRPYPLTDAERV--DQAYRENGFNIFVSDRIALNRSLPDIRHPNCKLKLYTAD 134
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ FHNEG+SSL+RTVHS++ R+P + EIILVDDFS K L LE Y+ R
Sbjct: 135 LPNTSVIIPFHNEGWSSLLRTVHSVLDRSPPSLIAEIILVDDFSDKGHLKAPLEQYMVRL 194
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVR++R +REGLIRTR GA +RG+VI FLD+HCE +NWLPPLL I +
Sbjct: 195 -PKVRILRTQKREGLIRTRLLGAAAARGQVITFLDSHCEANVNWLPPLLDRIAQN---TN 250
Query: 388 VPVIDGIDYQT 398
V+D + +T
Sbjct: 251 SSVLDSFNQRT 261
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/83 (48%), Positives = 52/83 (62%), Gaps = 4/83 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE Y+ R KVR++R +REGLIRTR GA +RG+VI FLD+HCE +NWLPPL
Sbjct: 183 LKAPLEQYMVRL-PKVRILRTQKREGLIRTRLLGAAAARGQVITFLDSHCEANVNWLPPL 241
Query: 62 LAPIYSDRKIMTVPVIDGIDYQT 84
L I + V+D + +T
Sbjct: 242 LDRIAQN---TNSSVLDSFNQRT 261
>gi|15207947|dbj|BAB62998.1| hypothetical protein [Macaca fascicularis]
Length = 443
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/191 (46%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 272
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 273 RGAFDWNLQFK 283
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 120/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP +GG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 296 EGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
Length = 675
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 353
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 234 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 410
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 411 SHVGHVFR 418
>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Metaseiulus occidentalis]
Length = 583
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 94/228 (41%), Positives = 133/228 (58%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G+ +PE A + N+ S I+ +R++PD+R+ EC+ YP LP
Sbjct: 78 PGENGEGVEIPEKETALKNEKFKINQFNLLASERIALNRSLPDVRLAECRKKTYPDRLPT 137
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +++L+RTVHSII+ +P + + EIILVDD S L QKLEDY+ +
Sbjct: 138 TSIVIVFHNEAWTTLLRTVHSIIQMSPRELIAEIILVDDASEFDHLGQKLEDYVAKLPVP 197
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V ++R +R GLIR R GA+ G+VI FLDAHCE WL PLLA I D + PV
Sbjct: 198 VHVLRTGKRSGLIRARLIGAETVTGQVITFLDAHCECTEGWLEPLLARIAEDNTRVVCPV 257
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I + F V D + G F W + ++ +P+RE +R +R
Sbjct: 258 IDVISDEN--FAYVPASDQTWGG-FNWKLNFRWYRVPQRENDRRGGDR 302
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLEDY+ + V ++R +R GLIR R GA+ G+VI FLDAHCE WL PL
Sbjct: 183 LGQKLEDYVAKLPVPVHVLRTGKRSGLIRARLIGAETVTGQVITFLDAHCECTEGWLEPL 242
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I D + PVID I + F V D + G F W + ++ +P+RE +R
Sbjct: 243 LARIAEDNTRVVCPVIDVISDEN--FAYVPASDQTWGG-FNWKLNFRWYRVPQRENDRRG 299
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFAMD+A+F +LG YD G+ +WGGEN E+SF+IWMCGG++E V C
Sbjct: 300 GDRTLPVRTPTMAGGLFAMDKAYFEKLGKYDEGMDIWGGENLEMSFRIWMCGGTLEIVTC 359
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 360 SHVGHVFR 367
>gi|348585731|ref|XP_003478624.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Cavia porcellus]
Length = 937
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 540 LKDKLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 598
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RG+F W M + +P E AK R
Sbjct: 599 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPEVVAKNR 655
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE VPC
Sbjct: 656 IKETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIVPC 715
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 716 SRVGHIFR 723
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 91/246 (36%), Positives = 134/246 (54%), Gaps = 11/246 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V K D L +P + PG+ G+ +P E N+ S+ I DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVIVPPGKEKEAQKRWKEGNFNVYLSDLIPVDR 473
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNQLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPQHLIKEILLVDD 533
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L KL+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 534 FSTKDYLKDKLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PE 428
WL PLL +Y RK + PVI+ I+ + + +V D+ RG+F W M + + PE
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPE 649
Query: 429 REAKKR 434
AK R
Sbjct: 650 VVAKNR 655
>gi|449281639|gb|EMC88675.1| Polypeptide N-acetylgalactosaminyltransferase-like protein 2
[Columba livia]
Length = 640
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/250 (40%), Positives = 146/250 (58%), Gaps = 18/250 (7%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLG--EYGMNMETSNHI 245
RP +A+G + + P P + P EG AAG LG +G N S I
Sbjct: 123 RPEARAEGDAESPQLPARPLQ--PAEGA----------AAGQRPLGLETHGFNEALSERI 170
Query: 246 SFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
S R +P++R C +Y LP ASVI+ FH+E +S+L+RTVHSI+ P L++II
Sbjct: 171 SLRRDLPEVRHPLCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSIMDTAPKASLKDII 230
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
LVDD S + L L +YI + +G V+LIR+ +R G+IR R GA + G+V+VF+D+HC
Sbjct: 231 LVDDLSQQGPLKSALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHC 289
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE 425
E WL PLLA + S+R + PVID ID++T+++ Y +RG+F+W + +
Sbjct: 290 ECQKGWLEPLLARLSSNRNSVVSPVIDVIDWKTFQY---YHSVGLHRGVFDWKLDFHWEP 346
Query: 426 LPEREAKKRK 435
+PERE K R+
Sbjct: 347 VPEREEKVRQ 356
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 120/190 (63%), Gaps = 4/190 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +YI + +G V+LIR+ +R G+IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 241 LKSALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPL 299
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA + S+R + PVID ID++T+++ Y +RG+F+W + + +PERE K R+
Sbjct: 300 LARLSSNRNSVVSPVIDVIDWKTFQY---YHSVGLHRGVFDWKLDFHWEPVPEREEKVRQ 356
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SP AG + AMDR +F G YD + +WG EN ELS + W+CGGS+E +PCS
Sbjct: 357 SPISPIRSPVVAGAVVAMDRHYFQNTGAYDSDMTMWGAENLELSIRTWLCGGSVEIIPCS 416
Query: 182 RIVSLIRPVF 191
R+ + R F
Sbjct: 417 RVGHVYRNHF 426
>gi|395739263|ref|XP_002818743.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 [Pongo
abelii]
Length = 437
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP ASV++ F+NE F++L RTV S
Sbjct: 97 LLQYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASVVICFYNEEFNALFRTVSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GKV++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNREWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVV 272
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 273 RGTFDWNL 280
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/184 (37%), Positives = 101/184 (54%), Gaps = 12/184 (6%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GKV++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNREWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + + + E
Sbjct: 240 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPVVRGTFDWNLHLNGDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK-IWMCGGSIEWVP 179
+ ++P S + + + M L G + + + L IWMCGG + +P
Sbjct: 296 EGPTKPIWSSSDSQHILLM-------LNGRNSKVKPNHTAHLYLCMHDIWMCGGQLFIIP 348
Query: 180 CSRI 183
CSR+
Sbjct: 349 CSRV 352
>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 657
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 118/187 (63%), Gaps = 2/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ + KV++IR ++REGLIR R GA + V FLD+HCE + WL PL
Sbjct: 228 LKKQLEDYMSHYP-KVKIIRASKREGLIRARLLGATRATAPVTTFLDSHCECTVGWLEPL 286
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID T E+ + + G F+W + + + +PERE K+ K
Sbjct: 287 LDRIAKDPTTVVCPVIDVIDDTTLEY-NFRDSGGVNVGGFDWNLQFNWHAVPEREKKRHK 345
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP SPT AGGLFA+D+ FF +G YD G +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 346 NTAEPVWSPTMAGGLFAIDKNFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCS 405
Query: 182 RIVSLIR 188
+ + R
Sbjct: 406 HVGHIFR 412
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 131/229 (57%), Gaps = 7/229 (3%)
Query: 212 GEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY-WDYPL 266
GE G+ HLP + D + N S+ IS R +PD R + CK +
Sbjct: 119 GEMGRPVHLPANLTGEIKKLVDEGWSKNAFNQYVSDLISVHRKLPDPRDKWCKEPGRFLQ 178
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
DLP+ SV++ FHNE +S L+RTVHS++ R+P L+EIILVDDFS L ++LEDY+
Sbjct: 179 DLPQTSVVICFHNEAWSVLLRTVHSVLDRSPPNLLKEIILVDDFSDMIHLKKQLEDYMSH 238
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
+ KV++IR ++REGLIR R GA + V FLD+HCE + WL PLL I D +
Sbjct: 239 YP-KVKIIRASKREGLIRARLLGATRATAPVTTFLDSHCECTVGWLEPLLDRIAKDPTTV 297
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID ID T E+ + + G F+W + + + +PERE K+ K
Sbjct: 298 VCPVIDVIDDTTLEY-NFRDSGGVNVGGFDWNLQFNWHAVPEREKKRHK 345
>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
Length = 630
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 118 LAPTVKEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 177
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 178 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 237
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 238 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 297
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 298 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 353
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 234 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 293
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 294 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 350
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 351 NDRTAPLRTPTMAGGLFSIDKDYFYELGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 410
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 411 SHVGHVFR 418
>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
Length = 536
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 139/239 (58%), Gaps = 3/239 (1%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
L P ++ K PGE GK +P + E N+ S+ IS +R++ D+R E C
Sbjct: 24 LAPSVQEAKGKPGEMGKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGC 83
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++
Sbjct: 84 RRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQ 143
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 144 LEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARI 203
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 204 VQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 259
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 140 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 199
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 200 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 256
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 257 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 316
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 317 SHVGHVFR 324
>gi|344276550|ref|XP_003410071.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5-like
[Loxodonta africana]
Length = 448
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 86/183 (46%), Positives = 119/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK++LIRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 185 DLKEKLDYHLEVFRGKIKLIRNKKREGLIRARLIGASRASGDVLVFLDSHCEVNRVWLEP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T E Y P RG F W + +K + + E +
Sbjct: 245 LLFAISKDPKVVVCPLIDVINDTTLE----YTPSPVVRGAFNWKLQFKWDNVLSYEMEGP 300
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA+ R +F E+G YD G+ +WGGEN ELS +IWMCGG + +PC
Sbjct: 301 EGPTGPIRSPAMAGGIFAIQRKYFNEIGQYDKGMYLWGGENLELSLRIWMCGGQLFIIPC 360
Query: 181 SRI 183
SR+
Sbjct: 361 SRV 363
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 91/191 (47%), Positives = 120/191 (62%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP ASVI+ FHNE F++L RTV S
Sbjct: 102 LLQYGFNIIISRSLGKEREVPDTRNKMCLEKHYPKYLPTASVIICFHNEEFNALFRTVSS 161
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP LEEIILVDD S DL +KL+ +++ F GK++LIRN +REGLIR R GA
Sbjct: 162 VMNLTPHYILEEIILVDDMSEFDDLKEKLDYHLEVFRGKIKLIRNKKREGLIRARLIGAS 221
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID I+ T E Y P
Sbjct: 222 RASGDVLVFLDSHCEVNRVWLEPLLFAISKDPKVVVCPLIDVINDTTLE----YTPSPVV 277
Query: 412 RGIFEWGMLYK 422
RG F W + +K
Sbjct: 278 RGAFNWKLQFK 288
>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 136/228 (59%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE GK +P + E N+ S+ I +R++ D+R +CK YP LP
Sbjct: 85 PGEMGKPVKIPANQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAKLPT 144
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L ++LE+Y++
Sbjct: 145 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGRQLEEYVRTLPVP 204
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 205 TFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 264
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I +T+E+ V D + G F W + ++ +P RE ++R ++R
Sbjct: 265 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRNHDR 309
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE+Y+ ++R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 190 LGRQLEEYVRTLPVPTFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 249
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I +T+E+ V D + G F W + ++ +P RE ++R
Sbjct: 250 LARIVLDRKTVVCPIIDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPAREMQRRN 306
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PC
Sbjct: 307 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEISPC 366
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 367 SHVGHVFR 374
>gi|308483704|ref|XP_003104053.1| hypothetical protein CRE_01109 [Caenorhabditis remanei]
gi|308258361|gb|EFP02314.1| hypothetical protein CRE_01109 [Caenorhabditis remanei]
Length = 277
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 88/196 (44%), Positives = 128/196 (65%), Gaps = 3/196 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYP 265
+EGPGE GK L G A + ++ MN+ S+ IS DR +PD R++ CK +DY
Sbjct: 72 REGPGEKGKPVVLSGKEAELGQADMKKWFMNVHVSDKISLDRDVPDPRIQACKDIKYDYA 131
Query: 266 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 325
LPK SVI++F +E ++ L+RTVHS+I R+P + L+EIIL+DD S + +L + L+++I+
Sbjct: 132 T-LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEIILLDDNSKRQELQEPLDEHIK 190
Query: 326 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P++ I +R
Sbjct: 191 RFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTA 250
Query: 386 MTVPVIDGIDYQTWEF 401
+ P+ID I T +
Sbjct: 251 IVCPMIDSISDSTLAY 266
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 39/87 (44%), Positives = 57/87 (65%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L + L+++I+RF GKVRLIR R GLIR + GA+E+ G++IVFLD+HCE WL P
Sbjct: 180 ELQEPLDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEF 87
++ I +R + P+ID I T +
Sbjct: 240 IVQRISDERTAIVCPMIDSISDSTLAY 266
>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 600
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 135/228 (59%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE GK +P + + E N+ S+ I +R++ D+R +CK Y LP
Sbjct: 87 PGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYSAKLPT 146
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RT+ S+I R+P L+EIILVDD S + L ++LEDY+
Sbjct: 147 TSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGKQLEDYVSTLPVS 206
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
++R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I DRK + P+
Sbjct: 207 TFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPI 266
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I +T+E+ V D + G F W + ++ +P RE ++R ++R
Sbjct: 267 IDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPSREMQRRNHDR 311
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY+ ++R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 192 LGKQLEDYVSTLPVSTFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 251
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I +T+E+ V D + G F W + ++ +P RE ++R
Sbjct: 252 LARIVLDRKTVVCPIIDVISDETFEY--VTASDQTWGG-FNWKLNFRWYRVPSREMQRRN 308
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG +E PC
Sbjct: 309 HDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPC 368
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 369 SHVGHVFR 376
>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
Length = 677
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 136/237 (57%), Gaps = 3/237 (1%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKY 261
P + PGE GK +P + E N+ S+ IS +R++ D+R E C+
Sbjct: 122 PTVREQHGQPGEMGKPVKIPADMKEVMKEKFKENQFNLLASDMISLNRSLTDVRHENCRR 181
Query: 262 WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE 321
Y LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L +KLE
Sbjct: 182 KHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKKLE 241
Query: 322 DYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
DY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I
Sbjct: 242 DYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQ 301
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 302 NRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 355
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLEDY+ + + ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 236 LGKKLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPL 295
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 296 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 352
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 353 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 412
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 413 SHVGHVFR 420
>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
malayi]
Length = 575
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/228 (39%), Positives = 138/228 (60%), Gaps = 5/228 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 267
G GE G+ L E + + ++ S+ I+ +R++PD+R +C+ Y +
Sbjct: 28 GAGEDGRPVRLSEEDERLSEDTFVINQFSLVVSDRIALNRSLPDIRKHQCRTKTYLPSSE 87
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+V+HNE FS+LMRTV S+I+R+P + L+EIILVDDFS++ L +LE ++ +
Sbjct: 88 LPTTSVIIVYHNEAFSTLMRTVMSVIQRSPRENLKEIILVDDFSTRTFLKVELEKFVAQL 147
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
++++IR ER GLIR R GA E+ G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 148 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 207
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 208 CPVIDIINDRTFAYQKSIEL---FRGGFNWNLQFRWYALPSEMIKSRS 252
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 114/188 (60%), Gaps = 10/188 (5%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE ++ + ++++IR ER GLIR R GA E+ G+V+ FLD+HCE W+ PL
Sbjct: 136 LKVELEKFVAQLGTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPL 195
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 196 LARIKENRKAVVCPVIDIINDRTFAYQKSIEL---FRGGFNWNLQFRWYALPSEMIKSRS 252
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P SPT AGGLF++DR +F E+G YD + +WGGEN E+S +++ E +PC
Sbjct: 253 DDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRVF------EILPC 306
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 307 SHVGHVFR 314
>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
Length = 515
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 96/230 (41%), Positives = 132/230 (57%), Gaps = 5/230 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
K+ PGE GKA +P+ + N S+ IS R +PD R + CK YP D
Sbjct: 7 KDAPGENGKAVDIPKEFLIESKRLFERNKFNQWASDKISLHRILPDARPKLCKDKVYPGD 66
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RT+HS++ RT L EIILVDD S +L L+ YI +
Sbjct: 67 LPPTSVVIVFHNEAWSTLLRTIHSVLDRTAPDLLIEIILVDDKSVVKELHAPLDAYIAKL 126
Query: 328 NGKVRLIRNTEREGLIRTRSRGAK--ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
KV++IRN +REGLIR+R G S+ V+ FLDAHCE WL PLL IY+DR
Sbjct: 127 -AKVKIIRNKKREGLIRSRLNGKSFAASKAPVVTFLDAHCEANTGWLEPLLERIYNDRST 185
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ P ID I + + ++ Y P RGIF W + ++ + E K+R+
Sbjct: 186 VVCPEIDVISDENFAYQ--YGPSGLMRGIFNWDLHFRWRAVSTEEQKRRQ 233
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 83/190 (43%), Positives = 119/190 (62%), Gaps = 5/190 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAK--ESRGEVIVFLDAHCEVGLNWL 58
+L L+ YI + KV++IRN +REGLIR+R G S+ V+ FLDAHCE WL
Sbjct: 114 ELHAPLDAYIAKL-AKVKIIRNKKREGLIRSRLNGKSFAASKAPVVTFLDAHCEANTGWL 172
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PLL IY+DR + P ID I + + ++ Y P RGIF W + ++ + E K
Sbjct: 173 EPLLERIYNDRSTVVCPEIDVISDENFAYQ--YGPSGLMRGIFNWDLHFRWRAVSTEEQK 230
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+R+ +P ++PT AGGLFA++R +F E+G YD + +WGGEN E+SF+IW CGG++E V
Sbjct: 231 RRQSPIDPVRTPTMAGGLFAINRDYFKEIGTYDEEMDIWGGENLEISFRIWQCGGTLEIV 290
Query: 179 PCSRIVSLIR 188
PCS + + R
Sbjct: 291 PCSHVGHVFR 300
>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
partial [Strongylocentrotus purpuratus]
Length = 531
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 118/188 (62%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNG-KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + +EDYI + + ++R++R +R GLI+ R G S GE FLD+H EV + WL P
Sbjct: 142 LQEPIEDYISQIHSVRIRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEP 201
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA + SDR I+ +PV+D I+ T+ + V EP RG F W Y+ +P + KR
Sbjct: 202 LLARLASDRTIVVMPVVDEINKDTFNYNVVPEPLQ--RGGFNWRFEYRWKPIPNYD--KR 257
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
P KSP GGL MDR+FFLELGG+D G+ VWGGEN E S KIWMCGGSIE +PC
Sbjct: 258 PSKVAPIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPC 317
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 318 SRVGHVYR 325
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 112/178 (62%), Gaps = 3/178 (1%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
CK YP DLP S+I+ FHNE +S+L+RT++SII R+P + ++EIIL+DD S+ L +
Sbjct: 85 CKNISYPHDLPSTSIIICFHNEAWSTLLRTLNSIIDRSPLRLIKEIILLDDASTMEHLQE 144
Query: 319 KLEDYIQRFNG-KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+EDYI + + ++R++R +R GLI+ R G S GE FLD+H EV + WL PLLA
Sbjct: 145 PIEDYISQIHSVRIRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLA 204
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ SDR I+ +PV+D I+ T+ + V EP RG F W Y+ +P + + K
Sbjct: 205 RLASDRTIVVMPVVDEINKDTFNYNVVPEPLQ--RGGFNWRFEYRWKPIPNYDKRPSK 260
>gi|158300139|ref|XP_320141.4| AGAP012414-PA [Anopheles gambiae str. PEST]
gi|157013013|gb|EAA00190.4| AGAP012414-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 136/228 (59%), Gaps = 3/228 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GKA L ++ D + G N S+ IS +R++PD+R C+ Y
Sbjct: 82 EAKRSGIGEHGKAGQLDKSEHEMKDKLFKKNGFNAVLSDKISLNRSLPDIRHRGCRKKQY 141
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L Q+L++Y+
Sbjct: 142 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELIAEIILVDDCSTKEFLKQQLDEYV 201
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D +
Sbjct: 202 TENMPKVKVVRLPERSGLITARLAGAKIATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 261
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
P ID ID+ T+E+R+ D RG F+W YK L R+ +
Sbjct: 262 TCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPRDLQ 306
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 86/187 (45%), Positives = 118/187 (63%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L++Y+ KV+++R ER GLI R GAK + +V++FLD+H E +NWLPPL
Sbjct: 193 LKQQLDEYVTENMPKVKVVRLPERSGLITARLAGAKIATADVLIFLDSHTEANVNWLPPL 252
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D + P ID ID+ T+E+R+ D RG F+W YK L R+ +
Sbjct: 253 LEPIAEDYRTCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPRDLQNP- 308
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP++SP AGGLFA+ FF E+GGYD GL +WGGE +ELSFKIW CGG + PCS
Sbjct: 309 --TEPFESPVMAGGLFAISAKFFWEIGGYDEGLDIWGGEQYELSFKIWQCGGKMYDAPCS 366
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 367 RVGHIYR 373
>gi|339249613|ref|XP_003373794.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
spiralis]
gi|316970007|gb|EFV54023.1| polypeptide N-acetylgalactosaminyltransferase 10 [Trichinella
spiralis]
Length = 587
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 94/228 (41%), Positives = 141/228 (61%), Gaps = 7/228 (3%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASL--GEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 265
++GPGE G+A++LP + G N S++++ +R+I DLR ++C Y
Sbjct: 75 RQGPGEQGEAFYLPNVSSVDHKKGILYKSNGFNALVSDYLALNRSIKDLRPKQCIGRSYL 134
Query: 266 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 325
L K SV++ F+NE +++L+RTVHS++ R+P + L+E+IL DDFS K L Q LE Y++
Sbjct: 135 AKLEKVSVVIPFYNEHWTTLLRTVHSVVNRSPVELLQEVILADDFSDKPFLKQPLEAYVR 194
Query: 326 -RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ G VR++R +REGLIR R G+K + V+VFLD+H E G NWLPPLL P+ + +
Sbjct: 195 DTWPGLVRIVRARKREGLIRARLLGSKAAISSVLVFLDSHSECGYNWLPPLLEPVALNYR 254
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENE-LPEREA 431
+T P +D ID+ T+ +R D RG F+W + YK LPE A
Sbjct: 255 TVTCPFVDVIDHSTFLYRL---QDQGARGSFDWELYYKRLPLLPEDAA 299
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/189 (46%), Positives = 118/189 (62%), Gaps = 9/189 (4%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + LE Y+ + + G VR++R +REGLIR R G+K + V+VFLD+H E G NWLPP
Sbjct: 185 LKQPLEAYVRDTWPGLVRIVRARKREGLIRARLLGSKAAISSVLVFLDSHSECGYNWLPP 244
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 119
LL P+ + + +T P +D ID+ T+ +R D RG F+W + YK L PE A
Sbjct: 245 LLEPVALNYRTVTCPFVDVIDHSTFLYRL---QDQGARGSFDWELYYKRLPLLPEDAA-- 299
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
Y P+ SP AGG FA+ +F ELGGYD GL +WGGE +ELSFKIW CGG++ VP
Sbjct: 300 --YPDRPFNSPVMAGGYFAISTKWFWELGGYDEGLDIWGGEQYELSFKIWQCGGTLIDVP 357
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 358 CSHVGHIYR 366
>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Saccoglossus kowalevskii]
Length = 633
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 121/188 (64%), Gaps = 9/188 (4%)
Query: 6 LEDYIERFNGK-----VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L+DY++ F V++I N +REGLIR R GA + G+V++FLD+HCEV WL P
Sbjct: 224 LKDYLDDFIKTNLFQIVKIIHNKKREGLIRARMIGAAAATGDVVMFLDSHCEVSTQWLEP 283
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D + P+ID I+ T+E Y+ RG F WG+ +K + +P + K +
Sbjct: 284 LLERIKFDPHTVVCPIIDIINADTFE----YQQSPLVRGGFNWGLHFKWDTIPSSQFKGK 339
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P +SPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IW CGG++E +PC
Sbjct: 340 EDYIKPVRSPTMAGGLFAMDRKYFHELGEYDDGMDIWGGENLEISFRIWQCGGTLEIIPC 399
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 400 SRVGHVFR 407
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 123/208 (59%), Gaps = 5/208 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N S+ I F R +PD R C Y Y +LP SV++ F NE +S+L+RT
Sbjct: 137 DEGYQQHAFNQLISDRIGFHRGLPDTRNGLCAYQVYSNNLPSTSVVICFFNEAWSTLLRT 196
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRS 347
V+S+I R+PA L EIILVDD+SS L L+D+I+ V++I N +REGLIR R
Sbjct: 197 VYSVIDRSPANLLHEIILVDDYSSSTYLKDYLDDFIKTNLFQIVKIIHNKKREGLIRARM 256
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + G+V++FLD+HCEV WL PLL I D + P+ID I+ T+E Y+
Sbjct: 257 IGAAAATGDVVMFLDSHCEVSTQWLEPLLERIKFDPHTVVCPIIDIINADTFE----YQQ 312
Query: 408 DHHYRGIFEWGMLYKENELPEREAKKRK 435
RG F WG+ +K + +P + K ++
Sbjct: 313 SPLVRGGFNWGLHFKWDTIPSSQFKGKE 340
>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
Length = 1150
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 137/223 (61%), Gaps = 5/223 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
GPGE GKA L A + + G N S+ IS +R+I DLR CK Y
Sbjct: 222 GPGEQGKAATLSPAESDSEQRKKLYLQNGFNALLSDKISINRSIADLRHPSCKLQQYFKH 281
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP ASV++ F+ E +S+L+RTVHS++ R+P+ L+E+I+VDD S+K L +L++Y+ +
Sbjct: 282 LPTASVVVPFYEEHWSTLLRTVHSVLNRSPSHLLKEVIIVDDGSTKEFLHGQLQNYVNQN 341
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KV+LIR ER GL++ R GAK + G+V+VFLD+H E G NWLPPLL PI + K
Sbjct: 342 LPKVKLIRQGERTGLMKARLAGAKLASGDVLVFLDSHTEAGYNWLPPLLEPIAENPKTCV 401
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
P+ID ID QT+ +++ D RG+F+W YK L E +
Sbjct: 402 CPLIDVIDDQTF---NIHPQDDGGRGLFDWRFHYKRLALKESD 441
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 116/187 (62%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L++Y+ + KV+LIR ER GL++ R GAK + G+V+VFLD+H E G NWLPPL
Sbjct: 330 LHGQLQNYVNQNLPKVKLIRQGERTGLMKARLAGAKLASGDVLVFLDSHTEAGYNWLPPL 389
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI + K P+ID ID QT+ +++ D RG+F+W YK L E + R
Sbjct: 390 LEPIAENPKTCVCPLIDVIDDQTF---NIHPQDDGGRGLFDWRFHYKRLALKESD---RV 443
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ P+ SP AGGLFA+ FF ELGGYD L +WG E +ELSFKIW CGG + PCS
Sbjct: 444 SPTAPFPSPVMAGGLFAIGTNFFWELGGYDEELDIWGAEQYELSFKIWQCGGRMLDAPCS 503
Query: 182 RIVSLIR 188
R + R
Sbjct: 504 RFSHIYR 510
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 114/188 (60%), Gaps = 7/188 (3%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+ Y+ V++I R GLI R GAK ++G+V+VFLD+H E G+NWLPP
Sbjct: 732 LKEQLDHYVATHLKHLVKIIHLPTRSGLITARLAGAKIAKGDVLVFLDSHVEAGINWLPP 791
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI + + P ID I T+E D RG F+W MLYK LP R + +
Sbjct: 792 LLEPIAHNPRTCVCPFIDVIMDDTFELTP---QDQGARGAFDWNMLYKR--LPLR-PEDQ 845
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++P++SP AGGLFA+ FF ELGGYD L +WG E +ELSFKIW CGG + PC
Sbjct: 846 KDPTQPFESPVMAGGLFAISSMFFWELGGYDEMLEIWGAEQYELSFKIWQCGGRMIDAPC 905
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 906 SRVGHIYR 913
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 105/172 (61%), Gaps = 6/172 (3%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y LP+ SVI+ F++E +S+L+RTV+S+++R+P+ L EIILVDD S K L +
Sbjct: 675 CHNIKYLQHLPRTSVIIPFYDEHWSTLLRTVYSVMRRSPSSLLLEIILVDDGSMKNFLKE 734
Query: 319 KLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+L+ Y+ V++I R GLI R GAK ++G+V+VFLD+H E G+NWLPPLL
Sbjct: 735 QLDHYVATHLKHLVKIIHLPTRSGLITARLAGAKIAKGDVLVFLDSHVEAGINWLPPLLE 794
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
PI + + P ID I T+E D RG F+W MLYK LP R
Sbjct: 795 PIAHNPRTCVCPFIDVIMDDTFELTP---QDQGARGAFDWNMLYK--RLPLR 841
>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
Length = 518
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 132/225 (58%), Gaps = 5/225 (2%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLP 269
PGE G+ +P Y+ N S+ IS R++PD R+ EC YP+ LP
Sbjct: 14 PGENGRGVIVPPEYQEESRKLFQRNRFNQWASDRISLHRSLPDARILECSSLKYPIHKLP 73
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
+ SVI+VFHNE +S+L+RTVHS++ R+P + L EIILVDD S +L LE Y+ + +
Sbjct: 74 QTSVIIVFHNEAWSTLLRTVHSVLDRSPPELLREIILVDDSSDHEELHSTLEKYVAKLS- 132
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KV+++RN REGLIR+R G + + FLDAHCE + WL PLL I +R I+ P
Sbjct: 133 KVKIVRNKAREGLIRSRLNGFAHATSPTVTFLDAHCEANVGWLEPLLYRIMQNRTIVVCP 192
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
ID I +T+E+ Y + RG F W + ++ +PE E K+R
Sbjct: 193 EIDVISDETFEY--TYSSG-NVRGSFNWNLNFRWKAVPEYENKRR 234
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 114/188 (60%), Gaps = 4/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L LE Y+ + + KV+++RN REGLIR+R G + + FLDAHCE + WL P
Sbjct: 119 ELHSTLEKYVAKLS-KVKIVRNKAREGLIRSRLNGFAHATSPTVTFLDAHCEANVGWLEP 177
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I +R I+ P ID I +T+E+ Y + RG F W + ++ +PE E K+R
Sbjct: 178 LLYRIMQNRTIVVCPEIDVISDETFEY--TYSSGN-VRGSFNWNLNFRWKAVPEYENKRR 234
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ +SPT AGGLF + +F ++G YD + +WGGEN ELSF+IW CGG +E +PC
Sbjct: 235 AARTDGIRSPTMAGGLFTIHSQYFKDIGLYDKQMEIWGGENLELSFRIWQCGGQLEIIPC 294
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 295 SHVGHVFR 302
>gi|68342011|ref|NP_001020319.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Rattus
norvegicus]
gi|50926898|gb|AAH78995.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Rattus
norvegicus]
Length = 443
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 126/191 (65%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N+ TS + +R +PD R + C+ YP +LP ASVI+ F+NE F++L+RTV S
Sbjct: 83 LSRYGLNVITSRRLGIERQVPDSRNKICQQKHYPFNLPTASVIICFYNEEFNTLLRTVSS 142
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ +++ F GK++L+RN +REGLIR+R GA
Sbjct: 143 VMNLSPKHLLEEIILVDDMSEFDDLKAKLDYHLEIFRGKIKLVRNKKREGLIRSRMIGAS 202
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ PVID ID T ++ V P
Sbjct: 203 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPVIDVIDELTLDY--VGSPI--V 258
Query: 412 RGIFEWGMLYK 422
RG F+W + ++
Sbjct: 259 RGAFDWNLNFR 269
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 121/183 (66%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ ++E F GK++L+RN +REGLIR+R GA + G+++VFLD+HCEV WL P
Sbjct: 166 DLKAKLDYHLEIFRGKIKLVRNKKREGLIRSRMIGASRASGDILVFLDSHCEVNRVWLEP 225
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ PVID ID T ++ V P RG F+W + ++ +++ E
Sbjct: 226 LLHAIAKDHKMVVCPVIDVIDELTLDY--VGSPI--VRGAFDWNLNFRWDDVFSYELDGP 281
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP +GG+FA++R +F ELG YD + +WGGEN ELS +IWMCGG + +PC
Sbjct: 282 EGPSTPIRSPAMSGGIFAINRHYFNELGQYDKDMDLWGGENVELSLRIWMCGGQLFILPC 341
Query: 181 SRI 183
SR+
Sbjct: 342 SRV 344
>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
Length = 927
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 138/227 (60%), Gaps = 5/227 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLD 267
G GE G+ L E D + G N+ S+ I+ +R++PD+R +C K + P +
Sbjct: 98 GVGEDGRPVKLDELEDRLSDDTFGINQFNLIISDKIALNRSLPDVRKHQCRDKIYPAPSE 157
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+V+HNE FS+L+RTV S+I R+P + L+EIILVDDFSS++ L L++++
Sbjct: 158 LPTTSVIIVYHNEAFSTLLRTVVSVIDRSPKEVLKEIILVDDFSSRSFLKDDLDNFVVTL 217
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
++++IR R GLIR R GA E+ GEV+ FLD+HCE WL PLLA I +RK +
Sbjct: 218 GIRIKIIRAQRRVGLIRARLMGANEADGEVLTFLDSHCECTKGWLEPLLARIKENRKAVV 277
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID I+ +T+ ++ E +RG F W + ++ +P K R
Sbjct: 278 CPVIDVINDRTFAYQKGIE---LFRGGFNWNLQFRWYAVPPDIVKGR 321
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++++ ++++IR R GLIR R GA E+ GEV+ FLD+HCE WL PL
Sbjct: 206 LKDDLDNFVVTLGIRIKIIRAQRRVGLIRARLMGANEADGEVLTFLDSHCECTKGWLEPL 265
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I+ +T+ ++ E +RG F W + ++ +P K R
Sbjct: 266 LARIKENRKAVVCPVIDVINDRTFAYQKGIE---LFRGGFNWNLQFRWYAVPPDIVKGRA 322
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SPT AGGLF++D+ +F ELG YDPG+ +WGGEN E+SF+IW CGG IE +PC
Sbjct: 323 NDPTMPIQSPTMAGGLFSIDKRYFEELGAYDPGMEIWGGENIEISFRIWQCGGRIEILPC 382
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 383 SHVGHIFR 390
>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Loxodonta africana]
Length = 527
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 88/185 (47%), Positives = 121/185 (65%), Gaps = 5/185 (2%)
Query: 5 KLEDYIERF-NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
+L++Y++++ GK ++IRN +REGLIR R GA ++ GEV+VFLD+HCEV WL PLLA
Sbjct: 201 ELDEYVQKYLPGKTKVIRNKKREGLIRGRMIGAAQATGEVLVFLDSHCEVNEMWLQPLLA 260
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 123
+ D + PVID I T +Y RG F WG+ +K + +P E +
Sbjct: 261 AVREDPHTVVCPVIDIISADTL----LYSSSPIVRGGFNWGLHFKWDLVPFDELGGPEGA 316
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PCSR+
Sbjct: 317 TAPIKSPTMAGGLFAMNRHYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRV 376
Query: 184 VSLIR 188
+ R
Sbjct: 377 GHIFR 381
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/200 (47%), Positives = 126/200 (63%), Gaps = 5/200 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ NM SN + + R +PD R CK YPLDLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNMLISNRLGYHRDVPDTRNAACKEKSYPLDLPAASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF-NGKVRLIRNTEREGLIRTRS 347
VHS+ RTPA L EIILVDD S DL +L++Y+Q++ GK ++IRN +REGLIR R
Sbjct: 171 VHSVTDRTPAHLLHEIILVDDDSDLDDLKGELDEYVQKYLPGKTKVIRNKKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA ++ GEV+VFLD+HCEV WL PLLA + D + PVID I T +Y
Sbjct: 231 IGAAQATGEVLVFLDSHCEVNEMWLQPLLAAVREDPHTVVCPVIDIISADTL----LYSS 286
Query: 408 DHHYRGIFEWGMLYKENELP 427
RG F WG+ +K + +P
Sbjct: 287 SPIVRGGFNWGLHFKWDLVP 306
>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
Length = 550
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/186 (45%), Positives = 120/186 (64%), Gaps = 3/186 (1%)
Query: 6 LEDYIERF---NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 62
L D ++ F KVRL+R+ +REGLIR R GA+ ++G+V+VFLD+HCE WL P+L
Sbjct: 146 LHDQLDEFVATQQKVRLVRSEKREGLIRARLIGAEAAKGQVLVFLDSHCECTPGWLEPML 205
Query: 63 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
I D + P+ID ID +T + G F+W M + + LP E ++RK
Sbjct: 206 DRIGQDWSHVVTPIIDVIDDKTLMYNFNPLSRGFSVGGFDWAMGFTWHALPNHEKERRKK 265
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
S+P +SPT AGGLFA+DR +F +G YDPG+ +WGGEN E+SF+IWMCGG++E +PCS
Sbjct: 266 ISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSFRIWMCGGTLETLPCSH 325
Query: 183 IVSLIR 188
+ + R
Sbjct: 326 VGHIFR 331
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 97/244 (39%), Positives = 148/244 (60%), Gaps = 7/244 (2%)
Query: 197 LGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGM-----NMETSNHISFDRTI 251
LG +E P E + PGE G A+ + E ++ + ++G N S+ IS RT+
Sbjct: 23 LGKVESP-EHNADDPGEMGVAFQVDEKKLSSAEKEEYDFGFKRNAFNQYASDRISVHRTL 81
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
PD R EC+ + +PKASVI++FHNE +S L+RTV+SI++R+P ++LEE+ILVDD+S
Sbjct: 82 PDYRDVECRAILHSSKMPKASVIVIFHNEAWSVLLRTVYSILERSPPRFLEEVILVDDYS 141
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
+ L +L++++ KVRL+R+ +REGLIR R GA+ ++G+V+VFLD+HCE W
Sbjct: 142 DQEHLHDQLDEFVAT-QQKVRLVRSEKREGLIRARLIGAEAAKGQVLVFLDSHCECTPGW 200
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREA 431
L P+L I D + P+ID ID +T + G F+W M + + LP E
Sbjct: 201 LEPMLDRIGQDWSHVVTPIIDVIDDKTLMYNFNPLSRGFSVGGFDWAMGFTWHALPNHEK 260
Query: 432 KKRK 435
++RK
Sbjct: 261 ERRK 264
>gi|432097046|gb|ELK27544.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
5, partial [Myotis davidii]
Length = 363
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 88/187 (47%), Positives = 120/187 (64%), Gaps = 4/187 (2%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
YG+N S + R +PD R + C YP LP AS+++ FHNE F++L RTV S++
Sbjct: 15 YGLNTIISKSLGNQRPVPDTRDKMCLKKRYPTRLPSASIVICFHNEEFNTLFRTVSSVMN 74
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP Q LEEIILVDD S DL +KL+ +++ F GK+++IR T+REGLIR R GA +
Sbjct: 75 LTPHQILEEIILVDDMSEFDDLKEKLDYHLEMFRGKIKVIRTTKREGLIRARLIGAAHAS 134
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VFLD+HCEV WL PLLA I DRK++ P++D ID+ T Y P RG
Sbjct: 135 GDVLVFLDSHCEVNRVWLEPLLAAIAKDRKMVVCPMVDSIDHLTLN----YYPAPIVRGA 190
Query: 415 FEWGMLY 421
F+W + +
Sbjct: 191 FDWHLRF 197
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 123/192 (64%), Gaps = 4/192 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IR T+REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 95 DLKEKLDYHLEMFRGKIKVIRTTKREGLIRARLIGAAHASGDVLVFLDSHCEVNRVWLEP 154
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I DRK++ P++D ID+ T Y P RG F+W + + + + E
Sbjct: 155 LLAAIAKDRKMVVCPMVDSIDHLTLN----YYPAPIVRGAFDWHLRFVWDTVFSYEMDGP 210
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP +GG+FA++R +F ELG YD + +WG EN ELS +IWMCGG + +PC
Sbjct: 211 EGPTTPIRSPAMSGGIFAINRHYFNELGQYDKDMNLWGAENLELSLRIWMCGGQLFILPC 270
Query: 181 SRIVSLIRPVFK 192
SR+ + R + +
Sbjct: 271 SRVGHVDRHIVQ 282
>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
Length = 622
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 106/249 (42%), Positives = 144/249 (57%), Gaps = 15/249 (6%)
Query: 197 LGNLEPPLEPYKEG----PGEGGKAY-----HLPEAYRAAGDASLGEYGMNMETSNHISF 247
L N + P+ YK G PGEGGKA L + + D + N S+ IS
Sbjct: 97 LFNRDSPI--YKSGDEHQPGEGGKAVIIDRNKLAFSEKRIYDDGFNKNAFNQYVSDMISI 154
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
R++P EECK Y DLP SVI+ FHNE +S L+RTVHS+++RTP L EIILV
Sbjct: 155 HRSLPSYIDEECKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILV 214
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS A L LE Y+++F KVR++R +REGLIR R +GA S+G VI +LD+HCE
Sbjct: 215 DDFSDMAHLKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCEC 273
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENEL 426
W+ PLL I + K + PVID ID T+E+ Y + G F+W + + + +
Sbjct: 274 LEGWMEPLLDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAI 331
Query: 427 PEREAKKRK 435
PE++ K R+
Sbjct: 332 PEKDRKGRR 340
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 123/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE Y+ +F KVR++R +REGLIR R +GA S+G VI +LD+HCE W+ PL
Sbjct: 223 LKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWMEPL 281
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREAKKR 120
L I + K + PVID ID T+E+ Y + G F+W + + + +PE++ K R
Sbjct: 282 LDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAIPEKDRKGR 339
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + +P KSPT AGGLF++DR FF +LG YDPGL +WGGEN ELSFK WMCGG +E VPC
Sbjct: 340 R-DIDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEIVPC 398
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 399 SHVGHIFR 406
>gi|345781283|ref|XP_853759.2| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Canis lupus
familiaris]
Length = 559
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/197 (43%), Positives = 124/197 (62%), Gaps = 4/197 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGL+R+R GA + G+V+VFLD+HCEV WL P
Sbjct: 194 DLKEKLDHHLEIFRGKIKVIRNKKREGLVRSRLIGASRASGDVLVFLDSHCEVNTAWLQP 253
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID T E Y+ RG F W + +K + + E
Sbjct: 254 LLHAIAKDSKMVVCPLIDVIDSMTLE----YQSSPVVRGAFNWHLDFKWDSVYSYEMDGP 309
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P +SP AGG+FA++R +F E+G YD G+ +WG EN ELS +IWMCGG + +PC
Sbjct: 310 EGPTRPIRSPAMAGGIFAINRHYFNEIGQYDKGMDLWGAENLELSLRIWMCGGQLFIIPC 369
Query: 181 SRIVSLIRPVFKADGKL 197
SR+ + + F +L
Sbjct: 370 SRVGHISKQRFSNQPEL 386
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 126/214 (58%), Gaps = 7/214 (3%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
E G+ GK ++ G L +YG N S + D +PD R + C YP L
Sbjct: 91 ETAGKLGKDFNYSNPEFIDG---LLKYGFNTILSKSLGSDSKVPDTRNKMCLQKRYPAKL 147
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P ASVI+ FHNE F++L RT+ S+ TP LEEIILVDD S DL +KL+ +++ F
Sbjct: 148 PTASVIICFHNEEFNALFRTLSSVGNLTPHYILEEIILVDDMSDFDDLKEKLDHHLEIFR 207
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
GK+++IRN +REGL+R+R GA + G+V+VFLD+HCEV WL PLL I D K++
Sbjct: 208 GKIKVIRNKKREGLVRSRLIGASRASGDVLVFLDSHCEVNTAWLQPLLHAIAKDSKMVVC 267
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P+ID ID T E Y+ RG F W + +K
Sbjct: 268 PLIDVIDSMTLE----YQSSPVVRGAFNWHLDFK 297
>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Meleagris gallopavo]
Length = 556
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 146/231 (63%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+EGPGE GKA +P+ + N+ S+ I+ +R++PD+R++ CK YP +
Sbjct: 54 QEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRSLPDVRLDGCKTKVYPEE 113
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SV++VFHNE +S+L+RTVHS++ R+P + L EIILVDD S + L LE+Y+++
Sbjct: 114 LPNTSVVIVFHNEAWSTLLRTVHSVLARSPRRLLAEIILVDDASEREFLKASLENYVKKL 173
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V+++R +R GLIR R RGA +RG+V+ FLDAHCE WL PLLA I DR+ +
Sbjct: 174 EVPVKILRMEQRSGLIRARLRGAAAARGQVVTFLDAHCECTRGWLEPLLARIREDRRTVV 233
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 234 CPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 281
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y+++ V+++R +R GLIR R RGA +RG+V+ FLDAHCE WL PL
Sbjct: 162 LKASLENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVVTFLDAHCECTRGWLEPL 221
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 222 LARIREDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 278
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF++W CGGS+E V C
Sbjct: 279 GDRTLPVRTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGSLEIVTC 338
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 339 SHVGHVFR 346
>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Cavia porcellus]
Length = 608
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 89/188 (47%), Positives = 119/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L++Y+++ K+++IRN +REGLIR R GA + GEV+VFLD+HCEV WL P
Sbjct: 198 LKGELDEYVQKSLPTKIKVIRNAKREGLIRGRMIGAAHATGEVLVFLDSHCEVNEMWLQP 257
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA I D + PVID I T + S RG F WG+ +K + +P E
Sbjct: 258 LLATIRGDPHTVVCPVIDIISADTLAYSS----SPVVRGGFNWGLHFKWDLVPLSELGGE 313
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P KSPT AGGLFAM+R +F ELG YD G+ +WGGEN E+SF+IWMCGG + +PC
Sbjct: 314 DGATAPIKSPTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPC 373
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 374 SRVGHIFR 381
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/203 (46%), Positives = 125/203 (61%), Gaps = 5/203 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D ++ N+ SN + + R +PD R CK YP DLP ASV++ F+NE FS+L+RT
Sbjct: 111 DLGYQKHAFNVLISNRLGYHRDVPDTRNAACKEQSYPADLPVASVVICFYNEAFSALLRT 170
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRS 347
VHS++ RTPA L EIILVDD S DL +L++Y+Q+ K+++IRN +REGLIR R
Sbjct: 171 VHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYVQKSLPTKIKVIRNAKREGLIRGRM 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA + GEV+VFLD+HCEV WL PLLA I D + PVID I T + S
Sbjct: 231 IGAAHATGEVLVFLDSHCEVNEMWLQPLLATIRGDPHTVVCPVIDIISADTLAYSS---- 286
Query: 408 DHHYRGIFEWGMLYKENELPERE 430
RG F WG+ +K + +P E
Sbjct: 287 SPVVRGGFNWGLHFKWDLVPLSE 309
>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 605
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KKLE+Y++ V ++R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 205 LGKKLENYVKTLPVPVTVLRTPKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 264
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I +T+E+ + D + G F W + ++ +P+RE +R
Sbjct: 265 LARITEDRKTVVCPIIDVISDETFEY--ITASDTTWGG-FNWRLNFRWYRVPKREMDRRN 321
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E VPC
Sbjct: 322 NDKTVPIRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEIVPC 381
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 382 SHVGHVFR 389
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/202 (43%), Positives = 127/202 (62%), Gaps = 3/202 (1%)
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRT 296
N+ S IS +R++PD+R + CK Y LP SV++VFHNE +S+L+RTV S+I R+
Sbjct: 126 FNLLASERISLNRSLPDVRAKGCKTKKYFELLPTTSVVIVFHNEAWSTLLRTVWSVINRS 185
Query: 297 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 356
P ++EIILVDD S + L +KLE+Y++ V ++R +R GLIR R GAK +G+
Sbjct: 186 PKPLIKEIILVDDASVQPHLGKKLENYVKTLPVPVTVLRTPKRSGLIRARLLGAKHVKGQ 245
Query: 357 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 416
VI FLDAHCE WL PLLA I DRK + P+ID I +T+E+ + D + G F
Sbjct: 246 VITFLDAHCECTEGWLEPLLARITEDRKTVVCPIIDVISDETFEY--ITASDTTWGG-FN 302
Query: 417 WGMLYKENELPEREAKKRKYNR 438
W + ++ +P+RE +R ++
Sbjct: 303 WRLNFRWYRVPKREMDRRNNDK 324
>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
caballus]
Length = 453
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PL
Sbjct: 57 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPL 116
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 117 LARIKHDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 173
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 174 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 233
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 234 SHVGHVFR 241
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/175 (47%), Positives = 115/175 (65%), Gaps = 3/175 (1%)
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
YP +LP SV++VFHNE +S+L+RTVHS+I R+P LEEI+LVDD S + L + LE Y
Sbjct: 5 YPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESY 64
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+++ V +IR +R GLIR R +GA S+G+VI FLDAHCE + WL PLLA I DR
Sbjct: 65 VKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARIKHDR 124
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
K + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 125 KTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 176
>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/226 (41%), Positives = 134/226 (59%), Gaps = 6/226 (2%)
Query: 212 GEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK--YWDYPLDL 268
G GGK A+ E + + + N S+ IS DRT+ D+R E CK + YP L
Sbjct: 8 GGGGKPAFLESEENKKLAEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAKL 67
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ FH E S L+RTVHS+I RTP + L E+I+VDDFS A L + L+D++ +F
Sbjct: 68 PTTSVIICFHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQFT 127
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KV+++R +REGL+R R +GA ++G+V+ FLD+HCE W PLLA I +DR+ +
Sbjct: 128 -KVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNVVC 186
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
P I+ I+ T+ ++ D RG F W + +K +P E K R
Sbjct: 187 PAIEVINADTFAYQGSTNADQ--RGGFSWDLFFKWKGIPPEEQKLR 230
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 122/187 (65%), Gaps = 3/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+D++ +F KV+++R +REGL+R R +GA ++G+V+ FLD+HCE W PL
Sbjct: 115 LGKPLDDHVAQFT-KVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPL 173
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +DR+ + P I+ I+ T+ ++ D RG F W + +K +P E K R
Sbjct: 174 LARIAADRRNVVCPAIEVINADTFAYQGSTNADQ--RGGFSWDLFFKWKGIPPEEQKLRN 231
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+S+P ++PT AGGLF++ R +F ++G YD + +WGGEN ELSF++WMCGG +E V CS
Sbjct: 232 DDSDPIRTPTMAGGLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCS 291
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 292 RVGHVFR 298
>gi|195020976|ref|XP_001985304.1| GH16989 [Drosophila grimshawi]
gi|193898786|gb|EDV97652.1| GH16989 [Drosophila grimshawi]
Length = 682
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 92/190 (48%), Positives = 120/190 (63%), Gaps = 10/190 (5%)
Query: 2 LDKKLEDYIERF--NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L+K+LEDYI G VR++R +R GLI RS GA+ + +V++FLD+H E NWLP
Sbjct: 262 LNKELEDYIVNHFAVGLVRVVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLP 321
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAK 118
PLL PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 322 PLLEPIAINKRAAVCPFIDVIDHSNFNYRA---QDEGARGGFDWQFFYKRLPLLPE---- 374
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
K+ +EP+KSP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG +
Sbjct: 375 DLKHPTEPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDA 434
Query: 179 PCSRIVSLIR 188
PCSR+ + R
Sbjct: 435 PCSRVGHIYR 444
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/216 (43%), Positives = 132/216 (61%), Gaps = 6/216 (2%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GE GK L E+ R E G N S+ IS +R++PD+R +C+ Y L
Sbjct: 155 GIGEQGKIAKLDDESVRENEQKVSIENGFNGLLSDSISVNRSLPDIRHIDCRKKLYLRKL 214
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRF 327
P SV+++F +E S LMR+VHS+I R+P + L+EIILVDDFS +A L+++LEDYI F
Sbjct: 215 PTVSVVIIFFDEYLSVLMRSVHSLINRSPPELLKEIILVDDFSDRAYLNKELEDYIVNHF 274
Query: 328 N-GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
G VR++R +R GLI RS GA+ + +V++FLD+H E NWLPPLL PI +++
Sbjct: 275 AVGLVRVVRLPQRTGLIGARSAGARNATADVLIFLDSHVEANYNWLPPLLEPIAINKRAA 334
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 335 VCPFIDVIDHSNFNYRA---QDEGARGGFDWQFFYK 367
>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
Length = 495
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 141/225 (62%), Gaps = 4/225 (1%)
Query: 215 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASV 273
G+A +PE+ A N+ S IS +RT+ D+RM+ CK YP++ LP SV
Sbjct: 2 GQAVIIPESQHAEMKEKFKVNQFNLMASELISVNRTLRDVRMDSCKSKTYPVESLPTTSV 61
Query: 274 ILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRL 333
++VFHNE +S+L+RTVHS+I R+P L+EIILVDD S K L ++L++Y+ + + V +
Sbjct: 62 VIVFHNEAWSTLLRTVHSVINRSPPPLLKEIILVDDASEKDFLGRQLDEYLSKLSVHVYV 121
Query: 334 IRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDG 393
+R +R GLIR R +GA + G+VI FLDAHCE WL PLL I+ +RK + P+ID
Sbjct: 122 LRMEKRTGLIRARLKGAARAEGKVITFLDAHCECTEGWLEPLLFEIHKNRKSVVCPIIDV 181
Query: 394 IDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
I +T+E+ + D + G F W + ++ +P+RE ++R +R
Sbjct: 182 ISDETFEY--ITGSDMTWGG-FNWKLNFRWYPVPQREVERRGGDR 223
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 123/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L++Y+ + + V ++R +R GLIR R +GA + G+VI FLDAHCE WL PL
Sbjct: 104 LGRQLDEYLSKLSVHVYVLRMEKRTGLIRARLKGAARAEGKVITFLDAHCECTEGWLEPL 163
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I+ +RK + P+ID I +T+E+ + D + G F W + ++ +P+RE ++R
Sbjct: 164 LFEIHKNRKSVVCPIIDVISDETFEY--ITGSDMTWGG-FNWKLNFRWYPVPQREVERRG 220
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SPT AGGL A++R +F E+G YD G+ +WGGEN E+SF+IWMCGG++ V C
Sbjct: 221 GDRSLPLRSPTMAGGLLAIERDYFYEIGSYDDGMDIWGGENLEMSFRIWMCGGTLLIVTC 280
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 281 SHVGHVFR 288
>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
Length = 693
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 90/194 (46%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY + V+++R +REGLIR R GAK ++ VI +LD+HCE WL PL
Sbjct: 266 LKKQLEDYFASY-PMVQIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPL 324
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID ID T EF HYR G F+W + + + +PE
Sbjct: 325 LDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGFDWNLQFSWHAVPE 376
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE ++ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 377 REKRRHNNTAEPVYSPTMAGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 436
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 437 LEIVPCSHVGHIFR 450
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 99/250 (39%), Positives = 139/250 (55%), Gaps = 23/250 (9%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAY----RAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
L+ P+ + PGE GK LP+ + A D + N S+ IS R++PD R
Sbjct: 145 LDAPVANLNDNPGELGKPVILPKDMPIDMKKAVDDGWTKNAFNQYVSDLISVHRSLPDPR 204
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE +S L+RTVHS++ R+P + + +IILVDD+S
Sbjct: 205 DAWCKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVLDRSPPELIGQIILVDDYSDMP 264
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + V+++R +REGLIR R GAK ++ VI +LD+HCE WL P
Sbjct: 265 HLKKQLEDYFASY-PMVQIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEP 323
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID ID T EF HYR G F+W + + + +P
Sbjct: 324 LLDRIARNSTTVVCPVIDVIDDTTLEF--------HYRDSSGVNVGGFDWNLQFSWHAVP 375
Query: 428 EREAKKRKYN 437
ERE KR++N
Sbjct: 376 ERE--KRRHN 383
>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 511
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L++Y+ R V++IR EREGLIR R GAKE++G+V+ FLDAHCE WL PL
Sbjct: 114 LKEDLDEYVARLPVPVKVIRTKEREGLIRARMIGAKEAKGQVLTFLDAHCECTKGWLEPL 173
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + DRK + PVID I+ T+ + +E H+ G F W + ++ L E KKRK
Sbjct: 174 LVRVSEDRKKVVCPVIDIINDDTFAYVRSFE--LHW-GAFNWNLHFRWYTLGTTEIKKRK 230
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +EP+ +P AGGLFA+ R +F E+G YD + +WGGEN E+SF+ W CGGS+E VPC
Sbjct: 231 NDVTEPFPTPAMAGGLFAIRRDYFYEIGAYDEQMKIWGGENLEMSFRGWQCGGSVEIVPC 290
Query: 181 SRIVSLIR 188
S + L R
Sbjct: 291 SHVGHLFR 298
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 92/201 (45%), Positives = 129/201 (64%), Gaps = 6/201 (2%)
Query: 237 MNMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N+ S+ I +RT+PD+R + C KY + P LP SV++VFHNE +S+L+RTV S+I
Sbjct: 34 FNLLASDRIPLNRTLPDVRKKRCLTKYQNLPELLP-TSVVIVFHNEAWSTLLRTVQSVID 92
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
R+P + L EIILVDD S++ L + L++Y+ R V++IR EREGLIR R GAKE++
Sbjct: 93 RSPRELLTEIILVDDGSTRKFLKEDLDEYVARLPVPVKVIRTKEREGLIRARMIGAKEAK 152
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+ FLDAHCE WL PLL + DRK + PVID I+ T+ + +E H+ G
Sbjct: 153 GQVLTFLDAHCECTKGWLEPLLVRVSEDRKKVVCPVIDIINDDTFAYVRSFE--LHW-GA 209
Query: 415 FEWGMLYKENELPEREAKKRK 435
F W + ++ L E KKRK
Sbjct: 210 FNWNLHFRWYTLGTTEIKKRK 230
>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 591
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 118/189 (62%), Gaps = 7/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L +L Y+ VRLIR REGLIR R GA + G+V+VFLD+HCEV + WL
Sbjct: 172 ELGLQLSRYVASELPSHVRLIRTPAREGLIRARMYGAHNASGQVLVFLDSHCEVNVGWLE 231
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P+LA I ++R +T PVID I+ T+E Y RG F WG+ +K P +
Sbjct: 232 PMLARIGANRTTVTCPVIDIINADTFE----YSASPIVRGGFNWGLHFKWESPPRLRGPQ 287
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ +P SPT AGGLFAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG +E +P
Sbjct: 288 QAI--DPIPSPTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISFRIWMCGGRLEILP 345
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 346 CSRVGHVFR 354
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 120/191 (62%), Gaps = 6/191 (3%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYP-LDLPKASVILVFHNEGFSSLMRTVHSI 292
++ N+ SN + R++PD R C+ ++ LP ASV++ F+NE +S+L+RTVHSI
Sbjct: 90 QHAFNVLISNRLGKVRSLPDTRNPLCRQQEFQEQSLPTASVVVCFYNEAWSALVRTVHSI 149
Query: 293 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAK 351
++RTPA L E+ILVDD S+ +L +L Y+ VRLIR REGLIR R GA
Sbjct: 150 LERTPAALLHELILVDDNSTLPELGLQLSRYVASELPSHVRLIRTPAREGLIRARMYGAH 209
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV + WL P+LA I ++R +T PVID I+ T+E Y
Sbjct: 210 NASGQVLVFLDSHCEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFE----YSASPIV 265
Query: 412 RGIFEWGMLYK 422
RG F WG+ +K
Sbjct: 266 RGGFNWGLHFK 276
>gi|195030214|ref|XP_001987963.1| GH10909 [Drosophila grimshawi]
gi|193903963|gb|EDW02830.1| GH10909 [Drosophila grimshawi]
Length = 668
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 100/229 (43%), Positives = 138/229 (60%), Gaps = 9/229 (3%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDYPL 266
G GE G + +A A + EY G N S+ IS +R++PD+R EECK Y
Sbjct: 145 GFGEHGLPVQIEDA--AEKELEQKEYRRNGFNGFISDRISVNRSVPDVRREECKTRKYLA 202
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-Q 325
LP+ SV+++F+NE F +L+RTV+SII RTP + L++I+LVDD S L Q+L+DY+ Q
Sbjct: 203 KLPRVSVVIIFYNEHFQTLLRTVYSIINRTPTELLQQIVLVDDGSEWETLKQQLDDYVAQ 262
Query: 326 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
+ V ++ + ER+GLI R GAK S GE IVF D+H EV NWLPPLL PI + KI
Sbjct: 263 HWPHLVDVVHSPERQGLIGARLAGAKVSMGEAIVFFDSHIEVNYNWLPPLLEPIAINNKI 322
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 433
T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 323 ATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK 369
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 119/189 (62%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+DY+ + + V ++ + ER+GLI R GAK S GE IVF D+H EV NWLPP
Sbjct: 252 LKQQLDDYVAQHWPHLVDVVHSPERQGLIGARLAGAKVSMGEAIVFFDSHIEVNYNWLPP 311
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI + KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 312 LLEPIAINNKIATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK 369
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY++P GGLFA+ FF +LGGYD GL +WGGE +ELSFKIWMCGG + VP
Sbjct: 370 ----SLPYRNPVMMGGLFAIASEFFWDLGGYDDGLQIWGGEQYELSFKIWMCGGMLLDVP 425
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 426 CSRVAHIFR 434
>gi|403276501|ref|XP_003929936.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Saimiri boliviensis boliviensis]
Length = 455
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 120/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 192 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 251
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ PVID ID +T + Y+P RG F+W + +K + + E
Sbjct: 252 LLHAIAKDPKMVVCPVIDVIDDRTLK----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP AGG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 308 EGPTKPIRSPAMAGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 367
Query: 181 SRI 183
SR+
Sbjct: 368 SRV 370
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/189 (46%), Positives = 121/189 (64%), Gaps = 4/189 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
+YG N+ S + R +PD R + C YP+ LP AS+++ F+NE F++L RTV SI
Sbjct: 111 KYGFNIIISRSLGIKREVPDTRSKMCLQKRYPVRLPTASIVICFYNEEFNALFRTVSSIW 170
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
TP LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA +
Sbjct: 171 NLTPHHCLEEIILVDDMSKVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHA 230
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+V+VFLD+HCEV WL PLL I D K++ PVID ID +T + Y+P RG
Sbjct: 231 SGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPVIDVIDDRTLK----YKPSPVVRG 286
Query: 414 IFEWGMLYK 422
F+W + +K
Sbjct: 287 AFDWNLQFK 295
>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
aries]
Length = 582
Score = 174 bits (441), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 117/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+ + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKTQLEAYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I+ D ++ PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 LERIHKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 301
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K EP++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 302 KSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 361
Query: 181 SRI 183
S +
Sbjct: 362 SHV 364
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 102/261 (39%), Positives = 149/261 (57%), Gaps = 18/261 (6%)
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYG 236
P + V L RP+++ +PP + GE GKA L E+ + + Y
Sbjct: 56 PGKKAVDLSRPLYE--------KPPAD--SRALGEWGKASKLQLSESELKQQEELIERYA 105
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKR 295
+N+ S+ IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++
Sbjct: 106 INIYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLET 165
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 355
+PA L+EIILVDD S + L +LE Y+ + +VRLIR +REGL+R R GA + G
Sbjct: 166 SPAVLLKEIILVDDLSDRVYLKTQLEAYVSNLD-RVRLIRTNKREGLVRARLIGATFATG 224
Query: 356 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGI 414
+V+ FLD HCE WL PLL I+ D ++ PVID ID+ T+EF EP G
Sbjct: 225 DVLTFLDCHCECNTGWLEPLLERIHKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGG 281
Query: 415 FEWGMLYKENELPEREAKKRK 435
F+W + ++ + +P+ E +RK
Sbjct: 282 FDWRLTFQWHSVPKHERDRRK 302
>gi|29612635|gb|AAH49907.1| Galnt7 protein, partial [Mus musculus]
Length = 365
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 85/166 (51%), Positives = 115/166 (69%), Gaps = 7/166 (4%)
Query: 41 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYR 98
G+V+++LDAHCEV +NW PL+API DR I TVP+ID I T+E + + D + R
Sbjct: 2 GQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYAR 61
Query: 99 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 158
G ++W ML+K L RE + RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WG
Sbjct: 62 GAWDWSMLWKRVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWG 121
Query: 159 GENFELSFKIWMCGGSIEWVPCSRIVSLIRPVFKADGKLGNLEPPL 204
GENFE+S+KIW CGG + +VPCSR+ +++ +G GN PPL
Sbjct: 122 GENFEISYKIWQCGGKLLFVPCSRVGH----IYRLEGWQGN-PPPL 162
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/83 (46%), Positives = 54/83 (65%), Gaps = 2/83 (2%)
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYR 412
G+V+++LDAHCEV +NW PL+API DR I TVP+ID I T+E + + D + R
Sbjct: 2 GQVLIYLDAHCEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYAR 61
Query: 413 GIFEWGMLYKENELPEREAKKRK 435
G ++W ML+K L RE + RK
Sbjct: 62 GAWDWSMLWKRVPLTSREKRLRK 84
>gi|198474477|ref|XP_001356707.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
gi|198138408|gb|EAL33772.2| GA16586 [Drosophila pseudoobscura pseudoobscura]
Length = 646
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 129/202 (63%), Gaps = 4/202 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+E+CK Y LP SVI +F+NE FS+L+R+++S+I R
Sbjct: 149 GFNGLLSDMISVNRSVPDVRLEQCKTRKYLSKLPNISVIFIFYNEHFSALLRSIYSVINR 208
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q+L+DY+ F V ++RN ER+GLI R GAK +
Sbjct: 209 TPVELLKQIVLVDDGSDWDTLKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVAT 268
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
GEV+VF D+H EV NWLPPLL PI + KI T P++D ID+ + + Y+ RG
Sbjct: 269 GEVLVFFDSHIEVNYNWLPPLLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGG 326
Query: 415 FEWGMLYKE-NELPEREAKKRK 435
F+W YK+ LPE K +
Sbjct: 327 FDWRFFYKQLPVLPEDSVDKSQ 348
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/189 (47%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+DY+ F V ++RN ER+GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 229 LKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVATGEVLVFFDSHIEVNYNWLPP 288
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI + KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 289 LLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGGFDWRFFYKQLPVLPEDSVDK 346
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S+P+++P GGLFA+ FF +LGGYD L +WGGE +ELSFKIWMCGG + +P
Sbjct: 347 ----SQPFRNPVMMGGLFAIRTDFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDIP 402
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 403 CSRVAHIFR 411
>gi|427778457|gb|JAA54680.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 568
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/172 (50%), Positives = 112/172 (65%), Gaps = 6/172 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
VRLIR REGLIR R GA + G+V+VFLD+HCEV + WL P+LA I ++R +T PV
Sbjct: 171 VRLIRTPAREGLIRARMYGAHNASGQVLVFLDSHCEVNVGWLEPMLARIGANRTTVTCPV 230
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y RG F WG+ +K P ++ +P SPT AGGL
Sbjct: 231 IDIINADTFE----YSASPIVRGGFNWGLHFKWESPPRLRGPQQAI--DPIPSPTMAGGL 284
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
FAMDR +F ELG YD G+ +WGGEN E+SF+IWMCGG +E +PCSR+ + R
Sbjct: 285 FAMDRQYFHELGEYDDGMDIWGGENLEISFRIWMCGGRLEILPCSRVGHVFR 336
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/196 (39%), Positives = 109/196 (55%), Gaps = 22/196 (11%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYP-LDLPKASVILVFHNEGFSSLMRTVHSI 292
++ N+ SN + R++PD R C+ ++ LP ASV++ F+NE +S+L+RTVHSI
Sbjct: 88 QHAFNVLISNRLGKVRSLPDTRNPLCRQQEFQEQSLPTASVVVCFYNEAWSALVRTVHSI 147
Query: 293 IKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAK 351
++RTPA L L Y+ VRLIR REGLIR R GA
Sbjct: 148 LERTPAALL----------------HXLSRYVASELPSHVRLIRTPAREGLIRARMYGAH 191
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV + WL P+LA I ++R +T PVID I+ T+E Y
Sbjct: 192 NASGQVLVFLDSHCEVNVGWLEPMLARIGANRTTVTCPVIDIINADTFE----YSASPIV 247
Query: 412 RGIFEWGMLYKENELP 427
RG F WG+ +K P
Sbjct: 248 RGGFNWGLHFKWESPP 263
>gi|426337441|ref|XP_004032714.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gorilla
gorilla gorilla]
Length = 940
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 126/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P+ + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPQGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|195148068|ref|XP_002014996.1| GL18655 [Drosophila persimilis]
gi|194106949|gb|EDW28992.1| GL18655 [Drosophila persimilis]
Length = 646
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/202 (45%), Positives = 129/202 (63%), Gaps = 4/202 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+E+CK Y LP SVI +F+NE FS+L+R+++S+I R
Sbjct: 149 GFNGLLSDMISVNRSVPDVRLEQCKTRKYLSKLPNISVIFIFYNEHFSALLRSIYSVINR 208
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q+L+DY+ F V ++RN ER+GLI R GAK +
Sbjct: 209 TPVELLKQIVLVDDGSDWDTLKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVAT 268
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
GEV+VF D+H EV NWLPPLL PI + KI T P++D ID+ + + Y+ RG
Sbjct: 269 GEVLVFFDSHIEVNYNWLPPLLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGG 326
Query: 415 FEWGMLYKE-NELPEREAKKRK 435
F+W YK+ LPE K +
Sbjct: 327 FDWRFFYKQLPVLPEDSVDKSQ 348
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+DY+ F V ++RN ER+GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 229 LKQQLDDYVSLHFPHVVTVVRNVERKGLIGARLEGAKVATGEVLVFFDSHIEVNYNWLPP 288
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI + KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 289 LLEPIAINPKISTCPIVDIIDHSNFAYNGGYQ--EGSRGGFDWRFFYKQLPVLPEDSVDK 346
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S+P+++P GGLFA+ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 347 ----SQPFRNPVMMGGLFAIRTDFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVP 402
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 403 CSRVAHIFR 411
>gi|32698686|ref|NP_055383.1| polypeptide N-acetylgalactosaminyltransferase 5 [Homo sapiens]
gi|51315940|sp|Q7Z7M9.1|GALT5_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|30841528|gb|AAP34404.1| GalNAc-T5 [Homo sapiens]
gi|119631854|gb|EAX11449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|148745655|gb|AAI42677.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|158257740|dbj|BAF84843.1| unnamed protein product [Homo sapiens]
Length = 940
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 86/225 (38%), Positives = 128/225 (56%), Gaps = 6/225 (2%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 489
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
+LP SVI+ F +E +S+L+R+VHS+I R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDK 549
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Metaseiulus occidentalis]
Length = 585
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/173 (51%), Positives = 112/173 (64%), Gaps = 4/173 (2%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVR+IR EREGLIR R GA S G+V+VFLD+H EV WL PLL PI ++ +T P
Sbjct: 188 KVRVIRTREREGLIRARMIGAHNSTGDVLVFLDSHVEVNERWLQPLLVPIQQNQTTVTCP 247
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+E Y P +G F WGM ++ + LP+ K K P SPT AGG
Sbjct: 248 VIDIINADTFE----YSPSPLVKGGFNWGMHFRWDNLPKGYFKSEKERIAPLPSPTMAGG 303
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LFA+ + F LG YD G+ VWGGEN ELSF+IWMCGGS++ +PCSR+ + R
Sbjct: 304 LFAIHKDEFRRLGEYDWGMDVWGGENLELSFRIWMCGGSLKIMPCSRVGHVFR 356
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 127/202 (62%), Gaps = 6/202 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
++ N S I R +PD R CK Y DLP+ASVI+ F+NE +S+L+RTV+S++
Sbjct: 94 QHAFNTLVSERIGLRRRVPDTRDALCKQQKYSKDLPRASVIICFYNEAWSTLIRTVNSVL 153
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
R+P+ L+EIILVDD S A+L + L ++Q+ + KVR+IR EREGLIR R GA S
Sbjct: 154 DRSPSALLQEIILVDDLSDIAEL-EPLAGFVQK-HEKVRVIRTREREGLIRARMIGAHNS 211
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
G+V+VFLD+H EV WL PLL PI ++ +T PVID I+ T+E Y P +G
Sbjct: 212 TGDVLVFLDSHVEVNERWLQPLLVPIQQNQTTVTCPVIDIINADTFE----YSPSPLVKG 267
Query: 414 IFEWGMLYKENELPEREAKKRK 435
F WGM ++ + LP+ K K
Sbjct: 268 GFNWGMHFRWDNLPKGYFKSEK 289
>gi|328792011|ref|XP_624873.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Apis mellifera]
Length = 637
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 123/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K++ YI FNGKV+ + +REGLIR R GA+++ GE+++FLD+H EV W+ P
Sbjct: 215 LHEKIKIYIANNFNGKVKYFKTEKREGLIRARIFGARKATGEILIFLDSHIEVNRQWIEP 274
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I + I +PVID I+ T++ Y RG F WG+ +K + +P
Sbjct: 275 LLSRIVYSKTITAMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVPIGTFVHD 330
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P KSPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGGSIE +PC
Sbjct: 331 EDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPC 390
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 391 SRVGHVFR 398
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 121/200 (60%), Gaps = 5/200 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D Y N+ S++I R +PD R + C+ Y L AS+++ F+NE + +L+R+
Sbjct: 128 DEGYKNYSFNILVSDNIGLHRELPDTRHKLCELQKYSSKLSNASIVICFYNEHYMTLLRS 187
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRS 347
+HSII RTP L EIILV+D+S L +K++ YI FNGKV+ + +REGLIR R
Sbjct: 188 LHSIIDRTPTNLLHEIILVNDWSDSKILHEKIKIYIANNFNGKVKYFKTEKREGLIRARI 247
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA+++ GE+++FLD+H EV W+ PLL+ I + I +PVID I+ T++ Y
Sbjct: 248 FGARKATGEILIFLDSHIEVNRQWIEPLLSRIVYSKTITAMPVIDIINPDTFQ----YTG 303
Query: 408 DHHYRGIFEWGMLYKENELP 427
RG F WG+ +K + +P
Sbjct: 304 SPLVRGGFNWGLHFKWDNVP 323
>gi|114581297|ref|XP_525944.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Pan troglodytes]
gi|410296312|gb|JAA26756.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
gi|410333399|gb|JAA35646.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
Length = 940
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|395732382|ref|XP_002812541.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Pongo abelii]
Length = 967
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 570 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 628
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 629 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 685
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 686 IKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 745
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 746 SRVGHIFR 753
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 83/246 (33%), Positives = 126/246 (51%), Gaps = 31/246 (12%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR------------- 255
+ PG+ G+ +P + E N+ S+ I DR I D R
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGGQLFLPLFPY 495
Query: 256 --------------MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
+ C +LP SVI+ F +E +S+L+R+VHS++ R+P +
Sbjct: 496 SHMTLAEIKTPLFLIHGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLI 555
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
+EI+LVDDFS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FL
Sbjct: 556 KEILLVDDFSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFL 614
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 421
D+H E + WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M +
Sbjct: 615 DSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNF 671
Query: 422 KENELP 427
+P
Sbjct: 672 GWRTIP 677
>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 654
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/271 (40%), Positives = 151/271 (55%), Gaps = 18/271 (6%)
Query: 173 GSIEWVPCSRIVSLIRPVFKADGKLGNLEPPLEP-----YKEGPGEGGKAY-----HLPE 222
GS E V SR L A KLG L P P Y GPGEGGKAY L
Sbjct: 150 GSKEPVESSRSHKL-----SAIAKLG-LSPSTPPPRSDEYSTGPGEGGKAYTINREDLSP 203
Query: 223 AYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGF 282
A + D + N S+ IS R +PD R CK Y +LP AS+I+ FHNE +
Sbjct: 204 AEQIIFDKGWEDNAYNQYASDRISVRRYLPDYREGTCKVNQYGSNLPSASIIICFHNEAW 263
Query: 283 SSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGL 342
S L+R+VHS+I R+P L+EIILVDDFS + L + LE+Y+ N V+++R +REGL
Sbjct: 264 SVLLRSVHSVIDRSPPNLLQEIILVDDFSDRPHLKEALEEYMGMLN-IVKIVRTKQREGL 322
Query: 343 IRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR 402
IR R GA+ S G+V+VFLD+H E WL PLL I + I+ VPVI I+ +T +
Sbjct: 323 IRARMIGAELSTGKVLVFLDSHIECTTGWLEPLLDRIAYNSSIVVVPVISTINDKTLKM- 381
Query: 403 SVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+ + D+ G F+W + ++ +E ER+ +
Sbjct: 382 NFLKADNVQVGGFDWSLTFRWHEQTERDRNR 412
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 117/187 (62%), Gaps = 2/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ N V+++R +REGLIR R GA+ S G+V+VFLD+H E WL PL
Sbjct: 297 LKEALEEYMGMLN-IVKIVRTKQREGLIRARMIGAELSTGKVLVFLDSHIECTTGWLEPL 355
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + I+ VPVI I+ +T + + + D+ G F+W + ++ +E ER+ +
Sbjct: 356 LDRIAYNSSIVVVPVISTINDKTLKM-NFLKADNVQVGGFDWSLTFRWHEQTERDRNRSG 414
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SPT AGGLFA+ R +F LG YD G+ +WGGEN ELSFK+WMCGG +E V CS
Sbjct: 415 APYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKVWMCGGILETVVCS 474
Query: 182 RIVSLIR 188
+ + R
Sbjct: 475 LVGHIFR 481
>gi|194759472|ref|XP_001961971.1| GF15238 [Drosophila ananassae]
gi|190615668|gb|EDV31192.1| GF15238 [Drosophila ananassae]
Length = 663
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 122/189 (64%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+EECK Y LP SVI +F NE S+L+R++HS++ R
Sbjct: 164 GFNGLLSDRISVNRSVPDVRLEECKTRKYLAKLPNVSVIFIFFNEYLSTLLRSIHSVVNR 223
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L +L+DY+ F G V ++RN ER GLI R GAK +
Sbjct: 224 TPPELLKQIVLVDDGSDWESLKHQLDDYVSIHFPGLVDIVRNPERRGLIGARIAGAKVAV 283
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VF D+H E NWLPPLL PI + KI T P+ID ID+ T+ + ++ RG
Sbjct: 284 GDVMVFFDSHIEANYNWLPPLLEPIAINNKICTCPMIDSIDHATFSYHGGHQ--EGARGG 341
Query: 415 FEWGMLYKE 423
F+W M YK+
Sbjct: 342 FDWKMYYKQ 350
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 119/188 (63%), Gaps = 6/188 (3%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L+DY+ F G V ++RN ER GLI R GAK + G+V+VF D+H E NWLPP
Sbjct: 244 LKHQLDDYVSIHFPGLVDIVRNPERRGLIGARIAGAKVAVGDVMVFFDSHIEANYNWLPP 303
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL PI + KI T P+ID ID+ T+ + ++ RG F+W M YK+ + ++ +
Sbjct: 304 LLEPIAINNKICTCPMIDSIDHATFSYHGGHQ--EGARGGFDWKMYYKQLPVLAEDSIDK 361
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
S P++SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VPC
Sbjct: 362 ---SLPFRSPVMMGGLFAINTDFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVPC 418
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 419 SHVAHIFR 426
>gi|380016857|ref|XP_003692388.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like,
partial [Apis florea]
Length = 556
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 123/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K++ YI FNGKV+ + +REGLIR R GA+++ GE+++FLD+H EV W+ P
Sbjct: 134 LHEKIKIYIANNFNGKVKYFKTEKREGLIRARIFGARKATGEILIFLDSHIEVNKQWIEP 193
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I + I +PVID I+ T++ Y RG F WG+ +K + +P
Sbjct: 194 LLSRIVYSKTITAMPVIDIINPDTFQ----YTGSPLVRGGFNWGLHFKWDNVPIGTFVHD 249
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +P KSPT AGGLFAM+R +F +LG YD G+ +WGGEN E+SF+IWMCGGSIE +PC
Sbjct: 250 EDFVKPIKSPTMAGGLFAMNREYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPC 309
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 310 SRVGHVFR 317
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 121/200 (60%), Gaps = 5/200 (2%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
D Y N+ S++I R +PD R + C+ Y L AS+++ F+NE + +L+R+
Sbjct: 47 DEGYKNYSFNILVSDNIGLHRELPDTRHKLCEIQKYSSKLSNASIVICFYNEHYMTLLRS 106
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRS 347
+HSII RTP L EIILV+D+S L +K++ YI FNGKV+ + +REGLIR R
Sbjct: 107 LHSIIDRTPTYLLHEIILVNDWSDSKILHEKIKIYIANNFNGKVKYFKTEKREGLIRARI 166
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
GA+++ GE+++FLD+H EV W+ PLL+ I + I +PVID I+ T++ Y
Sbjct: 167 FGARKATGEILIFLDSHIEVNKQWIEPLLSRIVYSKTITAMPVIDIINPDTFQ----YTG 222
Query: 408 DHHYRGIFEWGMLYKENELP 427
RG F WG+ +K + +P
Sbjct: 223 SPLVRGGFNWGLHFKWDNVP 242
>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
Length = 688
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 90/194 (46%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 258 LKKQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSAVLTYLDSHCECTEGWLEPL 316
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I T E+ HYR G F+W + + + +PE
Sbjct: 317 LDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGFDWNLQFSWHSVPE 368
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ ++EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 369 RERKRHNNSAEPVYSPTMAGGLFAIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 428
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 429 LEIVPCSHVGHIFR 442
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/246 (37%), Positives = 135/246 (54%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP + + A D + N S+ +S R++PD R
Sbjct: 138 IDPPGN-FEENPGEMGKPVRLPKEMPDDMKKAVDDGWTKNAFNQYVSDLVSVHRSLPDPR 196
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LP VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 197 DAWCKDSTQYLTNLPTTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 256
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 257 HLKKQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSAVLTYLDSHCECTEGWLEP 315
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I T E+ HYR G F+W + + + +P
Sbjct: 316 LLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGFDWNLQFSWHSVP 367
Query: 428 EREAKK 433
ERE K+
Sbjct: 368 ERERKR 373
>gi|195359229|ref|XP_002045319.1| GM11142 [Drosophila sechellia]
gi|194122575|gb|EDW44618.1| GM11142 [Drosophila sechellia]
Length = 658
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+++ F V ++RN ER+GLI R GAK + G+V+VF D+H EV NWLPP
Sbjct: 240 LKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPP 299
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ + + S + RG F+W MLYK+ LPE K
Sbjct: 300 LIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGGFDWKMLYKQLPVLPEDALDK 357
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 358 ----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVP 413
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 414 CSRVAHIFRGPMKPRGN 430
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 86/189 (45%), Positives = 125/189 (66%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 219
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 220 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 279
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 280 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGG 337
Query: 415 FEWGMLYKE 423
F+W MLYK+
Sbjct: 338 FDWKMLYKQ 346
>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
Length = 400
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 122/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PL
Sbjct: 4 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPL 63
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 64 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 120
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR +F E+G YD G+ +WGGEN E+SF+IW CGG++E V C
Sbjct: 121 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 180
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 181 SHVGHVFR 188
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/123 (43%), Positives = 74/123 (60%), Gaps = 3/123 (2%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L + LE Y+++ V +IR +R GLIR R +GA SRG+VI FLDAHCE WL PL
Sbjct: 4 LKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPL 63
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
LA I DR+ + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 64 LARIKHDRRTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 120
Query: 436 YNR 438
+R
Sbjct: 121 GDR 123
>gi|195576344|ref|XP_002078036.1| GD23236 [Drosophila simulans]
gi|194190045|gb|EDX03621.1| GD23236 [Drosophila simulans]
Length = 674
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+++ F V ++RN ER+GLI R GAK + G+V+VF D+H EV NWLPP
Sbjct: 255 LKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPP 314
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ + + S + RG F+W MLYK+ LPE K
Sbjct: 315 LIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGGFDWKMLYKQLPVLPEDALDK 372
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 373 ----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVP 428
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 429 CSRVAHIFRGPMKPRGN 445
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 86/189 (45%), Positives = 125/189 (66%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 175 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 234
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 235 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 294
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 295 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPIVDTISHEDFSYFSGNK--DGARGG 352
Query: 415 FEWGMLYKE 423
F+W MLYK+
Sbjct: 353 FDWKMLYKQ 361
>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 697
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/271 (40%), Positives = 151/271 (55%), Gaps = 18/271 (6%)
Query: 173 GSIEWVPCSRIVSLIRPVFKADGKLGNLEPPLEP-----YKEGPGEGGKAY-----HLPE 222
GS E V SR L A KLG L P P Y GPGEGGKAY L
Sbjct: 150 GSKEPVESSRSHKL-----SAIAKLG-LSPSTPPPRSDEYSTGPGEGGKAYTINREDLSP 203
Query: 223 AYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGF 282
A + D + N S+ IS R +PD R CK Y +LP AS+I+ FHNE +
Sbjct: 204 AEQIIFDKGWEDNAYNQYASDRISVRRYLPDYREGTCKVNQYGSNLPSASIIICFHNEAW 263
Query: 283 SSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGL 342
S L+R+VHS+I R+P L+EIILVDDFS + L + LE+Y+ N V+++R +REGL
Sbjct: 264 SVLLRSVHSVIDRSPPNLLQEIILVDDFSDRPHLKEALEEYMGMLN-IVKIVRTKQREGL 322
Query: 343 IRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR 402
IR R GA+ S G+V+VFLD+H E WL PLL I + I+ VPVI I+ +T +
Sbjct: 323 IRARMIGAELSTGKVLVFLDSHIECTTGWLEPLLDRIAYNSSIVVVPVISTINDKTLKM- 381
Query: 403 SVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+ + D+ G F+W + ++ +E ER+ +
Sbjct: 382 NFLKADNVQVGGFDWSLTFRWHEQTERDRNR 412
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 117/187 (62%), Gaps = 2/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ N V+++R +REGLIR R GA+ S G+V+VFLD+H E WL PL
Sbjct: 297 LKEALEEYMGMLN-IVKIVRTKQREGLIRARMIGAELSTGKVLVFLDSHIECTTGWLEPL 355
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + I+ VPVI I+ +T + + + D+ G F+W + ++ +E ER+ +
Sbjct: 356 LDRIAYNSSIVVVPVISTINDKTLKM-NFLKADNVQVGGFDWSLTFRWHEQTERDRNRSG 414
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SPT AGGLFA+ R +F LG YD G+ +WGGEN ELSFK+WMCGG +E V CS
Sbjct: 415 APYSPVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKVWMCGGILETVVCS 474
Query: 182 RIVSLIR 188
+ + R
Sbjct: 475 LVGHIFR 481
>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
Length = 580
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 123/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE Y+ +F KVR++R +REGLIR R +GA S+G VI +LD+HCE W+ PL
Sbjct: 204 LKASLEIYMRQFP-KVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWMEPL 262
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREAKKR 120
L I + K + PVID ID T+E+ Y + G F+W + + + +PE++ K R
Sbjct: 263 LDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAIPEKDRKGR 320
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + +P KSPT AGGLF++DR FF +LG YDPGL +WGGEN ELSFK WMCGG +E VPC
Sbjct: 321 R-DIDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEIVPC 379
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 380 SHVGHIFR 387
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/244 (40%), Positives = 137/244 (56%), Gaps = 24/244 (9%)
Query: 197 LGNLEPPLEPYKEG----PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
L N + P+ YK G PGEGGKA + A + + + G N N
Sbjct: 97 LFNRDSPI--YKSGDEHQPGEGGKAVIIDRNKLAFSEKRIYDDGFNKNAFN--------- 145
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
+CK Y DLP SVI+ FHNE +S L+RTVHS+++RTP L EIILVDDFS
Sbjct: 146 -----QCKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSD 200
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
A L LE Y+++F KVR++R +REGLIR R +GA S+G VI +LD+HCE W+
Sbjct: 201 MAHLKASLEIYMRQF-PKVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWM 259
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-GIFEWGMLYKENELPEREA 431
PLL I + K + PVID ID T+E+ Y + G F+W + + + +PE++
Sbjct: 260 EPLLDRIKKNPKTVVCPVIDVIDDNTFEYH--YSKAYFTNVGGFDWSLQFNWHAIPEKDR 317
Query: 432 KKRK 435
K R+
Sbjct: 318 KGRR 321
>gi|307198758|gb|EFN79561.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Harpegnathos
saltator]
Length = 606
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 124/188 (65%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K++ YI F+ KV+ + +REGLIR R GA+++ G+V++FLD+H EV W+ P
Sbjct: 184 LHEKIKVYITNNFDAKVQFFKTDKREGLIRARVFGARKATGDVLIFLDSHIEVNEVWIEP 243
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + LP K+
Sbjct: 244 LLSRIAHSKTIVAMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLKQE 299
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P KSPT AGGLFA+DR +F ++G YD G+ VWGGEN E+SF+IWMCGG+IE +PC
Sbjct: 300 DDFVKPIKSPTMAGGLFAIDREYFTKIGEYDTGMDVWGGENLEISFRIWMCGGNIELIPC 359
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 360 SRVGHVFR 367
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 89/233 (38%), Positives = 140/233 (60%), Gaps = 11/233 (4%)
Query: 200 LEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
L+P P++P +G E G +L + + D Y N+ S+++ RT+PD R
Sbjct: 66 LQPVPVKPAVTLDQGLDELGMVKNLDDQRKR--DEGYKNYSFNVLISDNLGVLRTLPDTR 123
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
+ C+ YP +LP AS+I+ F+NE + +L+R++HSII +TP L EIILV+D+S
Sbjct: 124 HKLCRARKYPTNLPNASIIICFYNEHYMTLLRSLHSIIDKTPTSLLHEIILVNDYSDSNI 183
Query: 316 LDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +K++ YI F+ KV+ + +REGLIR R GA+++ G+V++FLD+H EV W+ P
Sbjct: 184 LHEKIKVYITNNFDAKVQFFKTDKREGLIRARVFGARKATGDVLIFLDSHIEVNEVWIEP 243
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
LL+ I + I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 244 LLSRIAHSKTIVAMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLP 292
>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
grunniens mutus]
Length = 566
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 116/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+ + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 170 LKTQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 228
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D ++ PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 229 LERIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 285
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K EP++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 286 KSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 345
Query: 181 SRI 183
S +
Sbjct: 346 SHV 348
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/259 (37%), Positives = 145/259 (55%), Gaps = 14/259 (5%)
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMN 238
P + V L RP+++ +PP + + G L E+ + + Y +N
Sbjct: 40 PGKKAVDLSRPLYE--------KPPADSHALGEWGKASKLQLSESELKQQEELIERYAIN 91
Query: 239 METSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
+ S+ IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +P
Sbjct: 92 IYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSP 151
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
A L+EIILVDD S + L +LE Y+ + +VRLIR +REGL+R R GA + G+V
Sbjct: 152 AVLLKEIILVDDLSDRVYLKTQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDV 210
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFE 416
+ FLD HCE WL PLL I D ++ PVID ID+ T+EF EP G F+
Sbjct: 211 LTFLDCHCECNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFD 267
Query: 417 WGMLYKENELPEREAKKRK 435
W + ++ + +P+ E +RK
Sbjct: 268 WRLTFQWHSVPKHERDRRK 286
>gi|344265184|ref|XP_003404666.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 10-like [Loxodonta
africana]
Length = 602
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/185 (48%), Positives = 118/185 (63%), Gaps = 6/185 (3%)
Query: 6 LEDYIERFNGKVRLIRNT--EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
L + R +G +R + E EGLIRTR GA + +VI FLD+HCE +NWLPPLL
Sbjct: 192 LHKPLXRLHGPFPSVRISVPETEGLIRTRMLGASAAIXDVITFLDSHCEANVNWLPPLLD 251
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 123
I +RK + P+ID ID+ +FR + RG F+W M YK +P K
Sbjct: 252 RIARNRKTIVCPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIPPELQKADP-- 307
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
S+P++SP AGGLFA+DR +F ELGGYDPGL +WGGE +E+SFK+WMCGG +E +PCSR+
Sbjct: 308 SDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRV 367
Query: 184 VSLIR 188
+ R
Sbjct: 368 GHIYR 372
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 132/220 (60%), Gaps = 10/220 (4%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ Y + +A R D + E G N+ S+ IS +R++PD+R C Y LP
Sbjct: 88 GHGEQGRPYPMTDAERV--DQAYRENGFNIYISDKISLNRSLPDIRHPNCNSKRYLEMLP 145
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ FHNEG+SSL+RTVHS++ R+P + + EI+LVDDFS + L + L R +G
Sbjct: 146 NTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLHKPL----XRLHG 201
Query: 330 KVRLIRNT--EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+R + E EGLIRTR GA + +VI FLD+HCE +NWLPPLL I +RK +
Sbjct: 202 PFPSVRISVPETEGLIRTRMLGASAAIXDVITFLDSHCEANVNWLPPLLDRIARNRKTIV 261
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
P+ID ID+ +FR + RG F+W M YK +P
Sbjct: 262 CPMIDVIDHD--DFRYETQAGDAMRGAFDWEMYYKRIPIP 299
>gi|332233960|ref|XP_003266176.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Nomascus
leucogenys]
Length = 940
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDIIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIP 650
>gi|195030212|ref|XP_001987962.1| GH10908 [Drosophila grimshawi]
gi|193903962|gb|EDW02829.1| GH10908 [Drosophila grimshawi]
Length = 684
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/229 (43%), Positives = 138/229 (60%), Gaps = 9/229 (3%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEY---GMNMETSNHISFDRTIPDLRMEECKYWDYPL 266
G GE G + +A A + EY G N S+ IS +R++PD+R EECK Y
Sbjct: 161 GFGEHGLPVQIEDA--AEKELEQKEYRRNGFNGFISDRISVNRSVPDVRREECKTRKYLA 218
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI-Q 325
LP+ SV+++F+NE F +L+RTV+SII RTP + L++I+LVDD S L Q+L+DY+ Q
Sbjct: 219 KLPRVSVVIIFYNEHFQTLLRTVYSIINRTPTELLQQIVLVDDGSEWETLKQQLDDYVAQ 278
Query: 326 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
+ V ++ + ER+GLI R GAK S GE +VF D+H EV NWLPPLL PI + KI
Sbjct: 279 HWPHLVDVVHSPERQGLIGARLAGAKVSMGEAMVFFDSHIEVNYNWLPPLLEPIAINNKI 338
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 433
T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 339 ATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK 385
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 89/189 (47%), Positives = 119/189 (62%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L ++L+DY+ + + V ++ + ER+GLI R GAK S GE +VF D+H EV NWLPP
Sbjct: 268 LKQQLDDYVAQHWPHLVDVVHSPERQGLIGARLAGAKVSMGEAMVFFDSHIEVNYNWLPP 327
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI + KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 328 LLEPIAINNKIATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLAVLPEDSVDK 385
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY++P GGLFA+ FF +LGGYD GL +WGGE +ELSFKIWMCGG + VP
Sbjct: 386 ----SLPYRNPVMIGGLFAIASEFFWDLGGYDDGLQIWGGEQYELSFKIWMCGGMLLDVP 441
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 442 CSRVAHIFR 450
>gi|6688167|emb|CAB65104.1| GalNAc-T5 [Homo sapiens]
Length = 668
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 271 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 329
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 330 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 386
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 387 IKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 446
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 447 SRVGHIFR 454
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 164 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 223
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS+I R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 224 PTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 282
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 283 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 342
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 343 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 378
>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
Length = 578
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 116/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+ + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D ++ PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K EP++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 KSRIEPFRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 98/259 (37%), Positives = 145/259 (55%), Gaps = 14/259 (5%)
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMN 238
P + V L RP+++ +PP + + G L E+ + + Y +N
Sbjct: 52 PGKKAVDLSRPLYE--------KPPADSHALGEWGKASKLQLSESELKQQEELIERYAIN 103
Query: 239 METSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
+ S+ IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +P
Sbjct: 104 IYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSP 163
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
A L+EIILVDD S + L +LE Y+ + +VRLIR +REGL+R R GA + G+V
Sbjct: 164 AVLLKEIILVDDLSDRVYLKTQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDV 222
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFE 416
+ FLD HCE WL PLL I D ++ PVID ID+ T+EF EP G F+
Sbjct: 223 LTFLDCHCECNTGWLEPLLERIRKDETVVICPVIDTIDWNTFEFYMQTGEP---MIGGFD 279
Query: 417 WGMLYKENELPEREAKKRK 435
W + ++ + +P+ E +RK
Sbjct: 280 WRLTFQWHSVPKHERDRRK 298
>gi|170589103|ref|XP_001899313.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158593526|gb|EDP32121.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 636
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LE+++++ N V++IR +REGLIR R RGA+ +VIVFLDAH EV NWLP
Sbjct: 223 LKEPLEEFLKKAGLNHIVKVIRTEKREGLIRARQRGARHITADVIVFLDAHSEVNYNWLP 282
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ PI D K++ P ID ID T+E+R+ D RG F+W YK L E
Sbjct: 283 PLVEPIALDYKMVVCPFIDVIDCNTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DN 336
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+K + P+ SP AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ P
Sbjct: 337 KKNPTRPFHSPVMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAP 396
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 397 CSRVGHIYR 405
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/234 (41%), Positives = 138/234 (58%), Gaps = 8/234 (3%)
Query: 208 KEGPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 266
++G GEGGK + ++ D G + S+ I+ +R++ D+R CK Y
Sbjct: 114 RQGLGEGGKPVVVAISEFKKLRDDLYRINGYDAYISDLIALNRSVKDIRHSGCKNMVYLE 173
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LP V+ F+NE S+L+R+V+S+I R+P + EIILVDD S+KA L + LE+++++
Sbjct: 174 KLPTVGVVFPFYNEHNSTLLRSVYSVINRSPKDIMREIILVDDGSTKAFLKEPLEEFLKK 233
Query: 327 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
N V++IR +REGLIR R RGA+ +VIVFLDAH EV NWLPPL+ PI D K
Sbjct: 234 AGLNHIVKVIRTEKREGLIRARQRGARHITADVIVFLDAHSEVNYNWLPPLVEPIALDYK 293
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
++ P ID ID T+E+R+ D RG F+W YK LP E K+ R
Sbjct: 294 MVVCPFIDVIDCNTYEYRA---QDEGGRGSFDWEFNYK--RLPLTEDNKKNPTR 342
>gi|6525067|gb|AAF15313.1|AF154107_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5 [Homo
sapiens]
Length = 610
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 213 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 271
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 272 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 328
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 329 IKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 388
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 389 SRVGHIFR 396
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 106 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVXNNL 165
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS+I R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 166 PTTSVIMCFVDEVWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 224
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 225 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 284
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 285 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 320
>gi|51316006|sp|Q8IA42.2|GALT4_DROME RecName: Full=N-acetylgalactosaminyltransferase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4; Short=pp-GaNTase 4
gi|34042946|gb|AAQ56701.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 659
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+++ F V ++RN ER+GLI R GAK + G+V+VF D+H EV NWLPP
Sbjct: 240 LKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPP 299
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ + + S + RG F+W MLYK+ LPE K
Sbjct: 300 LIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGGFDWKMLYKQLPVLPEDALDK 357
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 358 ----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVP 413
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 414 CSRVAHIFRGPMKPRGN 430
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 125/189 (66%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PDLR+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 219
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 220 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 279
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 280 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 337
Query: 415 FEWGMLYKE 423
F+W MLYK+
Sbjct: 338 FDWKMLYKQ 346
>gi|390350617|ref|XP_784979.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 647
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 134/229 (58%), Gaps = 5/229 (2%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
GE GK + DA + N+ S+ I+F+R++PD+R ++CK YP LP
Sbjct: 235 GEMGKPVIFEGDMKTHADALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVYPEVLPTT 294
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF---N 328
SVI++FHNE FS+L+RTVHS+I R+P L+EIILVDD S++ L KL+DYI R +
Sbjct: 295 SVIIIFHNEAFSALLRTVHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISRHFHSS 354
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
+VR+ R R GLIR R GA + G+++ FLD+HCEV + WL PLLA I DR+ +
Sbjct: 355 ARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPLLAVIDKDRRNVVT 414
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
P ID ID ++ + G F W M ++ + + ++ K N
Sbjct: 415 PTIDVIDDNDLAYKGSDQLPQ--VGSFGWTMAFRWTAIQTMDLEEAKRN 461
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/191 (42%), Positives = 115/191 (60%), Gaps = 6/191 (3%)
Query: 2 LDKKLEDYIERF---NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
L KL+DYI R + +VR+ R R GLIR R GA + G+++ FLD+HCEV + WL
Sbjct: 339 LKVKLDDYISRHFHSSARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWL 398
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PLLA I DR+ + P ID ID ++ + G F W M ++ + + +
Sbjct: 399 EPLLAVIDKDRRNVVTPTIDVIDDNDLAYKGSDQLPQ--VGSFGWTMAFRWTAIQTMDLE 456
Query: 119 KRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEW 177
+ K N + P +SPT AGGLF++D+ +F+ELG YDPG +WG EN ELSFK WMCGGS+
Sbjct: 457 EAKRNPTLPIRSPTMAGGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYT 516
Query: 178 VPCSRIVSLIR 188
+ CS + + R
Sbjct: 517 MACSHVGHIFR 527
>gi|51316066|sp|Q95JX4.2|GLTL5_MACFA RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
gi|15207881|dbj|BAB62965.1| hypothetical protein [Macaca fascicularis]
Length = 443
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/191 (46%), Positives = 122/191 (63%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ FHNE F +L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRNKMCLQKHYPARLPTASIVICFHNEEFHALFRTVSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F GK+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+V LD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVILDSHCEVNRVWLEPLLHAIAKDPKMVVRPLIDVIDDRTLE----YKPSPVV 272
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 273 RGAFDWNLQFK 283
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 119/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GK+++IRN +REGLIR R GA + G+V+V LD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKIKIIRNKKREGLIRARLIGASHASGDVLVILDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVRPLIDVIDDRTLE----YKPSPVVRGAFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP +GG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 296 EGPTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryzias latipes]
Length = 584
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 105/251 (41%), Positives = 149/251 (59%), Gaps = 17/251 (6%)
Query: 194 DGKLGN---LEPPLEPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNMETSNHISF 247
DG L ++PP P PGE G+A L PE + + S+ Y +N+ S+ IS
Sbjct: 62 DGPLARALYIKPP--PDSSAPGEWGRATRLNLSPEE-KKLEEESVESYAINIFVSDKISL 118
Query: 248 DRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
R I D RMEEC K +DY LP SVI+ F+NE +S+L+RT+HS+++ TPA L+EII
Sbjct: 119 HRHIQDNRMEECRNKKFDY-RHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEII 177
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
L+DD+S + L +L +YI +VRLIR +REGL+R R GA + G+V+ FLD HC
Sbjct: 178 LIDDYSDRGYLKSQLAEYISNLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHC 236
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKEN 424
E W+ PLL I + + PVID ID+ ++EF EP G F+W + ++ +
Sbjct: 237 ECVPGWIEPLLERIAENASTIVCPVIDTIDWNSFEFYMQTGEP---MIGGFDWRLTFQWH 293
Query: 425 ELPEREAKKRK 435
+PE E K+RK
Sbjct: 294 SVPESERKRRK 304
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 117/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L +YI +VRLIR +REGL+R R GA + G+V+ FLD HCE W+ PL
Sbjct: 188 LKSQLAEYISNLQ-RVRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPL 246
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + + PVID ID+ ++EF EP G F+W + ++ + +PE E K+R
Sbjct: 247 LERIAENASTIVCPVIDTIDWNSFEFYMQTGEP---MIGGFDWRLTFQWHSVPESERKRR 303
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++P++SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGGS+E PC
Sbjct: 304 KSRTDPFRSPTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPC 363
Query: 181 SRI 183
S +
Sbjct: 364 SHV 366
>gi|109099754|ref|XP_001087663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Macaca mulatta]
Length = 940
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 124/219 (56%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P +EEI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIEEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
10-like [Acyrthosiphon pisum]
Length = 592
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/220 (40%), Positives = 132/220 (60%), Gaps = 4/220 (1%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
+E + G GE G + L R D G N S+ IS +R+IPD+R + C++
Sbjct: 77 IEKQRTGIGEQGVSASLSSHNRHKYDELYKVNGFNALLSDSISVNRSIPDIRHKLCRFKK 136
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y LP SV++ FHNE FS+L+RTV+S++ R+P L+EIILVDD S+K L + L+++
Sbjct: 137 YNSKLPTVSVVIPFHNEHFSTLLRTVYSVLNRSPKILLKEIILVDDSSTKTSLKRPLDNF 196
Query: 324 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
+ V++I +R+GLIR R GA+++ E+++FLD+H E NWLPPLL PI D
Sbjct: 197 LSNNLADTVQIIHLKKRQGLIRARLAGARKATSEILIFLDSHTEANANWLPPLLEPITED 256
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+ P ID I ++T+E+R+ D RG F+W YK
Sbjct: 257 YRTCVCPFIDVIAFETFEYRA---QDEGARGAFDWEFFYK 293
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 119/189 (62%), Gaps = 9/189 (4%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L++++ V++I +R+GLIR R GA+++ E+++FLD+H E NWLPP
Sbjct: 189 LKRPLDNFLSNNLADTVQIIHLKKRQGLIRARLAGARKATSEILIFLDSHTEANANWLPP 248
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 119
LL PI D + P ID I ++T+E+R+ D RG F+W YK L PE
Sbjct: 249 LLEPITEDYRTCVCPFIDVIAFETFEYRA---QDEGARGAFDWEFFYKRLPLLPEDLL-- 303
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
Y ++P++SP AGGLFA+ +F ELGGYDPGL +WGGE +ELSFKIW CGG+I P
Sbjct: 304 --YPTKPFRSPVMAGGLFAISAKWFWELGGYDPGLDIWGGEQYELSFKIWQCGGTILDAP 361
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 362 CSRVGHIYR 370
>gi|355750550|gb|EHH54877.1| hypothetical protein EGM_03977 [Macaca fascicularis]
Length = 940
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/219 (37%), Positives = 124/219 (56%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|149730635|ref|XP_001491185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Equus
caballus]
Length = 940
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPDIVAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKDTDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 87/238 (36%), Positives = 131/238 (55%), Gaps = 10/238 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 423 VLKIDVTLSPRDP------KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDR 476
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 595
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLL +Y RK + PVI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIP 650
>gi|355564907|gb|EHH21396.1| hypothetical protein EGK_04452 [Macaca mulatta]
Length = 940
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/219 (37%), Positives = 124/219 (56%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|403258971|ref|XP_003922013.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Saimiri boliviensis boliviensis]
Length = 967
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 570 LKDNLDKYMSQF-PKVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 628
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 629 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 685
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 686 IKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 745
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 746 SRVGHIFR 753
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 125/245 (51%), Gaps = 31/245 (12%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR-------------- 255
PG+ G+ +P + E N+ S+ I DR I D R
Sbjct: 437 APGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGEQLLLPLFPCS 496
Query: 256 -------------MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
+ C +LP SVI+ F +E +S+L+R+VHS++ R+P ++
Sbjct: 497 HMTLAEIKTSLFLIHGCTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIK 556
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EI+LVDDFS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD
Sbjct: 557 EILLVDDFSTKDYLKDNLDKYMSQF-PKVRILRLRERHGLIRARLAGAQNATGDVLTFLD 615
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+H E + WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M +
Sbjct: 616 SHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFG 672
Query: 423 ENELP 427
+P
Sbjct: 673 WRTIP 677
>gi|402888383|ref|XP_003907542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Papio
anubis]
Length = 940
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ K P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 127/225 (56%), Gaps = 6/225 (2%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 432 PRDP--KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQ 489
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 490 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDK 549
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 550 YMSQF-PKVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS 608
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 609 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|167515504|ref|XP_001742093.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778717|gb|EDQ92331.1| predicted protein [Monosiga brevicollis MX1]
Length = 548
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/189 (46%), Positives = 121/189 (64%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LED + K +L+R +R GLIR + GA + G+VI+FLD+HCEV +L PL
Sbjct: 258 LGQPLEDEVAA-TPKTKLLRLDKRSGLIRAKVHGALNAVGDVILFLDSHCEVNDGYLEPL 316
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENEL-PEREAKK 119
L IY +RK + +P+ID ID++TWE R+ + E RG+F+W + + L P +EA++
Sbjct: 317 LDRIYRNRKTVAMPIIDAIDFETWEHRTGLLE-----RGVFDWTLTFSWKMLSPMQEAER 371
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
P+ SP AGGLFAMDR FF E+G YD G+ WGGEN E+S +IW CGG IE VP
Sbjct: 372 ADDPLAPFTSPAMAGGLFAMDRKFFFEIGAYDMGMDTWGGENIEMSVRIWTCGGVIEAVP 431
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 432 CSHVGHVFR 440
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 87/200 (43%), Positives = 123/200 (61%), Gaps = 10/200 (5%)
Query: 238 NMETSNHISFDRTIPDLRMEEC-KYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKR 295
N S+ + R D R +C K Y +D LP SVI++F+NE S+LMRTV S++ R
Sbjct: 178 NQWISDQLPLQRRAWDTRQVDCLKREHYNIDSLPTVSVIIIFYNEARSTLMRTVWSVLDR 237
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 355
T L+EIILVDD SS L Q LED + K +L+R +R GLIR + GA + G
Sbjct: 238 THPTLLKEIILVDDHSSMEHLGQPLEDEVAA-TPKTKLLRLDKRSGLIRAKVHGALNAVG 296
Query: 356 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGI 414
+VI+FLD+HCEV +L PLL IY +RK + +P+ID ID++TWE R+ + E RG+
Sbjct: 297 DVILFLDSHCEVNDGYLEPLLDRIYRNRKTVAMPIIDAIDFETWEHRTGLLE-----RGV 351
Query: 415 FEWGMLYKENEL-PEREAKK 433
F+W + + L P +EA++
Sbjct: 352 FDWTLTFSWKMLSPMQEAER 371
>gi|221330664|ref|NP_001137779.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
melanogaster]
gi|442625712|ref|NP_722910.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
melanogaster]
gi|25987157|gb|AAN75751.1|AF324752_1 N-acetylgalactosaminyltransferase [Drosophila melanogaster]
gi|220901927|gb|ACL82986.1| polypeptide GalNAc transferase 4, isoform B [Drosophila
melanogaster]
gi|440213268|gb|AAN10370.2| polypeptide GalNAc transferase 4, isoform C [Drosophila
melanogaster]
Length = 644
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+++ F V ++RN ER+GLI R GAK + G+V+VF D+H EV NWLPP
Sbjct: 225 LKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPP 284
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ + + S + RG F+W MLYK+ LPE K
Sbjct: 285 LIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGGFDWKMLYKQLPVLPEDALDK 342
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 343 ----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVP 398
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 399 CSRVAHIFRGPMKPRGN 415
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 125/189 (66%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PDLR+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 145 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 204
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 205 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 264
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 265 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 322
Query: 415 FEWGMLYKE 423
F+W MLYK+
Sbjct: 323 FDWKMLYKQ 331
>gi|403258969|ref|XP_003922012.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Saimiri boliviensis boliviensis]
Length = 940
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/218 (38%), Positives = 124/218 (56%), Gaps = 4/218 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
PG+ G+ +P + E N+ S+ I DR I D R C +LP
Sbjct: 437 APGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNLP 496
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 497 TTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF-P 555
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK + P
Sbjct: 556 KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVACP 615
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
VI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 616 VIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
Length = 612
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/227 (39%), Positives = 136/227 (59%), Gaps = 5/227 (2%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLD 267
G GE G+ L + + + N+ S+ I+ +R++PD+R +C+ Y +
Sbjct: 44 GAGEDGRPVRLSKEDERLSEDTFVINQFNLVVSDRIALNRSLPDIRKHQCRTKTYLPSSE 103
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+V+HNE FS+LMRTV S+I R+P + L+EIILVDDFS++ L +LE + +
Sbjct: 104 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPRENLKEIILVDDFSTRTFLKVELEKLVAQL 163
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
++++IR ER GLIR R GA E+ G+V+ FLD+HCE W+ PLLA I +RK +
Sbjct: 164 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 223
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 224 CPVIDIINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSR 267
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 79/198 (39%), Positives = 117/198 (59%), Gaps = 14/198 (7%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE + + ++++IR ER GLIR R GA E+ G+V+ FLD+HCE W+ PL
Sbjct: 152 LKVELEKLVAQLGTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPL 211
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +RK + PVID I+ +T+ ++ E +RG F W + ++ LP K R
Sbjct: 212 LARIKENRKAVVCPVIDIINERTFAYQKGIEL---FRGGFNWNLQFRWYALPPEMIKSRS 268
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELS----------FKIWM 170
+ ++P SPT AGGLF++DR +F E+G YD + +WGGEN E+S F +W
Sbjct: 269 DDPTKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRLKLLKKNCFLVWQ 328
Query: 171 CGGSIEWVPCSRIVSLIR 188
CGG +E +PCS + + R
Sbjct: 329 CGGRVEILPCSHVGHVFR 346
>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
Length = 584
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/188 (46%), Positives = 126/188 (67%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL PL
Sbjct: 193 LRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWLEPL 252
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + +DRK + P+ID I + +E+ + D + G F W + ++ +P RE ++R
Sbjct: 253 LDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREMERRN 309
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ S P ++PT AGGLFA+DR FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PC
Sbjct: 310 HDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPC 369
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 370 SRVGHVFR 377
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 93/246 (37%), Positives = 146/246 (59%), Gaps = 6/246 (2%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+LG L L + GPGE G A + + + E ++ S+ IS +R +PD R
Sbjct: 70 ELGILLKSLNFERNGPGEMGSAVIIDPSQQEERTRKFKENQFDVMASDLISINRALPDYR 129
Query: 256 MEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
+C+ D LP S+I+VFHNE +S+L+RT+HS+I R+P ++E+IL+DD S+
Sbjct: 130 SSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSN 189
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
+ L + L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL
Sbjct: 190 RTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWL 249
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PLL + +DRK + P+ID I + +E+ + D + G F W + ++ +P RE +
Sbjct: 250 EPLLDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREME 306
Query: 433 KRKYNR 438
+R ++R
Sbjct: 307 RRNHDR 312
>gi|332243646|ref|XP_003270989.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5
[Nomascus leucogenys]
Length = 443
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/191 (45%), Positives = 123/191 (64%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ F+NE F++L RTV S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASIVICFYNEEFNALFRTVSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ TP +LEEIILVDD S DL +KL+ +++ F K+++IRN +REGLIR R GA
Sbjct: 157 VMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHLETFREKIKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKVVVCPLIDVIDDRTLE----YKPSPVV 272
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 273 RGTFDWNLQFK 283
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 118/183 (64%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F K+++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFREKIKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKVVVCPLIDVIDDRTLE----YKPSPVVRGTFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P SP +GG+FA+ R +F E+G YD + WGGEN ELS +IWMCGG + +PC
Sbjct: 296 EGPTKPIWSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGGENLELSLRIWMCGGQLFIIPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
>gi|354548807|gb|AER27632.1| AT25481p1 [Drosophila melanogaster]
Length = 666
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 93/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+++ F V ++RN ER+GLI R GAK + G+V+VF D+H EV NWLPP
Sbjct: 247 LKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPP 306
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ + + S + RG F+W MLYK+ LPE K
Sbjct: 307 LIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGGFDWKMLYKQLPVLPEDALDK 364
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 365 ----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVP 420
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 421 CSRVAHIFRGPMKPRGN 437
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 87/189 (46%), Positives = 125/189 (66%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PDLR+E CK Y LP SVI +F NE F++L+R+++S+I R
Sbjct: 167 GFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFFNEHFNTLLRSIYSVINR 226
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q L+DY+Q+ F V ++RN ER+GLI R GAK +
Sbjct: 227 TPPELLKQIVLVDDGSEWDVLKQPLDDYVQQHFPHLVTIVRNPERQGLIGARIAGAKVAV 286
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G+V+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 287 GQVMVFFDSHIEVNYNWLPPLIEPIAINPKISTCPMVDTISHEDFSYFSGNK--DGARGG 344
Query: 415 FEWGMLYKE 423
F+W MLYK+
Sbjct: 345 FDWKMLYKQ 353
>gi|157107416|ref|XP_001649767.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884053|gb|EAT48278.1| AAEL000654-PA [Aedes aegypti]
Length = 607
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/187 (46%), Positives = 116/187 (62%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+E KV++I ER GLI R GAK + +V++FLD+H E +NWLPPL
Sbjct: 191 LKDQLDRYVEENMPKVKVIHLPERSGLITARLAGAKVATADVLIFLDSHTEANINWLPPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D K P ID ID+ +E+R+ D RG F+W YK L +++ +
Sbjct: 251 LEPIAEDYKTCVCPFIDVIDWDNFEYRA---QDEGARGAFDWKFFYKRLPLLQKDLENP- 306
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCS
Sbjct: 307 --TEPFESPVMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMYDAPCS 364
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 365 RVGHIYR 371
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 90/226 (39%), Positives = 133/226 (58%), Gaps = 3/226 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE G A HL + D + G N S+ IS +R++PD+R + C+ Y
Sbjct: 80 EEKRTGIGEHGIAGHLEKKDEDMKDKLFKKNGFNAVLSDLISLNRSLPDIRHKGCRKKKY 139
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L +L+ Y+
Sbjct: 140 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELITEIILVDDCSTKEFLKDQLDRYV 199
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ KV++I ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D K
Sbjct: 200 EENMPKVKVIHLPERSGLITARLAGAKVATADVLIFLDSHTEANINWLPPLLEPIAEDYK 259
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
P ID ID+ +E+R+ D RG F+W YK L +++
Sbjct: 260 TCVCPFIDVIDWDNFEYRA---QDEGARGAFDWKFFYKRLPLLQKD 302
>gi|431894831|gb|ELK04624.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Pteropus alecto]
Length = 939
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 542 LKDNLDKYMSQF-PKVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPL 600
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 601 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNR 657
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 658 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 717
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 718 SRVGHIFR 725
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 435 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAKQLVHNNL 494
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 495 PTTSVIMCFVDEVWSTLVRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 553
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 554 PKVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLSRKKVAC 613
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 614 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 649
>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 749
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L++Y+++ KV+++R +REGLIR R GA+ ++G V+ FLD+H E + WL PL
Sbjct: 357 LKEPLDEYMKKL-PKVKVVRLPKREGLIRARLIGAEMAQGPVLTFLDSHVECNVGWLEPL 415
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I+ D + P ID ID ++E+ G F W M + N +PE EA++R
Sbjct: 416 LQRIHDDPTNVVCPAIDAIDATSFEYAG---SGATIIGAFNWEMKFTWNGIPEYEARRRD 472
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
S P +SP AGGLF++D+ FF +G YDPG +WG EN ELSFKIWMCGGS+E +PCS
Sbjct: 473 DESWPIRSPAMAGGLFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCGGSLEIIPCS 532
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 533 RVAHIFR 539
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/225 (38%), Positives = 131/225 (58%), Gaps = 4/225 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
GPGE G +A N S+ IS +R++PD+R CK +Y DLP
Sbjct: 251 GPGEHGAGVRTKLEEQAKVKIGWDHAYFNEYVSDMISVERSVPDVRHNLCKTKEYSDDLP 310
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
+ SVI+ F E +S+L+RTVHS++ R+P + + E++LVDDFS + L + L++Y+++
Sbjct: 311 RTSVIICFTEESWSTLLRTVHSVLNRSPPELIAEVLLVDDFSQRDYLKEPLDEYMKKL-P 369
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KV+++R +REGLIR R GA+ ++G V+ FLD+H E + WL PLL I+ D + P
Sbjct: 370 KVKVVRLPKREGLIRARLIGAEMAQGPVLTFLDSHVECNVGWLEPLLQRIHDDPTNVVCP 429
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
ID ID ++E+ G F W M + N +PE EA++R
Sbjct: 430 AIDAIDATSFEYAG---SGATIIGAFNWEMKFTWNGIPEYEARRR 471
>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
Length = 644
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 119/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LEDY + KV+++R +REGLIR R GA+ ++ V+ +LD+HCE WL PL
Sbjct: 250 LKKQLEDYFTAYP-KVQIVRGAKREGLIRARILGAQYAKSPVLTYLDSHCECTEGWLEPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I+ T E+ HYR G F+W + + + +PE
Sbjct: 309 LDRIARNSTTVVCPVIDVINDDTLEY--------HYRDSTGVNVGGFDWNLQFSWHAVPE 360
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ ++EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 361 REKKRHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 420
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 421 LEIVPCSHVGHIFR 434
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 97/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
L PP E +E PGE GK LP + + A + + N S+ IS RT+PD R
Sbjct: 130 LLPPSE-LEETPGEMGKPVKLPKDMPDDMKKAVEDGWTKNAFNQYASDLISVHRTLPDPR 188
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y DLPK VI+ FHNE +S L+RTVHS++ R+P + ++ILVDD+S
Sbjct: 189 DAWCKDTARYLTDLPKTDVIICFHNEAWSVLLRTVHSVLDRSPEHLIGKVILVDDYSDMP 248
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV+++R +REGLIR R GA+ ++ V+ +LD+HCE WL P
Sbjct: 249 HLKKQLEDYFTAYP-KVQIVRGAKREGLIRARILGAQYAKSPVLTYLDSHCECTEGWLEP 307
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I+ T E+ HYR G F+W + + + +P
Sbjct: 308 LLDRIARNSTTVVCPVIDVINDDTLEY--------HYRDSTGVNVGGFDWNLQFSWHAVP 359
Query: 428 EREAKK 433
ERE K+
Sbjct: 360 EREKKR 365
>gi|296204771|ref|XP_002749473.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Callithrix jacchus]
Length = 940
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDDLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDDLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Nasonia vitripennis]
Length = 610
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 121/194 (62%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV+++R ++REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 218 LKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGWLEPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I ++ + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 277 LDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 328
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 329 REKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 388
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 389 LEIVPCSHVGHIFR 402
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 96/251 (38%), Positives = 138/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P + PGE G+ LP E + D + N S+ IS R++P
Sbjct: 95 GVLVAPRDQDTSAPGEMGRPVILPANLTTEIKKLVDDGWINN-AFNQYASDLISVHRSLP 153
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDD+S
Sbjct: 154 DPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYS 213
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV+++R ++REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 214 DMPHLKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGW 272
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I ++ + PVID ID T E+ H+R G F+W + + +
Sbjct: 273 LEPLLDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 324
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 325 AVPEREKKRHK 335
>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
Length = 671
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/192 (45%), Positives = 117/192 (60%), Gaps = 16/192 (8%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
K+LE+Y + +V+++R +REGLIR R GA+ + V+ +LD+HCE WL PLL
Sbjct: 279 KQLEEYFLAY-PRVKIVRAAKREGLIRARLLGARHATAPVLTYLDSHCECTTGWLEPLLD 337
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPERE 116
I + + PVID ID T E+ HYR G F+W + + + +PERE
Sbjct: 338 RIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVGGFDWNLQFNWHAVPERE 389
Query: 117 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
+K K +EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WMCGGS+E
Sbjct: 390 KRKHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGGENLELSFKTWMCGGSLE 449
Query: 177 WVPCSRIVSLIR 188
+PCS + + R
Sbjct: 450 IIPCSHVGHIFR 461
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 99/262 (37%), Positives = 141/262 (53%), Gaps = 23/262 (8%)
Query: 188 RPVFKADGKLGNLEPPL--EPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMET 241
RP + D + G P + P + GPGE GK LP+ + D + N
Sbjct: 142 RPARQPDDQGGLALPGVIAPPSEGGPGELGKPVVLPKDLSPEVKKLVDEGWAKNAFNQYV 201
Query: 242 SNHISFDRTIPDLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
++ IS RT+PD R CK Y DLP SVI+ FHNE +S L+RTVHS++ R+P
Sbjct: 202 ADMISIRRTLPDPRDAWCKEPGRYREDLPPTSVIICFHNEAWSVLLRTVHSVLDRSPEHL 261
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
++E+ILVDDFS ++LE+Y + +V+++R +REGLIR R GA+ + V+ +
Sbjct: 262 VKEVILVDDFSDMPHTQKQLEEYFLAY-PRVKIVRAAKREGLIRARLLGARHATAPVLTY 320
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------G 413
LD+HCE WL PLL I + + PVID ID T E+ HYR G
Sbjct: 321 LDSHCECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVG 372
Query: 414 IFEWGMLYKENELPEREAKKRK 435
F+W + + + +PERE +K K
Sbjct: 373 GFDWNLQFNWHAVPEREKRKHK 394
>gi|410968689|ref|XP_003990834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Felis catus]
Length = 939
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 542 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 600
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 601 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNR 657
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 658 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 717
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 718 SRVGHIFR 725
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 435 KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 494
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 495 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 553
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 554 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 613
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 614 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 649
>gi|321455342|gb|EFX66478.1| hypothetical protein DAPPUDRAFT_302681 [Daphnia pulex]
Length = 613
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 90/185 (48%), Positives = 120/185 (64%), Gaps = 6/185 (3%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
K L+DY+ + +VR+I ER GLIR R GA+ ++G+VI+FLD+H E +NWLPPLL
Sbjct: 201 KPLDDYVTQHMPRVRVIHLAERSGLIRARMAGARRAKGDVIIFLDSHSEANVNWLPPLLD 260
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 123
PI D + + P ID ID++T+ +R+ D RG F+W YK L + K +
Sbjct: 261 PIAEDYRTVVCPFIDVIDFETFAYRA---QDEGARGAFDWEFFYKRLPLLPDDLK---HP 314
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
+ P+KSP AGGLFA+ + FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSRI
Sbjct: 315 ARPFKSPVMAGGLFAISKKFFFELGGYDEGLEIWGGEQYELSFKIWQCGGQMFDAPCSRI 374
Query: 184 VSLIR 188
+ R
Sbjct: 375 GHIYR 379
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 134/218 (61%), Gaps = 4/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + GPGE G A++L D+ G N S+ I+ +RT+ D+R +CK +Y
Sbjct: 89 ESKQTGPGEQGLAFYLSPEDEKIKDSLYKVNGFNALVSDRINLNRTLKDIRHPDCKAQNY 148
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
DLP AS+++ FHNE FS L+RT +S + R PA LE +ILVDD S+K + L+DY+
Sbjct: 149 LEDLPTASIVVPFHNEHFSVLLRTAYSALNRAPANLLE-VILVDDASTKEHSKKPLDDYV 207
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+ +VR+I ER GLIR R GA+ ++G+VI+FLD+H E +NWLPPLL PI D +
Sbjct: 208 TQHMPRVRVIHLAERSGLIRARMAGARRAKGDVIIFLDSHSEANVNWLPPLLDPIAEDYR 267
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+ P ID ID++T+ +R+ D RG F+W YK
Sbjct: 268 TVVCPFIDVIDFETFAYRA---QDEGARGAFDWEFFYK 302
>gi|344268422|ref|XP_003406059.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Loxodonta africana]
Length = 939
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 542 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPL 600
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK R
Sbjct: 601 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIPPDVVAKNR 657
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 658 IKETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 717
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 718 SRVGHIFR 725
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/225 (37%), Positives = 127/225 (56%), Gaps = 6/225 (2%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P +P + PG+ G+ +P E N+ S+ I DR I D R C
Sbjct: 431 PRDP--KAPGQFGRPVIVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPTGCAEQ 488
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 489 LVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 548
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y
Sbjct: 549 YMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLS 607
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
RK + PVI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 608 RKKVACPVIEVINDKDMSYMTV---DNFQRGVFVWPMNFGWRTIP 649
>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
Length = 647
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/192 (45%), Positives = 118/192 (61%), Gaps = 16/192 (8%)
Query: 4 KKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 63
K+LEDY E + +V++IR +REGLIR R GA+ + V+ +LD+HCE WL PLL
Sbjct: 255 KQLEDYFEAYP-RVKIIRAPKREGLIRARLLGARYATAPVLTYLDSHCECTTGWLEPLLD 313
Query: 64 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPERE 116
I + + PVID ID T E+ HYR G F+W + + + +P+RE
Sbjct: 314 RIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVGGFDWNLQFNWHAVPDRE 365
Query: 117 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
K+ K +EP SPT AGGLF++D+ FF LG YD G +WGGEN ELSFK WMCGG++E
Sbjct: 366 KKRHKSTAEPVFSPTMAGGLFSIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLE 425
Query: 177 WVPCSRIVSLIR 188
VPCS + + R
Sbjct: 426 IVPCSHVGHIFR 437
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/250 (38%), Positives = 137/250 (54%), Gaps = 21/250 (8%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPE----AYRAAGDASLGEYGMNMETSNHISFDRTIPD 253
G + PP E + PG GK LP+ + A D + N ++ IS R++PD
Sbjct: 130 GVIAPPHEDSPDSPGAMGKPVVLPKDMSPEMKKAVDDGWSKNAFNQYAADLISIRRSLPD 189
Query: 254 LRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
R CK Y DLP SVI+ FHNE +S L+RTVHS++ R+P ++E+ILVDDFS
Sbjct: 190 PRDPWCKEPGRYGTDLPATSVIICFHNEAWSVLLRTVHSVLDRSPEHLVKEVILVDDFSD 249
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
++LEDY + + +V++IR +REGLIR R GA+ + V+ +LD+HCE WL
Sbjct: 250 MPHTQKQLEDYFEAYP-RVKIIRAPKREGLIRARLLGARYATAPVLTYLDSHCECTTGWL 308
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENE 425
PLL I + + PVID ID T E+ HYR G F+W + + +
Sbjct: 309 EPLLDRIARNSTTVVCPVIDVIDDNTMEY--------HYRDSGGVNVGGFDWNLQFNWHA 360
Query: 426 LPEREAKKRK 435
+P+RE K+ K
Sbjct: 361 VPDREKKRHK 370
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Nasonia vitripennis]
Length = 646
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 121/194 (62%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV+++R ++REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 217 LKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGWLEPL 275
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I ++ + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 276 LDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 327
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLFA+DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 328 REKKRHKNPAEPVWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 387
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 388 LEIVPCSHVGHIFR 401
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 96/251 (38%), Positives = 138/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P + PGE G+ LP E + D + N S+ IS R++P
Sbjct: 94 GVLVAPRDQDTSAPGEMGRPVILPANLTTEIKKLVDDGWINN-AFNQYASDLISVHRSLP 152
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDD+S
Sbjct: 153 DPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYS 212
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV+++R ++REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 213 DMPHLKRQLEDYMMNYP-KVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGW 271
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I ++ + PVID ID T E+ H+R G F+W + + +
Sbjct: 272 LEPLLDRIARNQTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 323
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 324 AVPEREKKRHK 334
>gi|194749276|ref|XP_001957065.1| GF24250 [Drosophila ananassae]
gi|190624347|gb|EDV39871.1| GF24250 [Drosophila ananassae]
Length = 662
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 90/188 (47%), Positives = 115/188 (61%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE YI VR++R R GLI RS GA+ + EV++FLD+H E NWLPPL
Sbjct: 245 LGHDLEAYIANHFKIVRVVRLPRRTGLIGARSEGARNATAEVLIFLDSHVEANYNWLPPL 304
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKKR 120
L PI +++ P ID ID+ +++R+ D RG F+W YK LPE
Sbjct: 305 LEPIALNKRTAVCPFIDVIDHSNFQYRA---QDEGARGAFDWEFYYKRLRLLPE----DL 357
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ ++P+KSP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 358 KHPADPFKSPVMAGGLFAISAEFFWELGGYDEGLDIWGGEQYELSFKIWMCGGQMYDAPC 417
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 418 SRIGHIYR 425
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLG-EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GE G+A L + + + + E G N S+ IS +R++ D+R ++C+ +Y L
Sbjct: 138 GLGEQGQAASLDDESQIETEKRMSLENGFNALLSDSISVNRSLNDIRHKQCRKKEYLTQL 197
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F NE S LMR+VHS+I R+P + L+EIILVDD+S + L LE YI
Sbjct: 198 PTVSVIIIFWNEYLSVLMRSVHSLINRSPPELLKEIILVDDYSDREYLGHDLEAYIANHF 257
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI RS GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 258 KIVRVVRLPRRTGLIGARSEGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 317
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ +++R+ D RG F+W YK
Sbjct: 318 PFIDVIDHSNFQYRA---QDEGARGAFDWEFYYK 348
>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
Length = 617
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 94/237 (39%), Positives = 141/237 (59%), Gaps = 7/237 (2%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYSPHDDWGEG---GTGVSHLTPEQQKRADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C+ YP DLP SVI+V+HNE +S+L+RTV S+I R+P L+EIILVDDFS + L
Sbjct: 146 SCRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLKEIILVDDFSDREFLR 205
Query: 318 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
KL++ I+ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPKLDESIKPIPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKQ 319
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 82/185 (44%), Positives = 122/185 (65%), Gaps = 4/185 (2%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
KL++ I+ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 208 KLDESIKPIPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 123
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+ +
Sbjct: 268 IKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKQHLLDP 324
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
+ P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS +
Sbjct: 325 TGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHV 384
Query: 184 VSLIR 188
+ R
Sbjct: 385 GHVFR 389
>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
malayi]
Length = 584
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 126/188 (67%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL PL
Sbjct: 193 LRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWLEPL 252
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + +DRK + P+ID I + +E+ + D + G F W + ++ +P RE ++R
Sbjct: 253 LDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREMERRN 309
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ S P ++PT AGGLFA+DR FF ++G YD G+ +WGGEN E+SF++WMCGGS+E PC
Sbjct: 310 HDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEIWGGENLEISFRVWMCGGSLEIHPC 369
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 370 SRVGHVFR 377
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 93/246 (37%), Positives = 146/246 (59%), Gaps = 6/246 (2%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+LG L L + GPGE G A + + + E ++ S+ IS +R +PD R
Sbjct: 70 ELGILLKSLNFERNGPGEMGSAVIIDPSQQEERARKFKENQFDVMASDLISINRALPDYR 129
Query: 256 MEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
+C+ D LP S+I+VFHNE +S+L+RT+HS+I R+P ++E+IL+DD S+
Sbjct: 130 SSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRTLHSVINRSPLHLIKEVILIDDLSN 189
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
+ L + L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL
Sbjct: 190 RTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWL 249
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PLL + +DRK + P+ID I + +E+ + D + G F W + ++ +P RE +
Sbjct: 250 EPLLDRVSTDRKRVVAPIIDVISDENFEY--ITASDVTWGG-FNWHLNFRWYPVPMREME 306
Query: 433 KRKYNR 438
+R ++R
Sbjct: 307 RRNHDR 312
>gi|18314429|gb|AAH22021.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Homo sapiens]
gi|51105933|gb|EAL24517.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 15 [Homo sapiens]
gi|119574364|gb|EAW53979.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_c
[Homo sapiens]
gi|123979772|gb|ABM81715.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [synthetic
construct]
gi|123994539|gb|ABM84871.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [synthetic
construct]
Length = 443
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 120/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GKV++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLVRGTFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +++P +SP +GG+FA+ R +F E+G YD + WG EN ELS +IWMCGG + +PC
Sbjct: 296 EGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSLRIWMCGGQLFIIPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 121/191 (63%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + YP LP AS+++ F+NE ++L +T+ S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRSKMRLQKHYPARLPTASIVICFYNEECNALFQTMSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
+ TP +LEEIILVDD S DL +KL+ +++ F GKV++IRN +REGLIR R GA
Sbjct: 157 VTNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLV 272
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 273 RGTFDWNLQFK 283
>gi|281485547|ref|NP_660335.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Homo sapiens]
gi|322510123|sp|Q7Z4T8.3|GLTL5_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
Length = 443
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 120/183 (65%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL +KL+ ++E F GKV++IRN +REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 180 DLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEP 239
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID ID +T E Y+P RG F+W + +K + + E
Sbjct: 240 LLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLVRGTFDWNLQFKWDNVFSYEMDGP 295
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +++P +SP +GG+FA+ R +F E+G YD + WG EN ELS +IWMCGG + +PC
Sbjct: 296 EGSTKPIRSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSLRIWMCGGQLFIIPC 355
Query: 181 SRI 183
SR+
Sbjct: 356 SRV 358
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/191 (45%), Positives = 122/191 (63%), Gaps = 4/191 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L +YG N+ S + +R +PD R + C YP LP AS+++ F+NE ++L +T+ S
Sbjct: 97 LLKYGFNVIISRSLGIEREVPDTRSKMCLQKHYPARLPTASIVICFYNEECNALFQTMSS 156
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
+ TP +LEEIILVDD S DL +KL+ +++ F GKV++IRN +REGLIR R GA
Sbjct: 157 VTNLTPHYFLEEIILVDDMSKVDDLKEKLDYHLETFRGKVKIIRNKKREGLIRARLIGAS 216
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+V+VFLD+HCEV WL PLL I D K++ P+ID ID +T E Y+P
Sbjct: 217 HASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVIDDRTLE----YKPSPLV 272
Query: 412 RGIFEWGMLYK 422
RG F+W + +K
Sbjct: 273 RGTFDWNLQFK 283
>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
Length = 554
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 79/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + +++ FLD+HCEV +WLPPLL + D + P
Sbjct: 169 KVKCLRNKHREGLIRSRVRGADAAGAQILTFLDSHCEVNKDWLPPLLQRVKEDPTSVASP 228
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + KR +EP K+P AGG
Sbjct: 229 VIDIINMDTFAYVAA---SSDLRGGFDWSLHFKWEQLSAEKRAKRADPTEPIKTPIIAGG 285
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +DR++F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 286 LFVIDRSWFNRLGKYDTAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 338
Score = 145 bits (365), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 79/203 (38%), Positives = 111/203 (54%), Gaps = 14/203 (6%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
Y N S I +R + D R C Y DLP ++++ FHNE S+L+RTV S++
Sbjct: 79 YAFNQRESERIPSNRALRDTRHYRCTTLHYDPDLPSTTIVITFHNEARSTLLRTVRSVLN 138
Query: 295 RTPAQYLEEIILVDDFSSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
RTP + EIILVDDFS + L KL KV+ +RN REGLIR+R RGA
Sbjct: 139 RTPVHLIHEIILVDDFSEDPNDCLLLTKLP--------KVKCLRNKHREGLIRSRVRGAD 190
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ +++ FLD+HCEV +WLPPLL + D + PVID I+ T+ + +
Sbjct: 191 AAGAQILTFLDSHCEVNKDWLPPLLQRVKEDPTSVASPVIDIINMDTFAYVAA---SSDL 247
Query: 412 RGIFEWGMLYKENELPEREAKKR 434
RG F+W + +K +L + KR
Sbjct: 248 RGGFDWSLHFKWEQLSAEKRAKR 270
>gi|281341921|gb|EFB17505.1| hypothetical protein PANDA_013078 [Ailuropoda melanoleuca]
Length = 936
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 540 LKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 598
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 599 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNR 655
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 656 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 715
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 716 SRVGHIFR 723
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 88/238 (36%), Positives = 131/238 (55%), Gaps = 10/238 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDR 473
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 533
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 534 FSTKDYLKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 647
>gi|301776863|ref|XP_002923851.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Ailuropoda melanoleuca]
Length = 937
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 540 LKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 598
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 599 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNR 655
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 656 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 715
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 716 SRVGHIFR 723
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 88/238 (36%), Positives = 131/238 (55%), Gaps = 10/238 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 420 VLKIDVTLSPRDP------KAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDR 473
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 474 AIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 533
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 534 FSTKDYLKGNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 592
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 593 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 647
>gi|158289989|ref|XP_311577.4| AGAP010367-PA [Anopheles gambiae str. PEST]
gi|157018424|gb|EAA07231.4| AGAP010367-PA [Anopheles gambiae str. PEST]
Length = 587
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 117/187 (62%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLEDY+++ KV+L+R ER GLI+ R GAK + G+V++FLD+H E G NWLPPL
Sbjct: 169 LHNKLEDYVKQNLPKVKLVRQPERTGLIKARLAGAKIASGDVLIFLDSHTEAGYNWLPPL 228
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI + K P+ID ID QT++ V+ D RG+F+W YK + + R
Sbjct: 229 LEPIAENPKTCVCPLIDVIDDQTFD---VHPQDEGGRGLFDWTFHYKRVVIKNED---RI 282
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP+ SP AGGLFA+ FF ELGGYD L +WG E +E+SFKIW CGG + PCS
Sbjct: 283 SPTEPFPSPVMAGGLFAIGADFFWELGGYDEELDIWGAEQYEISFKIWQCGGRMLDAPCS 342
Query: 182 RIVSLIR 188
R + R
Sbjct: 343 RFGHIYR 349
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/215 (42%), Positives = 127/215 (59%), Gaps = 19/215 (8%)
Query: 210 GPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
GPGE GK L EA + G N S+ IS +R+I DLR
Sbjct: 75 GPGEQGKPATLSPEEATSELRKELYYKNGFNALLSDKISINRSIADLR------------ 122
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
SV++ F+ E +S+L+RT++S++ R+P L+EII+VDD S+K L KLEDY+++
Sbjct: 123 --HPSVVVPFYEEHWSTLLRTIYSVLNRSPPHLLKEIIIVDDGSTKEFLHNKLEDYVKQN 180
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KV+L+R ER GLI+ R GAK + G+V++FLD+H E G NWLPPLL PI + K
Sbjct: 181 LPKVKLVRQPERTGLIKARLAGAKIASGDVLIFLDSHTEAGYNWLPPLLEPIAENPKTCV 240
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P+ID ID QT++ V+ D RG+F+W YK
Sbjct: 241 CPLIDVIDDQTFD---VHPQDEGGRGLFDWTFHYK 272
>gi|440896773|gb|ELR48609.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Bos grunniens
mutus]
Length = 940
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK +
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNK 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE VPC
Sbjct: 659 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIVPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 134/246 (54%), Gaps = 10/246 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V + D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 423 VLRIDATLSPRDP------KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDR 476
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C +LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNI 595
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 652
Query: 430 EAKKRK 435
K K
Sbjct: 653 VVAKNK 658
>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
Length = 684
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 118/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 256 LKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I+ +T E+ HYR G F+W + + + +PE
Sbjct: 315 LDRIARNSSTVVCPVIDVINDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 366
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 367 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 427 LEIVPCSHVGHIFR 440
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/246 (39%), Positives = 138/246 (56%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP E + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I+ +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSSTVVCPVIDVINDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 428 EREAKK 433
ERE K+
Sbjct: 366 ERERKR 371
>gi|219804492|ref|NP_001137331.1| polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
gi|296490560|tpg|DAA32673.1| TPA: polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
Length = 940
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK +
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNK 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE VPC
Sbjct: 659 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIVPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 134/246 (54%), Gaps = 10/246 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V + D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 423 VLRIDATLSPRDP------KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDR 476
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C +LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 477 AIEDTRPAGCAEQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 536
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E +
Sbjct: 537 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNI 595
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 596 GWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPD 652
Query: 430 EAKKRK 435
K K
Sbjct: 653 VVAKNK 658
>gi|312372346|gb|EFR20327.1| hypothetical protein AND_20267 [Anopheles darlingi]
Length = 616
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 90/218 (41%), Positives = 131/218 (60%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GKA L E D + G N S+ IS +R++PD+R C+ Y
Sbjct: 86 EAKRTGIGEQGKAGRLSEKEAEMKDKLFKKNGFNAVLSDLISLNRSLPDIRHRGCRKKKY 145
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+LP SV++ F+NE +S+L+RT S++ R+P++ + E+ILVDD S+K L +LE Y+
Sbjct: 146 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPSELIAEVILVDDCSTKDFLKGQLELYV 205
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV+++R ER GLI R GAK + +V++FLD+H E +NWLPPLL PI +D +
Sbjct: 206 GENMPKVKIVRLPERSGLIAARLAGAKVATADVLIFLDSHTEANVNWLPPLLDPIAADYR 265
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ T+E+R+ D RG F+W YK
Sbjct: 266 TCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYK 300
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 86/187 (45%), Positives = 116/187 (62%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+ KV+++R ER GLI R GAK + +V++FLD+H E +NWLPPL
Sbjct: 197 LKGQLELYVGENMPKVKIVRLPERSGLIAARLAGAKVATADVLIFLDSHTEANVNWLPPL 256
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI +D + P ID ID+ T+E+R+ D RG F+W YK L ++
Sbjct: 257 LDPIAADYRTCVCPFIDVIDWDTFEYRA---QDEGARGAFDWKFFYKRLPLLPKDLAN-- 311
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP++SP AGGLFA+ FF E+GGYD GL +WGGE +ELSFKIW CGG + PCS
Sbjct: 312 -PTEPFESPVMAGGLFAISAKFFWEIGGYDEGLDIWGGEQYELSFKIWQCGGKMYDAPCS 370
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 371 RVGHIYR 377
>gi|291391583|ref|XP_002712189.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Oryctolagus cuniculus]
Length = 941
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 544 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 602
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK +
Sbjct: 603 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFLWPMNFGWKTIPPDVVAKNK 659
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 660 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 719
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 720 SRVGHIFR 727
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 124/227 (54%), Gaps = 4/227 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G +P E N+ S+ I DR I D R C +L
Sbjct: 437 QAPGQFGLPVVVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 496
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD S+K L L+ Y+ +F
Sbjct: 497 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKDNLDKYMSQF- 555
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 556 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 615
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVI+ I+ + + +V D+ RGIF W M + +P K K
Sbjct: 616 PVIEVINDKDMSYMTV---DNFQRGIFLWPMNFGWKTIPPDVVAKNK 659
>gi|426221067|ref|XP_004004733.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Ovis
aries]
Length = 938
Score = 171 bits (434), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E + WL PL
Sbjct: 541 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPL 599
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK +
Sbjct: 600 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNK 656
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 657 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 716
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 717 SRVGHIFR 724
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 131/233 (56%), Gaps = 6/233 (2%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P +P + PG+ G+ +P + E N+ S+ I DR I D R C
Sbjct: 430 PRDP--KAPGQFGRPVVVPHGKEKEVERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQ 487
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
+LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+
Sbjct: 488 LVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDK 547
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
Y+ +F KVR++R ER GLIR R GA+++ G+V+ FLD+H E + WL PLL +Y
Sbjct: 548 YMSQF-PKVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPLLERVYLS 606
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
RK + PVI+ I+ + + +V D+ RGIF W M + +P K K
Sbjct: 607 RKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNK 656
>gi|326921959|ref|XP_003207221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Meleagris gallopavo]
Length = 596
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 120/187 (64%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +YI + +G V+LIR+ +R G+IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 197 LKSALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPL 255
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA + S+R + P+ID ID++T+++ Y +RG+F+W + + +PE E K R+
Sbjct: 256 LARLSSNRNSVVSPIIDVIDWKTFQY---YHSVSLHRGVFDWKLDFHWEPVPEHEEKVRQ 312
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ P +SP AG + AMDR +F +G YD + +WG EN ELS + W+CGGS+E +PCS
Sbjct: 313 SPTSPIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSIRTWLCGGSVEIIPCS 372
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 373 RVGHVYR 379
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 77/177 (43%), Positives = 114/177 (64%), Gaps = 4/177 (2%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C +Y LP ASVI+ FH+E +S+L+RTVHSII P L++IILVDD S + L
Sbjct: 140 CLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSIINTAPKASLKDIILVDDLSQQGPLKS 199
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
L +YI + +G V+LIR+ +R G+IR R GA + G+V+VF+D+HCE WL PLLA
Sbjct: 200 ALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPLLAR 258
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ S+R + P+ID ID++T+++ Y +RG+F+W + + +PE E K R+
Sbjct: 259 LSSNRNSVVSPIIDVIDWKTFQY---YHSVSLHRGVFDWKLDFHWEPVPEHEEKVRQ 312
>gi|194855550|ref|XP_001968569.1| GG24947 [Drosophila erecta]
gi|190660436|gb|EDV57628.1| GG24947 [Drosophila erecta]
Length = 659
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 92/197 (46%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+++ F V ++ + ER+GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 240 LKQPLDDYVQQHFPHLVTVVHSPERQGLIGARIAGAKVAVGEVMVFFDSHIEVNYNWLPP 299
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ +F RG F+W MLYK+ LPE K
Sbjct: 300 LIEPIAINPKICTCPIVDSISHE--DFSYFGGNKDGTRGGFDWKMLYKQLPVLPEDALDK 357
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S+PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 358 ----SQPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVP 413
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 414 CSRVAHIFRGPMKPRGN 430
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 89/202 (44%), Positives = 127/202 (62%), Gaps = 4/202 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+E CK Y LP SV+ VF NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISVNRSVPDVRLEACKTRKYLAKLPNISVVFVFFNEHFNTLLRSMYSVINR 219
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L++I+LVDD S L Q L+DY+Q+ F V ++ + ER+GLI R GAK +
Sbjct: 220 TPPELLKQIVLVDDGSEWDSLKQPLDDYVQQHFPHLVTVVHSPERQGLIGARIAGAKVAV 279
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
GEV+VF D+H EV NWLPPL+ PI + KI T P++D I ++ +F RG
Sbjct: 280 GEVMVFFDSHIEVNYNWLPPLIEPIAINPKICTCPIVDSISHE--DFSYFGGNKDGTRGG 337
Query: 415 FEWGMLYKE-NELPEREAKKRK 435
F+W MLYK+ LPE K +
Sbjct: 338 FDWKMLYKQLPVLPEDALDKSQ 359
>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 621
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 91/228 (39%), Positives = 134/228 (58%), Gaps = 2/228 (0%)
Query: 206 PYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP 265
P + GPGE GKA + ++ + N+ SN IS DR++ D R + C Y
Sbjct: 96 PLRVGPGEMGKAVTVAKSEEEEMEKMFKVNYFNLMISNRISNDRSLADYRPQGCFAKKYS 155
Query: 266 LDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ 325
+LPK SVILV+HNE +S LMRTVHS+I R+P LEEI+L+DD S++ L + L+DYI
Sbjct: 156 RNLPKTSVILVYHNEAWSVLMRTVHSVINRSPRHLLEEILLIDDASTREYLGRPLDDYIT 215
Query: 326 RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKI 385
+ VR+ ER GLI R +GA+ ++ V+ FLD+HCE WL PLL I ++R
Sbjct: 216 KLPVPVRVHHAKERRGLIGARLKGAELAKAPVLTFLDSHCECSKGWLEPLLDRIAANRST 275
Query: 386 MTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+ PVI+ ID +++ F + E H G F+W +++ +P+ E +
Sbjct: 276 VVCPVINQIDDRSFAFVNATEVSH--IGGFDWNIIFNWYNIPQSEKDR 321
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 3/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L+DYI + VR+ ER GLI R +GA+ ++ V+ FLD+HCE WL PL
Sbjct: 206 LGRPLDDYITKLPVPVRVHHAKERRGLIGARLKGAELAKAPVLTFLDSHCECSKGWLEPL 265
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I ++R + PVI+ ID +++ F + E H G F+W +++ +P+ E +
Sbjct: 266 LDRIAANRSTVVCPVINQIDDRSFAFVNATEVSH--IGGFDWNIIFNWYNIPQSEKDRIG 323
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ SEP +SPT AGGLF++D+++F ELG YDP WGGEN ELS KIWMCGG +E+VPC
Sbjct: 324 GDKSEPVRSPTMAGGLFSIDKSYFEELGSYDPEFEFWGGENIELSLKIWMCGGILEFVPC 383
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 384 SHVGHVFR 391
>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 533
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 118/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L+DY+ KVR++R T+REGLIR R GAK V+ +LD+HCE WL PL
Sbjct: 98 LMQQLDDYMSSL-PKVRIVRATQREGLIRARLLGAKYVTAPVLTYLDSHCECTEGWLEPL 156
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I ++ + PVID ID T E+ HYR G F+W + + + +P
Sbjct: 157 LDRIARNKTNVVCPVIDVIDDNTLEY--------HYRDSTSVNVGGFDWNLQFNWHPVPA 208
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE + K+ +EP SPT AGGLFA+D+ FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 209 RERARHKHTAEPVWSPTMAGGLFAIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 268
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 269 LEIVPCSHVGHIFR 282
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 123/210 (58%), Gaps = 17/210 (8%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+PD R E CK Y DLP+ SV++ FHNE +S L+RTVHS+I
Sbjct: 17 AFNQYASDLISIRRTLPDPRDEWCKQPGRYLEDLPQTSVVICFHNEAWSVLLRTVHSVID 76
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
R+PA ++EIILVDDFS L Q+L+DY+ KVR++R T+REGLIR R GAK
Sbjct: 77 RSPAHLIKEIILVDDFSDMPHLMQQLDDYMSSL-PKVRIVRATQREGLIRARLLGAKYVT 135
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-- 412
V+ +LD+HCE WL PLL I ++ + PVID ID T E+ HYR
Sbjct: 136 APVLTYLDSHCECTEGWLEPLLDRIARNKTNVVCPVIDVIDDNTLEY--------HYRDS 187
Query: 413 -----GIFEWGMLYKENELPEREAKKRKYN 437
G F+W + + + +P RE + K+
Sbjct: 188 TSVNVGGFDWNLQFNWHPVPARERARHKHT 217
>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
Length = 721
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 255 LKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 313
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I +T E+ HYR G F+W + + + +PE
Sbjct: 314 LDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 365
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 366 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 425
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 426 LEIVPCSHVGHIFR 439
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 135 IDPPAN-FEENPGELGKPVRLPKEMSDDMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 193
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 194 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 253
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 254 HLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 312
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 313 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 364
Query: 428 EREAKK 433
ERE K+
Sbjct: 365 ERERKR 370
>gi|26325284|dbj|BAC26396.1| unnamed protein product [Mus musculus]
Length = 930
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 533 LKADLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 591
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK
Sbjct: 592 LERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNG 648
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 649 IKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIELIPC 708
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 709 SRVGHIFR 716
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 126/218 (57%), Gaps = 4/218 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
VI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIP 640
>gi|449493084|ref|XP_002193936.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 2 [Taeniopygia
guttata]
Length = 599
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 87/215 (40%), Positives = 130/215 (60%), Gaps = 4/215 (1%)
Query: 221 PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNE 280
PE + G L +G N S I+ R +P++R C Y LP ASVI+ FH+E
Sbjct: 124 PEGRQMGGRIGLEAHGFNALLSERIALRRDLPEVRHPLCLQQKYDSSLPTASVIICFHDE 183
Query: 281 GFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE 340
+S+L+RTVHSI+ P +L++IILVDD S + L L +YI + +G V+LIR+ +R
Sbjct: 184 AWSTLLRTVHSIMDTAPKAFLKDIILVDDLSQQGSLKSALSEYISKLDG-VKLIRSNKRL 242
Query: 341 GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE 400
G+IR R GA + G+V++F+D+HCE WL PLLA + S R + PVID ID++T++
Sbjct: 243 GVIRGRMLGAARATGDVLIFMDSHCECQKGWLEPLLARLSSSRNSVVSPVIDVIDWKTFQ 302
Query: 401 FRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ Y +RG+F+W + + +PE E K R+
Sbjct: 303 Y---YHSVGLHRGVFDWKLDFHWEPVPEHEEKVRQ 334
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 100/191 (52%), Gaps = 22/191 (11%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L L +YI + +G V+LIR+ +R G+IR R GA + G+V++F+D+HCE WL P
Sbjct: 218 SLKSALSEYISKLDG-VKLIRSNKRLGVIRGRMLGAARATGDVLIFMDSHCECQKGWLEP 276
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LLA + S R + PVID ID++T+++ Y +RG+F+W + + +PE E K R
Sbjct: 277 LLARLSSSRNSVVSPVIDVIDWKTFQY---YHSVGLHRGVFDWKLDFHWEPVPEHEEKVR 333
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ P SP F LE G+ + W+CGG +E +PC
Sbjct: 334 Q---SPI-SPISVTNTFHTSTQ--LEYSLTRQGV------------ETWLCGGFVEIIPC 375
Query: 181 SRIVSLIRPVF 191
SR+ + R F
Sbjct: 376 SRVGHVYRSHF 386
>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
Length = 685
Score = 171 bits (433), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 124/210 (59%), Gaps = 34/210 (16%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV------------ 53
LE+Y+ +F GKV+++R +REGLIR R RGA + GEV+ +LD+HCE
Sbjct: 229 LEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGKETENRVRT 288
Query: 54 -----GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR------GIFE 102
W+ PLL I D + PVID ID T+E+ HH + G F+
Sbjct: 289 RNKKCKKRWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEY-------HHSKAYFTSVGGFD 341
Query: 103 WGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 162
WG+ + + +PER+ K R +P +SPT AGGLF++D+ +F +LG YDPG +WGGEN
Sbjct: 342 WGLQFNWHSIPERDRKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENL 401
Query: 163 ELSFK----IWMCGGSIEWVPCSRIVSLIR 188
ELSFK IWMCGG++E VPCS + + R
Sbjct: 402 ELSFKVRKCIWMCGGTLEIVPCSHVGHVFR 431
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 109/283 (38%), Positives = 149/283 (52%), Gaps = 48/283 (16%)
Query: 185 SLIRPVFKADGKLGNLEPPLEP-YKEG----PGEGGKAY-----HLPEAYRAAGDASLGE 234
+L +PVF D P +P YK+G GE GKA L +A D +
Sbjct: 92 NLKKPVFMVD--------PNDPIYKKGDANQAGELGKAVVVDKTKLTSEQKAIYDKGMLN 143
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N S+ IS RT+P ECK Y +LP+ SVI+ FHNE +S L+RTVHS+++
Sbjct: 144 NAFNQYASDMISVHRTLPTNIDAECKVEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLE 203
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP LEEI+LVDDFS + LE+Y+ +F GKV+++R +REGLIR R RGA +
Sbjct: 204 RTPEHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIAT 263
Query: 355 GEVIVFLDAHCE-----------------VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 397
GEV+ +LD+HCE W+ PLL I D + PVID ID
Sbjct: 264 GEVLTYLDSHCECMEGKETENRVRTRNKKCKKRWIEPLLDRIKRDPTTVVCPVIDVIDDN 323
Query: 398 TWEFRSVYEPDHHYR------GIFEWGMLYKENELPEREAKKR 434
T+E+ HH + G F+WG+ + + +PER+ K R
Sbjct: 324 TFEY-------HHSKAYFTSVGGFDWGLQFNWHSIPERDRKNR 359
>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
Length = 647
Score = 171 bits (433), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 256 LKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I +T E+ HYR G F+W + + + +PE
Sbjct: 315 LDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 366
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 367 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 427 LEIVPCSHVGHIFR 440
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 428 EREAKK 433
ERE K+
Sbjct: 366 ERERKR 371
>gi|363730187|ref|XP_418741.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 2 [Gallus gallus]
Length = 638
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +YI + +G V+LIR+ R G+IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 239 LKSALSEYISKLDG-VKLIRSNRRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPL 297
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA + S+R + P+ID ID++T+++ Y +RG+F+W + + +PE E K R+
Sbjct: 298 LARLSSNRNSVVSPIIDVIDWKTFQY---YHSVSLHRGVFDWKLDFHWEPVPEHEEKVRQ 354
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ P +SP AG + AMDR +F +G YD + +WG EN ELS + W+CGGS+E +PCS
Sbjct: 355 SPTSPIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSIRTWLCGGSVEIIPCS 414
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 415 RVGHVYR 421
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/246 (38%), Positives = 139/246 (56%), Gaps = 16/246 (6%)
Query: 192 KADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLG--EYGMNMETSNHISFDR 249
+ + K G+ E L G G G A G+ LG +G N S I R
Sbjct: 123 RPEAKEGDPESQLLSLPLGDGNGA----------ATGERPLGLETHGFNEALSERIPLRR 172
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
+P++R C +Y LP ASVI+ FH+E +S+L+RTVHSI+ P L++IILVDD
Sbjct: 173 ELPEVRHPLCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSILNTAPKASLKDIILVDD 232
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L L +YI + +G V+LIR+ R G+IR R GA + G+V+VF+D+HCE
Sbjct: 233 LSQQGPLKSALSEYISKLDG-VKLIRSNRRLGVIRGRMLGAARATGDVLVFMDSHCECQK 291
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLLA + S+R + P+ID ID++T+++ Y +RG+F+W + + +PE
Sbjct: 292 GWLEPLLARLSSNRNSVVSPIIDVIDWKTFQY---YHSVSLHRGVFDWKLDFHWEPVPEH 348
Query: 430 EAKKRK 435
E K R+
Sbjct: 349 EEKVRQ 354
>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Anolis carolinensis]
Length = 634
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 84/189 (44%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL+DY+++F+ V+++R ER+GLI R GA + GE + FLDAHCE WL PL
Sbjct: 233 LQDKLDDYVKQFH-IVKVVRQKERKGLITARLLGASIATGETLTFLDAHCECFYGWLEPL 291
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y H+ RG F+W + + LPE E+KK
Sbjct: 292 LARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHN-RGNFDWSLSFGWESLPEHESKK 350
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 351 RKDETYPIKTPTFAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIP 410
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 411 CSVVGHVFR 419
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 103/247 (41%), Positives = 140/247 (56%), Gaps = 14/247 (5%)
Query: 202 PPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP + PG GKA+ +L + + ++ N S+ IS R + PD R
Sbjct: 115 PPQD--SNAPGASGKAFKTINLSPDEQKEKERGDEKHCFNAFASDRISLHRDLGPDTRPP 172
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +PA L+EIILVDD S
Sbjct: 173 ECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDD 231
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L KL+DY+++F+ V+++R ER+GLI R GA + GE + FLDAHCE WL P
Sbjct: 232 YLQDKLDDYVKQFH-IVKVVRQKERKGLITARLLGASIATGETLTFLDAHCECFYGWLEP 290
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAK 432
LLA I + + P I ID T+EF S Y H+ RG F+W + + LPE E+K
Sbjct: 291 LLARIAENNTYVVSPDISSIDLNTFEFSKPSPYGQSHN-RGNFDWSLSFGWESLPEHESK 349
Query: 433 KRKYNRY 439
KRK Y
Sbjct: 350 KRKDETY 356
>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
Length = 693
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 140/237 (59%), Gaps = 7/237 (2%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL P E + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYSPREEWGEG---GSGVTHLTPEQQKLADSTFAVNQFNLFVSDGISVRRSLPEIRKP 145
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C+ YP DLP SVI+V+HNE +S+L+RTV S+I R+P L EI+LVDDFS + L
Sbjct: 146 SCRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLREILLVDDFSDRDFLR 205
Query: 318 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
KL++ ++ +++IR+ +R GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPKLDESLKPLPTDIKIIRSNQRVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTEMAKQ 319
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 122/185 (65%), Gaps = 4/185 (2%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
KL++ ++ +++IR+ +R GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 208 KLDESLKPLPTDIKIIRSNQRVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 123
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+ +
Sbjct: 268 IKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTEMAKQHLLDP 324
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
+ P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS +
Sbjct: 325 TGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHV 384
Query: 184 VSLIR 188
+ R
Sbjct: 385 GHVFR 389
>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
Length = 650
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 256 LKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I +T E+ HYR G F+W + + + +PE
Sbjct: 315 LDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 366
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 367 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 427 LEIVPCSHVGHIFR 440
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEEDPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 428 EREAKK 433
ERE K+
Sbjct: 366 ERERKR 371
>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
Length = 650
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 256 LKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I +T E+ HYR G F+W + + + +PE
Sbjct: 315 LDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 366
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 367 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 427 LEIVPCSHVGHIFR 440
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP E + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y DLPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 428 EREAKK 433
ERE K+
Sbjct: 366 ERERKR 371
>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
Length = 637
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 135/220 (61%), Gaps = 5/220 (2%)
Query: 205 EPYKEGPGEGGKAYHLPEAY-RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
E + GPGE GK Y L +A D E G + S+ I+ +R++PD+R C+
Sbjct: 89 EANRVGPGEHGKPYRLTGVEEKALNDKLFKENGYSAVVSDMIALNRSVPDIRHISCRTKA 148
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LP SVI++F+NE +S+L+RTV+S++ R+PA L+E+ILV+D S+K L L ++
Sbjct: 149 YLRELPTVSVIVIFYNEHWSALLRTVYSVLNRSPASLLKEVILVNDHSTKPFLWAPLREF 208
Query: 324 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
++ KVRLI ER GLI R GA+E+RG+V++ LD+H EV NWLPPLL PI D
Sbjct: 209 VESELAPKVRLIDLPERSGLILARMAGAREARGDVLIVLDSHTEVNNNWLPPLLEPIAED 268
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+ P ID I + T+++R+ D RG F+W YK
Sbjct: 269 YRTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFYYK 305
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 84/173 (48%), Positives = 107/173 (61%), Gaps = 6/173 (3%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVRLI ER GLI R GA+E+RG+V++ LD+H EV NWLPPLL PI D + P
Sbjct: 216 KVRLIDLPERSGLILARMAGAREARGDVLIVLDSHTEVNNNWLPPLLEPIAEDYRTCVCP 275
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
ID I + T+++R+ D RG F+W YK L + ++P+ SP AGG
Sbjct: 276 FIDVIAHDTFQYRA---QDEGKRGAFDWKFYYKRLPLLPGDLDD---PTKPFNSPVMAGG 329
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+ + R
Sbjct: 330 LFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYR 382
>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
Length = 650
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 256 LKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I +T E+ HYR G F+W + + + +PE
Sbjct: 315 LDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 366
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 367 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 427 LEIVPCSHVGHIFR 440
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 96/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP + + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSDEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y +LPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 428 EREAKK 433
ERE K+
Sbjct: 366 ERERKR 371
>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Takifugu rubripes]
Length = 579
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 79/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVR +RN +REGLIR+R RGA ++ V+ FLD+HCEV +WLPPLL I D + P
Sbjct: 194 KVRCVRNPQREGLIRSRVRGADSAKAAVLTFLDSHCEVNKDWLPPLLQRIKQDPTRVVSP 253
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + +R ++P K+P AGG
Sbjct: 254 VIDIINMDTFAYVAA---SADLRGGFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIAGG 310
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +DR++F LG YD + +WGGENFE+SF++W CGGS+E +PCSR+ + R
Sbjct: 311 LFVIDRSWFNHLGKYDTAMDIWGGENFEISFRVWQCGGSLEILPCSRVGHVFR 363
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 86/207 (41%), Positives = 115/207 (55%), Gaps = 9/207 (4%)
Query: 228 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 287
GD Y N S I +R + D R C Y DLP S+I+ FHNE S+L+R
Sbjct: 97 GDDPYTLYAFNQRESERIPSNRALRDTRHFRCATIRYDSDLPPTSIIITFHNEARSTLLR 156
Query: 288 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 347
TV S++ RTP + EIILVDDFS Q L KVR +RN +REGLIR+R
Sbjct: 157 TVRSVLNRTPVHLIHEIILVDDFSDDESDCQLLIKL-----PKVRCVRNPQREGLIRSRV 211
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
RGA ++ V+ FLD+HCEV +WLPPLL I D + PVID I+ T+ + +
Sbjct: 212 RGADSAKAAVLTFLDSHCEVNKDWLPPLLQRIKQDPTRVVSPVIDIINMDTFAYVAA--- 268
Query: 408 DHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + +K +L PE+ A++
Sbjct: 269 SADLRGGFDWSLHFKWEQLSPEQRARR 295
>gi|332021082|gb|EGI61469.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Acromyrmex
echinatior]
Length = 580
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + ++ YI FN +VRL + REGLIR R GA+++ G+V++FLD+H EV W+ P
Sbjct: 158 LHENIKVYIRNNFNDRVRLFKTERREGLIRARVFGARKATGKVLIFLDSHIEVNEIWIEP 217
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I R I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 218 LLSRIAYSRNIIPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLPIGTLNHD 273
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P KSPT AGGLFA+DR +F ++G YD G+ +WGGEN E+SF+IWMCGGSIE +PC
Sbjct: 274 VDFVKPIKSPTMAGGLFAIDREYFTKMGEYDIGMDIWGGENLEISFRIWMCGGSIELIPC 333
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 334 SRVGHVFR 341
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 137/226 (60%), Gaps = 7/226 (3%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P ++G E G +L + + D +Y N+ S+++ R IPD R + CK
Sbjct: 47 PAVTLEQGLDELGMVKNLEDQRKR--DEGYKDYAFNILISDNLGVQRNIPDTRHKLCKMQ 104
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
YP +LP AS+I+ F+NE +++L+R++HSI+++TP L EIILV+D+S L + ++
Sbjct: 105 KYPANLPNASIIICFYNEHYTTLLRSLHSILEKTPTVLLHEIILVNDYSDSDTLHENIKV 164
Query: 323 YIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
YI+ FN +VRL + REGLIR R GA+++ G+V++FLD+H EV W+ PLL+ I
Sbjct: 165 YIRNNFNDRVRLFKTERREGLIRARVFGARKATGKVLIFLDSHIEVNEIWIEPLLSRIAY 224
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
R I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 225 SRNIIPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLP 266
>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
Length = 650
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 88/194 (45%), Positives = 117/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 256 LKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPL 314
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I +T E+ HYR G F+W + + + +PE
Sbjct: 315 LDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVPE 366
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 367 RERKRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 426
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 427 LEIVPCSHVGHIFR 440
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 137/246 (55%), Gaps = 22/246 (8%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
++PP ++E PGE GK LP E + A D + N S+ IS RT+PD R
Sbjct: 136 IDPPAN-FEENPGELGKPVRLPKEMSEEMKKAVDDGWTKNAFNQYVSDLISVHRTLPDPR 194
Query: 256 MEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
CK Y DLPK VI+ FHNE ++ L+RTVHS++ R+P + +IILVDD+S
Sbjct: 195 DAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMP 254
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LEDY + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL P
Sbjct: 255 HLKRQLEDYFAAY-PKVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEP 313
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELP 427
LL I + + PVID I +T E+ HYR G F+W + + + +P
Sbjct: 314 LLDRIARNSTTVVCPVIDVISDETLEY--------HYRDSGGVNVGGFDWNLQFSWHPVP 365
Query: 428 EREAKK 433
ERE K+
Sbjct: 366 ERERKR 371
>gi|397525624|ref|XP_003832760.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Pan
paniscus]
Length = 940
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++ +++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDTIRCPVMAGGLFSIHKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|13929126|ref|NP_113984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Rattus norvegicus]
gi|51315691|sp|O88422.1|GALT5_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|3510639|gb|AAC69708.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T5 [Rattus
norvegicus]
gi|149047792|gb|EDM00408.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
gi|149047793|gb|EDM00409.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
Length = 930
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 533 LKANLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 591
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK
Sbjct: 592 LERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIPPDVIAKNG 648
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 649 IKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 708
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 709 SRVGHIFR 716
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 126/218 (57%), Gaps = 4/218 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPGKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKANLDKYMSQF-P 545
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
VI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIP 640
>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
saltator]
Length = 605
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 92/231 (39%), Positives = 132/231 (57%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
K PGE G A H+P A N+ S+ IS +R++ D+R++ CK Y
Sbjct: 100 KGSPGEMGAAVHIPPENEAKQQELFKLNQFNLMASDMISLNRSLKDIRLDGCKNKKYNKY 159
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++VFHNE +++L+RTV S+I R+P L+E+ILVDD S + L Q LEDYI
Sbjct: 160 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEVILVDDASERDHLKQDLEDYIATL 219
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I +DR +
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 279
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 280 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRNSDR 327
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 116/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDYI + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 208 LKQDLEDYIATLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 267
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I +DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 268 LSRIANDRHTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRN 324
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 325 SDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 384
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 385 SHVGHVFR 392
>gi|158749624|ref|NP_766443.2| polypeptide N-acetylgalactosaminyltransferase 5 [Mus musculus]
gi|341940730|sp|Q8C102.2|GALT5_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|148694985|gb|EDL26932.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 533 LKADLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 591
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK
Sbjct: 592 LERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNG 648
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 649 IKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 708
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 709 SRVGHIFR 716
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 126/218 (57%), Gaps = 4/218 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
VI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIP 640
>gi|327262637|ref|XP_003216130.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Anolis carolinensis]
Length = 500
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 81/173 (46%), Positives = 114/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA+ + EV+ FLD+HCEV +WL PLL I D + P
Sbjct: 143 KVKCLRNRRREGLIRSRIRGAEMAEAEVLTFLDSHCEVNKDWLLPLLQRIKEDPSHVVSP 202
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L ++ KR +EP K+P AGG
Sbjct: 203 VIDIINLDTFAYVAA---SSDLRGGFDWSLHFKWEQLSPKQKAKRTDPTEPIKTPIIAGG 259
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 260 LFVIDKAWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 312
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 111/200 (55%), Gaps = 8/200 (4%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
Y N S I DR I D R C Y DLP S+I+ FHNE S+L+RT+ S++
Sbjct: 53 YAFNQRESERIPSDRAIRDTRHHRCTTLHYRTDLPPTSIIITFHNEARSTLLRTIRSVLN 112
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP + EIILVDDFS D + L KV+ +RN REGLIR+R RGA+ +
Sbjct: 113 RTPVHLVHEIILVDDFSDDPDDCRLLIKL-----PKVKCLRNRRREGLIRSRIRGAEMAE 167
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
EV+ FLD+HCEV +WL PLL I D + PVID I+ T+ + + RG
Sbjct: 168 AEVLTFLDSHCEVNKDWLLPLLQRIKEDPSHVVSPVIDIINLDTFAYVAA---SSDLRGG 224
Query: 415 FEWGMLYKENELPEREAKKR 434
F+W + +K +L ++ KR
Sbjct: 225 FDWSLHFKWEQLSPKQKAKR 244
>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 622
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKERLEQYVQQLQ-IVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F R V H RG F+W + + LPE E ++R
Sbjct: 283 LARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/246 (41%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ E + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTNLETKEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKERLEQYVQQLQ-IVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ + P I ID T++F R V H RG F+W + + LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|109732606|gb|AAI16333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 533 LKADLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 591
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK
Sbjct: 592 LERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIPPDVVAKNG 648
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 649 IKETDIIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 708
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 709 SRVGHIFR 716
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 82/218 (37%), Positives = 126/218 (57%), Gaps = 4/218 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
PG+ G+ +P + + E N+ S+ I DR I D R C DLP
Sbjct: 427 APGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNDLP 486
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 487 TTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQF-P 545
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y +RK + P
Sbjct: 546 KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLNRKKVACP 605
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
VI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 606 VIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWKTIP 640
>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 512
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
K++LIRN REGL+R+R GA ++GE++ FLD+HCE + WL PLL + DR I+ P
Sbjct: 143 KIKLIRNERREGLVRSRIFGADAAKGEILTFLDSHCECNIGWLEPLLHRVSQDRTIVVSP 202
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I T+++ RG F+W + +K + + KRK EP K+P AGG
Sbjct: 203 IIDVISMDTFDYIGASS---ELRGGFDWSLHFKWDGFTPAQRAKRKSPIEPIKTPMIAGG 259
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF+++R F+E G YD + +WGGENFE+SF+ WMCGGS+E +PCSR+ + R
Sbjct: 260 LFSINRQRFIETGKYDDQMDIWGGENFEISFRTWMCGGSLEIIPCSRVGHVFR 312
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 112/200 (56%), Gaps = 8/200 (4%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
N E S+ S DR +PD R C Y LP SVI+ FHNE S+L+RT+ S++ R
Sbjct: 54 AFNQEASDKTSIDRKVPDTRHSWCYNQVYHPTLPSTSVIITFHNEARSTLLRTIVSVLNR 113
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 355
+P +EEIILVDDFS + L K++LIRN REGL+R+R GA ++G
Sbjct: 114 SPPHLIEEIILVDDFSEDVNTGLLLTQM-----PKIKLIRNERREGLVRSRIFGADAAKG 168
Query: 356 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 415
E++ FLD+HCE + WL PLL + DR I+ P+ID I T+++ RG F
Sbjct: 169 EILTFLDSHCECNIGWLEPLLHRVSQDRTIVVSPIIDVISMDTFDYIGA---SSELRGGF 225
Query: 416 EWGMLYKENELPEREAKKRK 435
+W + +K + + KRK
Sbjct: 226 DWSLHFKWDGFTPAQRAKRK 245
>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 648
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 107/258 (41%), Positives = 144/258 (55%), Gaps = 21/258 (8%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYR---AAGDASLGEYGMNMETSNHIS 246
V A +G L PP P +GPGE G+ L + + A N S+ IS
Sbjct: 120 VDHAPAPVGVLAPPQNP--DGPGEMGRPVVLKDLTKEQEAKVKQGWDRNAFNQYISDMIS 177
Query: 247 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 306
R++PD+R ECK Y DLP SVI+ FHNE +S L+RTVHSII R+P + L EIIL
Sbjct: 178 LHRSLPDVRDSECKDERYLKDLPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIIL 237
Query: 307 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
VDD+S L QKLEDY+ F KV+++R +REGLIR R GA + V+ +LD+HCE
Sbjct: 238 VDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCE 296
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGM 419
WL PLL I + + PVID I T+E+ HYR G F+W +
Sbjct: 297 CTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEY--------HYRDSGGVNVGGFDWNL 348
Query: 420 LYKENELPEREAKKRKYN 437
+ + +PERE ++RK++
Sbjct: 349 QFSWHAVPERERQRRKHS 366
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/194 (45%), Positives = 122/194 (62%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLEDY+ F KV+++R +REGLIR R GA + V+ +LD+HCE WL PL
Sbjct: 247 LKQKLEDYVAHFP-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCECTEGWLEPL 305
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I T+E+ HYR G F+W + + + +PE
Sbjct: 306 LDRIARNSTTVVCPVIDVISDSTFEY--------HYRDSGGVNVGGFDWNLQFSWHAVPE 357
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE ++RK++ +P SPT AGGLF++D+AFF +LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 358 RERQRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKTWMCGGT 417
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 418 LEIVPCSHVGHIFR 431
>gi|196001851|ref|XP_002110793.1| hypothetical protein TRIADDRAFT_11844 [Trichoplax adhaerens]
gi|190586744|gb|EDV26797.1| hypothetical protein TRIADDRAFT_11844, partial [Trichoplax
adhaerens]
Length = 490
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 130/227 (57%), Gaps = 6/227 (2%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
G+ G A +P + A + N S+ IS RT+PD R CK +PL LP
Sbjct: 1 GQNGTAVIVPAESKNASEQLFNRNHFNQWISDRISLHRTLPDPRHPMCKDQIFPLHLPTT 60
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS---SKADLDQKLEDYIQRFN 328
SV++VFHNE +S+L+RTVHSI+ R+P L EIIL DD+S A+L LE Y +
Sbjct: 61 SVVVVFHNEAWSTLLRTVHSILSRSPPDLLHEIILQDDYSDPIGHAELFMPLELYTSKLE 120
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KV++ RN + EGLIR+R G + V+ FLDAHCEV WL PLL IY + +
Sbjct: 121 -KVKIFRNEKHEGLIRSRLNGFSHATAPVVTFLDAHCEVTTGWLEPLLERIYLNETTVVC 179
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
P ID ID +T++++ + P RG+F W + ++ +P E K+RK
Sbjct: 180 PEIDVIDDRTFQYQ--FGPPALMRGVFNWQLYFRWALIPPEEHKRRK 224
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 112/183 (61%), Gaps = 3/183 (1%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
LE Y + KV++ RN + EGLIR+R G + V+ FLDAHCEV WL PLL I
Sbjct: 112 LELYTSKLE-KVKIFRNEKHEGLIRSRLNGFSHATAPVVTFLDAHCEVTTGWLEPLLERI 170
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSE 125
Y + + P ID ID +T++++ + P RG+F W + ++ +P E K+RK +
Sbjct: 171 YLNETTVVCPEIDVIDDRTFQYQ--FGPPALMRGVFNWQLYFRWALIPPEEHKRRKSPID 228
Query: 126 PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVS 185
P SPT AGGLFA+ + FF LG YD VWGGEN E+SFK W+CGG +E VPCSR+
Sbjct: 229 PVWSPTMAGGLFAISKKFFKRLGTYDDQFDVWGGENMEISFKAWLCGGKLEIVPCSRVGH 288
Query: 186 LIR 188
+ R
Sbjct: 289 VFR 291
>gi|148672125|gb|EDL04072.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 436
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 38 LKERLEQYVQQLQ-IVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 96
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F R V H RG F+W + + LPE E ++R
Sbjct: 97 LARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 156
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 157 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 216
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 217 SVVGHVFR 224
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/125 (42%), Positives = 72/125 (57%), Gaps = 2/125 (1%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L ++LE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 38 LKERLEQYVQQLQ-IVRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 96
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LA I D+ + P I ID T++F R V H RG F+W + + LPE E ++R
Sbjct: 97 LARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 156
Query: 435 KYNRY 439
K Y
Sbjct: 157 KDETY 161
>gi|345797223|ref|XP_545481.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Canis
lupus familiaris]
Length = 602
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 205 LKDDLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 263
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 264 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVVAKNR 320
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 321 IKETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 380
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 381 SRVGHIFR 388
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 135/248 (54%), Gaps = 5/248 (2%)
Query: 189 PVFKADGKLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISF 247
P A ++ ++ L P E PG+ G+ +P + E N+ S+ I
Sbjct: 77 PAQPAVRRVSGIDATLSPRDPEAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPV 136
Query: 248 DRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILV 307
DR I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LV
Sbjct: 137 DRAIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLV 196
Query: 308 DDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
DDFS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E
Sbjct: 197 DDFSTKDYLKDDLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVEC 255
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
+ WL PLL +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 256 NVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 312
Query: 428 EREAKKRK 435
K +
Sbjct: 313 PDVVAKNR 320
>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
Length = 518
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 79/173 (45%), Positives = 113/173 (65%), Gaps = 1/173 (0%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVRL+R +REGLIR R GA ++G V+ FLD+HCE + WL PLL I ++ + P
Sbjct: 124 KVRLVRAKKREGLIRARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRISQNKSNVVTP 183
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T +++ G F+W + + + +P+ E K+RK + +P +SPT AGG
Sbjct: 184 VIDVINDDTIQYQYSSAKSTSVGG-FDWNLQFNWHGIPDHEKKRRKSDVDPVRSPTMAGG 242
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF++ R +F LG YDPG+ +WGGEN ELSF+IWMCGGS++ PCS + + R
Sbjct: 243 LFSISREYFEYLGTYDPGMDIWGGENLELSFRIWMCGGSLDIAPCSHVGHIFR 295
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 133/229 (58%), Gaps = 7/229 (3%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWDYPL 266
GE GK ++ ++ + + E G N S+ +S RT+PD+R +EC+ +Y
Sbjct: 2 GENGKGLNIDKSKLSPEELKKYEKGYQRNAFNQYASDQMSLHRTLPDVRDKECRDRNYAT 61
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
+LP S+I++FHNE +S L+RTV S + R+P ++EIILVDDFS L L+++
Sbjct: 62 ELPDTSIIVIFHNEAWSVLLRTVFSCLDRSPGHLVKEIILVDDFSDFEHLQAPLQEFADS 121
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
KVRL+R +REGLIR R GA ++G V+ FLD+HCE + WL PLL I ++ +
Sbjct: 122 -QEKVRLVRAKKREGLIRARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRISQNKSNV 180
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID I+ T +++ G F+W + + + +P+ E K+RK
Sbjct: 181 VTPVIDVINDDTIQYQYSSAKSTSVGG-FDWNLQFNWHGIPDHEKKRRK 228
>gi|189237799|ref|XP_001814012.1| PREDICTED: similar to N-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270008127|gb|EFA04575.1| PNR-like protein [Tribolium castaneum]
Length = 614
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 84/189 (44%), Positives = 119/189 (62%), Gaps = 5/189 (2%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L + L YI + F+ +V+LI+ REGLIR R GA+ ++ +VI+FLD+H EV + W+
Sbjct: 199 NLHENLSTYITKNFDDRVKLIKTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGWIE 258
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I + + +PVID I+ T+ Y RG F WG+ +K LP+
Sbjct: 259 PLLQRIKDNYTNVAMPVIDIINADTF----AYTASPLVRGGFNWGLHFKWENLPKGTLST 314
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ +P KSPT AGGLFAM R +F +LG YD G+ +WGGEN E+SF+IWMCGG +E +P
Sbjct: 315 KMDFIKPIKSPTMAGGLFAMSRKYFTDLGEYDAGMNIWGGENLEISFRIWMCGGRLELIP 374
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 375 CSRVGHVFR 383
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/235 (39%), Positives = 136/235 (57%), Gaps = 7/235 (2%)
Query: 195 GKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 254
KL + P L K + G ++ + + D ++ N+ S +S+ R +PD
Sbjct: 81 NKLQPVYPKLSTDKNELSQLGLVKNIDDQRKK--DEGYKKHAYNVLISERLSYHRDVPDT 138
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R E CK Y DLP A++I+ F+NE + +L+RTVHSII RTPA L+EI+LVDDFS
Sbjct: 139 RNELCKNISYSADLPTAAIIICFYNEHYYTLLRTVHSIIDRTPASVLKEILLVDDFSDLE 198
Query: 315 DLDQKLEDYIQR-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 373
+L + L YI + F+ +V+LI+ REGLIR R GA+ ++ +VI+FLD+H EV + W+
Sbjct: 199 NLHENLSTYITKNFDDRVKLIKTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGWIE 258
Query: 374 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPE 428
PLL I + + +PVID I+ T+ Y RG F WG+ +K LP+
Sbjct: 259 PLLQRIKDNYTNVAMPVIDIINADTF----AYTASPLVRGGFNWGLHFKWENLPK 309
>gi|395846631|ref|XP_003796006.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Otolemur garnettii]
Length = 943
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 546 LKDNLDEYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 604
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y R+ + PVI+ I+ + + +V D+ RGIF W M + +P + AK +
Sbjct: 605 LERVYLSRQKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPDVVAKNK 661
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 662 IKETDIIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 721
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 722 SRVGHIFR 729
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 88/246 (35%), Positives = 133/246 (54%), Gaps = 10/246 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V K D L +P + PG+ G+ +P + E N+ S+ I DR
Sbjct: 426 VLKIDVTLSPRDP------KAPGQFGRPVVVPLGKEKEAERRWKEGNFNVYLSDLIPVDR 479
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R C +LP SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDD
Sbjct: 480 AIEDTRPVGCAEQLVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDD 539
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S+K L L++Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 540 CSTKDYLKDNLDEYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 598
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLL +Y R+ + PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 599 GWLEPLLERVYLSRQKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWKTIPPD 655
Query: 430 EAKKRK 435
K K
Sbjct: 656 VVAKNK 661
>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 608
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 139/224 (62%), Gaps = 13/224 (5%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLG-----EYGMNMETSNHISFDRTIPDLRMEEC 259
E + GPGE GK P R GD + E G + S+ I+ +R+IPD+R +C
Sbjct: 82 ETERHGPGEHGK----PVKLRDPGDIKMNDKLYKENGYSAVVSDLIALNRSIPDIRHPQC 137
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y +LP SVI++F+NE +S+L+RTV+S++ R+P+ L+EIILV+D S+K L +
Sbjct: 138 RKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPSHLLKEIILVNDHSTKPFLWKP 197
Query: 320 LEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
L+++++ + KV+LI ER GLI R GAK + G+V++ LD+H EV +NWLPPLL P
Sbjct: 198 LQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLEP 257
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
I D + P+ID I + T+E+RS D RG F+W YK
Sbjct: 258 IAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYK 298
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 90/186 (48%), Positives = 117/186 (62%), Gaps = 7/186 (3%)
Query: 4 KKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 62
K L++++E + KV+LI ER GLI R GAK + G+V++ LD+H EV +NWLPPLL
Sbjct: 196 KPLQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLL 255
Query: 63 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
PI D + P+ID I + T+E+RS D RG F+W YK LP R
Sbjct: 256 EPIAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYKR--LPLRPGDLDD- 309
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR
Sbjct: 310 PTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSR 369
Query: 183 IVSLIR 188
+ + R
Sbjct: 370 VGHVYR 375
>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 619
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/231 (42%), Positives = 141/231 (61%), Gaps = 15/231 (6%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLG-----EYGMNMETSNHISFDRTIPDLRMEEC 259
E + GPGE GK P R GD L E G + S+ I+ +R+IPD+R +C
Sbjct: 93 ETERHGPGEHGK----PLKLRDPGDIKLNDKLYKENGYSAVVSDLIALNRSIPDIRHPQC 148
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y +LP SVI++F+NE +S+L+RTV+S++ R+P L+EIILV+D S+K L +
Sbjct: 149 RKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPHLLKEIILVNDHSTKPFLWKP 208
Query: 320 LEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
L+++++ + KV+LI ER GLI R GAK + G+V++ LD+H EV +NWLPPLL P
Sbjct: 209 LQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLLEP 268
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
I D + P+ID I + T+E+RS D RG F+W YK LP R
Sbjct: 269 IAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYK--RLPLR 314
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 90/186 (48%), Positives = 117/186 (62%), Gaps = 7/186 (3%)
Query: 4 KKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 62
K L++++E + KV+LI ER GLI R GAK + G+V++ LD+H EV +NWLPPLL
Sbjct: 207 KPLQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPPLL 266
Query: 63 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKY 122
PI D + P+ID I + T+E+RS D RG F+W YK LP R
Sbjct: 267 EPIAQDYRTCVCPLIDVIVHDTFEYRS---QDEGKRGAFDWKFYYKR--LPLRPGDLDD- 320
Query: 123 NSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSR 182
+EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR
Sbjct: 321 PTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSR 380
Query: 183 IVSLIR 188
+ + R
Sbjct: 381 VGHVYR 386
>gi|291397402|ref|XP_002715124.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oryctolagus cuniculus]
Length = 439
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 101/240 (42%), Positives = 133/240 (55%), Gaps = 17/240 (7%)
Query: 201 EPPLEPYKEGPGEGGKAYHL-PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
EP E K G H PE Y + +YG+N+ S + R +PD R + C
Sbjct: 66 EPAFEHLKSYSKPIGNFNHSNPEFY-----SGFFKYGLNILISRSVGIRRDVPDTRDKIC 120
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
YP LP AS+I+ FHNE ++L+RT+ S++ TP+ LEEIILVDD S DL ++
Sbjct: 121 HQKRYPHRLPTASIIICFHNEEINALLRTLSSVVNLTPSHLLEEIILVDDMSEFDDLKEE 180
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
L+ ++ F G V+LIRN REGLIR R GA + G+V+VFLD+HCEV WL PLL+ I
Sbjct: 181 LDQKLEDFRGVVKLIRNKRREGLIRARLIGAAHASGDVLVFLDSHCEVNKVWLEPLLSVI 240
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK-------ENELPEREAK 432
D + P+ID ID T E Y+P RG F W + +K E E PE AK
Sbjct: 241 AKDPHTVVCPIIDVIDEMTLE----YKPSPIVRGTFNWMLQFKWDNVFSYEMEGPEGPAK 296
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL ++L+ +E F G V+LIRN REGLIR R GA + G+V+VFLD+HCEV WL P
Sbjct: 176 DLKEELDQKLEDFRGVVKLIRNKRREGLIRARLIGAAHASGDVLVFLDSHCEVNKVWLEP 235
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I D + P+ID ID T E Y+P RG F W + +K + + E +
Sbjct: 236 LLSVIAKDPHTVVCPIIDVIDEMTLE----YKPSPIVRGTFNWMLQFKWDNVFSYEMEGP 291
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P +SP+ AGG+FA+ R +F E+G YD + +WGGEN E+S +IWMCGG + +PC
Sbjct: 292 EGPAKPIRSPSMAGGIFAIHRHYFKEIGQYDKDMDLWGGENVEISLRIWMCGGQLFIIPC 351
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 352 SRVGHITR 359
>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
2 [Sus scrofa]
Length = 578
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERIAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 KSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 100/257 (38%), Positives = 145/257 (56%), Gaps = 16/257 (6%)
Query: 182 RIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
+ V L RP+++ +PP + + G G L E + + Y +N+
Sbjct: 55 KAVDLSRPLYE--------KPPADSHALGEWGKGSKLQLNEGELKQQEELIERYAINIYL 106
Query: 242 SNHISFDRTIPDLRMEECKY--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S+ IS R I D RM ECK +DY LP SV++ F+NE +S+L+RT+HS+++ +PA
Sbjct: 107 SDRISLHRHIEDKRMYECKSKKFDY-RRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAV 165
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+
Sbjct: 166 LLKEIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLT 224
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWG 418
FLD HCE WL PLL I D + PVID ID+ T+EF EP G F+W
Sbjct: 225 FLDCHCECNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWR 281
Query: 419 MLYKENELPEREAKKRK 435
+ ++ + +P+ E +RK
Sbjct: 282 LTFQWHSVPKHERDRRK 298
>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oreochromis niloticus]
Length = 598
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 79/173 (45%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ RN +REGLIR+R RG +R +V+ FLD+HCEV +WLPPLL I D + P
Sbjct: 213 KVKCFRNNKREGLIRSRVRGTDAARAKVLTFLDSHCEVNKDWLPPLLQRIKEDPSRVVSP 272
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + +R ++P K+P AGG
Sbjct: 273 VIDIINMDTFAYVAA---SADLRGGFDWSLHFKWEQLSPEQRARRTDPTQPIKTPIIAGG 329
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +DRA+F LG YD + +WGGENFE+SF++W CGGS+E +PCSR+ + R
Sbjct: 330 LFVIDRAWFNHLGKYDTAMDIWGGENFEISFRVWQCGGSLEILPCSRVGHVFR 382
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 114/207 (55%), Gaps = 9/207 (4%)
Query: 228 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 287
GD Y N S I DR + D R C Y +LP S+I+ FHNE S+L+R
Sbjct: 116 GDDPYTLYAFNQRESERIPSDRALRDTRHYRCTTLHYDSELPSTSIIITFHNEARSTLLR 175
Query: 288 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 347
T+ S++ RTP + EIILVDDFS Q L KV+ RN +REGLIR+R
Sbjct: 176 TIKSVLNRTPVHLIYEIILVDDFSDDESDCQLLTKL-----PKVKCFRNNKREGLIRSRV 230
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
RG +R +V+ FLD+HCEV +WLPPLL I D + PVID I+ T+ + +
Sbjct: 231 RGTDAARAKVLTFLDSHCEVNKDWLPPLLQRIKEDPSRVVSPVIDIINMDTFAYVAA--- 287
Query: 408 DHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + +K +L PE+ A++
Sbjct: 288 SADLRGGFDWSLHFKWEQLSPEQRARR 314
>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Sus scrofa]
gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Sus scrofa]
Length = 582
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 100/257 (38%), Positives = 145/257 (56%), Gaps = 16/257 (6%)
Query: 182 RIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMET 241
+ V L RP+++ +PP + + G G L E + + Y +N+
Sbjct: 59 KAVDLSRPLYE--------KPPADSHALGEWGKGSKLQLNEGELKQQEELIERYAINIYL 110
Query: 242 SNHISFDRTIPDLRMEECKY--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S+ IS R I D RM ECK +DY LP SV++ F+NE +S+L+RT+HS+++ +PA
Sbjct: 111 SDRISLHRHIEDKRMYECKSKKFDY-RRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAV 169
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+
Sbjct: 170 LLKEIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLT 228
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWG 418
FLD HCE WL PLL I D + PVID ID+ T+EF EP G F+W
Sbjct: 229 FLDCHCECNTGWLEPLLERIAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWR 285
Query: 419 MLYKENELPEREAKKRK 435
+ ++ + +P+ E +RK
Sbjct: 286 LTFQWHSVPKHERDRRK 302
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 LERIAEDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 301
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 302 KSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 361
Query: 181 SRI 183
S +
Sbjct: 362 SHV 364
>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
Length = 586
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/187 (46%), Positives = 115/187 (61%), Gaps = 2/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L DY KV++IR +REGLIR R GA + GEV+ +LD+HCE WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID T E+ ++ G F+W + + + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP SPT AGGLF++D+ FF LG YD G +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 310 NPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCS 369
Query: 182 RIVSLIR 188
+ + R
Sbjct: 370 HVGHIFR 376
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 98/240 (40%), Positives = 133/240 (55%), Gaps = 7/240 (2%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRM 256
+P L P GE GK LP A DA + N S+ IS R++PD R
Sbjct: 72 KPVLLPPASNAGEMGKPVVLPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRD 131
Query: 257 EECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
E CK + LP+ SVI+ FHNE +S L+RTVHS++ R+P+ ++E+ILVDDFS
Sbjct: 132 EWCKAPGRFQEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDH 191
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L Q+L DY KV++IR +REGLIR R GA + GEV+ +LD+HCE WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
L I D + PVID ID T E+ ++ G F+W + + + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309
>gi|195115611|ref|XP_002002350.1| GI13183 [Drosophila mojavensis]
gi|193912925|gb|EDW11792.1| GI13183 [Drosophila mojavensis]
Length = 655
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/189 (47%), Positives = 118/189 (62%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L K L+DY+ ++ V ++ N ER GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 236 LKKHLDDYVALQWPKLVDVVHNPERRGLIGARLAGAKVATGEVMVFFDSHIEVNYNWLPP 295
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI + KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 296 LLEPIVINNKIATCPIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLPVLPEDSVDK 353
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ +F +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 354 ----SLPYRSPVMMGGLFAINSKWFWDLGGYDDELEIWGGEQYELSFKIWMCGGMLLDVP 409
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 410 CSRVAHIFR 418
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 131/226 (57%), Gaps = 4/226 (1%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ + + + G N S+ IS +R++PD+R E CK Y LP
Sbjct: 130 GLGEHGQPASVDPSEKELEQQEYRRNGFNGYLSDRISVNRSVPDVRKEACKTRKYLAKLP 189
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 328
SVI +F+NE F +L+R+++SI+ RTP + L++I+LVDD S L + L+DY+ ++
Sbjct: 190 NVSVIFIFYNEHFQTLLRSIYSIVNRTPPELLKQIVLVDDGSEWDTLKKHLDDYVALQWP 249
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V ++ N ER GLI R GAK + GEV+VF D+H EV NWLPPLL PI + KI T
Sbjct: 250 KLVDVVHNPERRGLIGARLAGAKVATGEVMVFFDSHIEVNYNWLPPLLEPIVINNKIATC 309
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 433
P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 310 PIVDIIDHNNFAYNGGYQEGS--RGGFDWRFFYKQLPVLPEDSVDK 353
>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Anolis carolinensis]
Length = 592
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 86/183 (46%), Positives = 120/183 (65%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+Y+ KVRLIR +REGL+R R GA ++G+V+ FLD HCE WL PL
Sbjct: 192 LKEKLENYVANLR-KVRLIRANKREGLVRARLLGASIAKGDVLTFLDCHCECHEEWLEPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + + PVID ID+ T+E+ + EP G F+W +++ + +PERE K+R
Sbjct: 251 LERIKEEPSAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHVVPEREQKQR 307
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++ +SPT AGGLFA+++ +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 308 RSKTDVIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 367
Query: 181 SRI 183
S +
Sbjct: 368 SHV 370
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 106/255 (41%), Positives = 155/255 (60%), Gaps = 19/255 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RPV++ +PPL E GE G+A L E+ + S+ + +N+ S+
Sbjct: 68 LRRPVYE--------KPPLGRETE-LGELGRAARLELSESELRRQEESVALHQINVYLSD 118
Query: 244 HISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
IS R +P+ R +C K +DY +LPK SVI+ F+NE +S+L+RTVHS+++ +P L
Sbjct: 119 RISLHRRLPERRHPQCTEKRYDY-YNLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDILL 177
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
EEIILVDD+S K L +KLE+Y+ KVRLIR +REGL+R R GA ++G+V+ FL
Sbjct: 178 EEIILVDDYSDKEHLKEKLENYVANLR-KVRLIRANKREGLVRARLLGASIAKGDVLTFL 236
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGML 420
D HCE WL PLL I + + PVID ID+ T+E+ + EP G F+W ++
Sbjct: 237 DCHCECHEEWLEPLLERIKEEPSAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLV 293
Query: 421 YKENELPEREAKKRK 435
+ + +PERE K+R+
Sbjct: 294 FTWHVVPEREQKQRR 308
>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 658
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 117/190 (61%), Gaps = 3/190 (1%)
Query: 1 DLDKKLEDYIER-FNGKV-RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
DL +LE YIE F V RLIR ER GLIR R GA+ +RG+V++FLD+HCE WL
Sbjct: 248 DLKDRLERYIEHHFRADVVRLIRLPERSGLIRARLVGARAARGDVLIFLDSHCETTPGWL 307
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PLL PI DR+ + PVID IDY+T ++ + E D G F W + + +P +
Sbjct: 308 EPLLEPIRRDRRAVVCPVIDVIDYRTLQYVAA-EGDRFQIGGFNWRGEFTWHNIPSAWRR 366
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
R +EP +SPT AGGLFA++R +F E G YD + WGGEN E+SF+IW CGG I
Sbjct: 367 NRVSVAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVIA 426
Query: 179 PCSRIVSLIR 188
PCS + + R
Sbjct: 427 PCSHVGHIFR 436
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 92/219 (42%), Positives = 131/219 (59%), Gaps = 4/219 (1%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DLPK 270
G+ G A L + D + N+ S+ + +R++PD R C+ YP+ ++P
Sbjct: 144 GKDGHAVILGRDEQLEADREFSKAAFNVYVSDRLPLNRSLPDTRHRHCRAITYPVAEMPT 203
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR-FNG 329
ASV+++F +E FS+L+RT+ S+I R+P L EIILVDDFS DL +LE YI+ F
Sbjct: 204 ASVVIIFTDEIFSTLLRTIVSVIDRSPRHLLREIILVDDFSQSEDLKDRLERYIEHHFRA 263
Query: 330 KV-RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
V RLIR ER GLIR R GA+ +RG+V++FLD+HCE WL PLL PI DR+ +
Sbjct: 264 DVVRLIRLPERSGLIRARLVGARAARGDVLIFLDSHCETTPGWLEPLLEPIRRDRRAVVC 323
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVID IDY+T ++ + E D G F W + + +P
Sbjct: 324 PVIDVIDYRTLQYVAA-EGDRFQIGGFNWRGEFTWHNIP 361
>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 587
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/224 (42%), Positives = 132/224 (58%), Gaps = 2/224 (0%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
GE G+ +L + G+ L + +N+ SN I R +PD+R CK Y LP A
Sbjct: 75 GEYGRPAYLSGEEKIKGNEVLKKKAVNIILSNKIPLQRKLPDVRDPLCKNVTYDSVLPSA 134
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 330
S+I++FHNE FS L+RTV+S+IK TP + L+EIILVDD S +L LE YIQ R K
Sbjct: 135 SIIIIFHNEAFSVLLRTVYSVIKETPPKLLKEIILVDD-KSNEELLGLLEYYIQTRLPKK 193
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V+L+R ER+GL+R R +GAK + G+V++FLDAHCEV WL PLL I + + P+
Sbjct: 194 VKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQRIKEKKNAVVTPI 253
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
ID I +T+E+ EP G F W + + E + K +
Sbjct: 254 IDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSK 297
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/184 (44%), Positives = 114/184 (61%), Gaps = 1/184 (0%)
Query: 6 LEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
LE YI+ R KV+L+R ER+GL+R R +GAK + G+V++FLDAHCEV WL PLL
Sbjct: 182 LEYYIQTRLPKKVKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQR 241
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 124
I + + P+ID I +T+E+ EP G F W + + E + K +
Sbjct: 242 IKEKKNAVVTPIIDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTSAI 301
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P KSPT AGGLFA++R +F ++G YD + WGGEN E+SF+IW CGG +E +PCSR+
Sbjct: 302 SPVKSPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCSRVG 361
Query: 185 SLIR 188
+ R
Sbjct: 362 HVFR 365
>gi|334329895|ref|XP_003341281.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Monodelphis domestica]
Length = 929
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 532 LKDQLDKYMSQF-PKVRVLHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNVGWLEPL 590
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L +Y +RK + PVI+ I+ + + +V D+ RGIF W M + ++P K+ K
Sbjct: 591 LERVYLNRKKVACPVIEIINDKDLSYMTV---DNFQRGIFVWPMNFSWKKIPPDVIKQNK 647
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 648 IKETDIIRCPVMAGGLFSIDKKYFFELGTYDPGLEVWGGENMELSFKVWMCGGEIEIIPC 707
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 708 SRVGHIFR 715
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 70/241 (29%), Positives = 111/241 (46%), Gaps = 21/241 (8%)
Query: 197 LGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
+ N++ L P + PG+ G +P E N+ S+ I DR I D R
Sbjct: 426 VQNIDVTLSPRDPKAPGQFGNPVVVPLGKEKEVKRRWKEGNFNVYLSDLIPLDRAIDDTR 485
Query: 256 MEECKYWD-YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
+ +P+ L K + + +K TP E + + F
Sbjct: 486 PTGPNIRNSHPMVLWKGQSTALLPS-------------VKVTPQDIGSEEVSL--FVRPY 530
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL P
Sbjct: 531 YLKDQLDKYMSQF-PKVRVLHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNVGWLEP 589
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LL +Y +RK + PVI+ I+ + + +V D+ RGIF W M + ++P K+
Sbjct: 590 LLERVYLNRKKVACPVIEIINDKDLSYMTV---DNFQRGIFVWPMNFSWKKIPPDVIKQN 646
Query: 435 K 435
K
Sbjct: 647 K 647
>gi|449271781|gb|EMC82021.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Columba
livia]
Length = 314
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 118/183 (64%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ KVRLIR +REGL+R R GA ++G+++ FLD HCE WL PL
Sbjct: 59 LKETLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPL 117
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I + + PVID ID+ T+E+ + EP G F+W +++ + PERE K+R
Sbjct: 118 LARIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHSTPEREQKRR 174
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++ +SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 175 KSKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 234
Query: 181 SRI 183
S +
Sbjct: 235 SHV 237
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 82/177 (46%), Positives = 116/177 (65%), Gaps = 6/177 (3%)
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K +DY LPK SV++ F+NE +S+L+RTVHS+++ +P LEEIILVDD+S K L +
Sbjct: 4 KKYDY-YSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLKET 62
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+Y+ KVRLIR +REGL+R R GA ++G+++ FLD HCE WL PLLA I
Sbjct: 63 LENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLARI 121
Query: 380 YSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ + PVID ID+ T+E+ + EP G F+W +++ + PERE K+RK
Sbjct: 122 AEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHSTPEREQKRRK 175
>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
rerio]
Length = 578
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 116/182 (63%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+YI KVRLIR +REGL+R R GA + GEV+ FLD HCE WL PL
Sbjct: 178 LKEPLENYIADLK-KVRLIRARKREGLVRARLLGASIATGEVLTFLDCHCECHEGWLEPL 236
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+++ + P G F+W +++ + +PE E K+R
Sbjct: 237 LQRIKEEPSAVVCPVIDVIDWNTFQY--LGNPGEPQIGGFDWRLVFTWHSIPEHEQKRRS 294
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++ +SPT AGGLFA+++ +FL LG YD G+ VWGGEN E SF+IW CGGS+E PCS
Sbjct: 295 AATDVVRSPTMAGGLFAVNKKYFLYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 354
Query: 182 RI 183
+
Sbjct: 355 HV 356
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 87/226 (38%), Positives = 132/226 (58%), Gaps = 6/226 (2%)
Query: 212 GEGGKAYHLPEA--YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYP-LDL 268
GE G+A L + + + S+ ++ +N S+ IS R + + C+ Y L L
Sbjct: 71 GEMGRAVKLDLSGEEKREEEESIKKHQINTYVSDRISLHRRLAERWNPLCRNLKYDYLSL 130
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SV++ F+NE +S+L+RTVHS+++ +P L E+ILVDD+S + L + LE+YI
Sbjct: 131 PTTSVVIAFYNEAWSTLLRTVHSVLETSPDLLLNEVILVDDYSDREHLKEPLENYIADLK 190
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVRLIR +REGL+R R GA + GEV+ FLD HCE WL PLL I + +
Sbjct: 191 -KVRLIRARKREGLVRARLLGASIATGEVLTFLDCHCECHEGWLEPLLQRIKEEPSAVVC 249
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
PVID ID+ T+++ + P G F+W +++ + +PE E K+R
Sbjct: 250 PVIDVIDWNTFQY--LGNPGEPQIGGFDWRLVFTWHSIPEHEQKRR 293
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
Length = 621
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/187 (46%), Positives = 115/187 (61%), Gaps = 2/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L DY KV++IR +REGLIR R GA + GEV+ +LD+HCE WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID T E+ ++ G F+W + + + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP SPT AGGLF++D+ FF LG YD G +WGGEN ELSFK WMCGG++E VPCS
Sbjct: 310 NPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVPCS 369
Query: 182 RIVSLIR 188
+ + R
Sbjct: 370 HVGHIFR 376
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 98/240 (40%), Positives = 133/240 (55%), Gaps = 7/240 (2%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRM 256
+P L P GE GK LP A DA + N S+ IS R++PD R
Sbjct: 72 KPVLLPPASNAGEMGKPVVLPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRD 131
Query: 257 EECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
E CK + LP+ SVI+ FHNE +S L+RTVHS++ R+P+ ++E+ILVDDFS
Sbjct: 132 EWCKAPGRFQEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDH 191
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L Q+L DY KV++IR +REGLIR R GA + GEV+ +LD+HCE WL PL
Sbjct: 192 LKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPL 250
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
L I D + PVID ID T E+ ++ G F+W + + + +PE E K+ K
Sbjct: 251 LDRIARDPTTVVCPVIDVIDDTTLEYH-FHDSGGVNVGGFDWNLQFNWHAVPEHEKKRHK 309
>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
Length = 658
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/194 (44%), Positives = 116/194 (59%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LEDY + KV++IR +REGLIR R GA+ ++ V+ +LD+HCE WL PL
Sbjct: 264 LKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLDSHCECTEGWLEPL 322
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I T E+ HYR G F+W + + + +PE
Sbjct: 323 LDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGFDWNLQFSWHAVPE 374
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR +F LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 375 REKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSFKTWMCGGT 434
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 435 LEIVPCSHVGHIFR 448
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 96/259 (37%), Positives = 137/259 (52%), Gaps = 23/259 (8%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETS 242
++P + D K ++PP + E GE GK LP + + A + N S
Sbjct: 132 LKPKLQDDTK-KVIDPPGN-FDENLGEMGKPVTLPKEMTDEMKKAVETGWTNNAFNQYVS 189
Query: 243 NHISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
+ IS RT+PD R CK Y +LP VI+ FHNE ++ L+RTVHS++ R+P +
Sbjct: 190 DLISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLI 249
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
IILVDD+S L +LEDY + KV++IR +REGLIR R GA+ ++ V+ +L
Sbjct: 250 GRIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYL 308
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GI 414
D+HCE WL PLL I + + PVID I T E+ HYR G
Sbjct: 309 DSHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGG 360
Query: 415 FEWGMLYKENELPEREAKK 433
F+W + + + +PERE K+
Sbjct: 361 FDWNLQFSWHAVPEREKKR 379
>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
Length = 658
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/194 (44%), Positives = 116/194 (59%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LEDY + KV++IR +REGLIR R GA+ ++ V+ +LD+HCE WL PL
Sbjct: 264 LKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLDSHCECTEGWLEPL 322
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID I T E+ HYR G F+W + + + +PE
Sbjct: 323 LDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGGFDWNLQFSWHAVPE 374
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ +EP SPT AGGLF++DR +F LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 375 REKKRHNSTAEPVYSPTMAGGLFSIDREYFNRLGTYDSGFDIWGGENLELSFKTWMCGGT 434
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 435 LEIVPCSHVGHIFR 448
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 96/259 (37%), Positives = 137/259 (52%), Gaps = 23/259 (8%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETS 242
++P + D K ++PP + E GE GK LP + + A + N S
Sbjct: 132 LKPKLQDDTK-KVIDPPGN-FDENLGEMGKPVTLPKEMTDEMKKAVETGWTNNAFNQYVS 189
Query: 243 NHISFDRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
+ IS RT+PD R CK Y +LP VI+ FHNE ++ L+RTVHS++ R+P +
Sbjct: 190 DLISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLI 249
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
IILVDD+S L +LEDY + KV++IR +REGLIR R GA+ ++ V+ +L
Sbjct: 250 GRIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYL 308
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GI 414
D+HCE WL PLL I + + PVID I T E+ HYR G
Sbjct: 309 DSHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDDTLEY--------HYRDSSGVNVGG 360
Query: 415 FEWGMLYKENELPEREAKK 433
F+W + + + +PERE K+
Sbjct: 361 FDWNLQFSWHAVPEREKKR 379
>gi|313241234|emb|CBY33515.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LED + +VRLIR +REGL+R R GA+ + GEV+ FLD H E WL PL
Sbjct: 200 LGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIECNNGWLEPL 258
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D ++ VP+I I +Q + F G F+W + ++ + +P+ KRK
Sbjct: 259 LQRIAEDDSVVAVPIISTIAWQDFAFHHSSNSIEPQIGGFDWRLTFQWHSIPDEIKAKRK 318
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+++P +PT AGGLFA+ R +F +G YD G+ VWGGEN E+SF++WMCGGS+E +PCS
Sbjct: 319 ADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGGSLEIIPCS 378
Query: 182 RIVSLIRP 189
IV + P
Sbjct: 379 -IVGHVFP 385
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 137/248 (55%), Gaps = 16/248 (6%)
Query: 202 PPLEPYKEG----PGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLR 255
PP+ P G G+GGK+ L E + + + + + +N S IS RT+ + R
Sbjct: 73 PPVLPRPLGDAITEGQGGKSVKLTEEQKKSDEYKKIVDRFMVNHLASERISLHRTVGEHR 132
Query: 256 MEEC-----KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
++C K + Y LP SVI+ F+NEG+++L+RT++SI+ +P L+EIIL+DD
Sbjct: 133 HKQCVALANKGYRYD-QLPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDD 191
Query: 311 SSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
S K + L ++LED + +VRLIR +REGL+R R GA+ + GEV+ FLD H E
Sbjct: 192 SDKVEFPRLGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIEC 250
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLL I D ++ VP+I I +Q + F G F+W + ++ + +P
Sbjct: 251 NNGWLEPLLQRIAEDDSVVAVPIISTIAWQDFAFHHSSNSIEPQIGGFDWRLTFQWHSIP 310
Query: 428 EREAKKRK 435
+ KRK
Sbjct: 311 DEIKAKRK 318
>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
Length = 617
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 90/237 (37%), Positives = 142/237 (59%), Gaps = 7/237 (2%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYSPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C+ +P +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNITFPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLK 205
Query: 318 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
KL++ ++ ++++R+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPKLDESLKPLPTDIKIVRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKE 319
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 122/185 (65%), Gaps = 4/185 (2%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
KL++ ++ ++++R+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 208 KLDESLKPLPTDIKIVRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN- 123
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+ +
Sbjct: 268 IKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPSSMAKEHLLDP 324
Query: 124 SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRI 183
+ P +SPT AGGLF++DR +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS +
Sbjct: 325 TGPIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHV 384
Query: 184 VSLIR 188
+ R
Sbjct: 385 GHVFR 389
>gi|149031395|gb|EDL86385.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Rattus
norvegicus]
Length = 300
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 121/183 (66%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ ++E F GK++L+RN +REGLIR+R GA + G+++VFLD+HCEV WL P
Sbjct: 23 DLKAKLDYHLEIFRGKIKLVRNKKREGLIRSRMIGASRASGDILVFLDSHCEVNRVWLEP 82
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ PVID ID T ++ V P RG F+W + ++ +++ E
Sbjct: 83 LLHAIAKDHKMVVCPVIDVIDELTLDY--VGSP--IVRGAFDWNLNFRWDDVFSYELDGP 138
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP +GG+FA++R +F ELG YD + +WGGEN ELS +IWMCGG + +PC
Sbjct: 139 EGPSTPIRSPAMSGGIFAINRHYFNELGQYDKDMDLWGGENVELSLRIWMCGGQLFILPC 198
Query: 181 SRI 183
SR+
Sbjct: 199 SRV 201
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 60/131 (45%), Positives = 86/131 (65%), Gaps = 4/131 (3%)
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 355
+P LEEIILVDD S DL KL+ +++ F GK++L+RN +REGLIR+R GA + G
Sbjct: 4 SPKHLLEEIILVDDMSEFDDLKAKLDYHLEIFRGKIKLVRNKKREGLIRSRMIGASRASG 63
Query: 356 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 415
+++VFLD+HCEV WL PLL I D K++ PVID ID T ++ V P RG F
Sbjct: 64 DILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPVIDVIDELTLDY--VGSP--IVRGAF 119
Query: 416 EWGMLYKENEL 426
+W + ++ +++
Sbjct: 120 DWNLNFRWDDV 130
>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 582
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+ + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKTQLETYVSNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D ++ PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 LERISEDETVIICPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 301
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 302 KSRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 361
Query: 181 SRI 183
S +
Sbjct: 362 SHV 364
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 96/228 (42%), Positives = 135/228 (59%), Gaps = 8/228 (3%)
Query: 212 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL-DL 268
GE GKA L EA + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASRLQLNEAELKQQEELIERYAINIYLSDKISLHRHIEDKRMYECKSKTFNYRQL 138
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE Y+ +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYVSNLD 198
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D ++
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISEDETVIIC 257
Query: 389 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRK 302
>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Ailuropoda melanoleuca]
Length = 578
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 KSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 101/260 (38%), Positives = 144/260 (55%), Gaps = 16/260 (6%)
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMN 238
P + L RP++K +PP + + G L E + + Y +N
Sbjct: 52 PWEKAEDLSRPLYK--------KPPADSHALGEWGKASKLQLSEGELKQQEELIERYAIN 103
Query: 239 METSNHISFDRTIPDLRMEECKY--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRT 296
+ S+ IS R I D RM ECK +DY LP SVI+ F+NE +S+L+RT+HS+++ +
Sbjct: 104 IYLSDRISLHRHIEDKRMYECKSRKFDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETS 162
Query: 297 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 356
PA L+EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+
Sbjct: 163 PAVLLKEIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGD 221
Query: 357 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIF 415
V+ FLD HCE WL PLL I D + PVID ID+ T+EF EP G F
Sbjct: 222 VLTFLDCHCECNSGWLEPLLERISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGF 278
Query: 416 EWGMLYKENELPEREAKKRK 435
+W + ++ + +P+ E +RK
Sbjct: 279 DWRLTFQWHSVPKHERDRRK 298
>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Pan paniscus]
gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pan paniscus]
Length = 622
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
gorilla gorilla]
Length = 622
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFQRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Megachile rotundata]
Length = 605
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 132/231 (57%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
K PGE G A H+ A N+ S+ IS +R++ D+R+E CK YP
Sbjct: 100 KGKPGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLRDIRLEGCKTKKYPKY 159
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 160 LPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTL 219
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLLA I +R +
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAENRSTVV 279
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 280 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDR 327
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 116/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 208 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 267
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 268 LARIAENRSTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 324
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 325 GDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 384
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 385 SHVGHVFR 392
>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
Length = 578
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 102/253 (40%), Positives = 144/253 (56%), Gaps = 18/253 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RP++K +PP + + PGE GKA L E + + Y +N+ S+
Sbjct: 59 LSRPLYK--------KPPADSH--APGEWGKASKLQLNEGELKQQEELIERYAINIYLSD 108
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+
Sbjct: 109 RISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLK 168
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 169 EIILVDDLSDRVYLKTQLESYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLD 227
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLY 421
HCE WL PLL I D + PVID ID+ T+EF EP G F+W + +
Sbjct: 228 CHCECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTF 284
Query: 422 KENELPEREAKKR 434
+ + +P+ E +R
Sbjct: 285 QWHSVPKHERDRR 297
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLESYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
Length = 622
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Anolis carolinensis]
Length = 583
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 102/254 (40%), Positives = 150/254 (59%), Gaps = 18/254 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEA--YRAAGDASLGEYGMNMETSN 243
L RPV++ +PP +P+ G GE GKA L + + + + Y +N+ S+
Sbjct: 64 LARPVYQ--------KPPPDPH--GLGEWGKAARLTLSPEEKKLEEELVERYAINIYLSD 113
Query: 244 HISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
IS R I D RM EC K +DY LP SVI+ F+NE +S+L+RT+HS+++ +P+ L
Sbjct: 114 KISLHRHIDDGRMPECRSKTYDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLESSPSVLL 172
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
+EIILVDD S K L +LE YI +VRLIR +REGL+R R GA + G+V+ FL
Sbjct: 173 KEIILVDDLSDKVYLKGELEKYISNLQ-RVRLIRTNKREGLVRARLIGATFATGDVLTFL 231
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 421
D HCE WL PLL + + ++ PVID ID+ T+EF +P G F+W + +
Sbjct: 232 DCHCECVPGWLEPLLQRVAENESVIICPVIDTIDWNTFEF--YMQPGEPMIGGFDWRLTF 289
Query: 422 KENELPEREAKKRK 435
+ + +P+ E ++RK
Sbjct: 290 QWHSVPDYERQRRK 303
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 115/182 (63%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 187 LKGELEKYISNLQ-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVPGWLEPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + + ++ PVID ID+ T+EF +P G F+W + ++ + +P+ E ++RK
Sbjct: 246 LQRVAENESVIICPVIDTIDWNTFEF--YMQPGEPMIGGFDWRLTFQWHSVPDYERQRRK 303
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 304 SKVDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSFRVWQCGGILEIHPCS 363
Query: 182 RI 183
+
Sbjct: 364 HV 365
>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
Length = 622
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
Length = 622
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Nomascus leucogenys]
Length = 622
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 101/247 (40%), Positives = 140/247 (56%), Gaps = 14/247 (5%)
Query: 202 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 256
PP +P PG GKA+ P R + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETREK-EEGYKKHCFNAFASDRISLQRSLGPDTRP 162
Query: 257 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 163 PECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTE 221
Query: 314 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 373
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EHLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 374 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PLLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E +
Sbjct: 281 PLLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ 340
Query: 433 KRKYNRY 439
+RK Y
Sbjct: 341 RRKDETY 347
>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
Length = 562
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/237 (38%), Positives = 140/237 (59%), Gaps = 7/237 (2%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C+ YP +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205
Query: 318 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQ 319
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 79/184 (42%), Positives = 120/184 (65%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL I
Sbjct: 209 LDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRI 268
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
+RK + PVID I+ T++++ E +RG F W + ++ +P AK+ + +
Sbjct: 269 KLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLLDPT 325
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS +
Sbjct: 326 GPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVG 385
Query: 185 SLIR 188
+ R
Sbjct: 386 HVFR 389
>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pongo abelii]
gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
3 [Pongo abelii]
Length = 622
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Acyrthosiphon pisum]
Length = 674
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 87/194 (44%), Positives = 118/194 (60%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE+Y E + K++++R +REGLIR R GA+ + V+ +LD+HCE WL PL
Sbjct: 240 LKTQLEEYSENY-PKIKIVRAKKREGLIRARLMGARYASAPVLTYLDSHCECTEGWLEPL 298
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I + + PVID ID T EF HYR G F+W + + + +P+
Sbjct: 299 LDRIAREASTVVCPVIDVIDDSTLEF--------HYRDAGGVNVGGFDWNLQFNWHVVPD 350
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
+E K+ K +EP SPT AGGLFA+D+ FF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 351 KEKKRHKNAAEPVWSPTMAGGLFAIDKKFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 410
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 411 LEIVPCSHVGHIFR 424
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 129/236 (54%), Gaps = 21/236 (8%)
Query: 212 GEGGKAYHLPEAYRA----AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
GE GK LP A D N S+ IS RT+PD R E CK LD
Sbjct: 131 GEMGKPVVLPANLTADVKKLVDEGWKNNAFNQYASDLISLHRTLPDPRDEWCKKPGRYLD 190
Query: 268 -LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LP+ SVI+ FHNE +S L+RTVHSI+ R+P + EIILVDDFS L +LE+Y +
Sbjct: 191 NLPQTSVIVCFHNEAWSVLLRTVHSILDRSPEHLIREIILVDDFSDMPHLKTQLEEYSEN 250
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
+ K++++R +REGLIR R GA+ + V+ +LD+HCE WL PLL I + +
Sbjct: 251 Y-PKIKIVRAKKREGLIRARLMGARYASAPVLTYLDSHCECTEGWLEPLLDRIAREASTV 309
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRK 435
PVID ID T EF HYR G F+W + + + +P++E K+ K
Sbjct: 310 VCPVIDVIDDSTLEF--------HYRDAGGVNVGGFDWNLQFNWHVVPDKEKKRHK 357
>gi|313231736|emb|CBY08849.1| unnamed protein product [Oikopleura dioica]
Length = 603
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LED + +VRLIR +REGL+R R GA+ + GEV+ FLD H E WL PL
Sbjct: 200 LGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIECNDGWLEPL 258
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D ++ VP+I I +Q + F G F+W + ++ + +P+ KRK
Sbjct: 259 LQRIAEDDSVVAVPIISTIAWQDFGFHHSSNSIEPQIGGFDWQLTFQWHSIPDEIKAKRK 318
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+++P +PT AGGLFA+ R +F +G YD G+ VWGGEN E+SF++WMCGGS+E +PCS
Sbjct: 319 ADTDPVPTPTMAGGLFAVSRQYFRSIGSYDTGMEVWGGENLEMSFRVWMCGGSLEIIPCS 378
Query: 182 RIVSLIRP 189
IV + P
Sbjct: 379 -IVGHVFP 385
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 89/248 (35%), Positives = 137/248 (55%), Gaps = 16/248 (6%)
Query: 202 PPLEPYKEG----PGEGGKAYHLPEAYRAAGDAS--LGEYGMNMETSNHISFDRTIPDLR 255
PP+ P G G+GGK+ L E + + + + + +N S IS RT+ + R
Sbjct: 73 PPVLPRPLGDAITEGQGGKSVKLTEEQKKSDEYKKIVDRFMVNHLASERISLHRTVGEHR 132
Query: 256 MEEC-----KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
++C K + Y LP SVI+ F+NEG+++L+RT++SI+ +P L+EIIL+DD
Sbjct: 133 HKQCVALANKGYRYD-QLPTTSVIVTFYNEGWTTLLRTIYSILHTSPEVLLKEIILIDDD 191
Query: 311 SSKAD---LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEV 367
S K + L ++LED + +VRLIR +REGL+R R GA+ + GEV+ FLD H E
Sbjct: 192 SDKVEFPRLGKELEDIVATM-PRVRLIRTKQREGLVRARLLGAELASGEVLTFLDCHIEC 250
Query: 368 GLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLL I D ++ VP+I I +Q + F G F+W + ++ + +P
Sbjct: 251 NDGWLEPLLQRIAEDDSVVAVPIISTIAWQDFGFHHSSNSIEPQIGGFDWQLTFQWHSIP 310
Query: 428 EREAKKRK 435
+ KRK
Sbjct: 311 DEIKAKRK 318
>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mustela putorius
furo]
Length = 452
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 56 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 114
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 115 LERISYDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 171
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN E SF+IW CGG++E PC
Sbjct: 172 KSRIDPIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPC 231
Query: 181 SRI 183
S +
Sbjct: 232 SHV 234
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 77/169 (45%), Positives = 107/169 (63%), Gaps = 5/169 (2%)
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP SVI+ F+NE +S+L+RTVHS+++ +PA L+EIILVDD S + L +LE YI
Sbjct: 8 LPTTSVIIAFYNEAWSTLLRTVHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNL 67
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D +
Sbjct: 68 D-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISYDETAVV 126
Query: 388 VPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 127 CPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRK 172
>gi|148671133|gb|EDL03080.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_a
[Mus musculus]
Length = 490
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 137 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 196
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 197 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 256
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 257 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 312
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 313 RGAFDWNL 320
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 116/183 (63%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ Y+E F GKV+LIRN +REGLIR++ GA + G+++VFLD+HCEV WL P
Sbjct: 220 DLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEP 279
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T + Y RG F+W + + + + E
Sbjct: 280 LLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIVRGAFDWNLNLRWDNVFAYELDGP 335
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP GG+FA++R +F ELG YD G+ + GGEN ELS +IWMCGG + +PC
Sbjct: 336 EGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPC 395
Query: 181 SRI 183
SR+
Sbjct: 396 SRV 398
>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
5 [Pan troglodytes]
Length = 638
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 6; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
Length = 618
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 91/237 (38%), Positives = 140/237 (59%), Gaps = 7/237 (2%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C+ YP +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205
Query: 318 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQ 319
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/184 (42%), Positives = 120/184 (65%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL I
Sbjct: 209 LDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRI 268
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
+RK + PVID I+ T++++ E +RG F W + ++ +P AK+ + +
Sbjct: 269 KLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLLDPT 325
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS +
Sbjct: 326 GPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVG 385
Query: 185 SLIR 188
+ R
Sbjct: 386 HVFR 389
>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
Length = 491
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 121 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 179
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 180 LERISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 236
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 237 KSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 296
Query: 181 SRI 183
S +
Sbjct: 297 SHV 299
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 96/238 (40%), Positives = 136/238 (57%), Gaps = 8/238 (3%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 260
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 5 KPPADSHALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 64
Query: 261 Y--WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
+DY LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 65 SRKFDY-RRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKT 123
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 124 QLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 182
Query: 379 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 183 ISKDETTVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRK 237
>gi|348522865|ref|XP_003448944.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Oreochromis niloticus]
Length = 590
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 119/187 (63%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +Y+ +G VRLIR+T+R G+ R+ GA + GEV+VF+D+HCE WL PL
Sbjct: 232 LKTGLSEYVSHLDG-VRLIRSTKRLGVGGCRTLGAARAVGEVVVFMDSHCECQKGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I DR + P++D ID+QT+ + + P RG+F+W + + +PE + K+ +
Sbjct: 291 LERIALDRTRVVSPIMDVIDWQTFRYNATQWP---VRGVFDWRLDFFWESIPELQDKEPE 347
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SP GG+ A+DR FF +G YDPG+++WG E ELS ++W CGGS+E VPCS
Sbjct: 348 MAVQPLQSPALGGGVVAIDRHFFQSVGTYDPGMVLWGAEQIELSIRVWSCGGSMEVVPCS 407
Query: 182 RIVSLIR 188
R+ LIR
Sbjct: 408 RVGHLIR 414
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 134/224 (59%), Gaps = 6/224 (2%)
Query: 212 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
GE GKA HL R +L +YG N S IS R +P+ R +C ++ LP
Sbjct: 126 GEMGKAVRLHLEGLERDMELRALQQYGFNEVVSERISLHRRLPEARHPKCLGVEHIESLP 185
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNG 329
ASV++ F++E +S+L+RTVHS++ P QYL+E++LVDD S + L L +Y+ +G
Sbjct: 186 SASVVICFNDEAWSTLLRTVHSVLDTAPKQYLQEVLLVDDLSQQGHLKTGLSEYVSHLDG 245
Query: 330 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 389
VRLIR+T+R G+ R+ GA + GEV+VF+D+HCE WL PLL I DR + P
Sbjct: 246 -VRLIRSTKRLGVGGCRTLGAARAVGEVVVFMDSHCECQKGWLEPLLERIALDRTRVVSP 304
Query: 390 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
++D ID+QT+ + + P RG+F+W + + +PE + K+
Sbjct: 305 IMDVIDWQTFRYNATQWP---VRGVFDWRLDFFWESIPELQDKE 345
>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 641
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 131/234 (55%), Gaps = 11/234 (4%)
Query: 209 EGPGEGGKAYHLPEAYRAAG----DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
PGE GK +P D N S+ IS R++PD+R CK +
Sbjct: 136 NAPGENGKGVIVPTNLTGDAKRRLDIGWQNNAFNQYASDMISLHRSLPDMRDPGCKTQKF 195
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
DLP+ SVI+ FHNE +S LMRTVHS+I R+P L+EIILVDDFS L ++LEDY
Sbjct: 196 RRDLPQTSVIICFHNEAWSVLMRTVHSVIDRSPKNLLKEIILVDDFSDMKHLKEQLEDYT 255
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
++ G V+++R ++REGLIR R GAK + V+ +LD+HCE WL PLL I
Sbjct: 256 RKL-GIVKIVRASKREGLIRARLLGAKFATAPVLTYLDSHCECSTGWLEPLLDRIAEADT 314
Query: 385 IMTVPVIDGIDYQTWEF---RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ PVID I T+E+ R+ Y + G F+W + + + LP+R+ RK
Sbjct: 315 NVVCPVIDVISDSTFEYPHRRAGYTVN---VGGFDWNLQFSWHSLPQRDKDARK 365
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 116/190 (61%), Gaps = 7/190 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY + G V+++R ++REGLIR R GAK + V+ +LD+HCE WL PL
Sbjct: 247 LKEQLEDYTRKL-GIVKIVRASKREGLIRARLLGAKFATAPVLTYLDSHCECSTGWLEPL 305
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF---RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L I + PVID I T+E+ R+ Y + G F+W + + + LP+R+
Sbjct: 306 LDRIAEADTNVVCPVIDVISDSTFEYPHRRAGYTVN---VGGFDWNLQFSWHSLPQRDKD 362
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
RK + SPT AGGLF++ +A+F +LG YD G +WG EN ELSFK+WMCGG +E V
Sbjct: 363 ARKQSWSAVPSPTMAGGLFSISKAYFEKLGLYDSGFDIWGAENLELSFKVWMCGGRLEIV 422
Query: 179 PCSRIVSLIR 188
PCS + + R
Sbjct: 423 PCSHVGHVFR 432
>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
anubis]
Length = 578
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 102/253 (40%), Positives = 144/253 (56%), Gaps = 18/253 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RP++K +PP + + PGE GKA L E + + Y +N+ S+
Sbjct: 59 LSRPLYK--------KPPADSH--APGEWGKASKLQLNEGELKQQEELIERYAINIYLSD 108
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+
Sbjct: 109 RISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLK 168
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 169 EIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLD 227
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLY 421
HCE WL PLL I D + PVID ID+ T+EF EP G F+W + +
Sbjct: 228 CHCECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTF 284
Query: 422 KENELPEREAKKR 434
+ + +P+ E +R
Sbjct: 285 QWHSVPKHERDRR 297
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
>gi|71682529|gb|AAI00448.1| Galntl5 protein, partial [Mus musculus]
Length = 447
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 94 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 153
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 154 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 213
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T ++ +
Sbjct: 214 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLDYMAA----PIV 269
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 270 RGAFDWNL 277
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/183 (44%), Positives = 117/183 (63%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ Y+E F GKV+LIRN +REGLIR++ GA + G+++VFLD+HCEV WL P
Sbjct: 177 DLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEP 236
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T ++ + RG F+W + + + + E
Sbjct: 237 LLHAIAKDHKMVVCPIIDVINELTLDYMAA----PIVRGAFDWNLNLRWDNVFAYELDGP 292
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP GG+FA++R +F ELG YD G+ + GGEN ELS +IWMCGG + +PC
Sbjct: 293 EGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPC 352
Query: 181 SRI 183
SR+
Sbjct: 353 SRV 355
>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
rerio]
gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
Length = 582
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 99/254 (38%), Positives = 144/254 (56%), Gaps = 16/254 (6%)
Query: 185 SLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP--EAYRAAGDASLGEYGMNMETS 242
+L+RPV++ L P PGE G+A L + +AS+ +N+ S
Sbjct: 63 NLLRPVYEK----------LPPDSNAPGEYGRATRLTLTSEEKKEEEASVERCAINIFIS 112
Query: 243 NHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
+ IS R I D RM ECK Y + LP SV++ F+NE +S+L+RT+HS+++ TPA L
Sbjct: 113 DKISLHRHIQDNRMHECKAKKYNIRRLPTTSVVIAFYNEAWSTLLRTIHSVLETTPAVLL 172
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
++IILVDDFS + L +L YI +VRLIR +REGL+R R GA + G V+ FL
Sbjct: 173 KDIILVDDFSDRGYLKSQLAQYISNLE-RVRLIRTKKREGLVRARLIGATYATGSVLTFL 231
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLY 421
D HCE W+ PLL I + + PVID ID+ T+EF + + G F+W + +
Sbjct: 232 DCHCECVPGWIEPLLERIAENETTIICPVIDTIDWNTFEF--YMQTEEPMVGGFDWRLTF 289
Query: 422 KENELPEREAKKRK 435
+ + +PE + K RK
Sbjct: 290 QWHAVPEIDRKIRK 303
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 113/182 (62%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L YI +VRLIR +REGL+R R GA + G V+ FLD HCE W+ PL
Sbjct: 187 LKSQLAQYISNLE-RVRLIRTKKREGLVRARLIGATYATGSVLTFLDCHCECVPGWIEPL 245
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+EF + + G F+W + ++ + +PE + K RK
Sbjct: 246 LERIAENETTIICPVIDTIDWNTFEF--YMQTEEPMVGGFDWRLTFQWHAVPEIDRKIRK 303
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ +A+F LG YD G+ VWGGEN ELSF++W CGGS+E PCS
Sbjct: 304 SRIDPIRSPTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCS 363
Query: 182 RI 183
+
Sbjct: 364 HV 365
>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
Length = 575
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ +++R+ R GLI+ R +GA ++G V+ FLDAHCE WL L
Sbjct: 170 LKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEAL 229
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + PVID I+ T+ + +E H+ G F W + ++ L RE K RK
Sbjct: 230 LSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GAFNWNLQFRWFTLGGRELKLRK 286
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P+ +PT AGGLFA+DR +F E+G YD G+ +WGGEN E+SF+IW CGG ++ PC
Sbjct: 287 NDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQCGGKVQIAPC 346
Query: 181 SRIVSLIR 188
SR+ L R
Sbjct: 347 SRVGHLFR 354
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 84/201 (41%), Positives = 123/201 (61%), Gaps = 6/201 (2%)
Query: 237 MNMETSNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N+ S+ I +R++PD R ++C + DYP PK S+I+VFHNE +S+L+RTV S+I
Sbjct: 90 FNLLASDRIPLNRSLPDFRRKKCATLFGDYP-TYPKTSIIIVFHNEAWSTLLRTVWSVIN 148
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
R+P + LEEIILVDD S + L + L+DY+ +++R+ R GLI+ R +GA ++
Sbjct: 149 RSPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAK 208
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G V+ FLDAHCE WL LL+ I DR + PVID I+ T+ + +E H+ G
Sbjct: 209 GPVLTFLDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GA 265
Query: 415 FEWGMLYKENELPEREAKKRK 435
F W + ++ L RE K RK
Sbjct: 266 FNWNLQFRWFTLGGRELKLRK 286
>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
Length = 622
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLERYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F + + H RG F+W + + LPE E ++R
Sbjct: 283 LARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 100/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ E D ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTLLETQEKDEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SV++VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLERYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ + P I ID T++F + + H RG F+W + + LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
Length = 617
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 91/237 (38%), Positives = 140/237 (59%), Gaps = 7/237 (2%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL P + + EG G HL + D++ N+ S+ IS R++P++R
Sbjct: 89 ANLYAPHDDWGEG---GAGVSHLTPEQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKP 145
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C+ YP +LP SVI+V+HNE +S+L+RTV S+I R+P + L+EIILVDDFS + L
Sbjct: 146 SCRNMTYPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLR 205
Query: 318 -QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL
Sbjct: 206 YPTLDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLL 265
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
I +RK + PVID I+ T++++ E +RG F W + ++ +P AK+
Sbjct: 266 TRIKLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQ 319
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/184 (42%), Positives = 120/184 (65%), Gaps = 4/184 (2%)
Query: 6 LEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 65
L+ ++ +++IR+ ER GLIR R GA+E++G+V+ FLD+HCE WL PLL I
Sbjct: 209 LDTTLKPLPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRI 268
Query: 66 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-S 124
+RK + PVID I+ T++++ E +RG F W + ++ +P AK+ + +
Sbjct: 269 KLNRKAVPCPVIDIINDNTFQYQKGIE---MFRGGFNWNLQFRWYGMPTAMAKQHLLDPT 325
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P +SPT AGGLF+++R +F ELG YDPG+ +WGGEN E+SF+IW CGG +E +PCS +
Sbjct: 326 GPIESPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVG 385
Query: 185 SLIR 188
+ R
Sbjct: 386 HVFR 389
>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Saimiri boliviensis boliviensis]
gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Saimiri boliviensis boliviensis]
Length = 622
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 100/246 (40%), Positives = 140/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P GPG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NGPGADGKAFQKRKWTPLETQEKEEGFKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
Length = 607
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ +++R+ R GLI+ R +GA ++G V+ FLDAHCE WL L
Sbjct: 202 LKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEAL 261
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + PVID I+ T+ + +E H+ G F W + ++ L RE K RK
Sbjct: 262 LSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GAFNWNLQFRWFTLGGRELKLRK 318
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ ++P+ +PT AGGLFA+DR +F E+G YD G+ +WGGEN E+SF+IW CGG ++ PC
Sbjct: 319 NDATQPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQCGGKVQIAPC 378
Query: 181 SRIVSLIR 188
SR+ L R
Sbjct: 379 SRVGHLFR 386
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 84/201 (41%), Positives = 123/201 (61%), Gaps = 6/201 (2%)
Query: 237 MNMETSNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
N+ S+ I +R++PD R ++C + DYP PK S+I+VFHNE +S+L+RTV S+I
Sbjct: 122 FNLLASDRIPLNRSLPDFRRKKCATLFGDYP-TYPKTSIIIVFHNEAWSTLLRTVWSVIN 180
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
R+P + LEEIILVDD S + L + L+DY+ +++R+ R GLI+ R +GA ++
Sbjct: 181 RSPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAK 240
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G V+ FLDAHCE WL LL+ I DR + PVID I+ T+ + +E H+ G
Sbjct: 241 GPVLTFLDAHCECTTGWLEALLSVIKQDRTAVVCPVIDIINDDTFAYVKSFE--LHW-GA 297
Query: 415 FEWGMLYKENELPEREAKKRK 435
F W + ++ L RE K RK
Sbjct: 298 FNWNLQFRWFTLGGRELKLRK 318
>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Callithrix jacchus]
Length = 622
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASMAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + + H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPIQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ L + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKRKLTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASMAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + + H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPIQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
Length = 584
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 124/188 (65%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ YI+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL PL
Sbjct: 193 LRSPLDLYIKRFSLPFHLIHLPERSGLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPL 252
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + DRK + P+ID I + +E+ + D + G F W + ++ +P RE ++R
Sbjct: 253 LDRVSVDRKRVVAPIIDVISDENFEY--ITASDITWGG-FNWHLNFRWYPVPMREMERRN 309
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ S P ++PT AGGLFA+DR FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PC
Sbjct: 310 HDRSVPLQTPTIAGGLFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPC 369
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 370 SRVGHVFR 377
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 139/234 (59%), Gaps = 6/234 (2%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
+ GPGE G A + + + E ++ S+ IS +R +PD R +C+ D
Sbjct: 82 RNGPGEMGSAVIIDPSQQEERKKKFNENQFDVMASDLISINRALPDYRSSKCREAARKYD 141
Query: 268 ---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
LP S+I+VFHNE +S+L+RT+HS+I R+P ++E+IL+DD S++ L L+ YI
Sbjct: 142 ITSLPTVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRSPLDLYI 201
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
+RF+ LI ER GLIR R +GAK ++G+V++FLDAH EV WL PLL + DRK
Sbjct: 202 KRFSLPFHLIHLPERSGLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPLLDRVSVDRK 261
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+ P+ID I + +E+ + D + G F W + ++ +P RE ++R ++R
Sbjct: 262 RVVAPIIDVISDENFEY--ITASDITWGG-FNWHLNFRWYPVPMREMERRNHDR 312
>gi|149032012|gb|EDL86924.1| rCG50623 [Rattus norvegicus]
Length = 431
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 33 LKEKLERYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 91
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F + + H RG F+W + + LPE E ++R
Sbjct: 92 LARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 151
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 152 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 211
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 212 SVVGHVFR 219
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/125 (41%), Positives = 72/125 (57%), Gaps = 2/125 (1%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L +KLE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 33 LKEKLERYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 91
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LA I D+ + P I ID T++F + + H RG F+W + + LPE E ++R
Sbjct: 92 LARIAEDKTAVVSPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 151
Query: 435 KYNRY 439
K Y
Sbjct: 152 KDETY 156
>gi|254553456|ref|NP_080725.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
5 [Mus musculus]
gi|51316084|sp|Q9D4M9.2|GLTL5_MOUSE RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5;
AltName: Full=Polypeptide GalNAc transferase 15;
Short=GalNAc-T15; Short=pp-GaNTase 15; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 15;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 15
gi|148671134|gb|EDL03081.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5, isoform CRA_b
[Mus musculus]
gi|148877565|gb|AAI45758.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 [Mus musculus]
Length = 431
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 254 RGAFDWNL 261
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 116/183 (63%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ Y+E F GKV+LIRN +REGLIR++ GA + G+++VFLD+HCEV WL P
Sbjct: 161 DLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEP 220
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T + Y RG F+W + + + + E
Sbjct: 221 LLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIVRGAFDWNLNLRWDNVFAYELDGP 276
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP GG+FA++R +F ELG YD G+ + GGEN ELS +IWMCGG + +PC
Sbjct: 277 EGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPC 336
Query: 181 SRI 183
SR+
Sbjct: 337 SRV 339
>gi|29437281|gb|AAH49554.1| Galntl5 protein, partial [Mus musculus]
Length = 434
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 254 RGAFDWNL 261
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 116/183 (63%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ Y+E F GKV+LIRN +REGLIR++ GA + G+++VFLD+HCEV WL P
Sbjct: 161 DLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEP 220
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T + Y RG F+W + + + + E
Sbjct: 221 LLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIVRGAFDWNLNLRWDNVFAYELDGP 276
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP GG+FA++R +F ELG YD G+ + GGEN ELS +IWMCGG + +PC
Sbjct: 277 EGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPC 336
Query: 181 SRI 183
SR+
Sbjct: 337 SRV 339
>gi|12838270|dbj|BAB24147.1| unnamed protein product [Mus musculus]
Length = 424
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ Y++ F GKV+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGAS 197
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 254 RGAFDWNL 261
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 116/183 (63%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ Y+E F GKV+LIRN +REGLIR++ GA + G+++VFLD+HCEV WL P
Sbjct: 161 DLKDKLDYYLEIFRGKVKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEP 220
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T + Y RG F+W + + + + E
Sbjct: 221 LLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIVRGAFDWNLNLRWDNVFAYELDGP 276
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP GG+FA++R +F ELG YD G+ + GGEN ELS +IWMCGG + +PC
Sbjct: 277 EGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPC 336
Query: 181 SRI 183
SR+
Sbjct: 337 SRV 339
>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
floridanus]
Length = 597
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 96/244 (39%), Positives = 144/244 (59%), Gaps = 4/244 (1%)
Query: 192 KADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTI 251
K + K+ LE + P G GE GK +L + G+A+L + +N+ SN IS R +
Sbjct: 69 KYEDKILKLEYNVVP---GLGENGKPAYLYGKDKFQGEAALKKKALNVILSNKISLTRKL 125
Query: 252 PDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
PD+R C Y LP ASV+++F+NE +S L+RTVHS++K +P L+EIILVDD S
Sbjct: 126 PDIRNSLCMNITYDKLLPSASVVIIFYNEPWSVLLRTVHSVLKGSPPHLLKEIILVDDHS 185
Query: 312 SKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
+ +L +L+ Y+ R KV+L+R + R+GLIR R GA+ ++G+V+VFLDAHCEV +
Sbjct: 186 EEEELQGQLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAKGDVLVFLDAHCEVIKD 245
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
WL PLL I ++ + +P+ID I +T E+ E G F W + + + E
Sbjct: 246 WLQPLLQRIKDNKNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHE 305
Query: 431 AKKR 434
+ R
Sbjct: 306 VESR 309
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 118/189 (62%), Gaps = 1/189 (0%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L +L+ Y+ R KV+L+R + R+GLIR R GA+ ++G+V+VFLDAHCEV +WL
Sbjct: 189 ELQGQLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAKGDVLVFLDAHCEVIKDWLQ 248
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I ++ + +P+ID I +T E+ E G F W + + + E +
Sbjct: 249 PLLQRIKDNKNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHEVES 308
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R P +SPT AGGLFA++R +F E+G YD + WGGEN E+SF+IW CGG++E +P
Sbjct: 309 RPSPISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSFRIWQCGGTLEIIP 368
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 369 CSRVGHIFR 377
>gi|432096894|gb|ELK27469.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Myotis davidii]
Length = 940
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 119/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 543 LKDNLDKYMSQF-PKVRILHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNIGWLEPL 601
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y RK + PVI+ I+ + + +V D+ RGIF W M + +P + AK R
Sbjct: 602 LERVYLSRKKVACPVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIPPDVIAKNR 658
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 659 IKETDVIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 718
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 719 SRVGHIFR 726
Score = 148 bits (373), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 82/219 (37%), Positives = 125/219 (57%), Gaps = 4/219 (1%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
+ PG+ G+ +P+ + E N+ S+ I DR I D R C +L
Sbjct: 436 KAPGQFGRPVLVPQGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPVGCAKQLVHNNL 495
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L L+ Y+ +F
Sbjct: 496 PTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF- 554
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL +Y RK +
Sbjct: 555 PKVRILHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNIGWLEPLLERVYLSRKKVAC 614
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
PVI+ I+ + + +V D+ RGIF W M + +P
Sbjct: 615 PVIEVINDKDMSYMTV---DNFQRGIFVWPMNFGWRTIP 650
>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
Length = 541
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE+Y+ + K ++R +R GLIR R GA+ GEVI FLDAHCE WL PL
Sbjct: 103 LGKQLEEYVAKLPVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPL 162
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I +R+ + P+ID I +T+E+ + D + G F W + ++ +P RE +R
Sbjct: 163 LARIVQNRRTVVCPIIDVISDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRN 219
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F E+G YD G+ +WGGEN E+SF+IW CGG +E +PC
Sbjct: 220 NDRTAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPC 279
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 280 SHVGHVFR 287
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 131/224 (58%), Gaps = 3/224 (1%)
Query: 215 GKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVI 274
GK +P + E N+ S+ IS +R++ D+R E C+ Y LP S++
Sbjct: 2 GKPVKIPADMKDLMKEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASKLPTTSIV 61
Query: 275 LVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLI 334
+VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE+Y+ + K ++
Sbjct: 62 IVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVL 121
Query: 335 RNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGI 394
R +R GLIR R GA+ GEVI FLDAHCE WL PLLA I +R+ + P+ID I
Sbjct: 122 RTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVVCPIIDVI 181
Query: 395 DYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+T+E+ + D + G F W + ++ +P RE +R +R
Sbjct: 182 SDETFEY--ITASDSTWGG-FNWKLNFRWYRVPSREMARRNNDR 222
>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
catus]
Length = 582
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 113/182 (62%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID+ T+EF + G F+W + ++ + +P+ E +RK
Sbjct: 245 LERIGKDETAIVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHSVPKHERDRRK 302
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 303 SRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCS 362
Query: 182 RI 183
+
Sbjct: 363 HV 364
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 132/227 (58%), Gaps = 6/227 (2%)
Query: 212 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 268
GE GKA L + + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASKLQLSQDELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRRL 138
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE YI +
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLD 198
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D +
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGKDETAIVC 257
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID ID+ T+EF + G F+W + ++ + +P+ E +RK
Sbjct: 258 PVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHSVPKHERDRRK 302
>gi|410897066|ref|XP_003962020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Takifugu rubripes]
Length = 600
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 122/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR+IR ER+GLIR R GA ++GEV+ FLD+H E + WL PL
Sbjct: 205 LKAPLDKYMSQF-PKVRIIRLRERQGLIRARLAGAAAAKGEVLTFLDSHVECNVGWLEPL 263
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L IY DR+ + PVI+ I+ + + V D+ RGIF W +++ + LPE KK
Sbjct: 264 LERIYMDRRKVPCPVIEVINDKDMSYMLV---DNFQRGIFRWPLVFGWSPLPEAYIKKHN 320
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
S+P + P AGGLF++D+ +F ELG YD GL VWGGEN E+SFKIWMCGG IE +PC
Sbjct: 321 LTISDPIRCPVMAGGLFSIDKKYFYELGAYDSGLDVWGGENMEISFKIWMCGGEIEIIPC 380
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 381 SRVGHIFR 388
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/222 (44%), Positives = 133/222 (59%), Gaps = 4/222 (1%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
G+ G+A H+ + A S E N+ SN I DR IPD R E C DLP
Sbjct: 101 GQFGQAVHVSSSEDALVRKSWDEGFFNVYLSNQIPLDRAIPDTRPESCAQTLVHDDLPST 160
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
SVI F +E +S+L+R+VHS++ R+P LEEIILVDDFS+K L L+ Y+ +F KV
Sbjct: 161 SVIFCFVDEVWSTLLRSVHSVLNRSPPHLLEEIILVDDFSTKEYLKAPLDKYMSQF-PKV 219
Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
R+IR ER+GLIR R GA ++GEV+ FLD+H E + WL PLL IY DR+ + PVI
Sbjct: 220 RIIRLRERQGLIRARLAGAAAAKGEVLTFLDSHVECNVGWLEPLLERIYMDRRKVPCPVI 279
Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
+ I+ + + V D+ RGIF W +++ + LPE KK
Sbjct: 280 EVINDKDMSYMLV---DNFQRGIFRWPLVFGWSPLPEAYIKK 318
>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4 [Equus caballus]
Length = 703
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 112/182 (61%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 307 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 365
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID+ T+EF + G F+W + ++ + +P+ E +RK
Sbjct: 366 LERISKDETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHSVPKHERDRRK 423
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 424 SRIDPISSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCS 483
Query: 182 RI 183
+
Sbjct: 484 HV 485
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/258 (36%), Positives = 141/258 (54%), Gaps = 12/258 (4%)
Query: 179 PCSRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMN 238
P + L RP+++ +PP + + G L E + + Y +N
Sbjct: 177 PGDKAEDLSRPLYE--------KPPADSHALGEWGKASKLQLNEGELKQQEELIERYAIN 228
Query: 239 METSNHISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
+ S+ IS R I D RM ECK + LP SV++ F+NE +S+L+RT+HS+++ +P
Sbjct: 229 IYLSDRISLHRHIEDKRMYECKSQKFNYRKLPTTSVVIAFYNEAWSTLLRTIHSVLETSP 288
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
A L+EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V
Sbjct: 289 AVLLKEIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDV 347
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 417
+ FLD HCE WL PLL I D + PVID ID+ T+EF + G F+W
Sbjct: 348 LTFLDCHCECNSGWLEPLLERISKDETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDW 405
Query: 418 GMLYKENELPEREAKKRK 435
+ ++ + +P+ E +RK
Sbjct: 406 RLTFQWHSVPKHERDRRK 423
>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 6 [Homo sapiens]
Length = 622
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 139/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|443704264|gb|ELU01402.1| hypothetical protein CAPTEDRAFT_127533 [Capitella teleta]
Length = 390
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 130/227 (57%), Gaps = 7/227 (3%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 266
GPGE G++ AA N S+ +SF+RTIPD R C K +DY
Sbjct: 6 NGPGEHGRSVPTSPKDEAAVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYS- 64
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LPK SVI+ F E +S+L+R+VHS++ RTP + LEEIILVDDFS + L KL++Y+ R
Sbjct: 65 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTR 124
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
KV LIR R+GLIR R R + +RG V+ FLD+H E + W PLL I +R+++
Sbjct: 125 L-PKVTLIRFPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 183
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
PVID I + + + + + RG F W ML+K +P E +
Sbjct: 184 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSR 227
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 116/188 (61%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL++Y+ R KV LIR R+GLIR R R + +RG V+ FLD+H E + W PL
Sbjct: 114 LHAKLDNYLTRL-PKVTLIRFPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPL 172
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R+++ PVID I + + + + + RG F W ML+K +P E +
Sbjct: 173 LQRISHNRRVIVAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSRTG 229
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFA+ + FF LG YDPGL +WG EN ELSFK WMCGGS+E +PC
Sbjct: 230 GDPTAPVRTPTIAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPC 289
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 290 SRVGHVYR 297
>gi|395519600|ref|XP_003763931.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Sarcophilus harrisii]
Length = 945
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 121/188 (64%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 548 LKDQLDKYMSQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPL 606
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L +Y ++K + PVI+ I+ + + +V D+ RGIF W M + ++P K+ K
Sbjct: 607 LERVYLNKKKVACPVIEIINDKDLSYMTV---DNFQRGIFVWPMNFSWKKIPPEIIKQNK 663
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 664 IKETDVIRCPVMAGGLFSIDKKYFFELGTYDPGLEVWGGENMELSFKVWMCGGEIEIIPC 723
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 724 SRVGHIFR 731
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/238 (36%), Positives = 133/238 (55%), Gaps = 5/238 (2%)
Query: 199 NLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRME 257
NL+ L P + PG+ G +P E N+ S+ I DR I D R
Sbjct: 430 NLDVTLSPRNPKAPGQFGNPVVVPFGKEKEVKRRWKEGNFNVYLSDLIPLDRAIDDTRPS 489
Query: 258 ECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
C +LP S+I+ F +E +S+L+R+VHS++ R+P ++EI+LVDDFS+K L
Sbjct: 490 GCADQLVHNNLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKGYLK 549
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PLL
Sbjct: 550 DQLDKYMSQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLLE 608
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+Y ++K + PVI+ I+ + + +V D+ RGIF W M + ++P K+ K
Sbjct: 609 RVYLNKKKVACPVIEIINDKDLSYMTV---DNFQRGIFVWPMNFSWKKIPPEIIKQNK 663
>gi|194857037|ref|XP_001968882.1| GG25115 [Drosophila erecta]
gi|190660749|gb|EDV57941.1| GG25115 [Drosophila erecta]
Length = 634
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 214 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIQSENATLAVPV 273
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 274 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 329
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 330 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 389
Query: 194 DG 195
DG
Sbjct: 390 DG 391
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 119/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + LP+AS+++ F+NE +LM
Sbjct: 108 DIGYKHHAFNALVSNNIGLFREIPDTRHKVCDRQETSEAEKLPQASIVMCFYNEHKMTLM 167
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 168 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 227
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 228 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIQSENATLAVPVIDLINADTFE---- 283
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 284 YTPSPLVRGGFNWGLHFRWENLPE 307
>gi|195115752|ref|XP_002002420.1| GI12891 [Drosophila mojavensis]
gi|193912995|gb|EDW11862.1| GI12891 [Drosophila mojavensis]
Length = 622
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 80/172 (46%), Positives = 114/172 (66%), Gaps = 4/172 (2%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R +RN +REGLIR+R GA+++ G+V+VFLD+H EV WL PLL + ++ + VPV
Sbjct: 207 LRYVRNEQREGLIRSRVIGARDASGDVLVFLDSHIEVNRQWLEPLLRLVKAENSTLAVPV 266
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 267 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFKGPFRSPTMAGGL 322
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
FA++R +F +G YD + +WGGEN E+SF++W CGGSI+ VPCSR+ + R
Sbjct: 323 FAVNRLYFQHIGEYDMAMDIWGGENIEISFRVWQCGGSIKIVPCSRVGHIFR 374
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 122/208 (58%), Gaps = 8/208 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C+ + LP+AS+I+ F+NE +LM
Sbjct: 101 DIGYKHHAFNALVSNNIGLFRDIPDTRHKVCERGETLESEQLPQASIIMCFYNEHKMTLM 160
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLD-QKLEDYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTPA L+EI+LVDD S +L+ L D R +R +RN +REGLIR
Sbjct: 161 RSIKTVLERTPAHLLKEIVLVDDNSDLPELEFHLLADLHARLKYDNLRYVRNEQREGLIR 220
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+++ G+V+VFLD+H EV WL PLL + ++ + VPVID I+ T+E
Sbjct: 221 SRVIGARDASGDVLVFLDSHIEVNRQWLEPLLRLVKAENSTLAVPVIDLINADTFE---- 276
Query: 405 YEPDHHYRGIFEWGMLYKENELPEREAK 432
Y P RG F WG+ ++ LPE K
Sbjct: 277 YTPSPLVRGGFNWGLHFRWENLPEGTLK 304
>gi|443715013|gb|ELU07165.1| hypothetical protein CAPTEDRAFT_143879 [Capitella teleta]
Length = 390
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 130/227 (57%), Gaps = 7/227 (3%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 266
GPGE G++ AA N S+ +SF+RTIPD R C K +DY
Sbjct: 6 NGPGEHGRSVPTSPKDEAAVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSFDYS- 64
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LPK SVI+ F E +S+L+R+VHS++ RTP + LEEIILVDDFS + L KL++Y+ R
Sbjct: 65 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTR 124
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
KV LIR R+GLIR R R + +RG V+ FLD+H E + W PLL I +R+++
Sbjct: 125 L-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 183
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
PVID I + + + + + RG F W ML+K +P E +
Sbjct: 184 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSR 227
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL++Y+ R KV LIR R+GLIR R R + +RG V+ FLD+H E + W PL
Sbjct: 114 LHAKLDNYLTRL-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPL 172
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R+++ PVID I + + + + + RG F W ML+K +P E +
Sbjct: 173 LQRISHNRRVIVAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPNYEKSRTG 229
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFA+ + FF LG YDPGL +WG EN ELSFK WMCGGS+E +PC
Sbjct: 230 GDPTAPVRTPTIAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPC 289
Query: 181 SRI 183
SR+
Sbjct: 290 SRV 292
>gi|195579200|ref|XP_002079450.1| GD23962 [Drosophila simulans]
gi|194191459|gb|EDX05035.1| GD23962 [Drosophila simulans]
Length = 635
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 215 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 274
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 275 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 330
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 331 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 390
Query: 194 DG 195
DG
Sbjct: 391 DG 392
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 109 DIGYKHHAFNALVSNNIGLFREIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 168
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 169 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 228
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 229 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 284
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 285 YTPSPLVRGGFNWGLHFRWENLPE 308
>gi|195338421|ref|XP_002035823.1| GM15572 [Drosophila sechellia]
gi|194129703|gb|EDW51746.1| GM15572 [Drosophila sechellia]
Length = 604
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 214 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 273
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 274 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 329
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 330 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 389
Query: 194 DG 195
DG
Sbjct: 390 DG 391
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 108 DIGYKHHAFNALVSNNIGLFREIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 167
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 168 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 227
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 228 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 283
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 284 YTPSPLVRGGFNWGLHFRWENLPE 307
>gi|14549429|gb|AAK66862.1|AF158747_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
melanogaster]
Length = 631
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 211 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 270
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 271 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 326
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 327 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 386
Query: 194 DG 195
DG
Sbjct: 387 DG 388
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 105 DIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 164
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 165 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 224
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 225 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 280
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 281 YTPSPLVRGGFNWGLHFRWENLPE 304
>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Oryctolagus cuniculus]
Length = 622
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFTGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D ++ P I ID T+EF + V H RG F+W + + +P E ++R
Sbjct: 283 LARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAVPAHENRRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 137/246 (55%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P + PG G+A+ E D ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--KSPGADGRAFQKSEWTPQETQEKDEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ PA L EIILVDD S++
Sbjct: 164 ECVDQKFRRCP-PLPSTSVIIVFHNEAWSTLLRTVYSVLHTAPAILLREIILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFTGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D ++ P I ID T+EF + V H RG F+W + + +P E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAVPAHENRR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|24584318|ref|NP_652069.2| polypeptide N-acetylgalactosaminyltransferase 35A [Drosophila
melanogaster]
gi|51316020|sp|Q8MVS5.2|GLT35_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 35A;
AltName: Full=Protein l(2)35Aa; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 35A;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 35A; Short=pp-GaNTase
35A; AltName: Full=dGalNAc-T1
gi|7298154|gb|AAF53391.1| polypeptide N-acetylgalactosaminyltransferase 35A [Drosophila
melanogaster]
gi|334191718|gb|AEG66944.1| LD24449p [Drosophila melanogaster]
Length = 632
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 212 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 271
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 272 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 327
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 328 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 387
Query: 194 DG 195
DG
Sbjct: 388 DG 389
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 106 DIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 165
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 166 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 225
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 226 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 281
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 282 YTPSPLVRGGFNWGLHFRWENLPE 305
>gi|21552971|gb|AAM62405.1|AF478697_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 632
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 212 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 271
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 272 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 327
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 328 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 387
Query: 194 DG 195
DG
Sbjct: 388 DG 389
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 106 DIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 165
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 166 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 225
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 226 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 281
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 282 YTPSPLVRGGFNWGLHFRWENLPE 305
>gi|17946358|gb|AAL49213.1| RE64279p [Drosophila melanogaster]
Length = 632
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 212 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 271
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 272 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 327
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 328 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 387
Query: 194 DG 195
DG
Sbjct: 388 DG 389
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 106 DIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 165
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 166 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 225
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 226 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 281
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 282 YTPSPLVRGGFNWGLHFRWENLPE 305
>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 493
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 78/174 (44%), Positives = 115/174 (66%), Gaps = 4/174 (2%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
K+RL+RNT+REGL+R+R RGA+ ++ V+ FLD+HCE WLPPLL + D + + P
Sbjct: 177 KIRLLRNTQREGLVRSRVRGARAAKAPVLTFLDSHCECNQGWLPPLLRRVKEDPRRVVCP 236
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 134
VID I+ +++++ + RG F W +++K L +E ++R N + P ++P AG
Sbjct: 237 VIDVINLESFKY---FGASSDLRGGFNWNLVFKWEFLSNKEREERANNPTLPIRTPMIAG 293
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLF +DRA F LG YD + +WGGEN ELSF+ W CGGS+E +PCSR+ + R
Sbjct: 294 GLFVVDRAQFERLGAYDTAMDIWGGENLELSFRAWQCGGSLEILPCSRVGHVFR 347
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/206 (39%), Positives = 126/206 (61%), Gaps = 11/206 (5%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECK---YWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
+ N E S+ ++ +R IPD R +C +LP SV++ FHNE S+L+RT+ S
Sbjct: 84 HKFNQEASDALASNRAIPDTRHPQCAKEGLLKPQEELPATSVVITFHNEARSALLRTIVS 143
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ R+PA+ +EEIILVDDFS ++L IQ K+RL+RNT+REGL+R+R RGA+
Sbjct: 144 VLNRSPAELIEEIILVDDFSDDPSDGEELAK-IQ----KIRLLRNTQREGLVRSRVRGAR 198
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
++ V+ FLD+HCE WLPPLL + D + + PVID I+ +++++ +
Sbjct: 199 AAKAPVLTFLDSHCECNQGWLPPLLRRVKEDPRRVVCPVIDVINLESFKY---FGASSDL 255
Query: 412 RGIFEWGMLYKENELPEREAKKRKYN 437
RG F W +++K L +E ++R N
Sbjct: 256 RGGFNWNLVFKWEFLSNKEREERANN 281
>gi|410976430|ref|XP_003994623.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 9, partial [Felis
catus]
Length = 629
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 117/187 (62%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DRTIPD R ++C+ + DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 62 EYGYNAQLSDRISLDRTIPDYRPKKCRQVTHTADLPQISVVFIFVNEALSVILRSVHSVV 121
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
RTP++ L+E+ILVDD S A+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 122 NRTPSRLLKEVILVDDNSDSAELKTSLDQYVNRRYPGLVKIVRNSRREGLIRARLQGWKA 181
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E G W P L I DR+ + +P ID I Y T+E + H Y
Sbjct: 182 ATAPVVGFFDAHVEFGTGWAEPALTRIQEDRRRIVLPAIDNIKYDTFEVQQYASAAHGY- 240
Query: 413 GIFEWGM 419
WG+
Sbjct: 241 ---NWGL 244
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/189 (37%), Positives = 104/189 (55%), Gaps = 11/189 (5%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ R + G V+++RN+ REGLIR R +G K + V+ F DAH E G W
Sbjct: 143 ELKTSLDQYVNRRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFGTGWAE 202
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L I DR+ + +P ID I Y T+E + H Y WG+ + L
Sbjct: 203 PALTRIQEDRRRIVLPAIDNIKYDTFEVQQYASAAHGY----NWGL--RMTHLXPAHVP- 255
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 256 ---SLLPTRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 312
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 313 CSRVAHIAR 321
>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryctolagus cuniculus]
Length = 579
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 183 LKTQLETYISNLD-RVRLIRTKKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 241
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 242 LERIERDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 298
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 299 KSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 358
Query: 181 SRI 183
S +
Sbjct: 359 SHV 361
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 99/239 (41%), Positives = 139/239 (58%), Gaps = 10/239 (4%)
Query: 201 EPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 258
+PP + + GE GKA L E + + Y +N+ S+ IS R I D RM E
Sbjct: 67 KPPAD--SQALGEWGKASKLQLSEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYE 124
Query: 259 CKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S +A L
Sbjct: 125 CKSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRAYLK 184
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 185 TQLETYISNLD-RVRLIRTKKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLE 243
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 244 RIERDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRK 299
>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 497
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 139/239 (58%), Gaps = 5/239 (2%)
Query: 196 KLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLR 255
K+ ++ P+ ++GPGE GK + ++ D N+ S+ I+ +R++PD+R
Sbjct: 7 KIQDMPKPVN--RDGPGEQGKPVIIEPEFKKERDEKWKINEFNLMASDKIALNRSLPDVR 64
Query: 256 MEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKAD 315
C YP LP SVI+VFHNE +S+L+RT HSII R+P + L E+ILVDD S++
Sbjct: 65 PRGCNDKKYPGKLPTTSVIVVFHNEAWSTLLRTTHSIINRSPRELLMEVILVDDCSTQEH 124
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L + L+DY+ + V + R R GLIR+R RG ++G+V+ +LD+HCE WL PL
Sbjct: 125 LKKPLDDYVAKLPVPVHVERMEVRSGLIRSRLRGGSVAKGDVLTYLDSHCECTEGWLEPL 184
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
++ I DRK P+ID ID +++ + E + G F W + ++ +PE E +R
Sbjct: 185 VSRIGDDRKTRVQPIIDIIDDRSFAYIGASESNS---GGFTWQLQHQWVRIPEYEQNRR 240
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/194 (40%), Positives = 119/194 (61%), Gaps = 10/194 (5%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+DY+ + V + R R GLIR+R RG ++G+V+ +LD+HCE WL PL
Sbjct: 125 LKKPLDDYVAKLPVPVHVERMEVRSGLIRSRLRGGSVAKGDVLTYLDSHCECTEGWLEPL 184
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR- 120
++ I DRK P+ID ID +++ + E + G F W + ++ +PE E +R
Sbjct: 185 VSRIGDDRKTRVQPIIDIIDDRSFAYIGASESNS---GGFTWQLQHQWVRIPEYEQNRRV 241
Query: 121 ------KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
+ + +++PT AGGLF++++ +F ++G YD G+ VWGGEN E+SF+IWMCGG
Sbjct: 242 SEYDNIRQVTLFHRTPTMAGGLFSINKTYFEKMGAYDTGMDVWGGENIEMSFRIWMCGGK 301
Query: 175 IEWVPCSRIVSLIR 188
IE +PCSRI + R
Sbjct: 302 IEIIPCSRIGHVYR 315
>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Hydra magnipapillata]
Length = 548
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 117/192 (60%), Gaps = 5/192 (2%)
Query: 2 LDKKLEDYIERFNGK-----VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 56
L K L +YI N K V L RN +R+GL+R+R +GA+ + G V+ FLD+HCE
Sbjct: 185 LKKPLRNYINELNRKLGEEMVILYRNAKRQGLVRSRLKGAELASGTVLTFLDSHCEATEG 244
Query: 57 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 116
W+ PLL I D++ + PVI+ ID ++ G F W + + E+ E E
Sbjct: 245 WVEPLLFRIKEDKRNVVCPVIEVIDAVDLSYKKTELDRITQVGGFTWDLFFNWKEITEDE 304
Query: 117 AKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
+ R ++P KSPT AGGLFA+D+++F E+G YD + +WGGEN E+SF+IWMCGG +E
Sbjct: 305 KRLRADGTQPLKSPTMAGGLFAIDKSYFYEIGSYDNQMEIWGGENLEMSFRIWMCGGKLE 364
Query: 177 WVPCSRIVSLIR 188
+PCSR+ + R
Sbjct: 365 IIPCSRVGHIFR 376
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 138/270 (51%), Gaps = 10/270 (3%)
Query: 174 SIEWVPCSRIVSLIR--PVFKADGKLGNLEPPLEPYKEGPGEGGKAYH--LPEAYRAAGD 229
S+ W+ + L + P+ D LG L L P P G + Y LP+ ++
Sbjct: 40 SVFWISLDFLFLLHKQAPIVDVD-VLGQLGIELYPELIDPLLGARGYPAILPDNLKSQSK 98
Query: 230 ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 289
+ + S+ IS +R + +++ + C YP +LP SVI+ FHNE S+L+RTV
Sbjct: 99 NLFKNHSFDSLLSDRISLNRRLGNVKGDLCSSKQYPAELPNTSVIICFHNEATSALLRTV 158
Query: 290 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK-----VRLIRNTEREGLIR 344
HS+I TP L I+LVDD S A L + L +YI N K V L RN +R+GL+R
Sbjct: 159 HSVINETPPNILSNIVLVDDASVGAALKKPLRNYINELNRKLGEEMVILYRNAKRQGLVR 218
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R +GA+ + G V+ FLD+HCE W+ PLL I D++ + PVI+ ID ++
Sbjct: 219 SRLKGAELASGTVLTFLDSHCEATEGWVEPLLFRIKEDKRNVVCPVIEVIDAVDLSYKKT 278
Query: 405 YEPDHHYRGIFEWGMLYKENELPEREAKKR 434
G F W + + E+ E E + R
Sbjct: 279 ELDRITQVGGFTWDLFFNWKEITEDEKRLR 308
>gi|449279337|gb|EMC86972.1| Polypeptide N-acetylgalactosaminyltransferase 9 [Columba livia]
Length = 519
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 117/187 (62%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R ++CK YP DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 32 EYGYNAQLSDRISLDRSIPDYRPKKCKQMSYPDDLPQISVVFIFVNEALSVILRSVHSVV 91
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+ L+EIILVDD S +L L+ Y+ +R+ G V+++RNT+REGLIR R +G K
Sbjct: 92 NHTPSHLLKEIILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNTKREGLIRARIQGWKA 151
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E + W+ P L I DRK + +P ID I Y T+E + H Y
Sbjct: 152 ATSPVVGFFDAHVEFNIGWVEPALTRIKDDRKRIILPAIDNIKYNTFEVQQYANAAHGY- 210
Query: 413 GIFEWGM 419
WG+
Sbjct: 211 ---NWGL 214
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/189 (39%), Positives = 111/189 (58%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RNT+REGLIR R +G K + V+ F DAH E + W+
Sbjct: 113 ELKFNLDQYVNKRYPGLVKIVRNTKREGLIRARIQGWKAATSPVVGFFDAHVEFNIGWVE 172
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L I DRK + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 173 PALTRIKDDRKRIILPAIDNIKYNTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 227
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P ++P G F +DR +F E+G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 228 KGDESAPIRTPAMIGCSFVVDREYFGEIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 287
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 288 CSRVAHIER 296
>gi|170060398|ref|XP_001865784.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
gi|167878898|gb|EDS42281.1| N-acetyl galactosaminyl transferase 7 [Culex quinquefasciatus]
Length = 356
Score = 167 bits (424), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 77/102 (75%), Positives = 88/102 (86%), Gaps = 1/102 (0%)
Query: 99 GIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWG 158
GIFEWGMLYKENE+P REAK+RK++SEPYKSPTHAGGLFA++R FFL++G YDPGLLVWG
Sbjct: 51 GIFEWGMLYKENEVPRREAKRRKHDSEPYKSPTHAGGLFAINREFFLKIGAYDPGLLVWG 110
Query: 159 GENFELSFKIWMCGGSIEWVPCSRIVSLIRPVFKAD-GKLGN 199
GENFELSFKIW CGGSIEWVPCSR+ + R + GKL N
Sbjct: 111 GENFELSFKIWQCGGSIEWVPCSRVGHVYRGFMPYNFGKLAN 152
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 20/25 (80%), Positives = 24/25 (96%)
Query: 413 GIFEWGMLYKENELPEREAKKRKYN 437
GIFEWGMLYKENE+P REAK+RK++
Sbjct: 51 GIFEWGMLYKENEVPRREAKRRKHD 75
>gi|301606438|ref|XP_002932836.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Xenopus (Silurana) tropicalis]
Length = 613
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 126/206 (61%), Gaps = 4/206 (1%)
Query: 230 ASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTV 289
+ L G + E S +I R IPD R EC +Y LP ASVI+ FHNEG+S+L+RTV
Sbjct: 130 SELNTNGFDEEVSKNIPLHRIIPDGRHPECLQQNYGEKLPIASVIICFHNEGWSTLLRTV 189
Query: 290 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 349
HS++ +P +L+EIILVDD S + L L +YI R G V+LIR+ +R G+I R G
Sbjct: 190 HSVLDNSPRTFLKEIILVDDLSHQEHLKSALSEYISRIGG-VKLIRSNKRLGVIGGRMLG 248
Query: 350 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 409
A + GEV++F+D+HCE WL PLL+ I +R + PVID ID++T+E+
Sbjct: 249 AARATGEVLIFMDSHCECHPGWLEPLLSRIMHNRNRIVSPVIDFIDWKTFEYS---HSSL 305
Query: 410 HYRGIFEWGMLYKENELPEREAKKRK 435
+G+F+W + + LPE E K R+
Sbjct: 306 LQQGVFDWKLDFHWVPLPEHEEKVRQ 331
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 112/182 (61%), Gaps = 4/182 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +YI R G V+LIR+ +R G+I R GA + GEV++F+D+HCE WL PL
Sbjct: 216 LKSALSEYISRIGG-VKLIRSNKRLGVIGGRMLGAARATGEVLIFMDSHCECHPGWLEPL 274
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I +R + PVID ID++T+E+ +G+F+W + + LPE E K R+
Sbjct: 275 LSRIMHNRNRIVSPVIDFIDWKTFEYS---HSSLLQQGVFDWKLDFHWVPLPEHEEKVRQ 331
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P++SP G + A DR +F +GG+D G+ WG E ELS ++W+CGGS+E VPCS
Sbjct: 332 SPIIPFRSPVIPGYVLASDRHYFQNIGGFDTGINSWGVETTELSIRVWLCGGSVEIVPCS 391
Query: 182 RI 183
R+
Sbjct: 392 RV 393
>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
Length = 578
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKAQLEAYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 TSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 141/256 (55%), Gaps = 14/256 (5%)
Query: 181 SRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNME 240
+R L +P++K +PP + + G L E + + Y +N+
Sbjct: 54 TREEDLSQPLYK--------KPPADSHALGEWGRASKLQLDEGELKQQEELIERYAINIY 105
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S+ IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA
Sbjct: 106 LSDRISLHRHIEDKRMYECKAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAV 165
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+
Sbjct: 166 LLKEIILVDDLSDRIYLKAQLEAYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLT 224
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWG 418
FLD HCE WL PLL I D + PVID ID+ T+EF EP G F+W
Sbjct: 225 FLDCHCECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWR 281
Query: 419 MLYKENELPEREAKKR 434
+ ++ + +P+ E +R
Sbjct: 282 LTFQWHSVPKHERDRR 297
>gi|324506451|gb|ADY42754.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Ascaris suum]
Length = 618
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/189 (44%), Positives = 120/189 (63%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L K L++Y++ + + V+++R +REGLIR R GA+ + +++VFLDAH E NWLP
Sbjct: 204 LKKPLDEYLKSAKLDHIVKVVRTKKREGLIRARQIGAQHATADIMVFLDAHSEPNYNWLP 263
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ PI D + + P +D ID T+E+R+ D RG F+W YK L E + K
Sbjct: 264 PLIEPITLDYRTVVCPFVDVIDCDTFEYRA---QDEGARGSFDWEFNYKRLPLTEDDLK- 319
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ + P+KSP AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ P
Sbjct: 320 --HPTRPFKSPVMAGGYFAISRKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGNMVDAP 377
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 378 CSRVGHIYR 386
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 92/235 (39%), Positives = 136/235 (57%), Gaps = 23/235 (9%)
Query: 210 GPGEGGKAYHLP----------EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
GPGEGGK +P E YR G + S+ I +R++ D+R ++C
Sbjct: 96 GPGEGGKPVAIPTDPEIKKKQEELYRVNGYDAF--------VSDLIPLNRSVKDIRHKDC 147
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ Y LP SVI FH+E S+L+R+ +S+I RTP + L+EIILVDD S+K L +
Sbjct: 148 QNLRYLEALPSVSVIFPFHDEHNSTLLRSAYSVIARTPKEILKEIILVDDASTKPFLKKP 207
Query: 320 LEDYIQ--RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
L++Y++ + + V+++R +REGLIR R GA+ + +++VFLDAH E NWLPPL+
Sbjct: 208 LDEYLKSAKLDHIVKVVRTKKREGLIRARQIGAQHATADIMVFLDAHSEPNYNWLPPLIE 267
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PI D + + P +D ID T+E+R+ D RG F+W YK L E + K
Sbjct: 268 PITLDYRTVVCPFVDVIDCDTFEYRA---QDEGARGSFDWEFNYKRLPLTEDDLK 319
>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
Length = 615
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/259 (37%), Positives = 145/259 (55%), Gaps = 10/259 (3%)
Query: 187 IRPVFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETS 242
+ VF D + N L +E GPG +GG +PE + + E N+ S
Sbjct: 83 VEQVFPVDKETANQLRKLMETQAFGPGYHGQGGTGVTVPEDKKDIKEKRFLENQFNVVAS 142
Query: 243 NHISFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
IS +RT+PD R E C+ L LP S+I+VFHNE +++L+RT+HS+I R+P
Sbjct: 143 EMISINRTLPDYRSEACRTTGNSLKTEGLPTTSIIIVFHNEAWTTLLRTLHSVINRSPRH 202
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
LEEIILVDD S + L + L+ YI++F V L+ +R GLIR R G+ ++G++++
Sbjct: 203 LLEEIILVDDKSDRDYLVKPLDAYIKKFPVPVHLVHLEDRSGLIRARLTGSGMAKGKILL 262
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 419
FLDAH EV WL PL+ + DRK + P+ID I T+E+ + E G F W +
Sbjct: 263 FLDAHVEVTDGWLEPLVTRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHL 319
Query: 420 LYKENELPEREAKKRKYNR 438
++ +P+RE +R +R
Sbjct: 320 NFRWYAVPKRELNRRGADR 338
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+ YI++F V L+ +R GLIR R G+ ++G++++FLDAH EV WL PL
Sbjct: 219 LVKPLDAYIKKFPVPVHLVHLEDRSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPL 278
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
+ + DRK + P+ID I T+E+ + E G F W + ++ +P+RE +R
Sbjct: 279 VTRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYAVPKRELNRRG 335
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PC
Sbjct: 336 ADRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPC 395
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 396 SRVGHVFR 403
>gi|440905500|gb|ELR55875.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Bos grunniens
mutus]
Length = 412
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 87/188 (46%), Positives = 120/188 (63%), Gaps = 3/188 (1%)
Query: 251 IPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
+ +L+ CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD
Sbjct: 8 LKELKKRRCKTRVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDA 67
Query: 311 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
S + L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L
Sbjct: 68 SERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLG 127
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPERE 430
WL PLLA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE
Sbjct: 128 WLEPLLARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQRE 184
Query: 431 AKKRKYNR 438
+RK +R
Sbjct: 185 MDRRKGDR 192
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 87/240 (36%), Positives = 119/240 (49%), Gaps = 56/240 (23%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 73 LKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 132
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 133 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 189
Query: 122 -----------------------------------YNSEPYKSPTHAGG----------- 135
+ E + +GG
Sbjct: 190 GDRTLPVRLLPEKWLSLKVIEHTSPAQCLASAISLLDKEKASTSGESGGTGFKQSQLIER 249
Query: 136 -------LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF++DR +F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS + + R
Sbjct: 250 TPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFR 309
>gi|195385643|ref|XP_002051514.1| GJ11806 [Drosophila virilis]
gi|194147971|gb|EDW63669.1| GJ11806 [Drosophila virilis]
Length = 653
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/189 (46%), Positives = 117/189 (61%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +L+ Y+ ++ V ++ N ER GLI R GA+ + GEV+VF D+H EV NWLPP
Sbjct: 237 LKDQLDQYVALQWPHLVDVVHNPERRGLIGARLAGARVATGEVMVFFDSHIEVNYNWLPP 296
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
LL PI + KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 297 LLEPIVINNKISTCPIVDIIDHNNFAYNGGYQ--EGTRGGFDWRFFYKQLPVLPEDSVDK 354
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 355 ----SLPYRSPVMMGGLFAINSEFFWDLGGYDDELDIWGGEQYELSFKIWMCGGMLLDVP 410
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 411 CSRVAHIFR 419
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 94/233 (40%), Positives = 134/233 (57%), Gaps = 13/233 (5%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEY-----GMNMETSNHISFDRTIPDLRMEECKYW 262
+ G GE G LP + +L E G N S+ IS +R++PD+R E+CK
Sbjct: 128 RSGLGEHG----LPATIEDPAEKTLEEQEYRRNGFNGYLSDRISVNRSLPDVRHEKCKTR 183
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
Y LP SV+++F+NE F +L+RTV+SI+ RTP + L +I+LVDD S L +L+
Sbjct: 184 KYLAKLPNVSVVIIFYNEHFQTLLRTVYSIVNRTPKELLHQIVLVDDGSEWETLKDQLDQ 243
Query: 323 YIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
Y+ ++ V ++ N ER GLI R GA+ + GEV+VF D+H EV NWLPPLL PI
Sbjct: 244 YVALQWPHLVDVVHNPERRGLIGARLAGARVATGEVMVFFDSHIEVNYNWLPPLLEPIVI 303
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 433
+ KI T P++D ID+ + + Y+ RG F+W YK+ LPE K
Sbjct: 304 NNKISTCPIVDIIDHNNFAYNGGYQ--EGTRGGFDWRFFYKQLPVLPEDSVDK 354
>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
echinatior]
Length = 605
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 93/231 (40%), Positives = 131/231 (56%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
K PGE G A +P A N+ S+ IS +R++ D+R+E CK Y
Sbjct: 100 KGMPGEMGAAVAIPPENDAKQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKNKKYLKY 159
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L Q LEDY+
Sbjct: 160 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKQDLEDYVITL 219
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I +DR +
Sbjct: 220 PVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 279
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ S D + G F W + ++ + +RE +R +R
Sbjct: 280 CPIIDVISDDTFEYISA--SDMTWGG-FNWKLNFRWYRVAQREMDRRNSDR 327
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 116/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY+ + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 208 LKQDLEDYVITLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 267
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I +DR + P+ID I T+E+ S D + G F W + ++ + +RE +R
Sbjct: 268 LSRIANDRHTVVCPIIDVISDDTFEYISA--SDMTWGG-FNWKLNFRWYRVAQREMDRRN 324
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 325 SDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 384
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 385 SHVGHVFR 392
>gi|443726011|gb|ELU13353.1| hypothetical protein CAPTEDRAFT_91056 [Capitella teleta]
Length = 426
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL+DY+ R KV LIR R+GLIR R R + +RG V+ FLD+H E + W PL
Sbjct: 150 LHAKLDDYLTRL-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPL 208
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R+++ PVID I + + + + + RG F W ML+K +P+ E +
Sbjct: 209 LQRISHNRRVIVAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPDYEKSRTG 265
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLFA+ + FF LG YDPGL +WG EN ELSFK WMCGGS+E +PC
Sbjct: 266 GDPTAPVRTPTIAGGLFAIHQGFFRSLGFYDPGLHIWGSENLELSFKAWMCGGSMEMIPC 325
Query: 181 SRI 183
+R+
Sbjct: 326 ARV 328
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 92/227 (40%), Positives = 128/227 (56%), Gaps = 7/227 (3%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPL 266
PGE G++ A N S+ +SF+RTIPD R C K +DY
Sbjct: 42 NSPGEHGRSVRTSPDDEAVVKEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYS- 100
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LPK SVI+ F E +S+L+R+VHS++ RTP LEEI+LVDDFS + L KL+DY+ R
Sbjct: 101 SLPKMSVIICFTEESWSTLLRSVHSVLNRTPPDLLEEILLVDDFSQREHLHAKLDDYLTR 160
Query: 327 FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIM 386
KV LIR R+GLIR R R + +RG V+ FLD+H E + W PLL I +R+++
Sbjct: 161 L-PKVTLIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVI 219
Query: 387 TVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
PVID I + + + + + RG F W ML+K +P+ E +
Sbjct: 220 VAPVIDAISSRDFSYIPI---SANQRGGFNWAMLFKWMPVPDYEKSR 263
>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Otolemur garnettii]
Length = 582
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKTQLETYISNLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 LERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 301
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K +P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 302 KSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 361
Query: 181 SRI 183
S +
Sbjct: 362 SHV 364
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/228 (42%), Positives = 132/228 (57%), Gaps = 8/228 (3%)
Query: 212 GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-L 268
GE GKA L E + + Y +N+ S+ IS R I D RM ECK + L
Sbjct: 79 GEWGKASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRL 138
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +LE YI
Sbjct: 139 PTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLE 198
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
+VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I D +
Sbjct: 199 -RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRDETAVVC 257
Query: 389 PVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID ID+ T+EF EP G F+W + ++ + +P+ E +RK
Sbjct: 258 PVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRRK 302
>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Monodelphis domestica]
Length = 708
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/253 (40%), Positives = 145/253 (57%), Gaps = 19/253 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLP---EAYRAAGDASLGEYGMNMETS 242
L +P++K +PP +P GE G+A HL +A + + +Y +N+ S
Sbjct: 188 LAQPLYK--------KPPPDP--GALGEWGEASHLQLQGDAEKQQAEELTEKYAINIYLS 237
Query: 243 NHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
+ IS R I D RM EC K +DY LP SVI+ F+NE +S+L+RTVHS+++ PA
Sbjct: 238 DRISLHRHIRDDRMYECRLKSFDY-RRLPTTSVIIAFYNEAWSTLLRTVHSVLETAPAVL 296
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L+EIILVDD S K L +LE YI +VRLIR +REGL+R R GA + GEV+ F
Sbjct: 297 LKEIILVDDLSDKVYLKAQLETYISSLQ-RVRLIRTKKREGLVRARLIGATFATGEVLTF 355
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LD HCE WL PLL I D ++ PVID ID+ T++F + G F+W +
Sbjct: 356 LDCHCECNQGWLEPLLERIGQDESVIICPVIDTIDWNTFDF--YMQEGEPVIGGFDWHLT 413
Query: 421 YKENELPEREAKK 433
++ +PE E ++
Sbjct: 414 FQWQPVPEHERRR 426
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 113/182 (62%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI +VRLIR +REGL+R R GA + GEV+ FLD HCE WL PL
Sbjct: 312 LKAQLETYISSLQ-RVRLIRTKKREGLVRARLIGATFATGEVLTFLDCHCECNQGWLEPL 370
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D ++ PVID ID+ T++F + G F+W + ++ +PE E ++ +
Sbjct: 371 LERIGQDESVIICPVIDTIDWNTFDF--YMQEGEPVIGGFDWHLTFQWQPVPEHERRRWQ 428
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++P KSP AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG++E PCS
Sbjct: 429 SRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGALEIHPCS 488
Query: 182 RI 183
+
Sbjct: 489 HV 490
>gi|224047294|ref|XP_002195048.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Taeniopygia guttata]
Length = 552
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 80/173 (46%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA ++ V+ FLD+HCEV +WL PLL I D + P
Sbjct: 167 KVKCLRNGRREGLIRSRIRGADVAKASVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + KR +EP K+P AGG
Sbjct: 227 VIDIINLDTFAYVAA---SSDLRGGFDWSLHFKWEQLSPEQKAKRLDPTEPIKTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 284 LFVIDKAWFNHLGKYDSAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 336
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/200 (41%), Positives = 113/200 (56%), Gaps = 9/200 (4%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
+ N S I DR + D R C Y DLP SVI+ FHNE S+L+RT+ S++
Sbjct: 77 HAFNQRESERIPSDRAVRDTRHHRCTTLHYSQDLPPTSVIITFHNEARSTLLRTIRSVLN 136
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP + EIILVDDFS D + L KV+ +RN REGLIR+R RGA ++
Sbjct: 137 RTPVHLVHEIILVDDFSDDPDDCRLLGQL-----PKVKCLRNGRREGLIRSRIRGADVAK 191
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
V+ FLD+HCEV +WL PLL I D + PVID I+ T+ + + RG
Sbjct: 192 ASVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAYVAA---SSDLRGG 248
Query: 415 FEWGMLYKENEL-PEREAKK 433
F+W + +K +L PE++AK+
Sbjct: 249 FDWSLHFKWEQLSPEQKAKR 268
>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
catus]
Length = 552
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ IRNTER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCIRNTERQGLVRSRIRGASVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I + + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIISLDNFNY---IESAAELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVMDKSWFEYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 119/207 (57%), Gaps = 9/207 (4%)
Query: 228 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 287
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+R
Sbjct: 70 GDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCADLPPTSIIITFHNEARSTLLR 129
Query: 288 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 347
T+ S++ RTP ++EIILVDDFS+ D +L KV+ IRNTER+GL+R+R
Sbjct: 130 TIRSVLNRTPMNLIQEIILVDDFSNDPDDCSQLIKL-----PKVKCIRNTERQGLVRSRI 184
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
RGA ++G + FLD+HCEV +WL PLL + D + PVID I + + E
Sbjct: 185 RGASVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY---IES 241
Query: 408 DHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 242 AAELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 569
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 91/236 (38%), Positives = 140/236 (59%), Gaps = 7/236 (2%)
Query: 209 EGPGEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTIPDLRMEECKYWD 263
+ PGE G Y ++ + + E G N SN IS R++ D R +EC
Sbjct: 59 KAPGELGSPYIFNKSQLTSKEKLEYETGWKKNNFNEFASNRISLQRSLKDPRDKECHNLT 118
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LP+ S+I+ FHNE +S L+R+V+SI+ RTP L+E+ILVDDFSS L + L+ +
Sbjct: 119 YSENLPEVSIIVTFHNEAWSVLIRSVYSILNRTPDSLLKEVILVDDFSSLEHLKEPLDQF 178
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+++F KV+++R TER+GLIR R RG +E+ G+V+VFLD+H E W PL+ PI +
Sbjct: 179 MEQFQ-KVKIVRATERQGLIRARLRGYREAVGDVLVFLDSHIECAEGWFEPLIDPIARNW 237
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNRY 439
+ PVID ID +T+++ G F+W +++ + +PE E K+R+ Y
Sbjct: 238 STVMTPVIDVIDKETFQY-GFQAASATNVGGFDWSLMFTWHFVPETEQKRRQNKHY 292
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 121/188 (64%), Gaps = 3/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L+ ++E+F KV+++R TER+GLIR R RG +E+ G+V+VFLD+H E W PL
Sbjct: 171 LKEPLDQFMEQFQ-KVKIVRATERQGLIRARLRGYREAVGDVLVFLDSHIECAEGWFEPL 229
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
+ PI + + PVID ID +T+++ G F+W +++ + +PE E K+R+
Sbjct: 230 IDPIARNWSTVMTPVIDVIDKETFQY-GFQAASATNVGGFDWSLMFTWHFVPETEQKRRQ 288
Query: 122 YNSE-PYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
P +SPT AGGLFA+ R +F +G YD G+ +WGGEN ELSF+IWMCGG++ PC
Sbjct: 289 NKHYLPVRSPTMAGGLFAISRKYFEHIGTYDEGMDIWGGENLELSFRIWMCGGTLLTAPC 348
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 349 SHVGHVFR 356
>gi|328794283|ref|XP_001122865.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like,
partial [Apis mellifera]
Length = 372
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 141/261 (54%), Gaps = 12/261 (4%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEG---------PGEGGKAYHLPEAYRAAGDASLGEYGM 237
I+PV + K +P L ++ PGE G A H+ A
Sbjct: 71 IQPVGQQSSKKRYKQPELHLWRPARVVRDNKGIPGEMGAAVHISPEDEARQQELFKLNQF 130
Query: 238 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
N+ S+ IS +R++ D+R++ CK Y LP S+++VFHNE +S+L+RTV S+I R+P
Sbjct: 131 NLMASDMISLNRSLKDIRLDGCKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSP 190
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
L+EIILVDD S + L Q LEDY++ + R +R GLIR R GAK +G+V
Sbjct: 191 RTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQV 250
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 417
I FLDAHCE WL PLL+ I DR + P+ID I T+E+ + D + G F W
Sbjct: 251 ITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNW 307
Query: 418 GMLYKENELPEREAKKRKYNR 438
+ ++ + +RE +R +R
Sbjct: 308 KLNFRWYRVAQREMDRRLGDR 328
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/167 (41%), Positives = 102/167 (61%), Gaps = 4/167 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 209 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 269 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 325
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 167
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+
Sbjct: 326 GDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFR 372
>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
Length = 614
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 116/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE Y++ V+++R ER+GLI R GAK ++ EV+ FLDAHCE WL PL
Sbjct: 216 LKDKLEQYVKELQ-VVKVVRQVERKGLITARLLGAKVAQAEVLTFLDAHCECFHGWLEPL 274
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 275 LARIAEDKTVVVSPDIVTIDLNTFEFSKPVQSGRVHSRGNFDWSLTFGWETLPPHEKQRH 334
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 335 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 394
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 395 SVVGHVFR 402
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/274 (37%), Positives = 147/274 (53%), Gaps = 17/274 (6%)
Query: 179 PCSRIVSLIRPVFKADGKLGNLEP-----PLEPYKEGPGEGGKAY---HLPEAYRAAGDA 230
P +VS+ +P L+P P +P + PG GKA+ + +
Sbjct: 70 PQQNLVSMNQPCLPGFYTPAELKPFWERPPQDP--KSPGADGKAFQKNNWTPLETQEKEE 127
Query: 231 SLGEYGMNMETSNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLM 286
++ N S+ IS R + PD R EC K+ P LP SVI+VFHNE +S+L+
Sbjct: 128 GYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLL 186
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RTV+S++ TPA L+EIILVDD S++ L KLE Y++ V+++R ER+GLI R
Sbjct: 187 RTVYSVLHTTPAILLKEIILVDDASTEDYLKDKLEQYVKELQ-VVKVVRQVERKGLITAR 245
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVY 405
GAK ++ EV+ FLDAHCE WL PLLA I D+ ++ P I ID T+EF + V
Sbjct: 246 LLGAKVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVVSPDIVTIDLNTFEFSKPVQ 305
Query: 406 EPDHHYRGIFEWGMLYKENELPEREAKKRKYNRY 439
H RG F+W + + LP E ++ K Y
Sbjct: 306 SGRVHSRGNFDWSLTFGWETLPPHEKQRHKDETY 339
>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 383
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 83/199 (41%), Positives = 123/199 (61%), Gaps = 4/199 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L++++ V ++R+ ER GLI+ R GA+++ G+V+ FLD+HCE WL PL
Sbjct: 106 LKKALDNFVLNLPVPVLIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECTEGWLEPL 165
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I DRKI PVID I+ +T++++ + YRG F W + ++ P E K+R
Sbjct: 166 LDRIAFDRKIAVAPVIDVINDETFQYQKGIDV---YRGGFNWNLQFRWYSSPPSELKRRG 222
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++DR FF E+G YD + +WGGEN E+SF+IW CGG +E +PC
Sbjct: 223 NDVTHPVRTPTIAGGLFSIDRQFFFEIGAYDKEMKIWGGENLEMSFRIWQCGGQLEIIPC 282
Query: 181 SRIVSLIRPVFKADGKLGN 199
S + + R D GN
Sbjct: 283 SHVGHVFRKKSPHDFPRGN 301
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/223 (39%), Positives = 131/223 (58%), Gaps = 3/223 (1%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
GE G++ +L + N+ S+ I +RT+ D R C+ Y LP
Sbjct: 2 GELGRSVNLNDNDSKLAKHLFQINQFNIVASDRIPLNRTLIDARRAACRNKTYSSALPTT 61
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKV 331
SVI+VFHNE +S+L+RTV S+I R+P + L+EIILVDD S +A L + L++++ V
Sbjct: 62 SVIIVFHNEAWSTLLRTVFSVINRSPKKLLKEIILVDDCSQRAFLKKALDNFVLNLPVPV 121
Query: 332 RLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVI 391
++R+ ER GLI+ R GA+++ G+V+ FLD+HCE WL PLL I DRKI PVI
Sbjct: 122 LIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECTEGWLEPLLDRIAFDRKIAVAPVI 181
Query: 392 DGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
D I+ +T++++ + YRG F W + ++ P E K+R
Sbjct: 182 DVINDETFQYQKGIDV---YRGGFNWNLQFRWYSSPPSELKRR 221
>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Macaca mulatta]
Length = 375
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 83/166 (50%), Positives = 111/166 (66%), Gaps = 4/166 (2%)
Query: 24 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 83
ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK + P+ID I
Sbjct: 3 ERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVISDD 62
Query: 84 TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAGGLFAMDRA 142
T+E+ + D Y G F W + ++ +P+RE +RK + + P ++PT AGGLF++DR
Sbjct: 63 TFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRN 119
Query: 143 FFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F E+G YD G+ +WGGEN E+SF+IW CGGS+E V CS + + R
Sbjct: 120 YFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFR 165
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 63/101 (62%), Gaps = 3/101 (2%)
Query: 338 EREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQ 397
ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA I DRK + P+ID I
Sbjct: 3 ERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVISDD 62
Query: 398 TWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 63 TFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 100
>gi|195471079|ref|XP_002087833.1| GE18238 [Drosophila yakuba]
gi|194173934|gb|EDW87545.1| GE18238 [Drosophila yakuba]
Length = 659
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/197 (46%), Positives = 123/197 (62%), Gaps = 8/197 (4%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L + L+DY+ + F V ++ + ER+GLI R GAK + GEV+VF D+H EV NWLPP
Sbjct: 240 LKQPLDDYVAQHFPHLVTVVHSPERQGLIGARLAGAKVAVGEVMVFFDSHIEVNYNWLPP 299
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKE-NELPEREAKK 119
L+ PI + KI T P++D I ++ + + S + RG F+W MLYK+ LPE K
Sbjct: 300 LIEPIAINPKIATCPMVDTIAHEDFSYFSGNKDG--ARGGFDWKMLYKQLPVLPEDALDK 357
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY+SP GGLFA++ FF +LGGYD L +WGGE +ELSFKIWMCGG + VP
Sbjct: 358 ----SMPYRSPVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGLLLDVP 413
Query: 180 CSRIVSLIRPVFKADGK 196
CSR+ + R K G
Sbjct: 414 CSRVGHIFRGPMKPRGN 430
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 84/189 (44%), Positives = 122/189 (64%), Gaps = 3/189 (1%)
Query: 236 GMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
G N S+ IS +R++PD+R+E CK Y LP SV+ +F NE F++L+R+++S+I R
Sbjct: 160 GFNGLISDRISLNRSVPDIRLEACKTRKYLAKLPNISVVFIFFNEHFNTLLRSIYSVINR 219
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
TP + L +I+LVDD S L Q L+DY+ Q F V ++ + ER+GLI R GAK +
Sbjct: 220 TPPELLRQIVLVDDGSEWDSLKQPLDDYVAQHFPHLVTVVHSPERQGLIGARLAGAKVAV 279
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
GEV+VF D+H EV NWLPPL+ PI + KI T P++D I ++ + + S + RG
Sbjct: 280 GEVMVFFDSHIEVNYNWLPPLIEPIAINPKIATCPMVDTIAHEDFSYFSGNKDG--ARGG 337
Query: 415 FEWGMLYKE 423
F+W MLYK+
Sbjct: 338 FDWKMLYKQ 346
>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
Length = 575
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 179 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 237
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF + EP G F+W + ++ + +P++E +R
Sbjct: 238 LERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQWHSVPKQERDRR 294
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 295 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 354
Query: 181 SRI 183
S +
Sbjct: 355 SHV 357
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 144/253 (56%), Gaps = 18/253 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RP++K +PP + GE GKA L E + + Y +N+ S+
Sbjct: 56 LSRPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSD 105
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+
Sbjct: 106 RISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLK 165
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 166 EIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLD 224
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLY 421
HCE WL PLL I D + PVID ID+ T+EF + EP G F+W + +
Sbjct: 225 CHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTF 281
Query: 422 KENELPEREAKKR 434
+ + +P++E +R
Sbjct: 282 QWHSVPKQERDRR 294
>gi|51316092|sp|Q9GM01.1|GALT9_MACFA RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 9;
AltName: Full=Polypeptide GalNAc transferase 9;
Short=GalNAc-T9; Short=pp-GaNTase 9; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 9;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|11041469|dbj|BAB17277.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Macaca
fascicularis]
Length = 606
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/217 (39%), Positives = 127/217 (58%), Gaps = 6/217 (2%)
Query: 204 LEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
+ P E +GG A L + + A + EYG N + S+ IS DR+IPD R +C++
Sbjct: 90 MVPEAEAEAQGGLAATLRDDGQEA-EGKYEEYGYNAQLSDRISLDRSIPDYRPRKCRHMS 148
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y DLP+ SV+ +F NE S ++R+VHS++ TP+Q L+E+ILVDD S +L L+ Y
Sbjct: 149 YAQDLPQVSVVFIFVNEALSVILRSVHSVVNHTPSQLLKEVILVDDNSDNVELKFNLDQY 208
Query: 324 I-QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W P L+ I D
Sbjct: 209 VNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIRED 268
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 419
R+ + +P ID I Y T+E + H Y WG+
Sbjct: 269 RRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWGL 301
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 200 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 259
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 260 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 314
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 315 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLP 374
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 375 CSRVAHIER 383
>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
Length = 622
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 116/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKERLEQYVQQLQ-IVRVVRQRERGGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F R V H RG F+W + + LPE E ++R
Sbjct: 283 LARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG + +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLGIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 100/246 (40%), Positives = 137/246 (55%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ E + ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKKEWTNLETKEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L EIIL+DD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLNEIILMDDASTDE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE Y+Q+ VR++R ER GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 HLKERLEQYVQQLQ-IVRVVRQRERGGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ + P I ID T++F R V H RG F+W + + LPE E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|12855129|dbj|BAB30220.1| unnamed protein product [Mus musculus]
Length = 431
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 119/188 (63%), Gaps = 4/188 (2%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
L YG+N S + +R +PD R + C+ YP +LP AS+I+ F+NE F++L+R V S
Sbjct: 78 LRRYGLNAIMSRRLGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSS 137
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ +P LEEIILVDD S DL KL+ Y++ F G+V+LIRN +REGLIR++ GA
Sbjct: 138 VVNLSPQHLLEEIILVDDMSEFDDLKDKLDYYLEIFRGEVKLIRNKKREGLIRSKMIGAS 197
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ G+++VFLD+HCEV WL PLL I D K++ P+ID I+ T + Y
Sbjct: 198 RASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIV 253
Query: 412 RGIFEWGM 419
RG F+W +
Sbjct: 254 RGAFDWNL 261
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 82/183 (44%), Positives = 116/183 (63%), Gaps = 4/183 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL KL+ Y+E F G+V+LIRN +REGLIR++ GA + G+++VFLD+HCEV WL P
Sbjct: 161 DLKDKLDYYLEIFRGEVKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEP 220
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL I D K++ P+ID I+ T + Y RG F+W + + + + E
Sbjct: 221 LLHAIAKDHKMVVCPIIDVINELTLD----YMAAPIVRGAFDWNLNLRWDNVFAYELDGP 276
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P +SP GG+FA++R +F ELG YD G+ + GGEN ELS +IWMCGG + +PC
Sbjct: 277 EGPSTPIRSPAMTGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPC 336
Query: 181 SRI 183
SR+
Sbjct: 337 SRV 339
>gi|195474017|ref|XP_002089288.1| GE19032 [Drosophila yakuba]
gi|194175389|gb|EDW89000.1| GE19032 [Drosophila yakuba]
Length = 635
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/182 (47%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 215 LRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 274
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 275 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 330
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 331 FAVNRKYFQYLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 390
Query: 194 DG 195
DG
Sbjct: 391 DG 392
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 119/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R C + +LP+AS+++ F+NE +LM
Sbjct: 109 DIGYKHHAFNALVSNNIGLFRQIPDTRHRVCDRQETLEAENLPQASIVMCFYNEHKMTLM 168
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 169 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 228
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 229 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 284
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 285 YTPSPLVRGGFNWGLHFRWENLPE 308
>gi|194758571|ref|XP_001961535.1| GF14884 [Drosophila ananassae]
gi|190615232|gb|EDV30756.1| GF14884 [Drosophila ananassae]
Length = 627
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 82/172 (47%), Positives = 113/172 (65%), Gaps = 4/172 (2%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N EREGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I ++ + VPV
Sbjct: 211 LRYIKNEEREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKAENSTLAVPV 270
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 271 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 326
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPC+R+ + R
Sbjct: 327 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCARVGHIFR 378
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 119/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP AS+++ F+NE +LM
Sbjct: 105 DIGYKHHAFNALVSNNIGLFRDIPDTRHKVCDRSETFEAENLPHASIVMCFYNEHKMTLM 164
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N EREGLIR
Sbjct: 165 RSIKTVLERTPSYLLREIILVDDNSDLPELEFHLHADLRARLKYPNLRYIKNEEREGLIR 224
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I ++ + VPVID I+ T+E
Sbjct: 225 SRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKAENSTLAVPVIDLINADTFE---- 280
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 281 YTPSPLVRGGFNWGLHFRWENLPE 304
>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
Length = 578
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 112/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKAQLETYISNLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 TSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 93/237 (39%), Positives = 133/237 (56%), Gaps = 6/237 (2%)
Query: 200 LEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC 259
++PP + + G L E + + Y +N+ S+ IS R I D RM EC
Sbjct: 65 IKPPADSHALGEWGRASKLQLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYEC 124
Query: 260 KYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
K + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L
Sbjct: 125 KAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKA 184
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL
Sbjct: 185 QLETYISNLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243
Query: 379 IYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 244 ISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
Length = 578
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF + EP G F+W + ++ + +P++E +R
Sbjct: 241 LERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQWHSVPKQERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 144/253 (56%), Gaps = 18/253 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RP++K +PP + GE GKA L E + + Y +N+ S+
Sbjct: 59 LSRPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSD 108
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+
Sbjct: 109 RISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLK 168
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 169 EIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLD 227
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLY 421
HCE WL PLL I D + PVID ID+ T+EF + EP G F+W + +
Sbjct: 228 CHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTF 284
Query: 422 KENELPEREAKKR 434
+ + +P++E +R
Sbjct: 285 QWHSVPKQERDRR 297
>gi|195397828|ref|XP_002057530.1| GJ18184 [Drosophila virilis]
gi|194141184|gb|EDW57603.1| GJ18184 [Drosophila virilis]
Length = 625
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R +RN REGLIR+R GA+++ G+V+VFLD+H EV WL PLL ++++ + VPV
Sbjct: 208 LRYVRNERREGLIRSRVIGARDANGDVLVFLDSHIEVNRQWLEPLLRLVHAENATLAVPV 267
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 268 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFKGPFRSPTMAGGL 323
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA+ R +F +G YD + +WGGEN E+SF++W CGG+I+ VPCSR+ + RP
Sbjct: 324 FAVSRLYFQHIGEYDMAMDIWGGENIEISFRVWQCGGAIKIVPCSRVGHIFRKRRPYTAP 383
Query: 194 DG 195
DG
Sbjct: 384 DG 385
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/208 (41%), Positives = 122/208 (58%), Gaps = 8/208 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD--YPLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + CK + P LP+ASVI+ F+NE +LM
Sbjct: 102 DIGYKHHAFNALVSNNIGLYRDIPDTRHKVCKREEPLEPEQLPQASVIMCFYNEHKMTLM 161
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLD-QKLEDYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP L+EIILVDD S +L+ L D R +R +RN REGLIR
Sbjct: 162 RSIKTVLERTPGHLLKEIILVDDNSDLPELEFHLLADLHARLKYDNLRYVRNERREGLIR 221
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+++ G+V+VFLD+H EV WL PLL ++++ + VPVID I+ T+E
Sbjct: 222 SRVIGARDANGDVLVFLDSHIEVNRQWLEPLLRLVHAENATLAVPVIDLINADTFE---- 277
Query: 405 YEPDHHYRGIFEWGMLYKENELPEREAK 432
Y P RG F WG+ ++ LPE K
Sbjct: 278 YTPSPLVRGGFNWGLHFRWENLPEGTLK 305
>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
Length = 504
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 118/189 (62%), Gaps = 7/189 (3%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVRLIRN++REGL+R+R GA + +V+ FLD+HCE +NWL PLLA + D + P
Sbjct: 125 KVRLIRNSKREGLVRSRVTGAAAATAKVLTFLDSHCECNVNWLEPLLARVAEDPTRVVCP 184
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 134
VID I T+++ RG F+W +++K L E K R+ + + P ++P AG
Sbjct: 185 VIDVISMDTFQYIGA---SADLRGGFDWNLVFKWEYLSNAERKARQRDPTAPIRTPMIAG 241
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR---PVF 191
GLF +D+A+F LG YD + +WGGEN E+SF++W CGGS+E +PCSR+ + R P
Sbjct: 242 GLFVIDKAYFERLGTYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYT 301
Query: 192 KADGKLGNL 200
G GN+
Sbjct: 302 FPGGGSGNI 310
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 118/204 (57%), Gaps = 13/204 (6%)
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYP-----LDLPKASVILVFHNEGFSSLMRTVHS 291
N + S+ + +R +PD R C+ + LP SVI+ FHNE S+L+RTV S
Sbjct: 32 FNQQASDGLKSNRELPDTRNAMCRRSSWSDLSTIAHLPATSVIITFHNEARSTLLRTVVS 91
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ R+P + + EIILVDD+S + Q+L IQ KVRLIRN++REGL+R+R GA
Sbjct: 92 VLNRSPERLIHEIILVDDYSDFPEDGQELAK-IQ----KVRLIRNSKREGLVRSRVTGAA 146
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ +V+ FLD+HCE +NWL PLLA + D + PVID I T+++
Sbjct: 147 AATAKVLTFLDSHCECNVNWLEPLLARVAEDPTRVVCPVIDVISMDTFQYIGA---SADL 203
Query: 412 RGIFEWGMLYKENELPEREAKKRK 435
RG F+W +++K L E K R+
Sbjct: 204 RGGFDWNLVFKWEYLSNAERKARQ 227
>gi|13938655|gb|AAH07484.1| Galnt7 protein, partial [Mus musculus]
Length = 362
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/160 (51%), Positives = 109/160 (68%), Gaps = 7/160 (4%)
Query: 47 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWG 104
LDAHCEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W
Sbjct: 5 LDAHCEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWS 64
Query: 105 MLYKENELPEREAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFEL 164
ML+K L RE + RK +EPY+SP AGGLFA+++ FF ELG YDPGL +WGGENFE+
Sbjct: 65 MLWKRVPLTSREKRLRKTKTEPYRSPAMAGGLFAIEKDFFFELGLYDPGLQIWGGENFEI 124
Query: 165 SFKIWMCGGSIEWVPCSRIVSLIRPVFKADGKLGNLEPPL 204
S+KIW CGG + +VPCSR+ +++ +G GN PPL
Sbjct: 125 SYKIWQCGGKLLFVPCSRVGH----IYRLEGWQGN-PPPL 159
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 37/77 (48%), Positives = 48/77 (62%), Gaps = 2/77 (2%)
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF--RSVYEPDHHYRGIFEWG 418
LDAHCEV +NW PL+API DR I TVP+ID I T+E + + D + RG ++W
Sbjct: 5 LDAHCEVAVNWYAPLVAPISKDRTICTVPIIDVISGNTYEIIPQGGGDEDGYARGAWDWS 64
Query: 419 MLYKENELPEREAKKRK 435
ML+K L RE + RK
Sbjct: 65 MLWKRVPLTSREKRLRK 81
>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
Length = 578
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF + EP G F+W + ++ + +P++E +R
Sbjct: 241 LERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTFQWHSVPKQERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 101/253 (39%), Positives = 144/253 (56%), Gaps = 18/253 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RP++K +PP + GE GKA L E + + Y +N+ S+
Sbjct: 59 LSRPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSD 108
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+
Sbjct: 109 RISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLK 168
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 169 EIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLD 227
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLY 421
HCE WL PLL I D + PVID ID+ T+EF + EP G F+W + +
Sbjct: 228 CHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQIGEP---MIGGFDWRLTF 284
Query: 422 KENELPEREAKKR 434
+ + +P++E +R
Sbjct: 285 QWHSVPKQERDRR 297
>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Sus scrofa]
Length = 552
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADAAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I T+++ E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIIHLDTFDY---IESATELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVMDKSWFDYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 121/209 (57%), Gaps = 11/209 (5%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S ++ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERVASNRVVPDTRLFRCTLLVYCADLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 345
RTV SI+ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 129 RTVRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 182
Query: 346 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 405
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+++
Sbjct: 183 RIRGADAAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFDY---I 239
Query: 406 EPDHHYRGIFEWGMLYKENEL-PEREAKK 433
E RG F+W + ++ +L PE++A++
Sbjct: 240 ESATELRGGFDWSLHFQWEQLTPEQKARR 268
>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis florea]
Length = 605
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 130/228 (57%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 103 PGEMGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYSKYLPD 162
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LE Y++R
Sbjct: 163 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEHYVKRLPVP 222
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 223 TYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 282
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 283 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDR 327
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 116/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y++R + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 208 LKQDLEHYVKRLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 267
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 268 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 324
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 325 GDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 384
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 385 SHVGHVFR 392
>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
davidii]
Length = 507
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN +REGL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 122 KVKCLRNDQREGLVRSRIRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 181
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R SEP ++P AGG
Sbjct: 182 VIDIINLDTFSY---IESATELRGGFDWSLHFQWEQLSPEQKAQRLDPSEPIRTPIIAGG 238
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 239 LFVMDKSWFNFLGKYDMDMDIWGGENFEMSFRVWMCGGSLEIVPCSRVGHVFR 291
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 121/208 (58%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD + N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 24 VGDDPYKLHAFNQRESERISSNRAIPDTRHLRCTLLMYCRDLPPTSIIITFHNEARSTLL 83
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP ++EIILVDDFS+ E+ I+ KV+ +RN +REGL+R+R
Sbjct: 84 RTIRSVLNRTPMNLIKEIILVDDFSNDPG---DCEELIKL--PKVKCLRNDQREGLVRSR 138
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ S E
Sbjct: 139 IRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTF---SYIE 195
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 196 SATELRGGFDWSLHFQWEQLSPEQKAQR 223
>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
grunniens mutus]
Length = 509
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 124 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 183
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 184 VIDIIHLDTFNY---IESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGG 240
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 241 LFVMDKSWFYYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 293
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 120/209 (57%), Gaps = 11/209 (5%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S I+ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 26 VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 85
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 345
RT+ SI+ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 86 RTIRSILNRTPMNLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 139
Query: 346 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 405
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+ +
Sbjct: 140 RIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---I 196
Query: 406 EPDHHYRGIFEWGMLYKENEL-PEREAKK 433
E RG F+W + ++ +L PE++A++
Sbjct: 197 ESASELRGGFDWSLHFQWEQLTPEQKARR 225
>gi|198473174|ref|XP_001356196.2| GA20382 [Drosophila pseudoobscura pseudoobscura]
gi|198139336|gb|EAL33256.2| GA20382 [Drosophila pseudoobscura pseudoobscura]
Length = 617
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 118/182 (64%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R ++N +REGLIR+R GA+E+ G+V+VFLD+H EV WL PLL I ++ + VPV
Sbjct: 206 LRYVKNNKREGLIRSRVIGAREAVGDVLVFLDSHIEVNRQWLEPLLRLIKAENASLAVPV 265
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 266 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 321
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F ++G YD + +WGGEN E+SF++W CGG+I+ VPCSR+ + RP
Sbjct: 322 FAVNRLYFQDIGEYDMAMDIWGGENIEISFRVWQCGGAIKIVPCSRVGHIFRKRRPYTAP 381
Query: 194 DG 195
DG
Sbjct: 382 DG 383
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 120/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + LP+ASVI+ F+NE +LM
Sbjct: 100 DIGYKHHAFNALVSNNIGLYREIPDTRHKVCDRSETSEAEQLPQASVIMCFYNEHKMTLM 159
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R+V ++++RTP+ L+EI+LVDD S +L+ L D R +R ++N +REGLIR
Sbjct: 160 RSVRTVLERTPSYLLKEIVLVDDHSDLPELEFHLHGDLRARLKYDNLRYVKNNKREGLIR 219
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV WL PLL I ++ + VPVID I+ T+E
Sbjct: 220 SRVIGAREAVGDVLVFLDSHIEVNRQWLEPLLRLIKAENASLAVPVIDLINADTFE---- 275
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 276 YTPSPLVRGGFNWGLHFRWENLPE 299
>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis mellifera]
Length = 606
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/261 (37%), Positives = 141/261 (54%), Gaps = 12/261 (4%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEG---------PGEGGKAYHLPEAYRAAGDASLGEYGM 237
I+PV + K +P L ++ PGE G A H+ A
Sbjct: 71 IQPVGQQSSKKRYKQPELHLWRPARVVRDNKGIPGEMGAAVHISPEDEARQQELFKLNQF 130
Query: 238 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
N+ S+ IS +R++ D+R++ CK Y LP S+++VFHNE +S+L+RTV S+I R+P
Sbjct: 131 NLMASDMISLNRSLKDIRLDGCKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSP 190
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
L+EIILVDD S + L Q LEDY++ + R +R GLIR R GAK +G+V
Sbjct: 191 RTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQV 250
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 417
I FLDAHCE WL PLL+ I DR + P+ID I T+E+ + D + G F W
Sbjct: 251 ITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNW 307
Query: 418 GMLYKENELPEREAKKRKYNR 438
+ ++ + +RE +R +R
Sbjct: 308 KLNFRWYRVAQREMDRRLGDR 328
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 116/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 209 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 269 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 325
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 326 GDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 385
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 386 SHVGHVFR 393
>gi|170056941|ref|XP_001864259.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876546|gb|EDS39929.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 606
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 113/187 (60%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+ KV++I ER GLI R GAK + +V++FLD+H E +NWLPPL
Sbjct: 190 LKDQLDRYVAENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPL 249
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D + P ID + + T+E+R+ D RG F+W YK L ++
Sbjct: 250 LEPIAEDYRTCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYKRLPLLPKDLAN-- 304
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCS
Sbjct: 305 -PTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGQMYDAPCS 363
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 364 RVGHIYR 370
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 88/218 (40%), Positives = 127/218 (58%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK HL + D + G N S+ IS +R++PD+R CK Y
Sbjct: 79 ERSRSGVGEHGKPGHLEKKDEEMQDKLFKKNGFNAVLSDLISLNRSLPDIRHPGCKKKKY 138
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L +L+ Y+
Sbjct: 139 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELISEIILVDDCSTKEFLKDQLDRYV 198
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV++I ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D +
Sbjct: 199 AENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 258
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID + + T+E+R+ D RG F+W YK
Sbjct: 259 TCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYK 293
>gi|195492881|ref|XP_002094181.1| GE20340 [Drosophila yakuba]
gi|194180282|gb|EDW93893.1| GE20340 [Drosophila yakuba]
Length = 666
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 90/187 (48%), Positives = 117/187 (62%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI +++ P ID ID+ + +R+ D RG F+W YK L E + K
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHTNFNYRAQ---DEGARGAFDWEFFYKRLPLLEEDLK--- 362
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCS
Sbjct: 363 HPADPFKSPVMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCS 422
Query: 182 RIVSLIR 188
RI + R
Sbjct: 423 RIGHIYR 429
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 94/224 (41%), Positives = 131/224 (58%), Gaps = 4/224 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GE GKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEKGKAATLDDESQRDLEKQKSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVAKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
P ID ID+ + +R+ D RG F+W YK L E + K
Sbjct: 322 PFIDVIDHTNFNYRA---QDEGARGAFDWEFFYKRLPLLEEDLK 362
>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
Length = 545
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 126/192 (65%), Gaps = 4/192 (2%)
Query: 1 DLDKKLEDYIERFNG--KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 58
+L L+DY+ R G KV+++ +REGLIR R RGA+++ G V+ FLDAH E + W+
Sbjct: 144 ELKGHLDDYVRRQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFLDAHIECNVGWV 203
Query: 59 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREA 117
PLL I+ +R + +P+I+ ID +T+E+ V + RG F W + + +PE E
Sbjct: 204 EPLLHRIWENRSNVVMPIIEAIDDKTFEYHGGVQSSRYAQRGGFSWELHFDWRVIPEYEI 263
Query: 118 KKRKYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIE 176
K+ K + + P +SPT AGGLF++D+++F ELG YD + WGGEN ELSFKIWMCGG++E
Sbjct: 264 KRWKGDETTPIRSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMCGGTLE 323
Query: 177 WVPCSRIVSLIR 188
PCS++ + R
Sbjct: 324 QPPCSKVGHVFR 335
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 117/185 (63%), Gaps = 3/185 (1%)
Query: 254 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
R CK YP LP SVI+ F +E FS++MR+VHSII RTP L E+ILVDD S++
Sbjct: 83 CRQVRCKTKKYPEYLPPTSVIMCFTDEAFSAVMRSVHSIINRTPPHLLAEVILVDDNSTR 142
Query: 314 ADLDQKLEDYIQRFNG--KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
A+L L+DY++R G KV+++ +REGLIR R RGA+++ G V+ FLDAH E + W
Sbjct: 143 AELKGHLDDYVRRQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFLDAHIECNVGW 202
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPERE 430
+ PLL I+ +R + +P+I+ ID +T+E+ V + RG F W + + +PE E
Sbjct: 203 VEPLLHRIWENRSNVVMPIIEAIDDKTFEYHGGVQSSRYAQRGGFSWELHFDWRVIPEYE 262
Query: 431 AKKRK 435
K+ K
Sbjct: 263 IKRWK 267
>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
lupus familiaris]
Length = 552
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ IRN+ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCIRNSERQGLVRSRIRGANVAKGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I + + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIISLDNFNY---IESAAELRGGFDWSLHFQWEQLSPEQKARRLDPAEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVMDKSWFNYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 119/209 (56%), Gaps = 11/209 (5%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSSRAVPDTRHLRCTMLVYCADLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 345
RT+ S++ RTP ++EIILVDDFS+ D D +Q KV+ IRN+ER+GL+R+
Sbjct: 129 RTIRSVLNRTPMNLIQEIILVDDFSNDPD------DCLQLIKLPKVKCIRNSERQGLVRS 182
Query: 346 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 405
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I + +
Sbjct: 183 RIRGANVAKGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNY---I 239
Query: 406 EPDHHYRGIFEWGMLYKENEL-PEREAKK 433
E RG F+W + ++ +L PE++A++
Sbjct: 240 ESAAELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|312065523|ref|XP_003135832.1| glycosyl transferase [Loa loa]
gi|307769015|gb|EFO28249.1| glycosyl transferase [Loa loa]
Length = 614
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 118/189 (62%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LE+++++ N V+++R +REGLIR R GA+ +VIVFLDAH E NWLP
Sbjct: 202 LKQPLEEFLKKAGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLP 261
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ PI D + + P+ID ID T+E+R+ D RG F+W YK L E
Sbjct: 262 PLVEPIALDYRTVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DN 315
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+K + P+ +P AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ P
Sbjct: 316 KKNPTRPFHNPVMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAP 375
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 376 CSRVGHIYR 384
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 93/234 (39%), Positives = 136/234 (58%), Gaps = 8/234 (3%)
Query: 208 KEGPGEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 266
++G GEGG+ A E ++ D G N S+ I+ +R+I D+R C+ Y
Sbjct: 93 RQGLGEGGQPAVVAVEEFKKLRDGLYRSNGYNAYISDFIALNRSIKDIRHSGCRNMVYLE 152
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LP V+ HNE S+L+R+++S+I R+P ++E+ILVDD S+K L Q LE+++++
Sbjct: 153 KLPTVGVVFPIHNEHNSTLLRSIYSVINRSPKDIMKEVILVDDGSTKPFLKQPLEEFLKK 212
Query: 327 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
N V+++R +REGLIR R GA+ +VIVFLDAH E NWLPPL+ PI D +
Sbjct: 213 AGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLPPLVEPIALDYR 272
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+ P+ID ID T+E+R+ D RG F+W YK LP E K+ R
Sbjct: 273 TVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYK--RLPLTEDNKKNPTR 321
>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
glaber]
Length = 553
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 114/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN+ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 168 KVKCLRNSERQGLVRSRMRGADIAQGATLTFLDSHCEVNRDWLEPLLHRVKEDYTRVVCP 227
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 228 VIDIINLDTFTY---IESASELRGGFDWSLHFRWEQLSPEQKARRLDPTEPIRTPIIAGG 284
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 285 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 337
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS R +PD R C Y LP S+I+ FHNE S+L+
Sbjct: 70 VGDDPYKLYAFNQRESERISSHRAVPDTRHPRCMLLVYHTALPPTSIIITFHNEARSTLL 129
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP ++EIILVDDFS+ D ++L KV+ +RN+ER+GL+R+R
Sbjct: 130 RTIRSVLNRTPMHLIQEIILVDDFSNDPDDCKQLVRL-----PKVKCLRNSERQGLVRSR 184
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 185 MRGADIAQGATLTFLDSHCEVNRDWLEPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 241
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 242 SASELRGGFDWSLHFRWEQLSPEQKARR 269
>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Anolis carolinensis]
Length = 941
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/188 (45%), Positives = 120/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL+ Y+ +F KVR++ ER GLIR R GA+ ++G+V+ FLD+H E + WL PL
Sbjct: 544 LKDKLDKYMAQF-PKVRILHLKERYGLIRARLAGAEIAKGDVLTFLDSHVECNVGWLEPL 602
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I+ +RK + PVI+ I + + +V D+ RGIF W M + +P +K K
Sbjct: 603 LERIHLNRKKVPCPVIEVISDKDMSYMTV---DNFQRGIFNWPMNFGWKPIPPDVIEKNK 659
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE +PC
Sbjct: 660 IKETDVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIIPC 719
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 720 SRVGHIFR 727
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 135/249 (54%), Gaps = 10/249 (4%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHIS 246
+ VF D G +P G+ G+ +P + E N+ S+ I
Sbjct: 421 VHKVFSIDKTFGPRDP------NAAGQFGRPAVVPNEKQEEAKRRWNEGNFNVYLSDMIP 474
Query: 247 FDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 306
DR I D R C DLP S+I+ F +E +S+L+R+VHS++ R+P Q ++EIIL
Sbjct: 475 IDRAIDDTRPIGCSDILVHNDLPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPQLIKEIIL 534
Query: 307 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
VDDFS+K L KL+ Y+ +F KVR++ ER GLIR R GA+ ++G+V+ FLD+H E
Sbjct: 535 VDDFSTKEYLKDKLDKYMAQF-PKVRILHLKERYGLIRARLAGAEIAKGDVLTFLDSHVE 593
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
+ WL PLL I+ +RK + PVI+ I + + +V D+ RGIF W M + +
Sbjct: 594 CNVGWLEPLLERIHLNRKKVPCPVIEVISDKDMSYMTV---DNFQRGIFNWPMNFGWKPI 650
Query: 427 PEREAKKRK 435
P +K K
Sbjct: 651 PPDVIEKNK 659
>gi|327283203|ref|XP_003226331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Anolis carolinensis]
Length = 644
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/187 (44%), Positives = 115/187 (61%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R ++CK YP DL + SV+ +F NE S ++R+VHS++
Sbjct: 119 EYGYNAQLSDRISLDRSIPDYRPKKCKQMSYPSDLSQISVVFIFVNEALSVILRSVHSVV 178
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+ L+EIILVDD S +L L+ Y+Q R+ G V+++RN +REGLIR R +G K
Sbjct: 179 NHTPSHLLKEIILVDDNSDNVELKFNLDQYVQKRYPGLVKIVRNNKREGLIRARIQGWKA 238
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E + WL P L I DRK + +P ID I Y T+E + H Y
Sbjct: 239 ATSPVVGFFDAHVEFNIGWLEPTLMRIREDRKRIILPAIDNIKYNTFEVQQYANAAHGY- 297
Query: 413 GIFEWGM 419
WG+
Sbjct: 298 ---NWGL 301
Score = 144 bits (363), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/189 (39%), Positives = 110/189 (58%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y++ R+ G V+++RN +REGLIR R +G K + V+ F DAH E + WL
Sbjct: 200 ELKFNLDQYVQKRYPGLVKIVRNNKREGLIRARIQGWKAATSPVVGFFDAHVEFNIGWLE 259
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L I DRK + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 260 PTLMRIREDRKRIILPAIDNIKYNTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 314
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P ++P G F +DR +F E+G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 315 KGDESAPIRTPAMIGCSFVVDREYFGEIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 374
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 375 CSRVAHIER 383
>gi|339239855|ref|XP_003375853.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316975462|gb|EFV58902.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 625
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/184 (45%), Positives = 117/184 (63%), Gaps = 2/184 (1%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
+L+ Y++R G V+++R +REGLIR R GA + G V+ FLD+HCE WLPPLL+
Sbjct: 253 QLDVYMDRL-GIVKIVRLQKREGLIRARLSGAAVAIGPVLTFLDSHCECLEGWLPPLLSR 311
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 124
I + + PVID ID T+++ + G F+W + + + +PER K R +
Sbjct: 312 IKENWSNVVCPVIDVIDDDTFKYHCGKSWMTNVGG-FDWNLQFNWHPIPERVRKSRSDPT 370
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P +SPT AGGLF++D+ +F LG YDPG +WGGEN ELSFK+WMCGG +E VPCS +
Sbjct: 371 APVESPTMAGGLFSIDKQYFQHLGTYDPGFDIWGGENLELSFKVWMCGGKLEIVPCSHVG 430
Query: 185 SLIR 188
+ R
Sbjct: 431 HIFR 434
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 84/176 (47%), Gaps = 7/176 (3%)
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL----EEIILVDDFSSKADLD-Q 318
Y +D+ K++ + F S T + +I F S L
Sbjct: 193 YDVDVTKSNTMTTFQRRRLSFAFTTKRGVFCFALCTAFCPGRRQICFKKLFLSTISLTLN 252
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L+ Y+ R G V+++R +REGLIR R GA + G V+ FLD+HCE WLPPLL+
Sbjct: 253 QLDVYMDRL-GIVKIVRLQKREGLIRARLSGAAVAIGPVLTFLDSHCECLEGWLPPLLSR 311
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
I + + PVID ID T+++ + G F+W + + + +PER K R
Sbjct: 312 IKENWSNVVCPVIDVIDDDTFKYHCGKSWMTNVGG-FDWNLQFNWHPIPERVRKSR 366
>gi|170039452|ref|XP_001847548.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863025|gb|EDS26408.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 606
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/187 (44%), Positives = 113/187 (60%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L+ Y+ KV++I ER GLI R GAK + +V++FLD+H E +NWLPPL
Sbjct: 190 LKDQLDRYVAENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPL 249
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI D + P ID + + T+E+R+ D RG F+W YK L ++
Sbjct: 250 LEPIAEDYRTCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYKRLPLLPKDLAN-- 304
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCS
Sbjct: 305 -PTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGQMYDAPCS 363
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 364 RVGHIYR 370
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 88/218 (40%), Positives = 127/218 (58%), Gaps = 3/218 (1%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E + G GE GK HL + D + G N S+ IS +R++PD+R CK Y
Sbjct: 79 ERSRSGVGEHGKPGHLEKKDEEMQDKLFKKNGFNAVLSDLISLNRSLPDIRHPGCKKKKY 138
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+LP SV++ F+NE +S+L+RT S++ R+P + + EIILVDD S+K L +L+ Y+
Sbjct: 139 LSELPTVSVVVPFYNEHWSTLLRTASSVLLRSPPELISEIILVDDCSTKEFLKDQLDRYV 198
Query: 325 QRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
KV++I ER GLI R GAK + +V++FLD+H E +NWLPPLL PI D +
Sbjct: 199 AENMPKVKVIHLPERSGLITARLAGAKAATADVLIFLDSHTEANVNWLPPLLEPIAEDYR 258
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID + + T+E+R+ D RG F+W YK
Sbjct: 259 TCVCPFIDVVAWDTFEYRA---QDEGARGAFDWKFYYK 293
>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
echinatior]
Length = 595
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+DY++ + R++R+ ER GLI+ R GA +++GEV+ FLDAHCE + WL PL
Sbjct: 211 LKNSLDDYVKNLSVSTRVLRSNERIGLIKARLLGANDAKGEVLTFLDAHCECTIGWLEPL 270
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + + + PVID I+ T+ + +E H+ G F W + ++ L R K+R+
Sbjct: 271 LEAVGKNATRIVAPVIDIINDNTFSYTRSFE--LHW-GAFNWDLHFRWLTLNGRLLKERR 327
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
N EP+++P AGGLF+M+R +F +LG YD + +WGGEN ELSF+ W CGGSIE PC
Sbjct: 328 DNIVEPFRTPAMAGGLFSMNRDYFFKLGSYDDQMRIWGGENLELSFRAWQCGGSIEIAPC 387
Query: 181 SRIVSLIR 188
S + L R
Sbjct: 388 SHVGHLFR 395
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 97/255 (38%), Positives = 149/255 (58%), Gaps = 10/255 (3%)
Query: 185 SLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNH 244
SL P D NL PL P++ GE H + Y+ + + N+ S+
Sbjct: 83 SLKIPNITKDVINQNLFNPL-PFEGKNGEP-VVIHPKDFYKMQQLYQINRF--NLMASDK 138
Query: 245 ISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
I +R++PD+R ++C +Y + +LPK S+I+VFHNE +S+L+RTVHS+I R+P + LE
Sbjct: 139 IPLNRSLPDVRKKKCISRYTNLG-NLPKTSIIIVFHNEAWSTLLRTVHSVINRSPKELLE 197
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L L+DY++ + R++R+ ER GLI+ R GA +++GEV+ FLD
Sbjct: 198 EIILVDDNSEREFLKNSLDDYVKNLSVSTRVLRSNERIGLIKARLLGANDAKGEVLTFLD 257
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
AHCE + WL PLL + + + PVID I+ T+ + +E H+ G F W + ++
Sbjct: 258 AHCECTIGWLEPLLEAVGKNATRIVAPVIDIINDNTFSYTRSFE--LHW-GAFNWDLHFR 314
Query: 423 ENELPEREAKKRKYN 437
L R K+R+ N
Sbjct: 315 WLTLNGRLLKERRDN 329
>gi|345321967|ref|XP_001514624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Ornithorhynchus anatinus]
Length = 484
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 81/203 (39%), Positives = 127/203 (62%), Gaps = 6/203 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L DY+ + +G V+L+R+ +R G+IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 66 LKSALSDYVSKLDG-VKLLRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECHRGWLEPL 124
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I S+R + P++D ID++T+++ + + +G+F+W + + LPE++ K R+
Sbjct: 125 LSRIASNRNRVVTPILDVIDWKTFQY---FHSEDLQQGVFDWKLDFHWELLPEQKRKVRQ 181
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SP GG+ AMDR +F G YD + +WGGEN ELS ++W+CGGS+E +PCS
Sbjct: 182 SPISPIRSPVVPGGVMAMDRHYFQNTGAYDSLMTLWGGENLELSIRVWLCGGSVEVLPCS 241
Query: 182 RIVSLIRPVFKADGKLGNLEPPL 204
R+ + R +A L N E L
Sbjct: 242 RVGHVYRN--QASDTLPNQEAIL 262
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/177 (41%), Positives = 117/177 (66%), Gaps = 4/177 (2%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C+ Y +LP AS+I+ FHNE +S+L+RTVHSI+ P +L+EIILVDD S + L
Sbjct: 9 CRQQHYGDNLPSASIIICFHNEAWSTLLRTVHSILDTVPRPHLKEIILVDDLSQQDHLKS 68
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
L DY+ + +G V+L+R+ +R G+IR R GA + G+V+VF+D+HCE WL PLL+
Sbjct: 69 ALSDYVSKLDG-VKLLRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECHRGWLEPLLSR 127
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I S+R + P++D ID++T+++ + + +G+F+W + + LPE++ K R+
Sbjct: 128 IASNRNRVVTPILDVIDWKTFQY---FHSEDLQQGVFDWKLDFHWELLPEQKRKVRQ 181
>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
norvegicus]
Length = 578
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 112/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRL R +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKAQLEAYISNLD-RVRLTRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 TSRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 96/256 (37%), Positives = 140/256 (54%), Gaps = 14/256 (5%)
Query: 181 SRIVSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNME 240
+R L +P++K +PP + + G L E + + Y +N+
Sbjct: 54 TREEDLSQPLYK--------KPPADSHALGEWGRASKLQLDEGELKQQEELIERYAINIY 105
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S+ IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA
Sbjct: 106 LSDRISLHRHIEDKRMYECKAKKFHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAV 165
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S + L +LE YI + +VRL R +REGL+R R GA + G+V+
Sbjct: 166 LLKEIILVDDLSDRIYLKAQLEAYISNLD-RVRLTRTNKREGLVRARLIGATFATGDVLT 224
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWG 418
FLD HCE WL PLL I D + PVID ID+ T+EF EP G F+W
Sbjct: 225 FLDCHCECNTGWLEPLLERISRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWR 281
Query: 419 MLYKENELPEREAKKR 434
+ ++ + +P+ E +R
Sbjct: 282 LTFQWHSVPKHERDRR 297
>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
troglodytes]
gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
paniscus]
gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
Length = 578
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P++E +R
Sbjct: 241 LERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKQERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 140/251 (55%), Gaps = 14/251 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
L RP++K +PP + + G L E + + Y +N+ S+ I
Sbjct: 59 LSRPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 246 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 423
CE WL PLL I D + PVID ID+ T+EF EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286
Query: 424 NELPEREAKKR 434
+ +P++E +R
Sbjct: 287 HSVPKQERDRR 297
>gi|393910975|gb|EJD76111.1| glycosyl transferase, variant [Loa loa]
Length = 549
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 85/189 (44%), Positives = 118/189 (62%), Gaps = 8/189 (4%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LE+++++ N V+++R +REGLIR R GA+ +VIVFLDAH E NWLP
Sbjct: 202 LKQPLEEFLKKAGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLP 261
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ PI D + + P+ID ID T+E+R+ D RG F+W YK L E
Sbjct: 262 PLVEPIALDYRTVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DN 315
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+K + P+ +P AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+W C G++ P
Sbjct: 316 KKNPTRPFHNPVMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKVWQCHGTMVDAP 375
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 376 CSRVGHIYR 384
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 93/234 (39%), Positives = 136/234 (58%), Gaps = 8/234 (3%)
Query: 208 KEGPGEGGK-AYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPL 266
++G GEGG+ A E ++ D G N S+ I+ +R+I D+R C+ Y
Sbjct: 93 RQGLGEGGQPAVVAVEEFKKLRDGLYRSNGYNAYISDFIALNRSIKDIRHSGCRNMVYLE 152
Query: 267 DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQR 326
LP V+ HNE S+L+R+++S+I R+P ++E+ILVDD S+K L Q LE+++++
Sbjct: 153 KLPTVGVVFPIHNEHNSTLLRSIYSVINRSPKDIMKEVILVDDGSTKPFLKQPLEEFLKK 212
Query: 327 --FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRK 384
N V+++R +REGLIR R GA+ +VIVFLDAH E NWLPPL+ PI D +
Sbjct: 213 AGLNHIVKVVRTQKREGLIRARQIGARHVTADVIVFLDAHSETNYNWLPPLVEPIALDYR 272
Query: 385 IMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
+ P+ID ID T+E+R+ D RG F+W YK LP E K+ R
Sbjct: 273 TVVCPLIDVIDCDTYEYRA---QDEGGRGSFDWEFNYK--RLPLTEDNKKNPTR 321
>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Nomascus leucogenys]
Length = 578
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P++E +R
Sbjct: 241 LERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKQERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 140/251 (55%), Gaps = 14/251 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
L RP++K +PP + + G L E + + Y +N+ S+ I
Sbjct: 59 LSRPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 246 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 423
CE WL PLL I D + PVID ID+ T+EF EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286
Query: 424 NELPEREAKKR 434
+ +P++E +R
Sbjct: 287 HSVPKQERDRR 297
>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
Length = 611
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+ YI++F V L+ ER GLIR R G+ ++G++++FLDAH EV WL PL
Sbjct: 218 LVKPLDAYIKKFPIPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPL 277
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
+ + DRK + P+ID I T+E+ + E G F W + ++ +P+RE +R
Sbjct: 278 VHRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYAVPKRELNRRG 334
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PC
Sbjct: 335 SDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPC 394
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 395 SRVGHVFR 402
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 135/231 (58%), Gaps = 7/231 (3%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW----DYPLD 267
G+GG +PE ++ + E N+ S IS +RT+PD R E C+ +
Sbjct: 110 GQGGTGVTVPEDQKSIKEKRFLENQFNVVASEMISVNRTLPDYRSEACRNAAGNEKTTVG 169
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+I+VFHNE +++L+RT+HS+I R+P LEEII++DD S + L + L+ YI++F
Sbjct: 170 LPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKKF 229
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
V L+ ER GLIR R G+ ++G++++FLDAH EV WL PL+ + DRK +
Sbjct: 230 PIPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVHRVAEDRKRVV 289
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + E G F W + ++ +P+RE +R +R
Sbjct: 290 APIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYAVPKRELNRRGSDR 337
>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Canis lupus familiaris]
Length = 622
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER+GLI R GA ++ +V+ FLDAHCE WL PL
Sbjct: 224 LKEQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAQVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D ++ P I ID T+EF + V H RG F+W + + +P E ++R
Sbjct: 283 LARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAIPAHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 97/246 (39%), Positives = 140/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE-AYRAAGDASLG--EYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + ++ + G ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKDKWTHQETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SV++VFHNE +S+L+RTV+S++ TPA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE Y+++ VR++R ER+GLI R GA ++ +V+ FLDAHCE WL P
Sbjct: 223 YLKEQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAQVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D ++ P I ID T+EF + V H RG F+W + + +P E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAIPAHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|118098416|ref|XP_415088.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Gallus gallus]
Length = 604
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 116/187 (62%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R ++CK YP DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 117 EYGYNAQLSDRISLDRSIPDYRPKKCKQMTYPDDLPQISVVFIFVNEALSVILRSVHSVV 176
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+ L+EIILVDD S +L L+ Y+ +R+ G V+++RN +REGLIR R +G K
Sbjct: 177 NHTPSHLLKEIILVDDNSDNVELKFNLDQYVHKRYPGLVKIVRNNKREGLIRARIQGWKA 236
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E + W+ P L I DRK + +P ID I Y T+E + H Y
Sbjct: 237 ATSPVVGFFDAHVEFNIGWVEPALTRIKEDRKRIILPAIDNIKYNTFEVQQYANAAHGY- 295
Query: 413 GIFEWGM 419
WG+
Sbjct: 296 ---NWGL 299
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 110/189 (58%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN +REGLIR R +G K + V+ F DAH E + W+
Sbjct: 198 ELKFNLDQYVHKRYPGLVKIVRNNKREGLIRARIQGWKAATSPVVGFFDAHVEFNIGWVE 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L I DRK + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 258 PALTRIKEDRKRIILPAIDNIKYNTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P ++P G F +DR +F E+G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 313 KGDESAPIRTPAMIGCSFVVDREYFGEIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVAHIER 381
>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
gorilla gorilla]
Length = 578
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P++E +R
Sbjct: 241 LERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKQERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 140/251 (55%), Gaps = 14/251 (5%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHI 245
L RP++K +PP + + G L E + + Y +N+ S+ I
Sbjct: 59 LSRPLYK--------KPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRI 110
Query: 246 SFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEI 304
S R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EI
Sbjct: 111 SLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEI 170
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
ILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD H
Sbjct: 171 ILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCH 229
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKE 423
CE WL PLL I D + PVID ID+ T+EF EP G F+W + ++
Sbjct: 230 CECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQW 286
Query: 424 NELPEREAKKR 434
+ +P++E +R
Sbjct: 287 HSVPKQERDRR 297
>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
Length = 578
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 113/182 (62%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID+ T+EF + G F+W + ++ + +P++E +R
Sbjct: 241 LERIGRDETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHSVPKQERDRRI 298
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 299 SRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCS 358
Query: 182 RI 183
+
Sbjct: 359 HV 360
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 99/252 (39%), Positives = 142/252 (56%), Gaps = 16/252 (6%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMETSN 243
L RP++K +PP + GE GKA L E + + Y +N+ S+
Sbjct: 59 LSRPLYK--------KPPAD--SRALGEWGKASKLQLNEDELKQQEELIERYAINIYLSD 108
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
IS R I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+
Sbjct: 109 RISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLK 168
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
EIILVDD S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD
Sbjct: 169 EIILVDDLSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLD 227
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
HCE WL PLL I D + PVID ID+ T+EF + G F+W + ++
Sbjct: 228 CHCECNSGWLEPLLERIGRDETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQ 285
Query: 423 ENELPEREAKKR 434
+ +P++E +R
Sbjct: 286 WHSVPKQERDRR 297
>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
floridanus]
Length = 608
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 132/231 (57%), Gaps = 3/231 (1%)
Query: 208 KEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLD 267
K PGE G A H+ A N+ S+ IS +R++ D+R+E CK YP
Sbjct: 103 KGSPGEMGAAVHIAPENEAKQQELFKLNQFNLMASDLISLNRSLKDIRLEGCKNKKYPKY 162
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LP S+++VFHNE +++L+RTV S+I R+P L+EIILVDD S + L ++LE +I
Sbjct: 163 LPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKKELEKHITEL 222
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
+ R +R GLIR R GAK +G+VI FLDAHCE WL PLL+ I +DR +
Sbjct: 223 PVPTYVYRTEKRSGLIRARLLGAKYVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVV 282
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
P+ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 283 CPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRNGDR 330
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 115/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE +I + R +R GLIR R GAK +G+VI FLDAHCE WL PL
Sbjct: 211 LKKELEKHITELPVPTYVYRTEKRSGLIRARLLGAKYVKGQVITFLDAHCECTEGWLEPL 270
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I +DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 271 LSRIANDRHTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRN 327
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E C
Sbjct: 328 GDRTAPLRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISSC 387
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 388 SHVGHVFR 395
>gi|426374742|ref|XP_004054222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Gorilla gorilla gorilla]
Length = 473
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 114/187 (60%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C+ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 116 EYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 175
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 176 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 235
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 236 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 294
Query: 413 GIFEWGM 419
WG+
Sbjct: 295 ---NWGL 298
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/182 (35%), Positives = 98/182 (53%), Gaps = 6/182 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 197 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 257 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 311
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++ S + V
Sbjct: 312 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVSAATVSRDRVR 371
Query: 180 CS 181
S
Sbjct: 372 AS 373
>gi|449268007|gb|EMC78887.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Columba
livia]
Length = 514
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 80/173 (46%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA ++ V+ FLD+HCEV +WL PLL I D + P
Sbjct: 128 KVKCLRNGRREGLIRSRIRGADVAQAGVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSP 187
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + KR +EP K+P AGG
Sbjct: 188 VIDIINLDTFAYVAA---SSDLRGGFDWSLHFKWEQLSPEQKAKRLDPTEPIKTPIIAGG 244
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 245 LFMIDKAWFNHLGKYDSAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 297
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 111/205 (54%), Gaps = 14/205 (6%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
+ N S I DR + D R C Y DLP SVI+ FHNE S+L+RT+ S +
Sbjct: 33 HAFNQRESERIPSDRAVRDTRHHRCTTLHYRQDLPPTSVIITFHNEARSTLLRTIRSTVM 92
Query: 295 RTPAQY-----LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 349
+ + + EIILVDDFS D + L KV+ +RN REGLIR+R RG
Sbjct: 93 HFLSSFFTVHLVHEIILVDDFSDDPDDCRLLGKL-----PKVKCLRNGRREGLIRSRIRG 147
Query: 350 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDH 409
A ++ V+ FLD+HCEV +WL PLL I D + PVID I+ T+ + +
Sbjct: 148 ADVAQAGVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAYVAA---SS 204
Query: 410 HYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + +K +L PE++AK+
Sbjct: 205 DLRGGFDWSLHFKWEQLSPEQKAKR 229
>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Nomascus leucogenys]
Length = 532
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 147 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 206
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 207 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 263
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 264 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 316
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 132/242 (54%), Gaps = 9/242 (3%)
Query: 193 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
AD L + +P + + + + +L GD Y N S IS +R +P
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLVKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREA 431
PLL + D + PVID I+ T+ + E RG F+W + ++ +L PE++A
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 432 KK 433
++
Sbjct: 247 RR 248
>gi|195436945|ref|XP_002066406.1| GK18112 [Drosophila willistoni]
gi|194162491|gb|EDW77392.1| GK18112 [Drosophila willistoni]
Length = 588
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/182 (46%), Positives = 116/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R IRN +REGLIR+R GA+E+ G+V+VFLD+H EV W+ PLL I ++ + VPV
Sbjct: 180 LRYIRNEKREGLIRSRVIGAREALGDVLVFLDSHIEVNRQWVEPLLRLIKAENSTLAVPV 239
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+ Y P RG F WG+ ++ LPE K+ + P++SPT AGGL
Sbjct: 240 IDLINADTFG----YTPSPLVRGGFNWGLHFRWENLPEGTLKQPEDFRGPFRSPTMAGGL 295
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F +G YD + +WGGEN E+SF+ W CGGSI+ VPCSR+ + RP
Sbjct: 296 FAVNRLYFQHIGEYDMAMDIWGGENIEISFRAWQCGGSIKIVPCSRVGHIFRKRRPYTSP 355
Query: 194 DG 195
DG
Sbjct: 356 DG 357
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 120/209 (57%), Gaps = 8/209 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + LP+AS+++ F+NE +LM
Sbjct: 74 DIGYKHHAFNALVSNNIGLYRDIPDTRHKVCNRSETLESEQLPQASIVMCFYNEHKMTLM 133
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLD-QKLEDYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP L+EIILVDD S +L+ L D R +R IRN +REGLIR
Sbjct: 134 RSIKTVLERTPQYLLKEIILVDDNSDLPELEFHLLADLHNRLKYDNLRYIRNEKREGLIR 193
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R GA+E+ G+V+VFLD+H EV W+ PLL I ++ + VPVID I+ T+
Sbjct: 194 SRVIGAREALGDVLVFLDSHIEVNRQWVEPLLRLIKAENSTLAVPVIDLINADTFG---- 249
Query: 405 YEPDHHYRGIFEWGMLYKENELPEREAKK 433
Y P RG F WG+ ++ LPE K+
Sbjct: 250 YTPSPLVRGGFNWGLHFRWENLPEGTLKQ 278
>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
sapiens]
gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_d [Homo sapiens]
gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
Length = 532
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 147 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 206
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 207 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 263
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 264 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 316
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 132/242 (54%), Gaps = 9/242 (3%)
Query: 193 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
AD L + +P + + + + +L GD Y N S IS +R IP
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIP 74
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREA 431
PLL + D + PVID I+ T+ + E RG F+W + ++ +L PE++A
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 432 KK 433
++
Sbjct: 247 RR 248
>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
Length = 522
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 115/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 126 LKTDLEKYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPL 184
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + ++ PVID ID++T+E+ EP G F+W + ++ + +P+ E +R
Sbjct: 185 LERIAENETVIVCPVIDTIDWKTFEYYMQTAEP---MIGGFDWRLTFQWHSVPKHERLRR 241
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 242 KSETDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPC 301
Query: 181 SRI 183
S +
Sbjct: 302 SHV 304
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 99/256 (38%), Positives = 144/256 (56%), Gaps = 18/256 (7%)
Query: 184 VSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKA--YHLPEAYRAAGDASLGEYGMNMET 241
+ L RPV+ + P +PY PGE GK L + + + +Y +N+
Sbjct: 1 MDLARPVYD--------KSPPDPY--SPGEWGKPSRLQLSSEEKKQEEELIEKYAINIYL 50
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S+ IS R I D R+ CK Y LP SVI+ F+NE +S+L+RT+HS+++ +P+
Sbjct: 51 SDKISLHRHIEDNRLSGCKAKSYNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPSVL 110
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L+EIILVDD S K L LE YI +VRLIR +REGL+R R GA + G+V+ F
Sbjct: 111 LKEIILVDDLSDKVYLKTDLEKYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTF 169
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWE-FRSVYEPDHHYRGIFEWGM 419
LD HCE WL PLL I + ++ PVID ID++T+E + EP G F+W +
Sbjct: 170 LDCHCECVSGWLEPLLERIAENETVIVCPVIDTIDWKTFEYYMQTAEP---MIGGFDWRL 226
Query: 420 LYKENELPEREAKKRK 435
++ + +P+ E +RK
Sbjct: 227 TFQWHSVPKHERLRRK 242
>gi|354482531|ref|XP_003503451.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Cricetulus griseus]
Length = 929
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/188 (44%), Positives = 119/188 (63%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 532 LKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPL 590
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP-EREAKKR 120
L +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P + AK
Sbjct: 591 LERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIPPDVVAKSG 647
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P GLF++D+++F ELG YDPGL VWGGEN ELSFK+WMCGG IE +PC
Sbjct: 648 IKETDIIRCPVMGCGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPC 707
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 708 SRVGHIFR 715
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/238 (36%), Positives = 133/238 (55%), Gaps = 10/238 (4%)
Query: 190 VFKADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
V + D L +P PG+ G+ +P + + E N+ S+ I DR
Sbjct: 412 VLRIDESLSPRDP------NAPGQFGRPVVVPPGKKEEAERRWKEGNFNVYLSDLIPVDR 465
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D R EC DLP S+I+ F +E +S+L+R+VHSI+ R+P ++EI+LVDD
Sbjct: 466 AIEDTRPAECAEQLVHNDLPTTSIIMCFVDEVWSALLRSVHSILNRSPPHLIKEILLVDD 525
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
FS+K L L+ Y+ +F KVR++R ER GLIR R GA+ + G+V+ FLD+H E +
Sbjct: 526 FSTKDYLKDNLDKYMSQF-PKVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNV 584
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
WL PLL +Y +RK + PVI+ I+ + + +V D+ RG+F W M + +P
Sbjct: 585 GWLEPLLERVYLNRKKVACPVIEVINDKDMSYMTV---DNFQRGVFTWPMNFGWRTIP 639
>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
Length = 575
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 128/210 (60%), Gaps = 13/210 (6%)
Query: 228 GDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMR 287
GD + + N++ S+ + DR +PD+R ++CK +P DLP ++I+ FHNEG S+L+R
Sbjct: 99 GDDAYAKNAYNIKKSDQLPVDREVPDVRDQQCKSQVWPHDLPTTTIIICFHNEGRSALLR 158
Query: 288 TVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRS 347
TV S + R+P L+EIILVDDFSS ++L KV+LIRNT+REGLIR+R
Sbjct: 159 TVISALNRSPPHLLKEIILVDDFSSDPKDGRRLLKL-----PKVKLIRNTKREGLIRSRV 213
Query: 348 RGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEP 407
+GA +RGEV+ FLD+HCE NWL PLL I K + P+ID I+ T+++
Sbjct: 214 KGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTIVSPIIDVINLDTFDYLG---S 270
Query: 408 DHHYRGIFEWGMLYKENELP-----EREAK 432
RG F W + +K + LP ER+ K
Sbjct: 271 SADLRGGFGWNLNFKWDFLPPHILAERQGK 300
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 112/174 (64%), Gaps = 4/174 (2%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+LIRNT+REGLIR+R +GA +RGEV+ FLD+HCE NWL PLL I K + P
Sbjct: 196 KVKLIRNTKREGLIRSRVKGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTIVSP 255
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER-EAKKRKYNSEPYKSPTHAG 134
+ID I+ T+++ RG F W + +K + LP A+++ + P KSP AG
Sbjct: 256 IIDVINLDTFDYLG---SSADLRGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSPVIAG 312
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLF++ + +F LG YD + VWGGEN E+SF+ W CGG++E +PCSR+ + R
Sbjct: 313 GLFSVAKKWFETLGKYDMQMDVWGGENLEISFRTWQCGGAMEIIPCSRVGHVFR 366
>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
aries]
Length = 552
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 76/173 (43%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIIHLDTFNY---IESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 284 LFVMDKSWFYYLGKYDTDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 336
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 121/208 (58%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S I+ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ SI+ RTP ++EIILVDDFS+ + ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSILNRTPMNLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLTPEQKARR 268
>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Pan paniscus]
Length = 532
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 147 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 206
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 207 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 263
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 264 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 316
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 132/242 (54%), Gaps = 9/242 (3%)
Query: 193 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
AD L + +P + + + + +L GD Y N S IS +R +P
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D R C Y DLP S+I+ FHNE S+L+RT+ S+I RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSN 134
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREA 431
PLL + D + PVID I+ T+ + E RG F+W + ++ +L PE++A
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 432 KK 433
++
Sbjct: 247 RR 248
>gi|345323153|ref|XP_001510349.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Ornithorhynchus anatinus]
Length = 479
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN +REGLIR+R RGA ++ V+ FLD+HCEV +WL PLL I D + P
Sbjct: 94 KVKCLRNGQREGLIRSRIRGADLAKAGVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSP 153
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + KR ++P K+P AGG
Sbjct: 154 VIDIINLDTFAYVAA---SSDLRGGFDWSLHFKWEQLSPEQKAKRTDPTQPIKTPIIAGG 210
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+++F LG YD + +WGGENFE+SF++WMCGG++E VPCSR+ + R
Sbjct: 211 LFVIDKSWFNHLGKYDTAMDIWGGENFEISFRVWMCGGTLEIVPCSRVGHVFR 263
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 76/178 (42%), Positives = 106/178 (59%), Gaps = 9/178 (5%)
Query: 257 EECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
+ C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS D
Sbjct: 26 DRCTASRYRSDLPSTSIIITFHNEARSTLLRTIRSVLNRTPMHLVHEIILVDDFSDDPDD 85
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
Q L KV+ +RN +REGLIR+R RGA ++ V+ FLD+HCEV +WL PLL
Sbjct: 86 CQLLGPL-----PKVKCLRNGQREGLIRSRIRGADLAKAGVLTFLDSHCEVNKDWLLPLL 140
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
I D + PVID I+ T+ + + RG F+W + +K +L PE++AK+
Sbjct: 141 QRIKEDPTRVVSPVIDIINLDTFAYVAA---SSDLRGGFDWSLHFKWEQLSPEQKAKR 195
>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Gorilla gorilla gorilla]
Length = 532
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 147 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 206
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 207 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 263
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 264 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 316
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/242 (36%), Positives = 132/242 (54%), Gaps = 9/242 (3%)
Query: 193 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
AD L + +P + + + + +L GD Y N S IS +R +P
Sbjct: 15 ADSGLSSSQPSDADWDDVWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAVP 74
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 134
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREA 431
PLL + D + PVID I+ T+ + E RG F+W + ++ +L PE++A
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 432 KK 433
++
Sbjct: 247 RR 248
>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Callithrix jacchus]
Length = 578
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/236 (39%), Positives = 133/236 (56%), Gaps = 6/236 (2%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 260
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 66 KPPADSHALGEWGKASKLRLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125
Query: 261 YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 126 SKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQ 185
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 186 LETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 244
Query: 380 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 GRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKHERDRR 297
>gi|402581838|gb|EJW75785.1| hypothetical protein WUBG_13308, partial [Wuchereria bancrofti]
Length = 279
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 96/237 (40%), Positives = 141/237 (59%), Gaps = 8/237 (3%)
Query: 205 EPYKEGPGEGGKAYHLPEA-YRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
+ ++G GEGGK + + ++ D G + S+ I+ +R++ D+R CK
Sbjct: 4 DALRQGLGEGGKPVIVAVSEFKKLRDDLYRINGYDAYISDLIALNRSVKDIRHSGCKDMV 63
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y LP V+ F+NE S+L+R+V+S+I R+P ++EIILVDD S+KA L + LE++
Sbjct: 64 YLEKLPTVGVVFPFYNEHNSTLLRSVYSVINRSPKDIMKEIILVDDGSTKAFLKEPLEEF 123
Query: 324 IQR--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYS 381
+++ N V++IR +REGLIR R RGA+ +VIVFLD+H EV NWLPPL+ PI
Sbjct: 124 LKKAGLNHIVKVIRTEKREGLIRARQRGARHITADVIVFLDSHSEVNYNWLPPLVEPIAL 183
Query: 382 DRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
D K++ P ID ID T+E+R+ D RG F+W YK LP E K+ R
Sbjct: 184 DYKMVVCPFIDVIDCNTYEYRA---QDEGGRGSFDWEFNYK--RLPLTEDNKKNPTR 235
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 81/169 (47%), Positives = 108/169 (63%), Gaps = 8/169 (4%)
Query: 2 LDKKLEDYIER--FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
L + LE+++++ N V++IR +REGLIR R RGA+ +VIVFLD+H EV NWLP
Sbjct: 116 LKEPLEEFLKKAGLNHIVKVIRTEKREGLIRARQRGARHITADVIVFLDSHSEVNYNWLP 175
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PL+ PI D K++ P ID ID T+E+R+ D RG F+W YK L E
Sbjct: 176 PLVEPIALDYKMVVCPFIDVIDCNTYEYRA---QDEGGRGSFDWEFNYKRLPLTE---DN 229
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 168
+K + P+ SP AGG FA+ R +F ELGGYD GL +WGGE +ELSFK+
Sbjct: 230 KKNPTRPFHSPVMAGGYFAISRKWFWELGGYDEGLEIWGGEQYELSFKV 278
>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
gallus]
Length = 590
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 85/183 (46%), Positives = 116/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ KVRLIR +REGL+R R GA +RG+++ FLD HCE WL PL
Sbjct: 190 LKEPLENYVAGLR-KVRLIRANKREGLVRARLLGASIARGDILTFLDCHCECHEGWLEPL 248
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + + PVID ID+ T+E+ + EP G F+W +++ + PERE K+R
Sbjct: 249 LERIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHTTPEREQKRR 305
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + +SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 306 KSKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 365
Query: 181 SRI 183
S +
Sbjct: 366 SHV 368
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 99/240 (41%), Positives = 144/240 (60%), Gaps = 12/240 (5%)
Query: 201 EPPLEPYKEGPGEGGKAYHL--PEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 258
+P LEP GE G+A L A + + S+ + +N+ S+ IS R +P+
Sbjct: 74 KPDLEP--GALGELGRAVRLELSPAEKRLQEESIRRHQINIYLSDRISLHRRLPERWHPL 131
Query: 259 CK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADL 316
CK +DY LPK SV++ F+NE +S+L+RTVHS+++ +P LEE+ILVDD+S K L
Sbjct: 132 CKGKKYDY-YSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHL 190
Query: 317 DQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLL 376
+ LE+Y+ KVRLIR +REGL+R R GA +RG+++ FLD HCE WL PLL
Sbjct: 191 KEPLENYVAGLR-KVRLIRANKREGLVRARLLGASIARGDILTFLDCHCECHEGWLEPLL 249
Query: 377 APIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I + + PVID ID+ T+E+ + EP G F+W +++ + PERE K+RK
Sbjct: 250 ERIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHTTPEREQKRRK 306
>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus terrestris]
gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus terrestris]
Length = 606
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 129/228 (56%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDR 328
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 115/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK G+VI FLDAHCE WL PL
Sbjct: 209 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 269 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 325
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 326 GDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 385
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 386 SHVGHVFR 393
>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
Length = 621
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 118/190 (62%), Gaps = 4/190 (2%)
Query: 3 DKKLEDYIERFNGK---VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
D+ L+D ++R+ + VR++R ER+GLI R GA + GEV+ FLDAHCE WL
Sbjct: 220 DEYLKDELDRYVKQLQIVRVVRQAERKGLITARLLGASVASGEVLTFLDAHCECFHGWLE 279
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
PLL+ I + + P I ID T+EF + V H RG F+W + + +P RE +
Sbjct: 280 PLLSRIAEEPTAVVSPDITTIDLNTFEFSKPVQYGKQHSRGNFDWSLTFGWEVVPPRERQ 339
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWV 178
+RK + P KSPT AGGLFA+ R++F +G YD + +WGGEN E+SF++W CGG +E +
Sbjct: 340 RRKDETVPIKSPTFAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSFRVWQCGGQLEII 399
Query: 179 PCSRIVSLIR 188
PCS + + R
Sbjct: 400 PCSVVGHVFR 409
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 98/244 (40%), Positives = 137/244 (56%), Gaps = 16/244 (6%)
Query: 202 PPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYG-----MNMETSNHISFDRTI-PDLR 255
PP +P GPG GKA+ + A ++ E G N S+ IS R + PD R
Sbjct: 105 PPQDP--SGPGADGKAFK--KEQWTAEESKEKERGYEKHCFNAFASDRISLQRALGPDSR 160
Query: 256 MEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
EC K+ P LP SV++VFHNE +S+L+RTV+S++ +PA L EIILVDD S+
Sbjct: 161 PPECIDQKFKRCP-PLPTTSVVIVFHNEAWSTLLRTVYSVLHASPAALLREIILVDDAST 219
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
L +L+ Y+++ VR++R ER+GLI R GA + GEV+ FLDAHCE WL
Sbjct: 220 DEYLKDELDRYVKQLQ-IVRVVRQAERKGLITARLLGASVASGEVLTFLDAHCECFHGWL 278
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREA 431
PLL+ I + + P I ID T+EF + V H RG F+W + + +P RE
Sbjct: 279 EPLLSRIAEEPTAVVSPDITTIDLNTFEFSKPVQYGKQHSRGNFDWSLTFGWEVVPPRER 338
Query: 432 KKRK 435
++RK
Sbjct: 339 QRRK 342
>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
taurus]
Length = 552
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIIHLDTFNY---IESASELRGGFDWSLHFQWEQLTPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF MD+++F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVMDKSWFYYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 121/208 (58%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S I+ +R +PD R+ C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ SI+ RTP ++EIILVDDFS+ + ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSILNRTPMNLIQEIILVDDFSNDPEDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLTPEQKARR 268
>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Gorilla gorilla gorilla]
Length = 517
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 132 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 191
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 192 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 248
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 249 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 301
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARR 233
>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 132 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 191
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 192 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 248
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 249 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 301
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARR 233
>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Pan paniscus]
Length = 517
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 132 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 191
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 192 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 248
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 249 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 301
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S+I RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 148
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARR 233
>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
mulatta]
Length = 517
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 132 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 191
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 192 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 248
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 249 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 301
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 34 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 93
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 94 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIRL-----PKVKCLRNNERQGLVRSR 148
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 149 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 205
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 206 SASELRGGFDWSLHFQWEQLSPEQKARR 233
>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Macaca mulatta]
Length = 532
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 147 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 206
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 207 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 263
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 264 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 316
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 132/242 (54%), Gaps = 9/242 (3%)
Query: 193 ADGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
AD L + +P + + + + +L GD Y N S IS +R IP
Sbjct: 15 ADSGLSSSQPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIP 74
Query: 253 DLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSS 312
D R C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+
Sbjct: 75 DTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSN 134
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
D ++L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL
Sbjct: 135 DPDDCKQLIRL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWL 189
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREA 431
PLL + D + PVID I+ T+ + E RG F+W + ++ +L PE++A
Sbjct: 190 QPLLHRVKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKA 246
Query: 432 KK 433
++
Sbjct: 247 RR 248
>gi|62630154|gb|AAX88899.1| unknown [Homo sapiens]
Length = 452
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 67 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 126
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 127 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 183
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 184 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 236
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 9/176 (5%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+ D +
Sbjct: 1 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 60
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL
Sbjct: 61 QLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 115
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ D + PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 116 VKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 168
>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus impatiens]
Length = 606
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 129/228 (56%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDR 328
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 115/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK G+VI FLDAHCE WL PL
Sbjct: 209 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 269 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 325
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF+IWMCGG++E C
Sbjct: 326 GDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRIWMCGGTLEIATC 385
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 386 SHVGHVFR 393
>gi|328723398|ref|XP_001946977.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 1 [Acyrthosiphon pisum]
gi|328723400|ref|XP_003247833.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 2 [Acyrthosiphon pisum]
Length = 624
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 124/189 (65%), Gaps = 6/189 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L+K+L+DY+ + R+IR+ +R GLI+ R GA++++G+++VFLDAHCE L WL L
Sbjct: 209 LEKELDDYVAKLPVLTRIIRSPKRIGLIKARLMGARQAKGKILVFLDAHCECTLGWLEAL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK--ENELPEREAKK 119
++ + DRK + PVID I +T+ + +E H+ G F W + ++ P+ +
Sbjct: 269 VSRVAEDRKRVVCPVIDIISDETFAYVRSFE--LHW-GAFNWDLHFRWYTRTTPDIMKGQ 325
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R ++ +++P AGGLFAMD+++F ELGGYD + +WGGEN ELSF++W CGGSIE P
Sbjct: 326 RDI-TQAFRTPAMAGGLFAMDKSYFFELGGYDERMEIWGGENLELSFRVWQCGGSIEIAP 384
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 385 CSHVGHVFR 393
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 78/186 (41%), Positives = 125/186 (67%), Gaps = 4/186 (2%)
Query: 238 NMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRT 296
N+ S+ I +R++PD+R + C+ +D LP ++VI+VFHNE +S+LMRTV S+I R+
Sbjct: 130 NLMASDRIPLNRSLPDVRKKSCRLKKIDIDKLPSSTVIIVFHNEAWSTLMRTVQSVIDRS 189
Query: 297 PAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGE 356
P L EIILVDD S++ L+++L+DY+ + R+IR+ +R GLI+ R GA++++G+
Sbjct: 190 PKYLLNEIILVDDASTRKFLEKELDDYVAKLPVLTRIIRSPKRIGLIKARLMGARQAKGK 249
Query: 357 VIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFE 416
++VFLDAHCE L WL L++ + DRK + PVID I +T+ + +E H+ G F
Sbjct: 250 ILVFLDAHCECTLGWLEALVSRVAEDRKRVVCPVIDIISDETFAYVRSFE--LHW-GAFN 306
Query: 417 WGMLYK 422
W + ++
Sbjct: 307 WDLHFR 312
>gi|281340802|gb|EFB16386.1| hypothetical protein PANDA_012334 [Ailuropoda melanoleuca]
Length = 600
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 117/187 (62%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DRTIPD R ++C+ + DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 117 EYGYNAQLSDRISLDRTIPDYRPKKCRQMTHAADLPQISVVFIFVNEALSVILRSVHSVV 176
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
RTP++ L+E+ILVDD S +L L+ Y+ +R+ G V+++RN REGLIR R +G K
Sbjct: 177 NRTPSRLLKEVILVDDNSDSVELKASLDQYVNRRYPGLVKIVRNHRREGLIRARLQGWKV 236
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E G+ W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 237 ATAPVVGFFDAHVEFGMGWAEPALSRIQEDRRRIVLPAIDNIKYDTFEVQQYASAAHGY- 295
Query: 413 GIFEWGM 419
WG+
Sbjct: 296 ---NWGL 299
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ R + G V+++RN REGLIR R +G K + V+ F DAH E G+ W
Sbjct: 198 ELKASLDQYVNRRYPGLVKIVRNHRREGLIRARLQGWKVATAPVVGFFDAHVEFGMGWAE 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 258 PALSRIQEDRRRIVLPAIDNIKYDTFEVQQYASAAHGY----NWG-LWCMYIIPPQDWLD 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R + P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 313 RGDEAAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVAHIER 381
>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Nasonia vitripennis]
Length = 610
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L++Y+ + N R++R+ +R GL+ R GA E++GEV+ FLDAHCE WL PL
Sbjct: 205 LRKPLDEYVAQLNVPTRVLRSDKRVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPL 264
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R + PVID I+ T+ + +E H+ G F W + ++ L ++R+
Sbjct: 265 LEAISKNRTRVVSPVIDIINDDTFSYTRSFE--LHW-GAFNWDLHFRWLMLNGALLRERR 321
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
N +P+K+P AGGLF+MDR +F ELG YD + +WGGEN ELSF++W CGGS+E PC
Sbjct: 322 ENIVDPFKTPAMAGGLFSMDREYFFELGSYDEHMRIWGGENLELSFRVWQCGGSVEIAPC 381
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 382 SHVGHIFR 389
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 122/187 (65%), Gaps = 6/187 (3%)
Query: 238 NMETSNHISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKR 295
N+ S+ I +RT+PD+R ++C +Y + DLP SVI+VFHNE +S+L+RTVHS+I R
Sbjct: 126 NLLASDRIPLNRTLPDVRKKKCITRYANLG-DLPSTSVIIVFHNEAWSTLLRTVHSVINR 184
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 355
+P + LEEIILVDD S + L + L++Y+ + N R++R+ +R GL+ R GA E++G
Sbjct: 185 SPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLNVPTRVLRSDKRVGLVNARLMGANEAKG 244
Query: 356 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 415
EV+ FLDAHCE WL PLL I +R + PVID I+ T+ + +E H+ G F
Sbjct: 245 EVLTFLDAHCECTAGWLEPLLEAISKNRTRVVSPVIDIINDDTFSYTRSFE--LHW-GAF 301
Query: 416 EWGMLYK 422
W + ++
Sbjct: 302 NWDLHFR 308
>gi|344288103|ref|XP_003415790.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Loxodonta africana]
Length = 640
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 83/203 (40%), Positives = 120/203 (59%), Gaps = 9/203 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +Y+ R G V+L+R+ +R G I+ R GA + G+V+VF+D+HCE WL PL
Sbjct: 240 LKSALSEYVARLEG-VKLLRSNKRLGPIQGRMLGATRATGDVLVFMDSHCECHQGWLEPL 298
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + PVID ID++T+++ Y + RG+ +W + + LPE E K +
Sbjct: 299 LSRIAGDRSRVVSPVIDVIDWKTFQY---YPSEALQRGVLDWNLDFHWEPLPEHEKKALQ 355
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SP GG+ A+DR +F G YDP + +WGGEN ELS K W+CGGS+E +PCS
Sbjct: 356 SPISPIRSPVVPGGVVAIDRHYFQNTGAYDPLMSLWGGENLELSLKTWLCGGSVEILPCS 415
Query: 182 RIVSLIR-----PVFKADGKLGN 199
R+ + R PV + L N
Sbjct: 416 RVGHVYRNQDAHPVLDQEATLQN 438
Score = 155 bits (391), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 82/202 (40%), Positives = 119/202 (58%), Gaps = 4/202 (1%)
Query: 231 SLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVH 290
SL G+ S + R +P++R C LP ASVIL FH+E +S+L+RTVH
Sbjct: 155 SLDPQGLTGALSARMPLRRALPEVRHPLCLQQHAGDRLPMASVILCFHDEAWSTLLRTVH 214
Query: 291 SIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGA 350
SI+ P +L+EIILVDD S + L L +Y+ R G V+L+R+ +R G I+ R GA
Sbjct: 215 SILDTAPRAFLKEIILVDDLSQQGQLKSALSEYVARLEG-VKLLRSNKRLGPIQGRMLGA 273
Query: 351 KESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHH 410
+ G+V+VF+D+HCE WL PLL+ I DR + PVID ID++T+++ Y +
Sbjct: 274 TRATGDVLVFMDSHCECHQGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQY---YPSEAL 330
Query: 411 YRGIFEWGMLYKENELPEREAK 432
RG+ +W + + LPE E K
Sbjct: 331 QRGVLDWNLDFHWEPLPEHEKK 352
>gi|301775621|ref|XP_002923231.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9-like
[Ailuropoda melanoleuca]
Length = 601
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 117/187 (62%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DRTIPD R ++C+ + DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 117 EYGYNAQLSDRISLDRTIPDYRPKKCRQMTHAADLPQISVVFIFVNEALSVILRSVHSVV 176
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
RTP++ L+E+ILVDD S +L L+ Y+ +R+ G V+++RN REGLIR R +G K
Sbjct: 177 NRTPSRLLKEVILVDDNSDSVELKASLDQYVNRRYPGLVKIVRNHRREGLIRARLQGWKV 236
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E G+ W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 237 ATAPVVGFFDAHVEFGMGWAEPALSRIQEDRRRIVLPAIDNIKYDTFEVQQYASAAHGY- 295
Query: 413 GIFEWGM 419
WG+
Sbjct: 296 ---NWGL 299
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYIER-FNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ R + G V+++RN REGLIR R +G K + V+ F DAH E G+ W
Sbjct: 198 ELKASLDQYVNRRYPGLVKIVRNHRREGLIRARLQGWKVATAPVVGFFDAHVEFGMGWAE 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 258 PALSRIQEDRRRIVLPAIDNIKYDTFEVQQYASAAHGY----NWG-LWCMYIIPPQDWLD 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R + P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 313 RGDEAAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVAHIER 381
>gi|402890489|ref|XP_003908519.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Papio anubis]
Length = 551
Score = 165 bits (418), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 166 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 225
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 226 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 282
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 283 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 335
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 79/208 (37%), Positives = 116/208 (55%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R DL + + + FHNE S+L+
Sbjct: 68 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRMSTSVGCTDLSRTARLXXFHNEARSTLL 127
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 128 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 182
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 183 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 239
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 240 SASELRGGFDWSLHFQWEQLSPEQKARR 267
>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
glaber]
Length = 582
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 112/182 (61%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 186 LKAQLETYISSLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 244
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I D + PVID ID+ T+EF + G F+W + ++ + +P++E +R
Sbjct: 245 LERIGRDETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHSVPKQERDRRT 302
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 303 SRIDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCS 362
Query: 182 RI 183
+
Sbjct: 363 HV 364
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/235 (38%), Positives = 132/235 (56%), Gaps = 4/235 (1%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 260
+PP + + G L E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 70 KPPADSHALGEWGRASKLELGEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMSECK 129
Query: 261 YWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
Y LP SV++ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 130 SKTYDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKAQ 189
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 190 LETYISSLE-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 248
Query: 380 YSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
D + PVID ID+ T+EF + G F+W + ++ + +P++E +R
Sbjct: 249 GRDETAVVCPVIDTIDWNTFEF--YMQTGEPMIGGFDWRLTFQWHSVPKQERDRR 301
>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
(Silurana) tropicalis]
gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
tropicalis]
Length = 555
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 80/174 (45%), Positives = 114/174 (65%), Gaps = 4/174 (2%)
Query: 16 KVRLIRNTERE-GLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 74
KVR +RN +RE GLIR+R RGA ++ V+ FLD+HCEV +WLPPLL I D +
Sbjct: 167 KVRCLRNEQREAGLIRSRVRGAGVAQAAVLTFLDSHCEVNKDWLPPLLHRIKEDPTRVVS 226
Query: 75 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAG 134
PVID I+ T+ + + RG F+W + +K +L + KR +EP K+P AG
Sbjct: 227 PVIDIINLDTFAYIAA---SSDLRGGFDWSLHFKWEQLSAEQKAKRLDPTEPIKTPVIAG 283
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLF +++++F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 284 GLFVIEKSWFNHLGKYDTAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 337
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 85/201 (42%), Positives = 113/201 (56%), Gaps = 9/201 (4%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
Y N S I DR I D R C Y DLP SVI+ FHNE S+L+RT+ S++
Sbjct: 77 YAFNQRESERIPSDRAIKDTRHYRCTELHYQSDLPPTSVIITFHNEARSTLLRTIRSVLN 136
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTERE-GLIRTRSRGAKES 353
RTP + EI+LVDDFS D D +L + KVR +RN +RE GLIR+R RGA +
Sbjct: 137 RTPMHLIHEILLVDDFSDNLD-DCRLLSKLP----KVRCLRNEQREAGLIRSRVRGAGVA 191
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
+ V+ FLD+HCEV +WLPPLL I D + PVID I+ T+ + + RG
Sbjct: 192 QAAVLTFLDSHCEVNKDWLPPLLHRIKEDPTRVVSPVIDIINLDTFAYIAA---SSDLRG 248
Query: 414 IFEWGMLYKENELPEREAKKR 434
F+W + +K +L + KR
Sbjct: 249 GFDWSLHFKWEQLSAEQKAKR 269
>gi|443729524|gb|ELU15389.1| hypothetical protein CAPTEDRAFT_73412, partial [Capitella teleta]
Length = 363
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 82/190 (43%), Positives = 114/190 (60%), Gaps = 3/190 (1%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
DL + LE Y++ KV+LIRN ER+GLIR+R G + SR VIVFLDAH EV + WL P
Sbjct: 82 DLGEPLEQYVDELGEKVKLIRNFERQGLIRSRLHGMQLSRAPVIVFLDAHIEVNVQWLEP 141
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
+LA + + P +DG+D + E+ + P+ YRG W + Y L E +
Sbjct: 142 ILAQLQEKPNSIIQPFVDGVDPDSLEYSA---PNILYRGGLTWDLRYVWIRLAEHIQEAA 198
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ +PY +PT G AMD+A+F +G +D L+WGGEN ELS + WMCGGS+ +PC
Sbjct: 199 FRSGQPYWTPTLIGCALAMDKAYFKSIGSFDADQLIWGGENVELSLRTWMCGGSLVTLPC 258
Query: 181 SRIVSLIRPV 190
SR+ + RP
Sbjct: 259 SRVGHVFRPT 268
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 86/188 (45%), Positives = 114/188 (60%), Gaps = 4/188 (2%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSII 293
Y +N+ S I RTI D R E CK + + P ASVI+ +NEG S ++RTVHS++
Sbjct: 1 YNINISRSEEIPLRRTIKDTRPETCKDTSFNISSFPSASVIIPLYNEGLSLVLRTVHSVL 60
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
RTP L+EIILVDD SS DL + LE Y+ KV+LIRN ER+GLIR+R G + S
Sbjct: 61 DRTPDVLLKEIILVDDASSHEDLGEPLEQYVDELGEKVKLIRNFERQGLIRSRLHGMQLS 120
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
R VIVFLDAH EV + WL P+LA + + P +DG+D + E+ + P+ YRG
Sbjct: 121 RAPVIVFLDAHIEVNVQWLEPILAQLQEKPNSIIQPFVDGVDPDSLEYSA---PNILYRG 177
Query: 414 IFEWGMLY 421
W + Y
Sbjct: 178 GLTWDLRY 185
>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Nomascus leucogenys]
Length = 535
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 150 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 209
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 210 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 266
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 267 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 319
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 52 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 111
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 112 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLVKL-----PKVKCLRNNERQGLVRSR 166
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 167 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 223
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 224 SASELRGGFDWSLHFQWEQLSPEQKARR 251
>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Otolemur garnettii]
Length = 678
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL I D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRIKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/200 (40%), Positives = 117/200 (58%), Gaps = 9/200 (4%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
Y N S +R +PD R C Y DLP S+I+ FHNE S+L+RT+ S++
Sbjct: 77 YAFNQRESERTPSNRAVPDTRHSRCTLLVYYTDLPPTSIIITFHNEARSTLLRTIRSVLN 136
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP ++EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R RGA ++
Sbjct: 137 RTPMHLIQEIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSRIRGADVAQ 191
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
G + FLD+HCEV +WL PLL I D + PVID I+ T+ + E RG
Sbjct: 192 GTTLTFLDSHCEVNRDWLQPLLHRIKEDYTRVVCPVIDIINLDTFTY---IESASELRGG 248
Query: 415 FEWGMLYKENEL-PEREAKK 433
F+W + ++ +L PE++A++
Sbjct: 249 FDWSLHFQWEQLSPEQKARR 268
>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
catus]
Length = 622
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L+ Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEQLDQYVKKLQ-IVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSRGNFDWSLTFGWEALPAHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/246 (39%), Positives = 136/246 (55%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFQKDKWTSLETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++L+ Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKEQLDQYVKKLQ-IVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDETVVVSPDIVTIDLNTFEFSKPVPRGRVHSRGNFDWSLTFGWEALPAHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
boliviensis boliviensis]
Length = 578
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/236 (39%), Positives = 134/236 (56%), Gaps = 6/236 (2%)
Query: 201 EPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECK 260
+PP + + G HL E + + Y +N+ S+ IS R I D RM ECK
Sbjct: 66 KPPADSHALGEWGKASKLHLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECK 125
Query: 261 YWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
+ LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD S + L +
Sbjct: 126 SKKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQ 185
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 186 LETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERI 244
Query: 380 YSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 245 GRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKYERDRR 297
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+ E +R
Sbjct: 241 LERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKYERDRR 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 3 [Bombus impatiens]
Length = 607
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 129/228 (56%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDR 328
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 115/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK G+VI FLDAHCE WL PL
Sbjct: 209 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 269 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 325
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 326 GDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 385
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 386 SHVGHVFR 393
>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Callithrix jacchus]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D Q+L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCQQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Gorilla gorilla gorilla]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|187957310|gb|AAI57846.1| GALNT9 protein [Homo sapiens]
gi|219521197|gb|AAI71844.1| GALNT9 protein [Homo sapiens]
Length = 473
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 114/187 (60%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C+ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 116 EYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 175
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 176 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 235
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 236 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 294
Query: 413 GIFEWGM 419
WG+
Sbjct: 295 ---NWGL 298
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 94/169 (55%), Gaps = 6/169 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 197 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 257 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 311
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKI 168
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++
Sbjct: 312 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRV 360
>gi|449476915|ref|XP_004176604.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 9 [Taeniopygia
guttata]
Length = 604
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 82/187 (43%), Positives = 115/187 (61%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R ++CK YP DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 117 EYGYNAQLSDRISLDRSIPDYRPKKCKQMTYPEDLPQISVVFIFVNEALSVILRSVHSVV 176
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+ L+EIILVDD S +L L+ Y+ +R+ G V+++RN +REGLIR R +G K
Sbjct: 177 NHTPSHLLKEIILVDDNSDNVELKFNLDQYVHKRYPGLVKIVRNNKREGLIRARIQGWKA 236
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E + W P L I DRK + +P ID I Y T+E + H Y
Sbjct: 237 ATSPVVGFFDAHVEFNIGWAEPALTRIKEDRKRIILPAIDNIKYNTFEVQQYANAAHGY- 295
Query: 413 GIFEWGM 419
WG+
Sbjct: 296 ---NWGL 299
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN +REGLIR R +G K + V+ F DAH E + W
Sbjct: 198 ELKFNLDQYVHKRYPGLVKIVRNNKREGLIRARIQGWKAATSPVVGFFDAHVEFNIGWAE 257
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L I DRK + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 258 PALTRIKEDRKRIILPAIDNIKYNTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 312
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+ S P ++P G F +DR +F E+G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 313 KXDESAPIRTPAMIGCSFVVDREYFGEIGLLDPGMEVYGGENIELGMRVWQCGGSMEVLP 372
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 373 CSRVAHIER 381
>gi|194380546|dbj|BAG58426.1| unnamed protein product [Homo sapiens]
Length = 238
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 81/164 (49%), Positives = 115/164 (70%), Gaps = 3/164 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESR-GEVIVFLDAHCEVGLNWLPP 60
L +KL++YI+ +NG V++ RN REGLI+ RS GA++++ G+V+++LDAHCEV +NW P
Sbjct: 31 LKEKLDEYIKLWNGLVKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAP 90
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTW--EFRSVYEPDHHYRGIFEWGMLYKENELPEREAK 118
L+API DR TVP+ID ID + E + + D RG ++W +L+K L +E
Sbjct: 91 LVAPISKDRTTCTVPLIDYIDGNDYSIEPQQGGDEDGFARGAWDWSLLWKRIPLSHKEKA 150
Query: 119 KRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENF 162
KRK+ +EPY+SP AGGLFA++R FF ELG YDPGL +WGG+
Sbjct: 151 KRKHKTEPYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGKTL 194
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 77/155 (49%), Positives = 110/155 (70%), Gaps = 3/155 (1%)
Query: 286 MRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRT 345
MRTVHS+IKRTP +YL EI+L+DDFS+K L +KL++YI+ +NG V++ RN REGLI+
Sbjct: 1 MRTVHSVIKRTPRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVKVFRNERREGLIQA 60
Query: 346 RSRGAKESR-GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTW--EFR 402
RS GA++++ G+V+++LDAHCEV +NW PL+API DR TVP+ID ID + E +
Sbjct: 61 RSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKDRTTCTVPLIDYIDGNDYSIEPQ 120
Query: 403 SVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN 437
+ D RG ++W +L+K L +E KRK+
Sbjct: 121 QGGDEDGFARGAWDWSLLWKRIPLSHKEKAKRKHK 155
>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus impatiens]
Length = 606
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 92/228 (40%), Positives = 129/228 (56%), Gaps = 3/228 (1%)
Query: 211 PGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPK 270
PGE G A H+ A N+ S+ IS +R++ D+R+E CK Y LP
Sbjct: 104 PGEVGAAVHISPEDEARQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNKYLPD 163
Query: 271 ASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGK 330
S+++VFHNE +S+L+RTV S+I R+P L+EIILVDD S + L Q LEDY++
Sbjct: 164 TSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVP 223
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
+ R +R GLIR R GAK G+VI FLDAHCE WL PLL+ I DR + P+
Sbjct: 224 TYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSRIAEDRTTVVCPI 283
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
ID I T+E+ + D + G F W + ++ + +RE +R +R
Sbjct: 284 IDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRLGDR 328
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 115/188 (61%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LEDY++ + R +R GLIR R GAK G+VI FLDAHCE WL PL
Sbjct: 209 LKQDLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPL 268
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID I T+E+ + D + G F W + ++ + +RE +R
Sbjct: 269 LSRIAEDRTTVVCPIIDVISDDTFEY--IPASDMTWGG-FNWKLNFRWYRVAQREMDRRL 325
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + P ++PT AGGLF++D+ +F ELG YD G+ +WGGEN E+SF++W CGG++E PC
Sbjct: 326 GDRTAPLRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPC 385
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 386 SHVGHVFR 393
>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
Length = 669
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/174 (44%), Positives = 114/174 (65%), Gaps = 4/174 (2%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVRLIRN +REGL+R+R GA + +V+ FLD+HCE ++WL PLLA + D + P
Sbjct: 288 KVRLIRNAKREGLVRSRVTGAAAATAKVLTFLDSHCECNVHWLEPLLARVAEDPTRVVCP 347
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 134
VID I T+++ RG F+W +++K L E K+R+ + + P ++P AG
Sbjct: 348 VIDVISMDTFQY---IGASADLRGGFDWNLVFKWEYLSGAERKERQRDPTAPIRTPMIAG 404
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLF +DR++F +LG YD + +WGGEN E+SF++W CGGS+E +PCSR+ + R
Sbjct: 405 GLFVIDRSYFEKLGTYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFR 458
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 82/204 (40%), Positives = 118/204 (57%), Gaps = 13/204 (6%)
Query: 237 MNMETSNHISFDRTIPDLRMEECK--YWDYPLD---LPKASVILVFHNEGFSSLMRTVHS 291
N + S+ + +R +PD R C+ W LP SVI+ FHNE S+L+RTV S
Sbjct: 195 FNQQASDGLKSNRELPDTRNAMCRRTSWSSATSIESLPATSVIITFHNEARSTLLRTVVS 254
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
++ R+P + + EIILVDDFS + Q+L IQ KVRLIRN +REGL+R+R GA
Sbjct: 255 VLNRSPERLIHEIILVDDFSDFPEDGQELAK-IQ----KVRLIRNAKREGLVRSRVTGAA 309
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
+ +V+ FLD+HCE ++WL PLLA + D + PVID I T+++
Sbjct: 310 AATAKVLTFLDSHCECNVHWLEPLLARVAEDPTRVVCPVIDVISMDTFQY---IGASADL 366
Query: 412 RGIFEWGMLYKENELPEREAKKRK 435
RG F+W +++K L E K+R+
Sbjct: 367 RGGFDWNLVFKWEYLSGAERKERQ 390
>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12 [Taeniopygia
guttata]
Length = 594
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 117/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ KVRLIR +REGL+R R GA ++G+++ FLD HCE WL PL
Sbjct: 194 LKETLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPL 252
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I + + PVID ID+ T+E+ + EP G F+ +++ + PERE K+R
Sbjct: 253 LARIAEEETAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDXRLVFTWHSTPEREQKRR 309
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++ +SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 310 KSKTDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 369
Query: 181 SRI 183
S +
Sbjct: 370 SHV 372
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 150/255 (58%), Gaps = 20/255 (7%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAY--HLPEAYRAAGDASLGEYGMNMETSN 243
L RPV+ +P L+P GE G+A L A + + S+ + +N+ S+
Sbjct: 71 LRRPVYA--------KPALDP--GALGELGRAVRLELSPAEKRRQEESIRRHQINIYLSD 120
Query: 244 HISFDRTIPDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYL 301
IS R +P+ C K +DY +LPK SV++ F+NE +S+L+RTVHS+++ +P L
Sbjct: 121 RISLHRRLPERWHPLCREKKYDY-YNLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILL 179
Query: 302 EEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFL 361
EEIILVDD+S K L + LE+Y+ KVRLIR +REGL+R R GA ++G+++ FL
Sbjct: 180 EEIILVDDYSDKEHLKETLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFL 238
Query: 362 DAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGML 420
D HCE WL PLLA I + + PVID ID+ T+E+ + EP G F+ ++
Sbjct: 239 DCHCECHEGWLEPLLARIAEEETAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDXRLV 295
Query: 421 YKENELPEREAKKRK 435
+ + PERE K+RK
Sbjct: 296 FTWHSTPEREQKRRK 310
>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Nomascus leucogenys]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLVKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
Length = 631
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++Y++R + VR++R ER+GLI R GA + G+ + FLDAHCE WL PL
Sbjct: 207 LKDGLDEYLKRLS-IVRVVRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPL 265
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I +R + P I ID T+EF S Y +H+ RG F+W + + LP+ E K+
Sbjct: 266 LARIAKNRTAVVSPDITTIDLNTFEFMKPSPYGQNHN-RGNFDWSLAFGWESLPDHEKKR 324
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ + +F ++G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 325 RKDETYPIKTPTFAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSFRVWQCGGQLEIIP 384
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 385 CSIVGHVFR 393
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 100/256 (39%), Positives = 143/256 (55%), Gaps = 19/256 (7%)
Query: 198 GNLEPPLE-PYKE--GPGEGGKAYHL----PEAYRAAGDASLGE--YGMNMETSNHISFD 248
L P L+ P+++ PG GK + PE + + GE + N+ S+ IS
Sbjct: 80 AELRPALDRPHQDPLAPGAAGKPFQTNSLSPEEQK---EKERGEEKHCFNLYASDRISLS 136
Query: 249 RTI-PDLRMEEC--KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEII 305
R + D R EC + + LP SVI+VFHNE +S+L+RTV+S++ +PA L+EII
Sbjct: 137 RDLGADTRPPECIEQTFKRCPPLPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEII 196
Query: 306 LVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHC 365
LVDD S L L++Y++R + VR++R ER+GLI R GA + G+ + FLDAHC
Sbjct: 197 LVDDASEDEALKDGLDEYLKRLS-IVRVVRQRERKGLITARLLGASVATGDTLTFLDAHC 255
Query: 366 EVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKE 423
E WL PLLA I +R + P I ID T+EF S Y +H+ RG F+W + +
Sbjct: 256 ECFNGWLEPLLARIAKNRTAVVSPDITTIDLNTFEFMKPSPYGQNHN-RGNFDWSLAFGW 314
Query: 424 NELPEREAKKRKYNRY 439
LP+ E K+RK Y
Sbjct: 315 ESLPDHEKKRRKDETY 330
>gi|195402751|ref|XP_002059968.1| GJ14949 [Drosophila virilis]
gi|194140834|gb|EDW57305.1| GJ14949 [Drosophila virilis]
Length = 666
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 91/229 (39%), Positives = 128/229 (55%), Gaps = 8/229 (3%)
Query: 194 DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPD 253
D + EP K+G G G+ +P R N+ S+ I +RT+ D
Sbjct: 80 DYNINQFEP-----KQGEGADGRPVVIPPRDRFRMQRFFKLNSFNILASDRIPLNRTLKD 134
Query: 254 LRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
R EC+ Y LP SVI+VFHNE +S L+RT+ S+I R+P Q L+EIILVDD S +
Sbjct: 135 YRTNECRDKRYAHGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRQLLKEIILVDDASDR 194
Query: 314 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 373
+ L ++LE YI+ N RL R ER GL+ R GA+ +RG+V+ FLDAHCE WL
Sbjct: 195 SFLKRQLEAYIKVLNVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLE 254
Query: 374 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
PLLA I R+++ PVID I + + +E +H+ G F W + ++
Sbjct: 255 PLLARIKESREVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFR 300
Score = 162 bits (409), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 80/189 (42%), Positives = 115/189 (60%), Gaps = 5/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE YI+ N RL R ER GL+ R GA+ +RG+V+ FLDAHCE WL PL
Sbjct: 197 LKRQLEAYIKVLNVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLEPL 256
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK--ENELPEREAKK 119
LA I R+++ PVID I + + +E +H+ G F W + ++ ++ + + K
Sbjct: 257 LARIKESREVVICPVIDIISDDNFSYTKTFE--NHW-GAFNWQLSFRWFSSDRKRQTSVK 313
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
K ++ P +P AGGLFA+DR +F E+G YD + +WGGEN E+SF+IW CGG IE P
Sbjct: 314 PKDSTAPIATPGMAGGLFAIDRKYFYEMGAYDSEMRIWGGENVEMSFRIWQCGGRIEISP 373
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 374 CSHVGHIFR 382
>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
5 [Macaca mulatta]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCKQLIRL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|357606408|gb|EHJ65055.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 389
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 90/187 (48%), Positives = 116/187 (62%), Gaps = 6/187 (3%)
Query: 5 KLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
+LE+Y +F V LIR ++REGLIR R GAK++ V+VFLD+HCE WL PLL
Sbjct: 104 RLENYTRKF-PNVILIRTSQREGLIRARIVGAKKASAPVLVFLDSHCECTEGWLEPLLER 162
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 124
+ + KI+ PVID ID T+E+ S P Y G F W + + + KR+
Sbjct: 163 LVENPKIVASPVIDHIDPNTFEYIS-QNPKDIYIGGFNWNLKFIWRSI----EYKRENFL 217
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
P K+PT AGGLFA+D+ FF +G YD G VWGGEN ELSFK+WMCGGS+E VPCS +
Sbjct: 218 LPIKTPTIAGGLFAIDKEFFYSIGYYDEGFDVWGGENLELSFKVWMCGGSLEIVPCSHVG 277
Query: 185 SLIRPVF 191
+ R F
Sbjct: 278 HIFRENF 284
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 114/188 (60%), Gaps = 2/188 (1%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
E N S+ I DR + DLR + C LP+ASV++ F NE +S+L+RT+HS++
Sbjct: 19 ENAFNQFVSDLIPIDRPLLDLRDKWCLERYSSKLLPQASVVICFFNEAWSTLLRTLHSVL 78
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKES 353
R+P L E++L+DDFS + +LE+Y ++F V LIR ++REGLIR R GAK++
Sbjct: 79 NRSPPHLLREVLLIDDFSDMDHIKVRLENYTRKF-PNVILIRTSQREGLIRARIVGAKKA 137
Query: 354 RGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRG 413
V+VFLD+HCE WL PLL + + KI+ PVID ID T+E+ S P Y G
Sbjct: 138 SAPVLVFLDSHCECTEGWLEPLLERLVENPKIVASPVIDHIDPNTFEYIS-QNPKDIYIG 196
Query: 414 IFEWGMLY 421
F W + +
Sbjct: 197 GFNWNLKF 204
>gi|119620893|gb|EAX00488.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_c [Homo sapiens]
Length = 519
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 134 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 193
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 194 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 250
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 251 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 303
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 100/208 (48%), Gaps = 42/208 (20%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAIPDTRH------------------------------ 98
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 99 ---LSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 150
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 151 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 207
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 208 SASELRGGFDWSLHFQWEQLSPEQKARR 235
>gi|196006600|ref|XP_002113166.1| hypothetical protein TRIADDRAFT_27135 [Trichoplax adhaerens]
gi|190583570|gb|EDV23640.1| hypothetical protein TRIADDRAFT_27135, partial [Trichoplax
adhaerens]
Length = 491
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+LIRN +REGLIR+R G K S+G+ I+FLD+HCEV W PLL I + K + P
Sbjct: 111 KVKLIRNKKREGLIRSRVFGVKSSQGKAIIFLDSHCEVNQQWAEPLLEQIVLNPKAIVSP 170
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
V+D ID T+E++ E RG F+W + ++ + + E +R + P K+PT AGG
Sbjct: 171 VLDNIDMNTFEYQ---EGTEDVRGGFDWSLTFRWDYMTEAMINQRIDPTSPIKTPTIAGG 227
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
++A+ + +F +LG YD G +WGGEN ELSF+ WMCGG ++ +PCSR+ + R
Sbjct: 228 IYAVSKQWFNDLGEYDMGQKIWGGENLELSFRAWMCGGFMKIIPCSRVGHVFR 280
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/210 (40%), Positives = 122/210 (58%), Gaps = 8/210 (3%)
Query: 225 RAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSS 284
R + D ++ N S+ I R +PD R CK Y L++P SV+++FHNE S+
Sbjct: 11 RGSKDEGYEKHQFNQFESDIIGAYRRVPDTRNPLCKNKIYRLNMPSVSVVIIFHNEARST 70
Query: 285 LMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIR 344
L+RTV S++ RTP L EI+LVDD S A L Q+L KV+LIRN +REGLIR
Sbjct: 71 LLRTVQSVLDRTPPHLLSEIVLVDDNSDDATLGQELLTL-----PKVKLIRNKKREGLIR 125
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+R G K S+G+ I+FLD+HCEV W PLL I + K + PV+D ID T+E++
Sbjct: 126 SRVFGVKSSQGKAIIFLDSHCEVNQQWAEPLLEQIVLNPKAIVSPVLDNIDMNTFEYQ-- 183
Query: 405 YEPDHHYRGIFEWGMLYKENELPEREAKKR 434
E RG F+W + ++ + + E +R
Sbjct: 184 -EGTEDVRGGFDWSLTFRWDYMTEAMINQR 212
>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
sapiens]
gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
sapiens]
gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Homo sapiens]
gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_a [Homo sapiens]
gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[synthetic construct]
gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
sapiens]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|402888188|ref|XP_003907455.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Papio
anubis]
Length = 448
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 114/187 (60%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C+ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 116 EYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 175
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 176 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 235
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 236 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 294
Query: 413 GIFEWGM 419
WG+
Sbjct: 295 ---NWGL 298
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 66/186 (35%), Positives = 101/186 (54%), Gaps = 6/186 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 197 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 257 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 311
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++ + S + V
Sbjct: 312 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVSVVTVSGDRVR 371
Query: 180 CSRIVS 185
S V+
Sbjct: 372 LSEAVA 377
>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Pan paniscus]
Length = 552
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S+I RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
Length = 613
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 86/184 (46%), Positives = 115/184 (62%), Gaps = 7/184 (3%)
Query: 6 LEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 64
L+D++ KV+LI R GLI R GAK + G+V++ LD+H EV +NWLPPL+ P
Sbjct: 203 LQDFVRTELAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEP 262
Query: 65 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNS 124
I D + P ID I + T+++R+ D RG F+W LYK LP R A+ +
Sbjct: 263 IAEDYRTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFLYKR--LPLR-AQDMVDPT 316
Query: 125 EPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIV 184
EP++SP AGGLFA+ FF ELGGYD GL +WGGE +ELSFK+W CGG + PCSR+
Sbjct: 317 EPFESPIMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVG 376
Query: 185 SLIR 188
+ R
Sbjct: 377 HVYR 380
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 127/226 (56%), Gaps = 21/226 (9%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY 264
E +EGPGE GK P D L E + N S + R Y
Sbjct: 103 EAEREGPGEHGK----PLKLEKLEDIKLNE---KLFKENGYSALSGVGKKR--------Y 147
Query: 265 PLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYI 324
+LP SVI++F+NE +S+L+RTV+S++ R+P+ L+EI+LV+D S+K L + L+D++
Sbjct: 148 LQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQDFV 207
Query: 325 Q-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+ KV+LI R GLI R GAK + G+V++ LD+H EV +NWLPPL+ PI D
Sbjct: 208 RTELAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEDY 267
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
+ P ID I + T+++R+ D RG F+W LYK LP R
Sbjct: 268 RTCVCPFIDVIAHDTFQYRA---QDEGKRGAFDWKFLYK--RLPLR 308
>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
norvegicus]
gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
Length = 633
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PL
Sbjct: 232 LHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y +H+ RG F+W + + LP+ E ++
Sbjct: 291 LARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQR 349
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ R +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 350 RKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMP 409
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 410 CSVVGHVFR 418
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 104/264 (39%), Positives = 147/264 (55%), Gaps = 15/264 (5%)
Query: 188 RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 241
RP + L+P L+ P ++ PG GK + HL + + ++ N
Sbjct: 95 RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFA 154
Query: 242 SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
S+ IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
A L+EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E
Sbjct: 214 AILLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAET 272
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 415
+ FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNF 331
Query: 416 EWGMLYKENELPEREAKKRKYNRY 439
+W + + LP+ E ++RK Y
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETY 355
>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
norvegicus]
Length = 633
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PL
Sbjct: 232 LHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y +H+ RG F+W + + LP+ E ++
Sbjct: 291 LARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQR 349
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ R +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 350 RKDETYPIKTPTFAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMP 409
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 410 CSVVGHVFR 418
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 103/264 (39%), Positives = 146/264 (55%), Gaps = 15/264 (5%)
Query: 188 RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 241
RP + L+P L+ P ++ PG GK + HL + + ++ N
Sbjct: 95 RPCLQGYYTAAELKPVLDRPPQDSNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFA 154
Query: 242 SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
S+ IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFNGCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
A +EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E
Sbjct: 214 AILPKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAET 272
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 415
+ FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNF 331
Query: 416 EWGMLYKENELPEREAKKRKYNRY 439
+W + + LP+ E ++RK Y
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETY 355
>gi|410035021|ref|XP_001163664.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Pan troglodytes]
Length = 491
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 134 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 193
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 194 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 250
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 251 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 303
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 100/208 (48%), Gaps = 42/208 (20%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRH------------------------------ 98
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
S+I RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 99 ---LSVINRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 150
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 151 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 207
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 208 SASELRGGFDWSLHFQWEQLSPEQKARR 235
>gi|334333371|ref|XP_001365881.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Monodelphis domestica]
Length = 757
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 83/183 (45%), Positives = 119/183 (65%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE+++ KVRLIR +REGL+R R GA + GE++ FLD HCE WL PL
Sbjct: 206 LKEQLENHLSDL-PKVRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPL 264
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I+ + + PVID ID+ T+E+ + EP G F+W +++ + +P+RE K+R
Sbjct: 265 LERIHEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHVVPQREQKRR 321
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + +SPT AGGLFA+++ +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 322 RSQIDVIRSPTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 381
Query: 181 SRI 183
S +
Sbjct: 382 SHV 384
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 95/257 (36%), Positives = 151/257 (58%), Gaps = 14/257 (5%)
Query: 188 RPVFKADGKLGNLEPPLEPYKEGP----GEGGKAY--HLPEAYRAAGDASLGEYGMNMET 241
RP + +L L P+ P P GE G+A L E + + S+ + +N
Sbjct: 71 RPGESEEERLQRLRRPVYPRPWLPPSALGELGRAVSLQLDETSKKLQEESIQRHQINTYL 130
Query: 242 SNHISFDRTIPDLRMEECK--YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S+ IS R +P+ CK +DY +LP+ SV++ F+NE +S+L+RTV+S+++ +P
Sbjct: 131 SDRISLHRRLPERWHPMCKEKKYDYE-NLPQTSVVIAFYNEAWSTLLRTVYSVLETSPDI 189
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+E+ILVDD+S + L ++LE+++ KVRLIR +REGL+R R GA + GE++
Sbjct: 190 LLKEVILVDDYSDREHLKEQLENHLSDL-PKVRLIRANKREGLVRARLLGASIATGEILT 248
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWG 418
FLD HCE WL PLL I+ + + PVID ID+ T+E+ + EP G F+W
Sbjct: 249 FLDCHCECHDGWLEPLLERIHEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWR 305
Query: 419 MLYKENELPEREAKKRK 435
+++ + +P+RE K+R+
Sbjct: 306 LVFTWHVVPQREQKRRR 322
>gi|326670821|ref|XP_003199296.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Danio rerio]
Length = 435
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 118/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KL++++++ + VR++R ER+GLI R GA + G+ + FLDAHCE WL PL
Sbjct: 36 LKDKLDEFLKQLH-IVRVVRQHERKGLITARLLGASVATGDTLTFLDAHCECFHGWLEPL 94
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y HH RG F+WG+ + LP+ E ++
Sbjct: 95 LARIAENSSAVVSPDITTIDLNTFEFMKPSPY-GQHHNRGNFDWGLSFGWETLPDHERRR 153
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ R +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 154 RKDETYPIKTPTFAGGLFSISRDYFYHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIP 213
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 214 CSVVGHVFR 222
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 79/137 (57%), Gaps = 4/137 (2%)
Query: 305 ILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAH 364
IL ++ + + L KL++++++ + VR++R ER+GLI R GA + G+ + FLDAH
Sbjct: 25 ILTEESTHEKHLKDKLDEFLKQLH-IVRVVRQHERKGLITARLLGASVATGDTLTFLDAH 83
Query: 365 CEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYK 422
CE WL PLLA I + + P I ID T+EF S Y HH RG F+WG+ +
Sbjct: 84 CECFHGWLEPLLARIAENSSAVVSPDITTIDLNTFEFMKPSPY-GQHHNRGNFDWGLSFG 142
Query: 423 ENELPEREAKKRKYNRY 439
LP+ E ++RK Y
Sbjct: 143 WETLPDHERRRRKDETY 159
>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
Length = 670
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 98/261 (37%), Positives = 139/261 (53%), Gaps = 8/261 (3%)
Query: 183 IVSLIRPVFKA-----DGKLGNLEPPLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGM 237
++S + P A D L L ++ G GE G + + ++ A E
Sbjct: 40 LISRLHPRINANFSNGDSALQTLLAAMKSKSPGAGEMGSPVIIQSSLQSEVKARFKENQF 99
Query: 238 NMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
N+ S IS +RT+PD R C+ Y K SV++VFHNE +S+LMRTV S+I R+
Sbjct: 100 NVVASERISLNRTLPDYRSSACRSIKYEKISLKTSVVIVFHNEAWSTLMRTVQSVINRSS 159
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
YLEEIILVDD S K +L +E +++ LIR +R GLI R RGA+ ++G+V
Sbjct: 160 VDYLEEIILVDDASEKDELIALVESFLKTIPVAHTLIRLPQRSGLIVGRVRGAEIAKGDV 219
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEW 417
+ FLDAH EV WL PLL+ I DR + PVID I T+++ + E G F W
Sbjct: 220 LTFLDAHVEVTDGWLEPLLSRISEDRTRVVAPVIDVISDDTFQYVTAAESTW---GGFSW 276
Query: 418 GMLYKENELPEREAKKRKYNR 438
M ++ + RE K+R N+
Sbjct: 277 TMNFRWYQASAREQKRRGKNK 297
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/189 (41%), Positives = 117/189 (61%), Gaps = 4/189 (2%)
Query: 1 DLDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
+L +E +++ LIR +R GLI R RGA+ ++G+V+ FLDAH EV WL P
Sbjct: 177 ELIALVESFLKTIPVAHTLIRLPQRSGLIVGRVRGAEIAKGDVLTFLDAHVEVTDGWLEP 236
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LL+ I DR + PVID I T+++ + E G F W M ++ + RE K+R
Sbjct: 237 LLSRISEDRTRVVAPVIDVISDDTFQYVTAAESTW---GGFSWTMNFRWYQASAREQKRR 293
Query: 121 KYN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
N + P ++PT AGGLF++DR +F ++G YD G+ +WGGEN E+SF++WMCGG++E P
Sbjct: 294 GKNKTTPIRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEISFRVWMCGGTLEINP 353
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 354 CSHVGHVFR 362
>gi|390349674|ref|XP_003727260.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Strongylocentrotus purpuratus]
Length = 379
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 84/170 (49%), Positives = 106/170 (62%), Gaps = 4/170 (2%)
Query: 19 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVID 78
++R +R GLI+ R G S GE FLD+H EV + WL PLLA + SDR I+ +PV+D
Sbjct: 1 MVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLARLASDRTIVVMPVVD 60
Query: 79 GIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGLFA 138
I+ T+ + V EP RG F W Y+ +P + KR P KSP GGL
Sbjct: 61 EINKDTFNYNVVPEPLQ--RGGFNWRFEYRWKPIPNYD--KRPSKVAPIKSPAMPGGLLT 116
Query: 139 MDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
MDR+FFLELGG+D G+ VWGGEN E S KIWMCGGSIE +PCSR+ + R
Sbjct: 117 MDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSRVGHVYR 166
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 41/103 (39%), Positives = 58/103 (56%), Gaps = 2/103 (1%)
Query: 333 LIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVID 392
++R +R GLI+ R G S GE FLD+H EV + WL PLLA + SDR I+ +PV+D
Sbjct: 1 MVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLARLASDRTIVVMPVVD 60
Query: 393 GIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I+ T+ + V EP RG F W Y+ +P + + K
Sbjct: 61 EINKDTFNYNVVPEPLQ--RGGFNWRFEYRWKPIPNYDKRPSK 101
>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
Length = 613
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 82/188 (43%), Positives = 120/188 (63%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+ YI+ V L+ ER GLIR R G+ ++G++++FLDAH EV WL PL
Sbjct: 218 LVKPLDAYIKALPVPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTEGWLEPL 277
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
++ + DRK + P+ID I T+E+ + E G F W + ++ +P+RE +R
Sbjct: 278 ISRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYSVPKRELNRRG 334
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PC
Sbjct: 335 SDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPC 394
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 395 SRVGHVFR 402
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 146/259 (56%), Gaps = 10/259 (3%)
Query: 187 IRPVFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETS 242
++ V+ D + + L +E GPG +GG +PE ++ + E N+ S
Sbjct: 82 VQQVYPVDKETASQLRKLMETQAFGPGFHGQGGTGVTVPEEKKSIKEKRFLENQFNVVAS 141
Query: 243 NHISFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
IS +RT+PD R E C+ + +P S+I+VFHNE +++L+RT+HS+I R+P
Sbjct: 142 EMISVNRTLPDYRSEACRTAGNSIKTTGMPTTSIIIVFHNEAWTTLLRTLHSVINRSPRH 201
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
LEEII++DD S + L + L+ YI+ V L+ ER GLIR R G+ ++G++++
Sbjct: 202 LLEEIIMIDDKSDRDYLVKPLDAYIKALPVPVHLVHLEERSGLIRARLTGSGMAKGKILL 261
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 419
FLDAH EV WL PL++ + DRK + P+ID I T+E+ + E G F W +
Sbjct: 262 FLDAHVEVTEGWLEPLISRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHL 318
Query: 420 LYKENELPEREAKKRKYNR 438
++ +P+RE +R +R
Sbjct: 319 NFRWYSVPKRELNRRGSDR 337
>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
Length = 612
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 121/188 (64%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K L+ YI+ F + L+ R GLIR R G++ ++G++++FLDAH EV WL PL
Sbjct: 217 LVKPLDSYIKMFPIPIHLVHLENRSGLIRARLTGSEMAKGKILLFLDAHVEVTDGWLEPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
++ + DRK + P+ID I T+E+ + E G F W + ++ +P+RE +R
Sbjct: 277 VSRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLNFRWYAVPKRELNRRG 333
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ S P ++PT AGGLFA+D+ FF ++G YD G+ VWGGEN E+SF++WMCGGS+E PC
Sbjct: 334 SDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPC 393
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 394 SRVGHVFR 401
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 94/258 (36%), Positives = 146/258 (56%), Gaps = 10/258 (3%)
Query: 188 RPVFKADGKLGN-LEPPLEPYKEGPG---EGGKAYHLPEAYRAAGDASLGEYGMNMETSN 243
+ V+ D + N L +E GPG +GG +PE + + E N+ S
Sbjct: 82 KQVYPVDKETANQLRKLMETQAFGPGYHGQGGTGVTVPEDKKTIKEKRFLENQFNVVASE 141
Query: 244 HISFDRTIPDLRMEECKYWDYPLD---LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
IS +RT+PD R + C+ L +PK S+I+VFHNE +++L+RT+HS+I R+P
Sbjct: 142 MISVNRTLPDYRSDACRTSGNNLKTAGMPKTSIIIVFHNEAWTTLLRTLHSVINRSPRHL 201
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
LEEIILVDD S + L + L+ YI+ F + L+ R GLIR R G++ ++G++++F
Sbjct: 202 LEEIILVDDKSDRDYLVKPLDSYIKMFPIPIHLVHLENRSGLIRARLTGSEMAKGKILLF 261
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
LDAH EV WL PL++ + DRK + P+ID I T+E+ + E G F W +
Sbjct: 262 LDAHVEVTDGWLEPLVSRVAEDRKRVVAPIIDVISDDTFEYVTASETTW---GGFNWHLN 318
Query: 421 YKENELPEREAKKRKYNR 438
++ +P+RE +R +R
Sbjct: 319 FRWYAVPKRELNRRGSDR 336
>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
caballus]
Length = 622
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 115/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEQLEQYVKQLQ-VVRVVRQKERTGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D + P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLSFGWEALPPHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 100/247 (40%), Positives = 136/247 (55%), Gaps = 14/247 (5%)
Query: 202 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 256
PP +P PG GKA+ P+ + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--SSPGADGKAFQKDKWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162
Query: 257 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L EIILVDD S+
Sbjct: 163 PECVDQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLREIILVDDASTD 221
Query: 314 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 373
L ++LE Y+++ VR++R ER GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EYLKEQLEQYVKQLQ-VVRVVRQKERTGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 374 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PLLA I D + P I ID T+EF + V H RG F+W + + LP E +
Sbjct: 281 PLLARIAEDETAVVSPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLSFGWEALPPHEKQ 340
Query: 433 KRKYNRY 439
+RK Y
Sbjct: 341 RRKDETY 347
>gi|195033813|ref|XP_001988768.1| GH11345 [Drosophila grimshawi]
gi|193904768|gb|EDW03635.1| GH11345 [Drosophila grimshawi]
Length = 620
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 80/182 (43%), Positives = 117/182 (64%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R +RN +REGLIR+R GA+++ G+V++FLD+H EV W+ PLL + ++ + VPV
Sbjct: 206 LRYVRNEQREGLIRSRVIGARDASGDVLIFLDSHIEVNREWVEPLLRLVKAENATLAVPV 265
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 266 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFKGPFRSPTMAGGL 321
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F +G YD + +WGGEN E+SF++W CGG+I+ VPCSR+ + RP
Sbjct: 322 FAVNRLYFQHIGEYDMAMDIWGGENIEISFRVWQCGGAIKIVPCSRVGHIFRKRRPYTAP 381
Query: 194 DG 195
DG
Sbjct: 382 DG 383
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 124/209 (59%), Gaps = 9/209 (4%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWD---YPLDLPKASVILVFHNEGFSSL 285
D + N SN+I R IPD R + CK + + + LP+ASVI+ F+NE +L
Sbjct: 99 DIGYKHHAFNALVSNNIGLYRDIPDTRHKVCKQTEPDSHLVQLPQASVIMCFYNEHKMTL 158
Query: 286 MRTVHSIIKRTPAQYLEEIILVDDFSSKADLD-QKLEDYIQRFN-GKVRLIRNTEREGLI 343
MR++ ++++RTPA L+EIILVDD S +L+ L D R +R +RN +REGLI
Sbjct: 159 MRSIKTVLERTPAHLLKEIILVDDNSDLPELEFHLLADLHARLKYDNLRYVRNEQREGLI 218
Query: 344 RTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRS 403
R+R GA+++ G+V++FLD+H EV W+ PLL + ++ + VPVID I+ T+E
Sbjct: 219 RSRVIGARDASGDVLIFLDSHIEVNREWVEPLLRLVKAENATLAVPVIDLINADTFE--- 275
Query: 404 VYEPDHHYRGIFEWGMLYKENELPEREAK 432
Y P RG F WG+ ++ LPE K
Sbjct: 276 -YTPSPLVRGGFNWGLHFRWENLPEGTLK 303
>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Meleagris gallopavo]
Length = 528
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 116/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+Y+ KVRLIR +REGL+R R GA ++G+++ FLD HCE WL PL
Sbjct: 128 LKEPLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPL 186
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + + PVID ID+ T+E+ + EP G F+W +++ + PERE K+R
Sbjct: 187 LERIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHTTPEREQKRR 243
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + +SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 244 KSKIDVIRSPTMAGGLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 303
Query: 181 SRI 183
S +
Sbjct: 304 SHV 306
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/182 (45%), Positives = 118/182 (64%), Gaps = 7/182 (3%)
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R +E KY DY LPK SV++ F+NE +S+L+RTVHS+++ +P LEE+ILVDD+S K
Sbjct: 69 RCKEKKY-DY-YSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKD 126
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L + LE+Y+ KVRLIR +REGL+R R GA ++G+++ FLD HCE WL P
Sbjct: 127 HLKEPLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEP 185
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LL I + + PVID ID+ T+E+ + EP G F+W +++ + PERE K+
Sbjct: 186 LLERIAEEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHTTPEREQKR 242
Query: 434 RK 435
RK
Sbjct: 243 RK 244
>gi|194865210|ref|XP_001971316.1| GG14889 [Drosophila erecta]
gi|190653099|gb|EDV50342.1| GG14889 [Drosophila erecta]
Length = 666
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 118/187 (63%), Gaps = 6/187 (3%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R +R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPKRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L PI +++ P ID ID+ + +R+ D RG F+W YK L + + K
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHSNFNYRA---QDEGARGAFDWEFFYKRLPLLKDDLK--- 362
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PCS
Sbjct: 363 HPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCS 422
Query: 182 RIVSLIR 188
RI + R
Sbjct: 423 RIGHIYR 429
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 91/214 (42%), Positives = 127/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GE GKA L E+ R E G N S+ IS +R++PD+R C +Y L
Sbjct: 142 GLGEKGKAASLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCHKKEYVTKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R +R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPKRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 322 PFIDVIDHSNFNYRA---QDEGARGAFDWEFFYK 352
>gi|301608341|ref|XP_002933751.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 586
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+DY++ V++ R ER+GLI R GA + GEV+ FLDAHCE WL PL
Sbjct: 185 LKEKLDDYVKALQ-IVKIARQKERKGLITARLLGASIATGEVLTFLDAHCECFHGWLEPL 243
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L+ + D + P I I+Y T+EF + V + + RG F+W + + +P + K+R
Sbjct: 244 LSRVAEDHTAVVSPDITAINYNTFEFGKPVQQGKMNSRGNFDWSLAFNWEAIPAADEKQR 303
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P K+PT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 304 KDETYPIKTPTFAGGLFSISKAYFEHIGSYDEEMEIWGGENVEMSFRVWQCGGKLEIIPC 363
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 364 SVVGHVFR 371
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 78/140 (55%), Gaps = 2/140 (1%)
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
++ I+ ++ + L +KL+DY++ V++ R ER+GLI R GA + GEV+ F
Sbjct: 170 VQFIVRIESVITTKYLKEKLDDYVKALQ-IVKIARQKERKGLITARLLGASIATGEVLTF 228
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGM 419
LDAHCE WL PLL+ + D + P I I+Y T+EF + V + + RG F+W +
Sbjct: 229 LDAHCECFHGWLEPLLSRVAEDHTAVVSPDITAINYNTFEFGKPVQQGKMNSRGNFDWSL 288
Query: 420 LYKENELPEREAKKRKYNRY 439
+ +P + K+RK Y
Sbjct: 289 AFNWEAIPAADEKQRKDETY 308
>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Meleagris gallopavo]
Length = 579
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 113/182 (62%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE YI +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 183 LKTDLEKYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPL 241
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + ++ PVID ID+ T+E+ + G F+W + ++ + +P+ E +RK
Sbjct: 242 LERIAENETVVICPVIDTIDWNTFEY--YMQSAEPMIGGFDWRLTFQWHSVPKHERLRRK 299
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PCS
Sbjct: 300 SETDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCS 359
Query: 182 RI 183
+
Sbjct: 360 HV 361
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 100/256 (39%), Positives = 143/256 (55%), Gaps = 18/256 (7%)
Query: 184 VSLIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAYHL---PEAYRAAGDASLGEYGMNME 240
V L RPV+ + P + Y PGE GK L PE + + + +Y +N+
Sbjct: 58 VDLARPVYD--------KSPPDSY--APGEWGKPTRLQLSPEEKKQEAEL-IDKYAINIY 106
Query: 241 TSNHISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
S+ IS R I D RM CK Y LP SV++ F+NE +S+L+RTVHS+++ +P+
Sbjct: 107 LSDKISLHRHIEDNRMSGCKTKSYNYRKLPTTSVVIAFYNEAWSTLLRTVHSVLETSPSV 166
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S K L LE YI +VRLIR +REGL+R R GA + G+V+
Sbjct: 167 LLKEIILVDDLSDKVYLKTDLEKYISSLK-RVRLIRTNKREGLVRARLIGATFATGDVLT 225
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGM 419
FLD HCE WL PLL I + ++ PVID ID+ T+E+ + G F+W +
Sbjct: 226 FLDCHCECVSGWLEPLLERIAENETVVICPVIDTIDWNTFEY--YMQSAEPMIGGFDWRL 283
Query: 420 LYKENELPEREAKKRK 435
++ + +P+ E +RK
Sbjct: 284 TFQWHSVPKHERLRRK 299
>gi|195336962|ref|XP_002035102.1| GM14515 [Drosophila sechellia]
gi|194128195|gb|EDW50238.1| GM14515 [Drosophila sechellia]
Length = 456
Score = 164 bits (416), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 117/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHSNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DL 361
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 362 KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 421
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 422 SRIGHIYR 429
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASSLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVTKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 322 PFIDVIDHSNFHYRAQ---DEGARGAFDWEFFYK 352
>gi|21552973|gb|AAM62406.1|AF478698_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase mutant
SF32 [Drosophila melanogaster]
Length = 632
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 85/182 (46%), Positives = 115/182 (63%), Gaps = 7/182 (3%)
Query: 17 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 76
+R I+N +REGLIR+ GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPV
Sbjct: 212 LRYIKNEQREGLIRSWVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPV 271
Query: 77 IDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGGL 136
ID I+ T+E Y P RG F WG+ ++ LPE K + P++SPT AGGL
Sbjct: 272 IDLINADTFE----YTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPTMAGGL 327
Query: 137 FAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLI---RPVFKA 193
FA++R +F LG YD + +WGGEN E+SF+ W CGG+I+ VPCSR+ + RP
Sbjct: 328 FAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRRPYTSP 387
Query: 194 DG 195
DG
Sbjct: 388 DG 389
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/204 (40%), Positives = 119/204 (58%), Gaps = 8/204 (3%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDY--PLDLPKASVILVFHNEGFSSLM 286
D + N SN+I R IPD R + C + +LP+AS+++ F+NE +LM
Sbjct: 106 DIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQASIVMCFYNEHKMTLM 165
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLE-DYIQRFN-GKVRLIRNTEREGLIR 344
R++ ++++RTP+ L EIILVDD S +L+ L D R +R I+N +REGLIR
Sbjct: 166 RSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARLKYDNLRYIKNEQREGLIR 225
Query: 345 TRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSV 404
+ GA+E+ G+V+VFLD+H EV WL PLL I S+ + VPVID I+ T+E
Sbjct: 226 SWVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSENATLAVPVIDLINADTFE---- 281
Query: 405 YEPDHHYRGIFEWGMLYKENELPE 428
Y P RG F WG+ ++ LPE
Sbjct: 282 YTPSPLVRGGFNWGLHFRWENLPE 305
>gi|432882425|ref|XP_004074024.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Oryzias latipes]
Length = 549
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 82/182 (45%), Positives = 116/182 (63%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+ F KVRLIR T+REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 149 LKEPLEKYLSGFR-KVRLIRATKREGLVRARLLGASIATGDVLTFLDCHCECHEGWLEPL 207
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + PVID ID+ T+++ + P G F+W +++ + +P+ E K+R
Sbjct: 208 LHRIKEEPTAVVCPVIDVIDWDTFQY--LGNPGEPQIGGFDWRLVFTWHSVPDNEQKRRH 265
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++ +SPT AGGLF++++ +F LG YD G+ VWGGEN E SF+IW CGGS+E PCS
Sbjct: 266 SPTDVIRSPTMAGGLFSVNKNYFYYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 325
Query: 182 RI 183
+
Sbjct: 326 HV 327
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 95/252 (37%), Positives = 149/252 (59%), Gaps = 16/252 (6%)
Query: 186 LIRPVFKADGKLGNLEPPLEPYKEGPGEGGKAY--HLPEAYRAAGDASLGEYGMNMETSN 243
L RPV++ +PPL+ PGE G+A +L E + + SL ++ +N+ S+
Sbjct: 26 LRRPVYE--------KPPLD--LNAPGELGRAVKLNLNEEEKRKEEESLKKHQINIYVSD 75
Query: 244 HISFDRTIPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLE 302
+S R +P+ C+ Y LP SV++ F+NE +S+L+RTVHS+++ +P L+
Sbjct: 76 KVSLHRRLPEKWNPLCRELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLK 135
Query: 303 EIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLD 362
E++LVDD+S +A L + LE Y+ F KVRLIR T+REGL+R R GA + G+V+ FLD
Sbjct: 136 EVVLVDDYSDRAHLKEPLEKYLSGFR-KVRLIRATKREGLVRARLLGASIATGDVLTFLD 194
Query: 363 AHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
HCE WL PLL I + + PVID ID+ T+++ + P G F+W +++
Sbjct: 195 CHCECHEGWLEPLLHRIKEEPTAVVCPVIDVIDWDTFQY--LGNPGEPQIGGFDWRLVFT 252
Query: 423 ENELPEREAKKR 434
+ +P+ E K+R
Sbjct: 253 WHSVPDNEQKRR 264
>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Cavia porcellus]
Length = 552
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 114/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA+ ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNGERQGLVRSRMRGAEIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFRWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 122/208 (58%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHPRCTLLGYHTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP ++EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSNDPDDCKQLVRL-----PKVKCLRNGERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA+ ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 MRGAEIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFRWEQLSPEQKARR 268
>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
Length = 611
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE ++ + VR++R ER+GLI R GA ++GEV+ FLDAHCE WL PL
Sbjct: 218 LKEQLEVFVHQLK-IVRVVRQPERKGLITARLLGASVAQGEVLTFLDAHCECFHGWLEPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAKKR 120
LA I + + P I ID +T++F H Y RG F+WG+ + ++PE K R
Sbjct: 277 LARIVEEPTAVVSPEITTIDLETFQFNKPVASSHAYNRGNFDWGLTFGWEQIPEAARKLR 336
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P K+PT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 337 KDETYPVKTPTFAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIPC 396
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 397 SVVGHVFR 404
Score = 157 bits (398), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 138/247 (55%), Gaps = 14/247 (5%)
Query: 202 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 256
PP +P PG GKA+ PE + + + N S+ IS R++ D R
Sbjct: 100 PPQDP--GSPGADGKAFQKDQMTPEEENEKKEG-MTRHCFNQFASDRISLSRSLGDDTRP 156
Query: 257 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S+
Sbjct: 157 PECVERKFLRCPA-LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASAA 215
Query: 314 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 373
L ++LE ++ + VR++R ER+GLI R GA ++GEV+ FLDAHCE WL
Sbjct: 216 DHLKEQLEVFVHQLK-IVRVVRQPERKGLITARLLGASVAQGEVLTFLDAHCECFHGWLE 274
Query: 374 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAK 432
PLLA I + + P I ID +T++F H Y RG F+WG+ + ++PE K
Sbjct: 275 PLLARIVEEPTAVVSPEITTIDLETFQFNKPVASSHAYNRGNFDWGLTFGWEQIPEAARK 334
Query: 433 KRKYNRY 439
RK Y
Sbjct: 335 LRKDETY 341
>gi|344242314|gb|EGV98417.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Cricetulus
griseus]
Length = 205
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 86/180 (47%), Positives = 116/180 (64%), Gaps = 3/180 (1%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
CK YP +LP SV++VFHNE +S+L+RTV+S+I R+P L E+ILVDD S + L
Sbjct: 18 CKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKL 77
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PLLA
Sbjct: 78 TLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 137
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNR 438
I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK +R
Sbjct: 138 IKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRKGDR 194
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 55/120 (45%), Positives = 73/120 (60%), Gaps = 3/120 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L LE+Y++ V++IR ER GLIR R RGA S+G+VI FLDAHCE L WL PL
Sbjct: 75 LKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPL 134
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
LA I DRK + P+ID I T+E+ + D Y G F W + ++ +P+RE +RK
Sbjct: 135 LARIKEDRKTVVCPIIDVISDDTFEYMA--GSDMTYGG-FNWKLNFRWYPVPQREMDRRK 191
>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
Length = 489
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 118/187 (63%), Gaps = 3/187 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +L++Y + KV+L+R +REGL+R R G + + V+ FLD+HCE WL PL
Sbjct: 220 LKIQLKEYFSLY-PKVQLVRVAKREGLVRARLFGMEYADSPVVTFLDSHCECTEGWLEPL 278
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R + P ID ID +T+++ Y+ + G+F+W + + +P RE K+R
Sbjct: 279 LDRIARNRNTVASPTIDMIDPKTFQYN--YDGANDVLGVFDWNLEFYWIPIPLRELKRRN 336
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ +EP ++PT AGGLFA+D FF +G YDPG +WGG+N ELSFK WMCGG +E +PCS
Sbjct: 337 HFAEPIQTPTIAGGLFAIDLEFFRSVGTYDPGFNIWGGDNLELSFKTWMCGGILEIIPCS 396
Query: 182 RIVSLIR 188
+ + R
Sbjct: 397 HVGHIFR 403
Score = 161 bits (407), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 95/250 (38%), Positives = 141/250 (56%), Gaps = 9/250 (3%)
Query: 192 KADGKLGNLEPPLEPYKEGPGEGGKAYHLP----EAYRAAGDASLGEYGMNMETSNHISF 247
K D L PP E +E PGE GK LP +A + A + + N S+ IS
Sbjct: 92 KEDAAQKVLLPPSE-LEETPGEMGKPVELPTNMSDAMKKAVEDGWTKNAFNQYASDLISV 150
Query: 248 DRTIPDLRMEECK-YWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIIL 306
+R +PD R CK Y DLPK VI+ FHNE +S+L+RTVHS++ R+P + ++IL
Sbjct: 151 NRKLPDPRSAWCKDTARYLTDLPKTDVIICFHNEAWSTLLRTVHSVLARSPEHLIGKVIL 210
Query: 307 VDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCE 366
VDD+S L +L++Y + KV+L+R +REGL+R R G + + V+ FLD+HCE
Sbjct: 211 VDDYSDMPHLKIQLKEYFSLY-PKVQLVRVAKREGLVRARLFGMEYADSPVVTFLDSHCE 269
Query: 367 VGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL 426
WL PLL I +R + P ID ID +T+++ Y+ + G+F+W + + +
Sbjct: 270 CTEGWLEPLLDRIARNRNTVASPTIDMIDPKTFQYN--YDGANDVLGVFDWNLEFYWIPI 327
Query: 427 PEREAKKRKY 436
P RE K+R +
Sbjct: 328 PLRELKRRNH 337
>gi|194759470|ref|XP_001961970.1| GF15237 [Drosophila ananassae]
gi|190615667|gb|EDV31191.1| GF15237 [Drosophila ananassae]
Length = 632
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 86/190 (45%), Positives = 123/190 (64%), Gaps = 4/190 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
++G N S+ IS +R++PD R CK Y LPKASVI+VFHNE S+L+R++HSI+
Sbjct: 134 QHGFNGLLSDRISLNRSLPDARPVGCKDRKYSEKLPKASVIIVFHNEHLSALLRSIHSIV 193
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
R+P+ LEE+ILVDD S+ +L + L+ Y+ Q F +LIR REGL+++R RGA
Sbjct: 194 NRSPSILLEEVILVDDASTLKNLGENLDKYLGQNFPNITKLIRMKAREGLVKSRLRGAGI 253
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ G+V++F+D+H EVG NWLPPLL PI I+T P+ D I T E+ + P+ +R
Sbjct: 254 AAGQVLIFMDSHIEVGYNWLPPLLEPIVIHPYIVTSPIFDSITASTLEYG--FGPE-VFR 310
Query: 413 GIFEWGMLYK 422
F+W K
Sbjct: 311 SGFDWSFQIK 320
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 113/189 (59%), Gaps = 8/189 (4%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L + L+ Y+ + F +LIR REGL+++R RGA + G+V++F+D+H EVG NWLP
Sbjct: 215 NLGENLDKYLGQNFPNITKLIRMKAREGLVKSRLRGAGIAAGQVLIFMDSHIEVGYNWLP 274
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL PI I+T P+ D I T E+ + P+ +R F+W K + +
Sbjct: 275 PLLEPIVIHPYIVTSPIFDSITASTLEYG--FGPE-VFRSGFDWSFQIKYMKTIGTDMGD 331
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S PY++P G + A+DR FF LGG+D L +WGGE FELSFK+WMCGG VP
Sbjct: 332 ---GSNPYRTPLVVGAV-AIDRTFFWGLGGFDDQLDIWGGEEFELSFKVWMCGGMGLHVP 387
Query: 180 CSRIVSLIR 188
CSRI + R
Sbjct: 388 CSRIGHISR 396
>gi|359465583|ref|NP_001240755.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 2 [Homo
sapiens]
gi|10434341|dbj|BAB14227.1| unnamed protein product [Homo sapiens]
gi|119620892|gb|EAX00487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_b [Homo sapiens]
Length = 557
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 172 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 231
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 232 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 288
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 289 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 341
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 125/232 (53%), Gaps = 24/232 (10%)
Query: 203 PLEPYKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYW 262
P PY+ G HLP G ++ + ++ + + +R C
Sbjct: 65 PTPPYQRG--------HLP----TGGHLAVCHFPCLLQEAQFHLQTQVFLQVR---CTLL 109
Query: 263 DYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLED 322
Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+ D ++L
Sbjct: 110 VYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCKQLIK 169
Query: 323 YIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D
Sbjct: 170 L-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKED 224
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 225 YTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 273
>gi|355751232|gb|EHH55487.1| hypothetical protein EGM_04701, partial [Macaca fascicularis]
Length = 516
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 131 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 190
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 191 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 247
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 248 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 300
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 9/176 (5%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+ D +
Sbjct: 65 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 124
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL
Sbjct: 125 QLIRL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 179
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ D + PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 180 VKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 232
>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 599
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 116/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ VR++R R+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 216 LKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 274
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D + P I ID T+EF + V + H RG F+W + + LP E ++R
Sbjct: 275 LARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQRR 334
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 335 KDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 394
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 395 SVVGHVFR 402
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 98/242 (40%), Positives = 135/242 (55%), Gaps = 12/242 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R + PD R
Sbjct: 98 PPQDP--NSPGADGKAFQKDKWTPLETQEKEEGYKKHCFNAFASDQISLQRALGPDTRPP 155
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 156 ECVNQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 214
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L + LE Y+Q+ VR++R R+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 215 YLKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 273
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D + P I ID T+EF + V + H RG F+W + + LP E ++
Sbjct: 274 LLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQR 333
Query: 434 RK 435
RK
Sbjct: 334 RK 335
>gi|397513819|ref|XP_003827205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Pan paniscus]
Length = 557
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 172 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 231
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 232 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 288
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 289 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 341
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/176 (41%), Positives = 107/176 (60%), Gaps = 9/176 (5%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y DLP S+I+ FHNE S+L+RT+ S+I RTP + EIILVDDFS+ D +
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFSNDPDDCK 165
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL
Sbjct: 166 QLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 220
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ D + PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 221 VKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 273
>gi|149729705|ref|XP_001496082.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 2 [Equus
caballus]
Length = 635
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 81/187 (43%), Positives = 115/187 (61%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +Y+ R G V+L+R+ +R G IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 234 LKSALSEYVARLEG-VKLLRSKKRLGAIRARMLGATRATGDVLVFMDSHCECHPGWLEPL 292
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + PVID ID++T+++ Y RG+ +W + ++ LPERE K +
Sbjct: 293 LSRIAGDRSRVVSPVIDVIDWKTFQY---YPSKDLQRGVLDWKLDFRWEPLPERERKALQ 349
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SP GG+ AMDR +F G YDP + + GGEN ELS K W+CGGS+E +PCS
Sbjct: 350 SPVSPIRSPVLPGGVVAMDRHYFQNTGAYDPLMSLRGGENLELSLKAWLCGGSVEILPCS 409
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 410 RVGHIYR 416
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 84/192 (43%), Positives = 120/192 (62%), Gaps = 6/192 (3%)
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S I R +P++R C +P D LP ASVIL FH+E +S+L+RTVHSI+ P +
Sbjct: 160 SARIPLQRALPEVRHPLC-LQQHPGDSLPTASVILCFHDEAWSTLLRTVHSILDTAPRAF 218
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L+E+ILVDDFS + L L +Y+ R G V+L+R+ +R G IR R GA + G+V+VF
Sbjct: 219 LKEVILVDDFSQQGQLKSALSEYVARLEG-VKLLRSKKRLGAIRARMLGATRATGDVLVF 277
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
+D+HCE WL PLL+ I DR + PVID ID++T+++ Y RG+ +W +
Sbjct: 278 MDSHCECHPGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQY---YPSKDLQRGVLDWKLD 334
Query: 421 YKENELPEREAK 432
++ LPERE K
Sbjct: 335 FRWEPLPERERK 346
>gi|426335183|ref|XP_004029112.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Gorilla gorilla gorilla]
Length = 557
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 172 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 231
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 232 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 288
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 289 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 341
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 9/176 (5%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+ D +
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSNDPDDCK 165
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL
Sbjct: 166 QLIKL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 220
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ D + PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 221 VKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 273
>gi|195587296|ref|XP_002083401.1| GD13712 [Drosophila simulans]
gi|194195410|gb|EDX08986.1| GD13712 [Drosophila simulans]
Length = 631
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 117/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHSNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DL 361
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 362 KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 421
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 422 SRIGHIYR 429
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASSLDDESQRDLEKRMSLENGFNALLSDSISVNRSLPDIRHPLCRKKEYVTKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 322 PFIDVIDHSNFHYRAQ---DEGARGAFDWEFFYK 352
>gi|355565588|gb|EHH22017.1| hypothetical protein EGK_05198 [Macaca mulatta]
Length = 557
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 172 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 231
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 232 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 288
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 289 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 341
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 9/176 (5%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+ D +
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 165
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL
Sbjct: 166 QLIRL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 220
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ D + PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 221 VKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 273
>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
Length = 621
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 115/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I + + P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 135/246 (54%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+H + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFHKDKWTPMETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDD 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I + + P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PL
Sbjct: 232 LHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y +H+ RG F+W + + LP+ E ++
Sbjct: 291 LARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQR 349
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 350 RKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMP 409
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 410 CSVVGHVFR 418
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 144/262 (54%), Gaps = 22/262 (8%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSN 243
++PVF PP + PG GK + HL + + ++ N S+
Sbjct: 107 LKPVFD--------RPPQD--SNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFASD 156
Query: 244 HISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +PA
Sbjct: 157 RISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAI 215
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E +
Sbjct: 216 LLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLT 274
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEW 417
FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F+W
Sbjct: 275 FLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDW 333
Query: 418 GMLYKENELPEREAKKRKYNRY 439
+ + LP+ E ++RK Y
Sbjct: 334 SLSFGWESLPDHEKQRRKDETY 355
>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 624
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 116/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE Y+++ VR++R R+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 226 LKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 284
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D + P I ID T+EF + V + H RG F+W + + LP E ++R
Sbjct: 285 LARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQRR 344
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K ++P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 345 KDETDPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 404
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 405 SVVGHVFR 412
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/242 (40%), Positives = 135/242 (55%), Gaps = 12/242 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R + PD R
Sbjct: 108 PPQDP--NSPGADGKAFQKDKWTPLETQEKEEGYKKHCFNAFASDQISLQRALGPDTRPP 165
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 166 ECVNQKFRRCP-PLPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 224
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L + LE Y+Q+ VR++R R+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 225 YLKEPLEQYVQQLR-IVRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 283
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D + P I ID T+EF + V + H RG F+W + + LP E ++
Sbjct: 284 LLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQR 343
Query: 434 RK 435
RK
Sbjct: 344 RK 345
>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Ailuropoda melanoleuca]
Length = 622
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 115/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I + + P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 283 LARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 135/246 (54%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+H + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NSPGADGKAFHKDKWTPMETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA L EIILVDD S+
Sbjct: 164 ECVDQKFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDD 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKDQLEQYVKKLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I + + P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 282 LLARIAEEETAVVSPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
Length = 603
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 81/188 (43%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE L PL
Sbjct: 205 LKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGRLEPL 263
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++R
Sbjct: 264 LARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 323
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 324 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 383
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 384 SVVGHVFR 391
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 138/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ + + ++ N S+ IS R++ PD R
Sbjct: 87 PPQDP--NAPGADGKAFQKSKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPP 144
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P L SVI+VFHNE +S+L+RTV+S++ TPA L+EIILVDD S++
Sbjct: 145 ECVDQKFRRCP-PLATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEE 203
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L +KLE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE L P
Sbjct: 204 HLKEKLEQYVKQLQ-VVRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGRLEP 262
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ ++ P I ID T+EF + V H RG F+W + + LP E ++
Sbjct: 263 LLARIAEDKTVVVSPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQR 322
Query: 434 RKYNRY 439
RK Y
Sbjct: 323 RKDETY 328
>gi|34042986|gb|AAQ56703.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 666
Score = 164 bits (415), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 117/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHTNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DL 361
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 362 KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 421
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 422 SRIGHIYR 429
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 322 PFIDVIDHTNFHYRAQ---DEGARGAFDWEFFYK 352
>gi|109102570|ref|XP_001104659.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Macaca mulatta]
Length = 557
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 172 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 231
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 232 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 288
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 289 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 341
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/176 (40%), Positives = 107/176 (60%), Gaps = 9/176 (5%)
Query: 259 CKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQ 318
C Y DLP S+I+ FHNE S+L+RT+ S++ RTP + EIILVDDFS+ D +
Sbjct: 106 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFSNDPDDCK 165
Query: 319 KLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAP 378
+L KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL
Sbjct: 166 QLIRL-----PKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR 220
Query: 379 IYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKK 433
+ D + PVID I+ T+ + E RG F+W + ++ +L PE++A++
Sbjct: 221 VKEDYTRVVCPVIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARR 273
>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
musculus]
Length = 633
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PL
Sbjct: 232 LHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y +H+ RG F+W + + LP+ E ++
Sbjct: 291 LARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDWSLSFGWESLPDHEKQR 349
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 350 RKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMP 409
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 410 CSVVGHVFR 418
Score = 157 bits (398), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 144/262 (54%), Gaps = 22/262 (8%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSN 243
++PVF PP + PG GK + HL + + ++ N S+
Sbjct: 107 LKPVFD--------RPPQD--SNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFASD 156
Query: 244 HISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +PA
Sbjct: 157 RISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAI 215
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E +
Sbjct: 216 LLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLT 274
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEW 417
FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F+W
Sbjct: 275 FLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYGSNHN-RGNFDW 333
Query: 418 GMLYKENELPEREAKKRKYNRY 439
+ + LP+ E ++RK Y
Sbjct: 334 SLSFGWESLPDHEKQRRKDETY 355
>gi|432112638|gb|ELK35354.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Myotis davidii]
Length = 416
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R +R+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 18 LKEQLEQYVKQLQ-IVRVVRQEKRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 76
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D + P I ID T+EF + V + H RG F+W + + LP E ++R
Sbjct: 77 LARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPPHEKQRR 136
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 137 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 196
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 197 SVVGHVFR 204
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/128 (39%), Positives = 72/128 (56%), Gaps = 2/128 (1%)
Query: 313 KADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWL 372
K L ++LE Y+++ VR++R +R+GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 15 KQYLKEQLEQYVKQLQ-IVRVVRQEKRKGLITARLLGASVAQAEVLTFLDAHCECFHGWL 73
Query: 373 PPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREA 431
PLLA I D + P I ID T+EF + V + H RG F+W + + LP E
Sbjct: 74 EPLLARITEDETAVVSPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPPHEK 133
Query: 432 KKRKYNRY 439
++RK Y
Sbjct: 134 QRRKDETY 141
>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Saimiri boliviensis boliviensis]
Length = 552
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + + +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFTY---IESASELRGGFDWSLHFHWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 336
Score = 154 bits (389), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 119/208 (57%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y +LP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTELPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP + EIILVDDFS+ D Q+L KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIREIILVDDFSNDPDDCQQLIKL-----PKVKCLRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + + +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFHWEQLSPEQKARR 268
>gi|16769916|gb|AAL29177.1| SD10722p [Drosophila melanogaster]
Length = 666
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 117/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHTNFHYRAQ---DEGARGAFDWEFFYKRLPLLPE----DL 361
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 362 KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 421
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 422 SRIGHIYR 429
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 322 PFIDVIDHTNFHYRAQ---DEGARGAFDWEFFYK 352
>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Loxodonta africana]
Length = 622
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++L+ Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKEQLDQYVKQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D ++ P I ID T+EF + V H RG F+W + + +P E ++R
Sbjct: 283 LARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWETVPLHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P KSPT AGGLF++ +++F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 343 KDETYPIKSPTFAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPC 402
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 403 SVVGHVFR 410
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 141/247 (57%), Gaps = 14/247 (5%)
Query: 202 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 256
PP +P GPG GKA+ P+ + + ++ N S+ IS R + PD R
Sbjct: 106 PPQDP--NGPGADGKAFQKDKWTPQETQEK-EEGYKKHCFNAFASDRISLQRALGPDTRP 162
Query: 257 EEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSK 313
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ PA +L+EIILVDD S++
Sbjct: 163 PECLDQKFRRCP-QLPTTSVIIVFHNEAWSTLLRTVYSVLHTAPAIFLKEIILVDDASTE 221
Query: 314 ADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 373
L ++L+ Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL
Sbjct: 222 EYLKEQLDQYVKQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLE 280
Query: 374 PLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAK 432
PLLA I D ++ P I ID T+EF + V H RG F+W + + +P E +
Sbjct: 281 PLLARIAEDETVVVSPDIITIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWETVPLHEKQ 340
Query: 433 KRKYNRY 439
+RK Y
Sbjct: 341 RRKDETY 347
>gi|380813218|gb|AFE78483.1| polypeptide N-acetylgalactosaminyltransferase 9 isoform A [Macaca
mulatta]
Length = 603
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 115/187 (61%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C++ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 116 EYGYNAQLSDRISLDRSIPDYRPRKCRHMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 175
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 176 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 235
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 236 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 294
Query: 413 GIFEWGM 419
WG+
Sbjct: 295 ---NWGL 298
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 197 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 257 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 311
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 312 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLP 371
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 372 CSRVAHIER 380
>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oreochromis niloticus]
Length = 575
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 86/183 (46%), Positives = 116/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L+ YI N KVRLIR T+REGL+R R GA + GEV+ FLD HCE WL P+
Sbjct: 175 LKEPLDKYISGLN-KVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPV 233
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + K + PVID ID+ T+++ EP G F+W +++ + +P+ E K+R
Sbjct: 234 LHRIKEEPKAVVCPVIDVIDWNTFQYLGHAGEPQ---IGGFDWRLVFTWHSIPDYEQKRR 290
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + +SPT AGGLFA+ + FF LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 291 RSPVDVIRSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEVHPC 350
Query: 181 SRI 183
S +
Sbjct: 351 SHV 353
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 95/239 (39%), Positives = 143/239 (59%), Gaps = 10/239 (4%)
Query: 201 EPPLEPYKEGPGEGGKAY--HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEE 258
+PPL+ E GE G+A +L E + + S+ + +N S+ IS R +P+
Sbjct: 59 KPPLD--LEAVGEMGRAVKLNLNEEEKRKEEESIKAHQINTYVSDKISLHRRLPERWNPL 116
Query: 259 CKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLD 317
CK Y LP SV++ F+NE +S+L+RTVHS+++ +P L+E++LVDD+S KA L
Sbjct: 117 CKELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDKAHLK 176
Query: 318 QKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLA 377
+ L+ YI N KVRLIR T+REGL+R R GA + GEV+ FLD HCE WL P+L
Sbjct: 177 EPLDKYISGLN-KVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPVLH 235
Query: 378 PIYSDRKIMTVPVIDGIDYQTWEFRS-VYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
I + K + PVID ID+ T+++ EP G F+W +++ + +P+ E K+R+
Sbjct: 236 RIKEEPKAVVCPVIDVIDWNTFQYLGHAGEPQ---IGGFDWRLVFTWHSIPDYEQKRRR 291
>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
saltator]
Length = 598
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/189 (44%), Positives = 115/189 (60%), Gaps = 1/189 (0%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L +L+ Y+ R KV+L+R R+GLIR R GAK + G+V+VFLDAHCEV +WL
Sbjct: 189 ELQGQLDYYLSTRLPTKVKLLRLPYRQGLIRARLHGAKNATGDVLVFLDAHCEVIKDWLQ 248
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I R + +P+ID I +T E+ E G F W + + + E K
Sbjct: 249 PLLQRIKEKRNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHELKS 308
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R P +SPT AGGLFA+DR +F E+G YD + WGGEN E+SF+IW CGG++E +P
Sbjct: 309 RLSLISPTRSPTMAGGLFAIDRKYFWEVGSYDDKMDGWGGENLEMSFRIWQCGGTLEIIP 368
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 369 CSRVGHIFR 377
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 129/226 (57%), Gaps = 1/226 (0%)
Query: 210 GPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLP 269
G GE G+ +L + G+ L + +N+ SN IS R +PD+R C Y LP
Sbjct: 84 GLGENGEPAYLHGKEKVEGETVLAKKALNVVLSNKISLTRKLPDVRNPLCANLTYDTLLP 143
Query: 270 KASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFN 328
SVI++F+NE +S L+RTVHS++K + L+EIILVDD S + +L +L+ Y+ R
Sbjct: 144 SVSVIIIFYNEPWSVLLRTVHSVLKGSLPHLLKEIILVDDHSEEEELQGQLDYYLSTRLP 203
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
KV+L+R R+GLIR R GAK + G+V+VFLDAHCEV +WL PLL I R + +
Sbjct: 204 TKVKLLRLPYRQGLIRARLHGAKNATGDVLVFLDAHCEVIKDWLQPLLQRIKEKRNAVLM 263
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
P+ID I +T E+ E G F W + + + E K R
Sbjct: 264 PIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHELKSR 309
>gi|443685595|gb|ELT89149.1| hypothetical protein CAPTEDRAFT_34275, partial [Capitella teleta]
Length = 358
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 80/190 (42%), Positives = 113/190 (59%), Gaps = 3/190 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L++Y + + K+R+IR+ REGL+R R GAKE++ + +VFLDAH E WL PL
Sbjct: 87 LKAPLDEYCKVLSPKIRIIRSEHREGLMRGRMVGAKEAKADTLVFLDAHVECNEGWLDPL 146
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I + + VP +D ID QT ++ S +H G F W M Y+ LP+ K
Sbjct: 147 LQIIMDHPRAIAVPTMDNIDPQTIKYESW---NHVAYGGFTWNMEYQWKVLPDTLVNKLI 203
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++P+ SPT G AM+R +F E+GG+D G+ +WGGEN E+SFK WMCG + PCS
Sbjct: 204 SKTQPFPSPTTIGCAMAMNRDYFFEIGGFDEGMFIWGGENLEISFKTWMCGEGLYISPCS 263
Query: 182 RIVSLIRPVF 191
R+ L R +
Sbjct: 264 RVGHLFRTIL 273
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 84/205 (40%), Positives = 122/205 (59%), Gaps = 4/205 (1%)
Query: 229 DASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRT 288
+ + +Y MN++ SN + DR+I D R EC + L K S+I+ F++E +S L+R
Sbjct: 1 ETNFDQYSMNVQLSNTVPLDRSILDTRNPECHVVQFSQQL-KVSIIVPFYDESWSMLLRM 59
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 348
+HS+I RTP LEEIIL+DD SS+ L L++Y + + K+R+IR+ REGL+R R
Sbjct: 60 LHSVIDRTPDALLEEIILIDDKSSRDYLKAPLDEYCKVLSPKIRIIRSEHREGLMRGRMV 119
Query: 349 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
GAKE++ + +VFLDAH E WL PLL I + + VP +D ID QT ++ S +
Sbjct: 120 GAKEAKADTLVFLDAHVECNEGWLDPLLQIIMDHPRAIAVPTMDNIDPQTIKYESW---N 176
Query: 409 HHYRGIFEWGMLYKENELPEREAKK 433
H G F W M Y+ LP+ K
Sbjct: 177 HVAYGGFTWNMEYQWKVLPDTLVNK 201
>gi|355786668|gb|EHH66851.1| hypothetical protein EGM_03916, partial [Macaca fascicularis]
Length = 524
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 115/187 (61%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C++ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 37 EYGYNAQLSDRISLDRSIPDYRPRKCRHMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 96
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 97 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 156
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 157 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 215
Query: 413 GIFEWGM 419
WG+
Sbjct: 216 ---NWGL 219
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/189 (36%), Positives = 101/189 (53%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 118 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 177
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ P
Sbjct: 178 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYISPXXXXXX 232
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
S P ++P G F +DR +F ++G DPG+ V+GGEN EL + GS+E +P
Sbjct: 233 XXXPSLPPRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRXXXXXGSMEVLP 292
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 293 CSRVAHIER 301
>gi|24656262|ref|NP_647749.2| polypeptide GalNAc transferase 6, isoform A [Drosophila
melanogaster]
gi|24656265|ref|NP_728779.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
melanogaster]
gi|442629817|ref|NP_001261342.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
melanogaster]
gi|51315873|sp|Q6WV16.2|GALT6_DROME RecName: Full=N-acetylgalactosaminyltransferase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|7292281|gb|AAF47689.1| polypeptide GalNAc transferase 6, isoform A [Drosophila
melanogaster]
gi|7292282|gb|AAF47690.1| polypeptide GalNAc transferase 6, isoform B [Drosophila
melanogaster]
gi|440215219|gb|AGB94037.1| polypeptide GalNAc transferase 6, isoform C [Drosophila
melanogaster]
Length = 666
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/188 (48%), Positives = 117/188 (62%), Gaps = 8/188 (4%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L K+LE YI VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPL
Sbjct: 249 LGKELETYIAEHFKWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPL 308
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENEL-PEREAKKR 120
L PI +++ P ID ID+ + +R+ D RG F+W YK L PE
Sbjct: 309 LEPIALNKRTAVCPFIDVIDHTNFHYRA---QDEGARGAFDWEFFYKRLPLLPE----DL 361
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K+ ++P+KSP AGGLFA+ R FF ELGGYD GL +WGGE +ELSFKIWMCGG + PC
Sbjct: 362 KHPADPFKSPIMAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPC 421
Query: 181 SRIVSLIR 188
SRI + R
Sbjct: 422 SRIGHIYR 429
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 92/214 (42%), Positives = 128/214 (59%), Gaps = 4/214 (1%)
Query: 210 GPGEGGKAYHLP-EAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDL 268
G GEGGKA L E+ R E G N S+ IS +R++PD+R C+ +Y L
Sbjct: 142 GLGEGGKASTLDDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKL 201
Query: 269 PKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN 328
P SVI++F+NE S LMR+VHS+I R+P + ++EIILVDD S + L ++LE YI
Sbjct: 202 PTVSVIIIFYNEYLSVLMRSVHSLINRSPPELMKEIILVDDHSDREYLGKELETYIAEHF 261
Query: 329 GKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTV 388
VR++R R GLI R+ GA+ + EV++FLD+H E NWLPPLL PI +++
Sbjct: 262 KWVRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALNKRTAVC 321
Query: 389 PVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
P ID ID+ + +R+ D RG F+W YK
Sbjct: 322 PFIDVIDHTNFHYRA---QDEGARGAFDWEFFYK 352
>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PL
Sbjct: 232 LHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y ++H RG F+W + + LP+ E ++
Sbjct: 291 LARIAENYTAVVSPDIASIDLNTFEFNKPSPYG-NNHNRGNFDWSLSFGWESLPDHEKQR 349
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 350 RKDETYPIKTPTFAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMP 409
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 410 CSVVGHVFR 418
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/262 (39%), Positives = 144/262 (54%), Gaps = 22/262 (8%)
Query: 187 IRPVFKADGKLGNLEPPLEPYKEGPGEGGKAY---HLPEAYRAAGDASLGEYGMNMETSN 243
++PVF PP + PG GK + HL + + ++ N S+
Sbjct: 107 LKPVFD--------RPPQD--SNAPGASGKPFKITHLSPEEQKEKERGETKHCFNAFASD 156
Query: 244 HISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQ 299
IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +PA
Sbjct: 157 RISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAI 215
Query: 300 YLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIV 359
L+EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E +
Sbjct: 216 LLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQQERKGLITARLLGAAVATAETLT 274
Query: 360 FLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEW 417
FLDAHCE WL PLLA I + + P I ID T+EF S Y ++H RG F+W
Sbjct: 275 FLDAHCECFYGWLEPLLARIAENYTAVVSPDIASIDLNTFEFNKPSPYG-NNHNRGNFDW 333
Query: 418 GMLYKENELPEREAKKRKYNRY 439
+ + LP+ E ++RK Y
Sbjct: 334 SLSFGWESLPDHEKQRRKDETY 355
>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Pan
paniscus]
Length = 529
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 150 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 209
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 210 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 266
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 267 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 319
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 42 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 100
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 101 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 155
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 156 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 215
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 216 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 251
>gi|334348942|ref|XP_001380115.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Monodelphis domestica]
Length = 642
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 113/182 (62%), Gaps = 4/182 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L D++ + G VRL+R+ +R G IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 240 LKSALSDHVAQLEG-VRLLRSNKRLGAIRGRMLGAARAAGDVLVFMDSHCECHKGWLEPL 298
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR + P+ID ID++ +++ Y RG+F+W + + LPE+E K R+
Sbjct: 299 LSRIAGDRSRLVSPIIDVIDWKNFQY---YHSMDLQRGVFDWELNFHWRPLPEQERKMRQ 355
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SP GG+ A+DR +F G YD + +WG EN ELS ++W+CGGS+E +PCS
Sbjct: 356 SPISPIRSPVLPGGVLAIDRHYFQNTGAYDSLMSIWGSENLELSIRVWLCGGSVEIIPCS 415
Query: 182 RI 183
R+
Sbjct: 416 RV 417
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 116/186 (62%), Gaps = 4/186 (2%)
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
+P +M+ C DY LP ASVI+ FH+E +S+L+RTVHSI+ P +L+EIILVDD
Sbjct: 174 ALPPSKMKLCLQEDYGDHLPTASVIICFHDEAWSTLLRTVHSILDTAPRPFLKEIILVDD 233
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L L D++ + G VRL+R+ +R G IR R GA + G+V+VF+D+HCE
Sbjct: 234 LSQQGHLKSALSDHVAQLEG-VRLLRSNKRLGAIRGRMLGAARAAGDVLVFMDSHCECHK 292
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLL+ I DR + P+ID ID++ +++ Y RG+F+W + + LPE+
Sbjct: 293 GWLEPLLSRIAGDRSRLVSPIIDVIDWKNFQY---YHSMDLQRGVFDWELNFHWRPLPEQ 349
Query: 430 EAKKRK 435
E K R+
Sbjct: 350 ERKMRQ 355
>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
[Homo sapiens]
gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
[synthetic construct]
Length = 542
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
Length = 558
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Macaca
mulatta]
Length = 558
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 122/219 (55%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L + +A G+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
Length = 558
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADMAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADMAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|345328051|ref|XP_003431229.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Ornithorhynchus anatinus]
Length = 863
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 117/188 (62%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 552 LKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPL 610
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + RK + PVI+ I + +++V D+ RGIF W M + +P +K K
Sbjct: 611 LERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKNK 667
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE VPC
Sbjct: 668 MKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVPC 727
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 728 SRVGHIFR 735
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 133/241 (55%), Gaps = 5/241 (2%)
Query: 196 KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 254
K+ L+ L P + PG+ G A +P + E N+ S+ I DR I D
Sbjct: 431 KVLTLDVTLSPRDPKAPGQFGHAAVVPAEKQERAKKRWKEGNFNVYLSDLIPVDRAIEDT 490
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R + C DLP ++I+ F +E +S+L+R++HS++ R+P ++EIILVDDFS+K
Sbjct: 491 RPDGCAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKE 550
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL P
Sbjct: 551 HLKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEP 609
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LL + RK + PVI+ I + +++V D+ RGIF W M + +P +K
Sbjct: 610 LLERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKN 666
Query: 435 K 435
K
Sbjct: 667 K 667
>gi|109099288|ref|XP_001106039.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 9 [Macaca
mulatta]
Length = 597
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 115/187 (61%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C++ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 116 EYGYNAQLSDRISLDRSIPDYRPRKCRHMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 175
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 176 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 235
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 236 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 294
Query: 413 GIFEWGM 419
WG+
Sbjct: 295 ---NWGL 298
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 197 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 256
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 257 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 311
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 312 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLP 371
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 372 CSRVAHIER 380
>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
Length = 499
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 120 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 179
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 180 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 236
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 237 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 289
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 122/219 (55%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L + +A G+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 12 KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 70
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 71 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 125
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 126 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 185
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 186 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 221
>gi|10436305|dbj|BAB14795.1| unnamed protein product [Homo sapiens]
Length = 457
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 134 KVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 193
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 194 VIDIINLDTFTY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 250
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E VPCSR+ + R
Sbjct: 251 LFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFR 303
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 100/208 (48%), Gaps = 42/208 (20%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R IPD R
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAIPDTRH------------------------------ 98
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
S++ RTP + EIILVDDFS+ D ++L KV+ +RN ER+GL+R+R
Sbjct: 99 ---LSVLNRTPTHLIREIILVDDFSNDPDDCKQLIKL-----PKVKCLRNNERQGLVRSR 150
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 151 IRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY---IE 207
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 208 SASELRGGFDWSLHFQWEQLSPEQKARR 235
>gi|307172175|gb|EFN63700.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Camponotus floridanus]
Length = 433
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 121/194 (62%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KVR+IR ++REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 4 LKRQLEDYMMNYP-KVRIIRASKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 62
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 63 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 114
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF +G YD G +WGGEN ELSFK WMCGG+
Sbjct: 115 REKKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSFKTWMCGGT 174
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 175 LEIVPCSHVGHIFR 188
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 72/127 (56%), Gaps = 16/127 (12%)
Query: 316 LDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 375
L ++LEDY+ + KVR+IR ++REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 4 LKRQLEDYMMNYP-KVRIIRASKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 62
Query: 376 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 428
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 63 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 114
Query: 429 REAKKRK 435
RE K+ K
Sbjct: 115 REKKRHK 121
>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Nomascus leucogenys]
Length = 558
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|363731300|ref|XP_419370.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Gallus
gallus]
Length = 552
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN +REGLIR+R +GA ++ V+ FLD+HCEV +WL PLL I D + P
Sbjct: 167 KVKCLRNRQREGLIRSRIQGADVAQAGVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + + RG F+W + +K +L + KR ++P K+P AGG
Sbjct: 227 VIDIINLDTFAYVAA---SSDLRGGFDWSLHFKWEQLSPEQKAKRLDPTKPIKTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 284 LFVIDKAWFNHLGKYDNAMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 336
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 114/200 (57%), Gaps = 9/200 (4%)
Query: 235 YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIK 294
+ N S + DR + D R C Y DLP S+++ FHNE S+L+RT+ S++
Sbjct: 77 HAFNQRESERMPSDRAVRDTRHHRCTTLHYRQDLPPTSIVITFHNEARSTLLRTIRSVMN 136
Query: 295 RTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESR 354
RTP + EIILVDDFS D + L KV+ +RN +REGLIR+R +GA ++
Sbjct: 137 RTPVHLIHEIILVDDFSDDPDDCRLLAKL-----PKVKCLRNRQREGLIRSRIQGADVAQ 191
Query: 355 GEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGI 414
V+ FLD+HCEV +WL PLL I D + PVID I+ T+ + + RG
Sbjct: 192 AGVLTFLDSHCEVNKDWLLPLLQRIKEDPTRVVSPVIDIINLDTFAYVAA---SSDLRGG 248
Query: 415 FEWGMLYKENEL-PEREAKK 433
F+W + +K +L PE++AK+
Sbjct: 249 FDWSLHFKWEQLSPEQKAKR 268
>gi|355564838|gb|EHH21338.1| hypothetical protein EGK_04374, partial [Macaca mulatta]
Length = 524
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 79/187 (42%), Positives = 115/187 (61%), Gaps = 5/187 (2%)
Query: 234 EYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSII 293
EYG N + S+ IS DR+IPD R +C++ Y DLP+ SV+ +F NE S ++R+VHS++
Sbjct: 37 EYGYNAQLSDRISLDRSIPDYRPRKCRHMSYAQDLPQVSVVFIFVNEALSVILRSVHSVV 96
Query: 294 KRTPAQYLEEIILVDDFSSKADLDQKLEDYI-QRFNGKVRLIRNTEREGLIRTRSRGAKE 352
TP+Q L+E+ILVDD S +L L+ Y+ +R+ G V+++RN+ REGLIR R +G K
Sbjct: 97 NHTPSQLLKEVILVDDNSDNVELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKA 156
Query: 353 SRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR 412
+ V+ F DAH E W P L+ I DR+ + +P ID I Y T+E + H Y
Sbjct: 157 ATAPVVGFFDAHVEFNTGWAEPALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY- 215
Query: 413 GIFEWGM 419
WG+
Sbjct: 216 ---NWGL 219
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 109/189 (57%), Gaps = 6/189 (3%)
Query: 1 DLDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L L+ Y+ +R+ G V+++RN+ REGLIR R +G K + V+ F DAH E W
Sbjct: 118 ELKFNLDQYVNKRYPGLVKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAE 177
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
P L+ I DR+ + +P ID I Y T+E + H Y WG L+ +P ++
Sbjct: 178 PALSRIREDRRRIVLPAIDNIKYSTFEVQQYANAAHGY----NWG-LWCMYIIPPQDWLD 232
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
R S P ++P G F +DR +F ++G DPG+ V+GGEN EL ++W CGGS+E +P
Sbjct: 233 RGDESAPIRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLP 292
Query: 180 CSRIVSLIR 188
CSR+ + R
Sbjct: 293 CSRVAHIER 301
>gi|322799307|gb|EFZ20695.1| hypothetical protein SINV_03796 [Solenopsis invicta]
Length = 338
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 91/239 (38%), Positives = 145/239 (60%), Gaps = 11/239 (4%)
Query: 194 DGKLGNLEP-PLEP---YKEGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDR 249
D L L+P P++P +G E G +L + + D Y N+ S+++ R
Sbjct: 91 DLLLQQLQPVPVKPAVTLDQGLDELGMVKNLEDQRKR--DEGYKNYAFNVLISDNLGVRR 148
Query: 250 TIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
IPD R + C YP +LP AS+I+ F+NE +++L+R++H++++RTP L EIILV+D
Sbjct: 149 DIPDTRHKLCGAQKYPANLPNASIIICFYNEHYTTLLRSLHTVLERTPMALLHEIILVND 208
Query: 310 FSSKADLDQKLEDYIQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVG 368
+S + L +K++ YI+ F+ +VRL + +REGLIR R GA+++ G+V++FLD+H EV
Sbjct: 209 YSDSSALHEKIKVYIRNNFDDRVRLFKTEKREGLIRARVFGARKATGKVLIFLDSHIEVN 268
Query: 369 LNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 427
W+ PLL+ I R I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 269 EMWIEPLLSRIAYSRNIVPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLP 323
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 71/113 (62%), Gaps = 5/113 (4%)
Query: 2 LDKKLEDYI-ERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 60
L +K++ YI F+ +VRL + +REGLIR R GA+++ G+V++FLD+H EV W+ P
Sbjct: 215 LHEKIKVYIRNNFDDRVRLFKTEKREGLIRARVFGARKATGKVLIFLDSHIEVNEMWIEP 274
Query: 61 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELP 113
LL+ I R I+ +PVID I+ T++ Y RG F WG+ +K + LP
Sbjct: 275 LLSRIAYSRNIVPMPVIDIINADTFQ----YTGSPLVRGGFNWGLHFKWDNLP 323
>gi|149639580|ref|XP_001512277.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Ornithorhynchus anatinus]
Length = 949
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/188 (44%), Positives = 117/188 (62%), Gaps = 5/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL PL
Sbjct: 552 LKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPL 610
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L + RK + PVI+ I + +++V D+ RGIF W M + +P +K K
Sbjct: 611 LERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKNK 667
Query: 122 YN-SEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
++ + P AGGLF++D+ +F ELG YDPGL VWGGEN E+SFK+WMCGG IE VPC
Sbjct: 668 MKETDIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVPC 727
Query: 181 SRIVSLIR 188
SR+ + R
Sbjct: 728 SRVGHIFR 735
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 87/241 (36%), Positives = 133/241 (55%), Gaps = 5/241 (2%)
Query: 196 KLGNLEPPLEPYK-EGPGEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDL 254
K+ L+ L P + PG+ G A +P + E N+ S+ I DR I D
Sbjct: 431 KVLTLDVTLSPRDPKAPGQFGHAAVVPAEKQERAKKRWKEGNFNVYLSDLIPVDRAIEDT 490
Query: 255 RMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
R + C DLP ++I+ F +E +S+L+R++HS++ R+P ++EIILVDDFS+K
Sbjct: 491 RPDGCAEQLVHNDLPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKE 550
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L L+ Y+ +F KVR++ ER GLIR R GA+ + G+V+ FLD+H E + WL P
Sbjct: 551 HLKDNLDKYMAQF-PKVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEP 609
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
LL + RK + PVI+ I + +++V D+ RGIF W M + +P +K
Sbjct: 610 LLERVRLHRKKVACPVIEVISDKDLSYQTV---DNFQRGIFTWPMNFGWKSIPPEVIEKN 666
Query: 435 K 435
K
Sbjct: 667 K 667
>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 122/219 (55%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS DL+ L + R KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSS--DLEDCL--LLTRI-PKVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Macaca mulatta]
Length = 558
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 122/219 (55%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L + +A G+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAYLLAKQLKA-GEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|301611308|ref|XP_002935181.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Xenopus (Silurana) tropicalis]
Length = 532
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 86/183 (46%), Positives = 113/183 (61%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE YI + KVRLIR +REGL+R R GA ++ EV+ FLD HCE WL PL
Sbjct: 166 LKEPLEKYISSWR-KVRLIRANKREGLVRARLLGASIAKAEVLTFLDCHCECHEGWLEPL 224
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + PVID ID+ T+E+ + EP G F+W M++ + +PE E KKR
Sbjct: 225 LERIREKESAVICPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRMVFTWHTVPETEQKKR 281
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + SPT AGGLF++ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 282 RSPIDVISSPTMAGGLFSVSKKYFEHLGSYDTGMEVWGGENLEFSFRIWQCGGSLEVHPC 341
Query: 181 SRI 183
S +
Sbjct: 342 SHV 344
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 85/186 (45%), Positives = 114/186 (61%), Gaps = 6/186 (3%)
Query: 252 PDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDF 310
P E CK Y LPK SVI+ F+NE +S+L+RTVHS+++ +P LEEIILVDD+
Sbjct: 101 PRFDPERCKTVKYKYRRLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDLLLEEIILVDDY 160
Query: 311 SSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLN 370
S + L + LE YI + KVRLIR +REGL+R R GA ++ EV+ FLD HCE
Sbjct: 161 SDREHLKEPLEKYISSWR-KVRLIRANKREGLVRARLLGASIAKAEVLTFLDCHCECHEG 219
Query: 371 WLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPER 429
WL PLL I + PVID ID+ T+E+ + EP G F+W M++ + +PE
Sbjct: 220 WLEPLLERIREKESAVICPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRMVFTWHTVPET 276
Query: 430 EAKKRK 435
E KKR+
Sbjct: 277 EQKKRR 282
>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
Length = 619
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 79/188 (42%), Positives = 117/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE+Y++ V+++R ER+GLI R GA ++ GE++ FLDAHCE WL PL
Sbjct: 219 LHGKLEEYVKALK-IVKVVRQPERKGLITARLLGASKAEGEILTFLDAHCECFHGWLEPL 277
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I + + P I ID T++F + V H RG F+W + + +P+ E KR
Sbjct: 278 LARIVEEPTAVVSPEITTIDLNTFQFHKPVATARAHNRGNFDWSLTFGWEGIPDYENAKR 337
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P K+PT AGGLF++ +A+F ++G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 338 KDETYPVKTPTFAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIPC 397
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 398 SVVGHVFR 405
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 93/246 (37%), Positives = 135/246 (54%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYH---LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP P + PG G + + + + + N S+ IS RT+ D R
Sbjct: 101 PPENP--QAPGADGVPFQYDRMTKEEEKEKQEGMTRHCFNQFASDRISLHRTLGDDTRPP 158
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC K+ P LP SVI+VFHNE +S+L+RTV+S++ +PA +L+EII+VDD S+
Sbjct: 159 ECVDRKFRRCPA-LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAAFLKEIIMVDDASTAE 217
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L KLE+Y++ V+++R ER+GLI R GA ++ GE++ FLDAHCE WL P
Sbjct: 218 HLHGKLEEYVKALK-IVKVVRQPERKGLITARLLGASKAEGEILTFLDAHCECFHGWLEP 276
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I + + P I ID T++F + V H RG F+W + + +P+ E K
Sbjct: 277 LLARIVEEPTAVVSPEITTIDLNTFQFHKPVATARAHNRGNFDWSLTFGWEGIPDYENAK 336
Query: 434 RKYNRY 439
RK Y
Sbjct: 337 RKDETY 342
>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
laevis]
gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
Length = 622
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/188 (42%), Positives = 115/188 (61%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KL+DY++ V++ R ER+GLI R GA + GEV+ FLDAHCE WL PL
Sbjct: 222 LKEKLDDYVKALQ-IVKIARQKERKGLITARLLGASIATGEVLTFLDAHCECFHGWLEPL 280
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L+ I D + P I ID ++EF + V H RG F+W + + +PE E +R
Sbjct: 281 LSRIAEDYTAVVSPDITTIDLNSFEFAKPVQYGKTHSRGNFDWSLTFGWEAIPEAEKLRR 340
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P K+PT AGGLF++ +A+F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 341 KNETYPIKTPTFAGGLFSISKAYFEHIGSYDEDMEIWGGENVEMSFRVWQCGGQLEIIPC 400
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 401 SVVGHVFR 408
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 88/211 (41%), Positives = 121/211 (57%), Gaps = 7/211 (3%)
Query: 234 EYGMNMETSNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTV 289
++ N S+ IS R + PD R C K+ P LP SVI+VFHNE +S+L+RTV
Sbjct: 137 KHCFNAYASDRISLHRALGPDTRPPVCIEQKFKRCP-SLPTTSVIIVFHNEAWSTLLRTV 195
Query: 290 HSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRG 349
+S++ +PA L+EIILVDD S L +KL+DY++ V++ R ER+GLI R G
Sbjct: 196 YSVLYTSPAILLKEIILVDDASEDEYLKEKLDDYVKALQ-IVKIARQKERKGLITARLLG 254
Query: 350 AKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPD 408
A + GEV+ FLDAHCE WL PLL+ I D + P I ID ++EF + V
Sbjct: 255 ASIATGEVLTFLDAHCECFHGWLEPLLSRIAEDYTAVVSPDITTIDLNSFEFAKPVQYGK 314
Query: 409 HHYRGIFEWGMLYKENELPEREAKKRKYNRY 439
H RG F+W + + +PE E +RK Y
Sbjct: 315 THSRGNFDWSLTFGWEAIPEAEKLRRKNETY 345
>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Papio
anubis]
Length = 558
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 179 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 238
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 239 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 295
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 296 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 348
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 71 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 129
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 130 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 184
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 185 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 244
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 245 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 280
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus impatiens]
Length = 637
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 208 LQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 266
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 267 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 318
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 319 REKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 378
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 379 LEIVPCSHVGHIFR 392
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 102/251 (40%), Positives = 136/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 315 AVPEREKKRHK 325
>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Ornithorhynchus anatinus]
Length = 467
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/183 (45%), Positives = 117/183 (63%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+++ KVRLIR ++REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 70 LKEPLENHLAGL-PKVRLIRASKREGLVRARLLGASIATGQVLTFLDCHCECHEGWLEPL 128
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I + + PVID ID+ T+E+ + EP G F+W +++ + +PERE K+R
Sbjct: 129 LERIREEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHPIPEREQKRR 185
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+ + +SPT AGGLFA+ + +F LG YD G+ VWGGEN E SF+IW CGGS+E PC
Sbjct: 186 RSKVDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPC 245
Query: 181 SRI 183
S +
Sbjct: 246 SHV 248
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/177 (43%), Positives = 114/177 (64%), Gaps = 6/177 (3%)
Query: 260 KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQK 319
K +DY LP+ SV++ F+NE S+L+RTVHS+++ +P L E+ILVDD+S K L +
Sbjct: 15 KKYDY-RRLPRTSVVIAFYNEARSTLLRTVHSVLETSPDVLLNEVILVDDYSDKGHLKEP 73
Query: 320 LEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPI 379
LE+++ KVRLIR ++REGL+R R GA + G+V+ FLD HCE WL PLL I
Sbjct: 74 LENHLAGL-PKVRLIRASKREGLVRARLLGASIATGQVLTFLDCHCECHEGWLEPLLERI 132
Query: 380 YSDRKIMTVPVIDGIDYQTWEFR-SVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
+ + PVID ID+ T+E+ + EP G F+W +++ + +PERE K+R+
Sbjct: 133 REEESAVVCPVIDVIDWNTFEYLGNAGEPQ---IGGFDWRLVFTWHPIPEREQKRRR 186
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 3 [Bombus terrestris]
Length = 637
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 208 LQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 266
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 267 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 318
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 319 REKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 378
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 379 LEIVPCSHVGHIFR 392
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 102/251 (40%), Positives = 136/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 315 AVPEREKKRHK 325
>gi|345491789|ref|XP_001607575.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 566
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 86/207 (41%), Positives = 130/207 (62%), Gaps = 1/207 (0%)
Query: 212 GEGGKAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKA 271
G+ G+A +L ++ + G + +N+ SN I R I D+R CK Y LP
Sbjct: 63 GDFGEAAYLSDSEKQNGSLVYSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTKLPTT 122
Query: 272 SVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQ-RFNGK 330
SV+++FHNE +S L+RTV+S+++ +P ++L+EIILVDD S++ +L+ L YI+ R K
Sbjct: 123 SVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELEDILAYYIETRLPKK 182
Query: 331 VRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPV 390
V+L+R +R+GLIR R GA+++ G+V+VFLDAHCEV WL PLL I + + +PV
Sbjct: 183 VKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLHRIKARPNAVLIPV 242
Query: 391 IDGIDYQTWEFRSVYEPDHHYRGIFEW 417
ID ID +T E++ H G F+W
Sbjct: 243 IDVIDAKTLEYKLAARGSHMPIGGFKW 269
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 117/192 (60%), Gaps = 1/192 (0%)
Query: 1 DLDKKLEDYIE-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLP 59
+L+ L YIE R KV+L+R +R+GLIR R GA+++ G+V+VFLDAHCEV WL
Sbjct: 166 ELEDILAYYIETRLPKKVKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLS 225
Query: 60 PLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
PLL I + + +PVID ID +T E++ H G F+W + + + +
Sbjct: 226 PLLHRIKARPNAVLIPVIDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPKRT 285
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
+P +PT AGGLFA+DR +F +G YD + WGGEN E+SF+IW CGGSIE VP
Sbjct: 286 TASPIDPINTPTMAGGLFAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVP 345
Query: 180 CSRIVSLIRPVF 191
CSR+ + R F
Sbjct: 346 CSRVGHIFRDFF 357
>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Gorilla gorilla gorilla]
Length = 598
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 219 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 278
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 279 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 335
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 336 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 388
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 111 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 169
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 170 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 224
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 225 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 284
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 285 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 320
>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
Length = 575
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 108/173 (62%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P
Sbjct: 212 KVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSP 271
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R + P ++P AGG
Sbjct: 272 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPVIAGG 328
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 329 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 381
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 88/219 (40%), Positives = 120/219 (54%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP SVI+
Sbjct: 104 KAY-LSAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVII 162
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 163 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 217
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA + V+ FLD+HCEV WLPP+L + D + P+ID I
Sbjct: 218 NDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKEDHTRVVSPIIDVIS 277
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 278 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 313
>gi|344259060|gb|EGW15164.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Cricetulus
griseus]
Length = 400
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 140/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ E D ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--SSPGADGKAFQKKEWTPLETQEKDEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC ++ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+E+ILVDD S++
Sbjct: 164 ECVDQRFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEVILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKERLEQYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ + P I ID T++F + V + H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSKPVQKGKAHSRGNFDWSLTFGWEMLPAHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 107/174 (61%), Gaps = 2/174 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKERLEQYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F + V + H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTAVVSPDIVTIDLNTFQFSKPVQKGKAHSRGNFDWSLTFGWEMLPAHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF++ M S
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDDQMEIWGGENVEMSFRVRMKRAS 396
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Megachile rotundata]
Length = 638
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 209 LQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 267
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 268 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 319
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 320 REKKRHKNPAEPVWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGT 379
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 380 LEIVPCSHVGHIFR 393
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 100/251 (39%), Positives = 136/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 86 GVLVAPREQDSSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 144
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y +LP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDD+S
Sbjct: 145 DPRDPWCKEPGRYLKELPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDYS 204
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 205 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 263
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 264 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 315
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 316 AVPEREKKRHK 326
>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
Length = 585
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/173 (48%), Positives = 107/173 (61%), Gaps = 6/173 (3%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVRL+ ER GLI R GA+E+RG+V++ LD+H EV NWLPPLL PI D + P
Sbjct: 185 KVRLVDLPERSGLIVARMAGAREARGDVLIVLDSHTEVNTNWLPPLLEPIAEDYRTCVCP 244
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
ID I + T+++RS D RG F+W YK L + ++P+ SP AGG
Sbjct: 245 FIDVIAHDTFQYRS---QDEGKRGAFDWKFYYKRLPLLPGDLDD---PTKPFNSPVMAGG 298
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LFA+ FF ELGGYD GL +WGGE +ELSFKIW CGG + PCSR+ + R
Sbjct: 299 LFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYR 351
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/220 (40%), Positives = 129/220 (58%), Gaps = 17/220 (7%)
Query: 205 EPYKEGPGEGGKAYHLPEAYRAAGDASL-GEYGMNMETSNHISFDRTIPDLRMEECKYWD 263
E + GPGE G+ Y L A +A L E G + S+ I+ +R+ +
Sbjct: 70 ESKRTGPGEHGRPYKLSSEQDIALNAKLFKENGYSAVVSDMIALNRS------------E 117
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y +LP SVI++F+NE +S+L+RTV+S++ R+P L+EIILV+D S+K L L ++
Sbjct: 118 YLKELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPALLKEIILVNDHSTKPFLWTPLREF 177
Query: 324 IQ-RFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSD 382
++ KVRL+ ER GLI R GA+E+RG+V++ LD+H EV NWLPPLL PI D
Sbjct: 178 VESELAPKVRLVDLPERSGLIVARMAGAREARGDVLIVLDSHTEVNTNWLPPLLEPIAED 237
Query: 383 RKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYK 422
+ P ID I + T+++RS D RG F+W YK
Sbjct: 238 YRTCVCPFIDVIAHDTFQYRS---QDEGKRGAFDWKFYYK 274
>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Cricetulus griseus]
Length = 797
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 73/173 (42%), Positives = 110/173 (63%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN +REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P
Sbjct: 163 KVKCLRNDKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPMLQRVMEDHTRVVSP 222
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R ++P ++P AGG
Sbjct: 223 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTRTDPTKPIRTPVIAGG 279
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
+F +D+++F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R
Sbjct: 280 IFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFR 332
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 121/218 (55%), Gaps = 13/218 (5%)
Query: 222 EAYRAAGDASLGE-----YGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILV 276
+AY +A GE + N S+ +S DR I D R C Y LDLP SVI+
Sbjct: 55 KAYLSAKQLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSLDLPATSVIIT 114
Query: 277 FHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRN 336
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +RN
Sbjct: 115 FHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLRN 169
Query: 337 TEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDY 396
+REGLIR+R RGA + V+ FLD+HCEV + WL P+L + D + P+ID I
Sbjct: 170 DKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPMLQRVMEDHTRVVSPIIDVISL 229
Query: 397 QTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 230 DNFAYLAA---SADLRGGFDWSLHFKWEQIPLEQKMTR 264
>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Cricetulus griseus]
Length = 541
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/173 (43%), Positives = 112/173 (64%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R RGA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 156 KVKCLRNNERQGLVRSRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 215
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + R +EP ++P AGG
Sbjct: 216 VIDIINLDTFNY---IESASELRGGFDWSLHFQWEQLSPEQKALRLDPTEPIRTPIIAGG 272
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 273 LFVIDKAWFDYLGKYDVDMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFR 325
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 84/207 (40%), Positives = 119/207 (57%), Gaps = 11/207 (5%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R + C Y DLP S+I+ FHNE S+L+
Sbjct: 58 VGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTLL 117
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFN-GKVRLIRNTEREGLIRT 345
RT+ S++ RTP ++EIILVDDFS+ ED Q KV+ +RN ER+GL+R+
Sbjct: 118 RTIRSVLNRTPTHLIQEIILVDDFSNDP------EDCKQLIKLPKVKCLRNNERQGLVRS 171
Query: 346 RSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVY 405
R RGA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ +
Sbjct: 172 RMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---I 228
Query: 406 EPDHHYRGIFEWGMLYKENEL-PEREA 431
E RG F+W + ++ +L PE++A
Sbjct: 229 ESASELRGGFDWSLHFQWEQLSPEQKA 255
>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Sarcophilus harrisii]
Length = 590
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/182 (44%), Positives = 116/182 (63%), Gaps = 3/182 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + LE+++ KVRLIR +REGL+R R GA + GE++ FLD HCE WL PL
Sbjct: 193 LKEPLENHLSGLR-KVRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPL 251
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I+ + + PVID ID+ T+E+ + G F+W +++ + +PE+E K+R+
Sbjct: 252 LERIHEEESAVVCPVIDVIDWNTFEY--LGNSGDPQIGGFDWRLVFTWHSVPEKEQKRRR 309
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
+ +SPT AGGLFA+++ +F LG YD G+ VWGGEN E SF+IW CGGS+E PCS
Sbjct: 310 SKIDVIRSPTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 369
Query: 182 RI 183
+
Sbjct: 370 HV 371
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 140/228 (61%), Gaps = 8/228 (3%)
Query: 212 GEGGKAY--HLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEEC--KYWDYPLD 267
G+ G+A L EA + + S+ + +N+ S+ IS R +P+ C K +DY +
Sbjct: 86 GDLGRAVSLQLDEAGKKLQEESIQRHQINIYLSDLISLHRRLPERWHPLCREKKYDYE-N 144
Query: 268 LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRF 327
LPK SV++ F+NE +S+L+RTV+S+++ +P L+EIILVDD+S + L + LE+++
Sbjct: 145 LPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLKEIILVDDYSDREHLKEPLENHLSGL 204
Query: 328 NGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMT 387
KVRLIR +REGL+R R GA + GE++ FLD HCE WL PLL I+ + +
Sbjct: 205 R-KVRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPLLERIHEEESAVV 263
Query: 388 VPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 435
PVID ID+ T+E+ + G F+W +++ + +PE+E K+R+
Sbjct: 264 CPVIDVIDWNTFEY--LGNSGDPQIGGFDWRLVFTWHSVPEKEQKRRR 309
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus impatiens]
Length = 602
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 208 LQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 266
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 267 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 318
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 319 REKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 378
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 379 LEIVPCSHVGHIFR 392
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 102/251 (40%), Positives = 136/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 315 AVPEREKKRHK 325
>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Takifugu rubripes]
Length = 618
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 78/188 (41%), Positives = 118/188 (62%), Gaps = 2/188 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE+++ +F VR++R ER+GLI R GA E++GEV+ FLDAHCE WL PL
Sbjct: 218 LKEQLEEFVLQFK-IVRVLRQPERKGLITARLLGASEAQGEVLTFLDAHCECFHGWLEPL 276
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAKKR 120
LA I + + P I ID ++++F H + RG F+W + + ++PE K R
Sbjct: 277 LARIVEEPTAVVSPEITTIDLESFQFNKPAPSSHAFNRGNFDWSLTFGWEQIPEAARKLR 336
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
K + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +PC
Sbjct: 337 KDETCPVKTPTFAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIPC 396
Query: 181 SRIVSLIR 188
S + + R
Sbjct: 397 SVVGHVFR 404
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 96/242 (39%), Positives = 137/242 (56%), Gaps = 12/242 (4%)
Query: 202 PPLEPYKEGPGEGGKAYH----LPEAYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRM 256
PP +P PG GKA+ PE D + + N S+ IS R++ D R
Sbjct: 100 PPQDP--GSPGADGKAFKKDQMSPEEETEKKDG-MTRHCFNQFASDRISLSRSLGEDTRP 156
Query: 257 EECKYWDYPL--DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC +P LP SVI+VFHNE +S+L+RTV+S++ +PA L+EIILVDD S
Sbjct: 157 RECVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDASVAG 216
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE+++ +F VR++R ER+GLI R GA E++GEV+ FLDAHCE WL P
Sbjct: 217 HLKEQLEEFVLQFK-IVRVLRQPERKGLITARLLGASEAQGEVLTFLDAHCECFHGWLEP 275
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY-RGIFEWGMLYKENELPEREAKK 433
LLA I + + P I ID ++++F H + RG F+W + + ++PE K
Sbjct: 276 LLARIVEEPTAVVSPEITTIDLESFQFNKPAPSSHAFNRGNFDWSLTFGWEQIPEAARKL 335
Query: 434 RK 435
RK
Sbjct: 336 RK 337
>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Acromyrmex echinatior]
Length = 566
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KVR+IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 137 LKRQLEDYMMNY-PKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 195
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 196 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSSGVNVGGFDWNLQFNWHAVPE 247
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF +G YD G +WGGEN ELSFK WMCGG+
Sbjct: 248 RERKRHKNPAEPVWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSFKTWMCGGT 307
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 308 LEIVPCSHVGHIFR 321
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 79/179 (44%), Positives = 108/179 (60%), Gaps = 16/179 (8%)
Query: 264 YPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDY 323
Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS L ++LEDY
Sbjct: 85 YLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLKRQLEDY 144
Query: 324 IQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDR 383
+ + KVR+IR +REGLIR R GA ++ V+ +LD+HCE WL PLL I D
Sbjct: 145 MMNY-PKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDP 203
Query: 384 KIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPEREAKKRK 435
+ PVID ID T E+ H+R G F+W + + + +PERE K+ K
Sbjct: 204 TTVVCPVIDVIDDTTLEY--------HWRDSSGVNVGGFDWNLQFNWHAVPERERKRHK 254
>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Loxodonta africana]
Length = 552
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 113/173 (65%), Gaps = 3/173 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN ER+GL+R+R +GA ++G + FLD+HCEV +WL PLL + D + P
Sbjct: 167 KVKCVRNNERQGLVRSRIQGAGIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCP 226
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
VID I+ T+ + E RG F+W + ++ +L + +R +EP ++P AGG
Sbjct: 227 VIDIINLDTFNY---IESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPIIAGG 283
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
LF +D+A+F LG YD + +WGGENFE+SF++WMCGGS+E +PCSR+ + R
Sbjct: 284 LFVIDKAWFDYLGKYDSEMDIWGGENFEMSFRVWMCGGSLEIIPCSRVGHVFR 336
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 122/208 (58%), Gaps = 9/208 (4%)
Query: 227 AGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLM 286
GD Y N S IS +R +PD R C Y DLP S+I+ FHNE S+L+
Sbjct: 69 VGDDPYKLYAFNQRESERISSNRAVPDTRHLRCNLLVYCTDLPPTSIIITFHNEARSTLL 128
Query: 287 RTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTR 346
RT+ S++ RTP ++EIILVDDFSS D D KL + KV+ +RN ER+GL+R+R
Sbjct: 129 RTIRSVLNRTPMHLIQEIILVDDFSSDPD-DCKLLIKL----PKVKCVRNNERQGLVRSR 183
Query: 347 SRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYE 406
+GA ++G + FLD+HCEV +WL PLL + D + PVID I+ T+ + E
Sbjct: 184 IQGAGIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNY---IE 240
Query: 407 PDHHYRGIFEWGMLYKENEL-PEREAKK 433
RG F+W + ++ +L PE++A++
Sbjct: 241 SASELRGGFDWSLHFQWEQLSPEQKARR 268
>gi|443684177|gb|ELT88186.1| hypothetical protein CAPTEDRAFT_64353, partial [Capitella teleta]
Length = 266
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 119/203 (58%), Gaps = 1/203 (0%)
Query: 232 LGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVILVFHNEGFSSLMRTVHS 291
+ N S+ IS RT+PD R E+C YP+++P SVI++F NE +S+L+RTV S
Sbjct: 36 FATHSFNQYLSDRISLHRTLPDFRHEDCNSVPYPINMPDTSVIIIFRNEAWSTLLRTVFS 95
Query: 292 IIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAK 351
I+ R+P L EIILVDDFS L KLE +I R K++LIR+ R GLI+ R GA
Sbjct: 96 ILDRSPPHLLREIILVDDFSDAKHLKHKLEGFIAR-QPKLKLIRSPNRVGLIKARLLGAA 154
Query: 352 ESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHY 411
++ G+V+ FLD+HCE WL PLL I +R + P+ID I T ++ +
Sbjct: 155 DAEGQVLTFLDSHCECTPGWLEPLLYRIGQNRSHVVTPIIDTISDVTLQYIGGRVVKNFM 214
Query: 412 RGIFEWGMLYKENELPEREAKKR 434
G F+W M + + LP+ E KR
Sbjct: 215 VGGFDWSMSFNWHYLPKSEQVKR 237
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 84/147 (57%), Gaps = 1/147 (0%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L KLE +I R K++LIR+ R GLI+ R GA ++ G+V+ FLD+HCE WL PL
Sbjct: 120 LKHKLEGFIAR-QPKLKLIRSPNRVGLIKARLLGAADAEGQVLTFLDSHCECTPGWLEPL 178
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +R + P+ID I T ++ + G F+W M + + LP+ E KR
Sbjct: 179 LYRIGQNRSHVVTPIIDTISDVTLQYIGGRVVKNFMVGGFDWSMSFNWHYLPKSEQVKRT 238
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELG 148
+P +SPT AGGLFA+ + +F LG
Sbjct: 239 SPVDPVESPTMAGGLFAIHKDYFNYLG 265
>gi|354506562|ref|XP_003515329.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6, partial
[Cricetulus griseus]
Length = 389
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 140/246 (56%), Gaps = 12/246 (4%)
Query: 202 PPLEPYKEGPGEGGKAYHLPE---AYRAAGDASLGEYGMNMETSNHISFDRTI-PDLRME 257
PP +P PG GKA+ E D ++ N S+ IS R++ PD R
Sbjct: 106 PPQDP--SSPGADGKAFQKKEWTPLETQEKDEGYKKHCFNAFASDRISLQRSLGPDTRPP 163
Query: 258 EC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKA 314
EC ++ P LP SVI+VFHNE +S+L+RTV+S++ +PA L+E+ILVDD S++
Sbjct: 164 ECVDQRFRRCP-PLPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEVILVDDASTEE 222
Query: 315 DLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPP 374
L ++LE Y+Q+ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL P
Sbjct: 223 YLKERLEQYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEP 281
Query: 375 LLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKK 433
LLA I D+ + P I ID T++F + V + H RG F+W + + LP E ++
Sbjct: 282 LLARIAEDKTAVVSPDIVTIDLNTFQFSKPVQKGKAHSRGNFDWSLTFGWEMLPAHEKQR 341
Query: 434 RKYNRY 439
RK Y
Sbjct: 342 RKDETY 347
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 104/167 (62%), Gaps = 2/167 (1%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LE Y+++ VR++R ER+GLI R GA ++ EV+ FLDAHCE WL PL
Sbjct: 224 LKERLEQYVQQLQ-IVRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPL 282
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
LA I D+ + P I ID T++F + V + H RG F+W + + LP E ++R
Sbjct: 283 LARIAEDKTAVVSPDIVTIDLNTFQFSKPVQKGKAHSRGNFDWSLTFGWEMLPAHEKQRR 342
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFK 167
K + P KSPT AGGLF++ +A+F +G YD + +WGGEN E+SF+
Sbjct: 343 KDETYPIKSPTFAGGLFSISKAYFEHIGTYDDQMEIWGGENVEMSFR 389
>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Cavia porcellus]
Length = 633
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 81/189 (42%), Positives = 119/189 (62%), Gaps = 4/189 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +KLE+YI++F+ V+++R ER+GLI R GA + E + FLDAHCE WL PL
Sbjct: 232 LHEKLEEYIKQFS-IVKIVRQKERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPL 290
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIFEWGMLYKENELPEREAKK 119
LA I + + P I ID T+EF S Y +H+ RG F+W + + LP+ E ++
Sbjct: 291 LARIADNYTAVVSPDIASIDLNTFEFNKPSPYGTNHN-RGNFDWSLSFGWESLPDHEKQR 349
Query: 120 RKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVP 179
RK + P K+PT AGGLF++ + +F +G YD + +WGGEN E+SF++W CGG +E +P
Sbjct: 350 RKDETYPIKTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMP 409
Query: 180 CSRIVSLIR 188
CS + + R
Sbjct: 410 CSVVGHVFR 418
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/264 (39%), Positives = 147/264 (55%), Gaps = 15/264 (5%)
Query: 188 RPVFKADGKLGNLEPPLE-PYKE--GPGEGGKAY---HLPEAYRAAGDASLGEYGMNMET 241
RP L+P L+ P ++ PG GKA+ +L + + ++ N
Sbjct: 95 RPCLPGYYTAAELKPVLDRPPQDSNAPGASGKAFKTTNLSIEEKKEKERGEAKHCFNAFA 154
Query: 242 SNHISFDRTI-PDLRMEEC---KYWDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTP 297
S+ IS R + PD R EC K+ P LP SVI+VFHNE +S+L+RTVHS++ +P
Sbjct: 155 SDRISLHRDLGPDTRPPECIEQKFKRCP-PLPTTSVIIVFHNEAWSTLLRTVHSVLYSSP 213
Query: 298 AQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEV 357
A L+EIILVDD S L +KLE+YI++F+ V+++R ER+GLI R GA + E
Sbjct: 214 AILLKEIILVDDASVDDYLHEKLEEYIKQFS-IVKIVRQKERKGLITARLLGAAVATAET 272
Query: 358 IVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFR--SVYEPDHHYRGIF 415
+ FLDAHCE WL PLLA I + + P I ID T+EF S Y +H+ RG F
Sbjct: 273 LTFLDAHCECFYGWLEPLLARIADNYTAVVSPDIASIDLNTFEFNKPSPYGTNHN-RGNF 331
Query: 416 EWGMLYKENELPEREAKKRKYNRY 439
+W + + LP+ E ++RK Y
Sbjct: 332 DWSLSFGWESLPDHEKQRRKDETY 355
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus terrestris]
gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus terrestris]
Length = 602
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 91/194 (46%), Positives = 120/194 (61%), Gaps = 16/194 (8%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE WL PL
Sbjct: 208 LQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPL 266
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKENELPE 114
L I D + PVID ID T E+ H+R G F+W + + + +PE
Sbjct: 267 LDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWHAVPE 318
Query: 115 REAKKRKYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGS 174
RE K+ K +EP SPT AGGLF++DRAFF LG YD G +WGGEN ELSFK WMCGG+
Sbjct: 319 REKKRHKNPAEPVWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGT 378
Query: 175 IEWVPCSRIVSLIR 188
+E VPCS + + R
Sbjct: 379 LEIVPCSHVGHIFR 392
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 102/251 (40%), Positives = 136/251 (54%), Gaps = 23/251 (9%)
Query: 198 GNLEPPLEPYKEGPGEGGKAYHLP-----EAYRAAGDASLGEYGMNMETSNHISFDRTIP 252
G L P E PGE G+ LP E + D L N S+ IS RT+P
Sbjct: 85 GVLVAPREQDPSAPGEMGRPVILPTNLTAETKKLVDDGWLNN-AFNQYVSDLISVHRTLP 143
Query: 253 DLRMEECKY-WDYPLDLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFS 311
D R CK Y DLP +VI+ FHNE +S L+RTVHS++ R+P ++EIILVDDFS
Sbjct: 144 DPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFS 203
Query: 312 SKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNW 371
L ++LEDY+ + KV++IR +REGLIR R GA ++ V+ +LD+HCE W
Sbjct: 204 DMPHLQRQLEDYMMNYP-KVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGW 262
Query: 372 LPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYR-------GIFEWGMLYKEN 424
L PLL I D + PVID ID T E+ H+R G F+W + + +
Sbjct: 263 LEPLLDRIARDPTTVVCPVIDVIDDTTLEY--------HWRDSGGVNVGGFDWNLQFNWH 314
Query: 425 ELPEREAKKRK 435
+PERE K+ K
Sbjct: 315 AVPEREKKRHK 325
>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
abelii]
Length = 578
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/183 (44%), Positives = 114/183 (62%), Gaps = 5/183 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE WL PL
Sbjct: 182 LKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPL 240
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 120
L I D + PVID ID+ T+EF EP G F+W + ++ + +P+++ ++
Sbjct: 241 LERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVPKQKRDRQ 297
Query: 121 KYNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPC 180
+P +SPT AGGLFA+ + +F LG YD G+ VWGGEN ELSF++W CGG +E PC
Sbjct: 298 ISRIDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPC 357
Query: 181 SRI 183
S +
Sbjct: 358 SHV 360
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 101/250 (40%), Positives = 143/250 (57%), Gaps = 17/250 (6%)
Query: 199 NLEPPLEPYKEGP------GEGGKA--YHLPEAYRAAGDASLGEYGMNMETSNHISFDRT 250
+L PL YK+ P GE GKA L E + + Y +N+ S+ IS R
Sbjct: 58 DLSQPL--YKKPPADSHALGEWGKASKLQLNEDELKQQEELIERYAINIYLSDRISLHRH 115
Query: 251 IPDLRMEECKYWDYPL-DLPKASVILVFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDD 309
I D RM ECK + LP SVI+ F+NE +S+L+RT+HS+++ +PA L+EIILVDD
Sbjct: 116 IEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDD 175
Query: 310 FSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGL 369
S + L +LE YI + +VRLIR +REGL+R R GA + G+V+ FLD HCE
Sbjct: 176 LSDRVYLKTQLETYISNLD-RVRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNS 234
Query: 370 NWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEF-RSVYEPDHHYRGIFEWGMLYKENELPE 428
WL PLL I D + PVID ID+ T+EF EP G F+W + ++ + +P
Sbjct: 235 GWLEPLLERIGRDETAIVCPVIDTIDWNTFEFYMQTGEP---MIGGFDWRLTFQWHSVP- 290
Query: 429 REAKKRKYNR 438
++ + R+ +R
Sbjct: 291 KQKRDRQISR 300
>gi|301759097|ref|XP_002915398.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Ailuropoda melanoleuca]
Length = 634
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 113/187 (60%), Gaps = 4/187 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L L +Y+ R G V+L+R+ R G IR R GA + G+V+VF+D+HCE WL PL
Sbjct: 234 LKSSLSEYVARLEG-VKLLRSNRRLGAIRARMLGATRATGDVLVFMDSHCECHPGWLEPL 292
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L+ I DR M PVID ID++T+++ Y RG+ +W + + LPERE K +
Sbjct: 293 LSRIADDRSRMVSPVIDVIDWKTFQY---YPSKDLQRGVLDWKLDFHWEPLPERERKALQ 349
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
P +SP G+ A+DR +F G YDP + + GGEN ELS K+W+CGGS+E +PCS
Sbjct: 350 SPISPIRSPVVPSGVVAIDRHYFQNTGAYDPLMSLQGGENLELSLKVWLCGGSVEILPCS 409
Query: 182 RIVSLIR 188
R+ + R
Sbjct: 410 RVGHIYR 416
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 116/192 (60%), Gaps = 6/192 (3%)
Query: 242 SNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKRTPAQY 300
S I R +P++R C +P D LP ASVIL FH+E +S+L+RTVHSI+ P
Sbjct: 160 SARIPLQRALPEVRHPLC-LQQHPGDSLPTASVILCFHDEAWSTLLRTVHSILDTVPRAS 218
Query: 301 LEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVF 360
L+EIILVDD S + L L +Y+ R G V+L+R+ R G IR R GA + G+V+VF
Sbjct: 219 LKEIILVDDLSQQGQLKSSLSEYVARLEG-VKLLRSNRRLGAIRARMLGATRATGDVLVF 277
Query: 361 LDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGML 420
+D+HCE WL PLL+ I DR M PVID ID++T+++ Y RG+ +W +
Sbjct: 278 MDSHCECHPGWLEPLLSRIADDRSRMVSPVIDVIDWKTFQY---YPSKDLQRGVLDWKLD 334
Query: 421 YKENELPEREAK 432
+ LPERE K
Sbjct: 335 FHWEPLPERERK 346
>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
2, partial [Ciona intestinalis]
Length = 582
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 110/174 (63%), Gaps = 4/174 (2%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KVR++RN +REGL+R+R RGA + V+ FLD+H E NWL PLL I DR + P
Sbjct: 202 KVRVLRNNQREGLMRSRIRGADAAAAPVLTFLDSHVECNKNWLEPLLQRIADDRTAVVCP 261
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYN-SEPYKSPTHAG 134
+ID I+ +E+ RG F+W +++K + + E + R N + P +P AG
Sbjct: 262 IIDVINMDNFEYIGASAD---LRGGFDWNLVFKWDYMSSEERRSRAGNPTAPISTPMIAG 318
Query: 135 GLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIR 188
GLF+MD+++F +LG YD + VWGGEN E+SF++W CGG +E +PCSR+ + R
Sbjct: 319 GLFSMDKSYFNQLGKYDTAMDVWGGENLEISFRVWQCGGRLEIIPCSRVGHVFR 372
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/202 (40%), Positives = 116/202 (57%), Gaps = 9/202 (4%)
Query: 237 MNMETSNHISFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRTVHSIIKR 295
N + S+ + DR +PD R C + L LP SVI+ FHNE S+L+RTV S++ R
Sbjct: 113 FNQQASDKLKCDRPVPDTRNGLCSSNSWDLSKLPATSVIVTFHNEARSTLLRTVVSVLNR 172
Query: 296 TPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSRGAKESRG 355
+P + EIILVDDFS A+ D +L I+ KVR++RN +REGL+R+R RGA +
Sbjct: 173 SPPSLVREIILVDDFSDNAE-DGQLLAQIE----KVRVLRNNQREGLMRSRIRGADAAAA 227
Query: 356 EVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIF 415
V+ FLD+H E NWL PLL I DR + P+ID I+ +E+ RG F
Sbjct: 228 PVLTFLDSHVECNKNWLEPLLQRIADDRTAVVCPIIDVINMDNFEYIGASAD---LRGGF 284
Query: 416 EWGMLYKENELPEREAKKRKYN 437
+W +++K + + E + R N
Sbjct: 285 DWNLVFKWDYMSSEERRSRAGN 306
>gi|443687046|gb|ELT90152.1| hypothetical protein CAPTEDRAFT_141956, partial [Capitella teleta]
Length = 351
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 118/188 (62%), Gaps = 4/188 (2%)
Query: 2 LDKKLEDYIERFNGKVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPL 61
L + L+ Y++ ++R+IR +R+GLIR R+RGA+E++G+++ F DAH EVG WLPPL
Sbjct: 90 LKEPLDKYLQHL-PRIRIIRLKQRQGLIRARTRGAEEAKGDILYFADAHTEVGEGWLPPL 148
Query: 62 LAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRK 121
L I +RK++ P +D I +Q++E+ + Y G F W M +K P+ +R
Sbjct: 149 LQRIKENRKVLVFPEMDPIQHQSFEY---WRAGDEYHGAFYWHMEFKYKFAPKEILNRRS 205
Query: 122 YNSEPYKSPTHAGGLFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCS 181
++P SP G A++R +F E G YD + +WGGEN E +F++WMCGG +E +PCS
Sbjct: 206 DPTQPVPSPVMVGCAHAIEREYFFETGAYDTDMEIWGGENIEHAFRLWMCGGRVEVIPCS 265
Query: 182 RIVSLIRP 189
R+ + +P
Sbjct: 266 RVGHVFKP 273
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 130/206 (63%), Gaps = 6/206 (2%)
Query: 231 SLGEYGMNMETSNHI-SFDRTIPDLRMEECKYWDYPLD-LPKASVILVFHNEGFSSLMRT 288
+ G + N +S+ + +F +PD RME C Y L L K S+I++FHNE S+L+RT
Sbjct: 3 TTGYHSFNHSSSDLVGNFRHELPDFRMEGCHKKTYDLTTLGKTSIIIIFHNEARSTLLRT 62
Query: 289 VHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIRNTEREGLIRTRSR 348
+H++++RTP L EI++VDD S+ A L + L+ Y+Q ++R+IR +R+GLIR R+R
Sbjct: 63 IHALLERTPILLLVEILIVDDASTHAWLKEPLDKYLQHL-PRIRIIRLKQRQGLIRARTR 121
Query: 349 GAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGIDYQTWEFRSVYEPD 408
GA+E++G+++ F DAH EVG WLPPLL I +RK++ P +D I +Q++E+ +
Sbjct: 122 GAEEAKGDILYFADAHTEVGEGWLPPLLQRIKENRKVLVFPEMDPIQHQSFEY---WRAG 178
Query: 409 HHYRGIFEWGMLYKENELPEREAKKR 434
Y G F W M +K P+ +R
Sbjct: 179 DEYHGAFYWHMEFKYKFAPKEILNRR 204
>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Monodelphis domestica]
Length = 563
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 113/184 (61%), Gaps = 3/184 (1%)
Query: 16 KVRLIRNTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVP 75
KV+ +RN REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P
Sbjct: 187 KVKCLRNDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSP 246
Query: 76 VIDGIDYQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKRKYNSEPYKSPTHAGG 135
+ID I + + + RG F+W + +K ++P + R ++P ++P AGG
Sbjct: 247 IIDVISLDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSRTDPTQPIRTPVIAGG 303
Query: 136 LFAMDRAFFLELGGYDPGLLVWGGENFELSFKIWMCGGSIEWVPCSRIVSLIRPVFKADG 195
+F +D+A+F LG YD + +WGGENFELSF++WMCGGS+E VPCSR+ + R D
Sbjct: 304 IFVIDKAWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRHPYDF 363
Query: 196 KLGN 199
GN
Sbjct: 364 PEGN 367
Score = 147 bits (372), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 121/219 (55%), Gaps = 9/219 (4%)
Query: 216 KAYHLPEAYRAAGDASLGEYGMNMETSNHISFDRTIPDLRMEECKYWDYPLDLPKASVIL 275
KAY L AG+ ++ N S+ +S DR I D R C Y DLP S+++
Sbjct: 79 KAY-LASKLLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCTSVHYASDLPTTSIVI 137
Query: 276 VFHNEGFSSLMRTVHSIIKRTPAQYLEEIILVDDFSSKADLDQKLEDYIQRFNGKVRLIR 335
FHNE S+L+RTV S++ RTPA ++EIILVDDFSS + D L I KV+ +R
Sbjct: 138 TFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFSSDPE-DCLLLTRIP----KVKCLR 192
Query: 336 NTEREGLIRTRSRGAKESRGEVIVFLDAHCEVGLNWLPPLLAPIYSDRKIMTVPVIDGID 395
N REGLIR+R RGA+ + +++ FLD+HCEV WL P+L + D + P+ID I
Sbjct: 193 NDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPMLQRVKEDYTRVVSPIIDVIS 252
Query: 396 YQTWEFRSVYEPDHHYRGIFEWGMLYKENELPEREAKKR 434
+ + + RG F+W + +K ++P + R
Sbjct: 253 LDNFAYLAA---SADLRGGFDWSLHFKWEQIPIEQKMSR 288
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.141 0.440
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,761,284,377
Number of Sequences: 23463169
Number of extensions: 356830373
Number of successful extensions: 738865
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2135
Number of HSP's successfully gapped in prelim test: 2358
Number of HSP's that attempted gapping in prelim test: 728271
Number of HSP's gapped (non-prelim): 9323
length of query: 439
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 293
effective length of database: 8,933,572,693
effective search space: 2617536799049
effective search space used: 2617536799049
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)