BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10463
(605 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328723398|ref|XP_001946977.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 1 [Acyrthosiphon pisum]
gi|328723400|ref|XP_003247833.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
isoform 2 [Acyrthosiphon pisum]
Length = 624
Score = 615 bits (1587), Expect = e-173, Method: Compositional matrix adjust.
Identities = 285/421 (67%), Positives = 344/421 (81%), Gaps = 12/421 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S+++IVFHNEAWS L+RTV SVI RSP+ +L EI+LVDDASTR+FL+ LD+YVAKL
Sbjct: 161 LPSSTVIIVFHNEAWSTLMRTVQSVIDRSPKYLLNEIILVDDASTRKFLEKELDDYVAKL 220
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V TR+IRSP R+GLIKARL+GARQA+G+ILVFLDAHCECTLGWLE LV+RVAEDR RVV
Sbjct: 221 PVLTRIIRSPKRIGLIKARLMGARQAKGKILVFLDAHCECTLGWLEALVSRVAEDRKRVV 280
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVIDIISD TFAYVRSFELHWGAFNW+LHFRWYT + D + ++D T+ F+TPAMAGG
Sbjct: 281 CPVIDIISDETFAYVRSFELHWGAFNWDLHFRWYTRTTPDIMKGQRDITQAFRTPAMAGG 340
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFA+D++YFF +G YDE M++WGGENLE+SFRVWQCGGSIEIAPCSHV H+FRK+SPY+F
Sbjct: 341 LFAMDKSYFFELGGYDERMEIWGGENLELSFRVWQCGGSIEIAPCSHVGHVFRKSSPYTF 400
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGVS VLY NLARVALVWMDEW EFYFKFNPEAEK RD+Q +R+RLELR +LKC FKW
Sbjct: 401 PGGVSHVLYTNLARVALVWMDEWQEFYFKFNPEAEKYRDEQQIRTRLELRDRLKCKGFKW 460
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC--THLPVLT 554
YL +VWP HF+P D +FFG+I+H+ +++C+EKP +GS+NQ GP + C L+
Sbjct: 461 YLDNVWPEHFMPTDKRFFGKIKHMLSNRCLEKPSGRGSLNQPMGPVGISGCDVQGRASLS 520
Query: 555 QMFVMKLPTDL--------IATDESVCLDVPEYE--NDISPRVRILACSGFNRQRWTYDK 604
MFVM +L + TDESVCLD PE E ++I+ +VRI+AC+G RQRW YD
Sbjct: 521 LMFVMAPEEELDSVVWSGSLMTDESVCLDTPELEATDEIALKVRIVACTGQKRQRWKYDV 580
Query: 605 E 605
E
Sbjct: 581 E 581
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 124/220 (56%), Positives = 158/220 (71%), Gaps = 30/220 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSD---------------------- 47
L+L R+QQ++ID RG+HVVVGHYLGD + G +NL+D
Sbjct: 52 LRLKRDQQEFIDHRGLHVVVGHYLGDVNNVGASANLTDELLNRNMFDPKPKEGRDGTPVI 111
Query: 48 --------AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
Q+LY+INRFNL+ SDRIP+NR+LPDVRK C+ K + + LP S+++IVFH
Sbjct: 112 IPPHMSLQMQKLYRINRFNLMASDRIPLNRSLPDVRKKSCRLKKIDIDKLPSSTVIIVFH 171
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS L+RTV SVI RSP+ +L EI+LVDDASTR+FL+ LD+YVAKL V TR+IRSP
Sbjct: 172 NEAWSTLMRTVQSVIDRSPKYLLNEIILVDDASTRKFLEKELDDYVAKLPVLTRIIRSPK 231
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R+GLIKARL+GARQA+G+ILVFLDAHCECTL + E + +
Sbjct: 232 RIGLIKARLMGARQAKGKILVFLDAHCECTLGWLEALVSR 271
>gi|340711409|ref|XP_003394268.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Bombus terrestris]
Length = 604
Score = 596 bits (1537), Expect = e-168, Method: Compositional matrix adjust.
Identities = 273/419 (65%), Positives = 343/419 (81%), Gaps = 4/419 (0%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T N LPK+SI+IVFHNEAWS LLRTV+SVI+RSPR +L+EI+LVDD S R+FLK +
Sbjct: 146 TRYMNVGNLPKTSIIIVFHNEAWSTLLRTVYSVINRSPRHLLEEIILVDDNSDRDFLKDA 205
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
LDE+V L V T+V+RS R+GL+ ARLLGA +A+GE+L FLDAHCECT+GWLE L+ V
Sbjct: 206 LDEHVKNLKVSTKVLRSRKRIGLVNARLLGANRAKGEVLTFLDAHCECTVGWLEPLLEAV 265
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
A++RTRVV PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R++ EPF
Sbjct: 266 AKNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPF 325
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TPAMAGGLF+++R YFF +G+YD++M++WGGENLE+SFRVWQCGGSIEIAPCSHV HLF
Sbjct: 326 RTPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLF 385
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RK+SPY+FPGGV E+LYGNLARVALVWMDEWAEFYFKFN EA + RDKQ VR RLELRK+
Sbjct: 386 RKSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFNTEAARLRDKQPVRGRLELRKR 445
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
L+C +F+WYL ++WP HF P DD+FFGRI H+ ++KC+ +P AKG+ +Q SG A L C
Sbjct: 446 LQCKNFEWYLNNIWPEHFFPKDDRFFGRILHISSNKCIMRPTAKGTYSQPSGYAVLETCL 505
Query: 549 HLPVLTQMFVMKLPTD-LIATDESVCLDVPEYEND-ISPRVRILACSGFNRQRWTYDKE 605
P+L+QMFVM TD +I TDESVCLD P+++ +P+V+I+ACSG +RQ+W YD++
Sbjct: 506 PRPILSQMFVMT--TDGIIMTDESVCLDAPDHDTQHKTPKVKIMACSGHSRQKWRYDEQ 562
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 121/218 (55%), Positives = 152/218 (69%), Gaps = 30/218 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLS----------------------- 46
L+L NQ+DYIDR+GVHVVVGHY+GDSVD N++
Sbjct: 45 LRLLDNQRDYIDRKGVHVVVGHYIGDSVDPMKTPNITKDLINKNMFDPRPFEGKNGNPVL 104
Query: 47 -------DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
QQL+QINRFNL+ SDRIP+NR+LPDVR+ C T+ N LPK+SI+IVFH
Sbjct: 105 VPAKDFYKMQQLFQINRFNLMASDRIPLNRSLPDVRRKGCITRYMNVGNLPKTSIIIVFH 164
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTV+SVI+RSPR +L+EI+LVDD S R+FLK +LDE+V L V T+V+RS
Sbjct: 165 NEAWSTLLRTVYSVINRSPRHLLEEIILVDDNSDRDFLKDALDEHVKNLKVSTKVLRSRK 224
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R+GL+ ARLLGA +A+GE+L FLDAHCECT+ + E L
Sbjct: 225 RIGLVNARLLGANRAKGEVLTFLDAHCECTVGWLEPLL 262
>gi|307215388|gb|EFN90069.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Harpegnathos
saltator]
Length = 493
Score = 596 bits (1536), Expect = e-167, Method: Compositional matrix adjust.
Identities = 272/410 (66%), Positives = 332/410 (80%), Gaps = 2/410 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+SI+IVFHNEAWS LLRTVHSVI RSPR +L+EI+LVDD S REFLK+ LDEYV KL
Sbjct: 42 LPKTSIIIVFHNEAWSTLLRTVHSVIDRSPRELLEEIILVDDNSEREFLKNPLDEYVKKL 101
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
SVPT+V+RS RVGLIKARLLGA A+GE+L FLDAHCECT+GWLE L+ V ++ TR++
Sbjct: 102 SVPTKVLRSTERVGLIKARLLGASDAKGEVLTFLDAHCECTVGWLEPLLEAVGKNATRII 161
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R+ EPF+TPAMAGG
Sbjct: 162 SPVIDIINDNTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRESIVEPFRTPAMAGG 221
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+++R YFF +G+YD++M++WGGENLE+SFR WQCGGSIEIAPCSHV HLFRK+SPY+F
Sbjct: 222 LFSMNRNYFFQLGSYDDQMRIWGGENLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYTF 281
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV ++LYGNL RVA VWMD+WAEFYFKFNPEA + R KQ VRSRL LR++L+C SF+W
Sbjct: 282 PGGVGDILYGNLVRVASVWMDQWAEFYFKFNPEAARLRYKQQVRSRLALREKLQCKSFEW 341
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +VWP HF P DD+FFGR+ H T++C+ +P AKGS Q SG A L C P+L+QM
Sbjct: 342 YLENVWPEHFFPTDDRFFGRVIHATTNRCLMRPTAKGSYTQPSGHAVLHSCIPRPMLSQM 401
Query: 557 FVMKLPTDLIATDESVCLDVPEYE-NDISPRVRILACSGFNRQRWTYDKE 605
FVM +I TDESVCLD PE + +P+V+I+ACSG +RQ+W YDK+
Sbjct: 402 FVMT-KNGVIMTDESVCLDAPERDTQQKTPKVKIMACSGRDRQKWQYDKQ 450
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 108/149 (72%), Positives = 125/149 (83%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
QQLYQINRFNL+ SDRIP+NR+LPDVRK KC ++ N LPK+SI+IVFHNEAWS LLR
Sbjct: 2 QQLYQINRFNLMASDRIPLNRSLPDVRKKKCISRYANLGKLPKTSIIIVFHNEAWSTLLR 61
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI RSPR +L+EI+LVDD S REFLK+ LDEYV KLSVPT+V+RS RVGLIKARL
Sbjct: 62 TVHSVIDRSPRELLEEIILVDDNSEREFLKNPLDEYVKKLSVPTKVLRSTERVGLIKARL 121
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
LGA A+GE+L FLDAHCECT+ + E L
Sbjct: 122 LGASDAKGEVLTFLDAHCECTVGWLEPLL 150
>gi|350416150|ref|XP_003490858.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Bombus impatiens]
Length = 604
Score = 595 bits (1535), Expect = e-167, Method: Compositional matrix adjust.
Identities = 271/418 (64%), Positives = 341/418 (81%), Gaps = 2/418 (0%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T N LPK+SI+IVFHNEAWS LLRTV+SVI+RSPR +L+EI+LVDD S R+FLK +
Sbjct: 146 TRYMNVGNLPKTSIIIVFHNEAWSTLLRTVYSVINRSPRHLLEEIILVDDNSDRDFLKDA 205
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
LDE+V L V T+V+RS R+GL+ ARLLGA +A+GE+L FLDAHCECT+GWLE L+ V
Sbjct: 206 LDEHVKNLKVSTKVLRSRKRIGLVNARLLGANRAKGEVLTFLDAHCECTVGWLEPLLEAV 265
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
A++RTRVV PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R++ EPF
Sbjct: 266 AKNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPF 325
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TPAMAGGLF+++R YFF +G+YD++M++WGGENLE+SFRVWQCGGSIEIAPCSHV HLF
Sbjct: 326 RTPAMAGGLFSMNRNYFFELGSYDDQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLF 385
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RK+SPY+FPGGV E+LYGNLARVALVWMDEWAEFYFKFN EA + RDKQ VR RLELRK+
Sbjct: 386 RKSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFNTEAARLRDKQPVRGRLELRKR 445
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
L+C +F+WYL ++WP HF P DD+FFGRI H+ ++KC+ +P AKG+ +Q SG A L C
Sbjct: 446 LQCKNFEWYLNNIWPEHFFPKDDRFFGRILHISSNKCIMRPTAKGTYSQPSGYAVLETCL 505
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYEND-ISPRVRILACSGFNRQRWTYDKE 605
P+L+QMFVM +I TDESVCLD P+++ +P+V+I+ACSG +RQ+W YD++
Sbjct: 506 PRPILSQMFVMT-KDGIIMTDESVCLDAPDHDTQHKTPKVKIMACSGNDRQKWRYDEQ 562
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 121/218 (55%), Positives = 152/218 (69%), Gaps = 30/218 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLS----------------------- 46
L+L NQ+DYIDR+GVHVVVGHY+GDSVD N++
Sbjct: 45 LRLLDNQRDYIDRKGVHVVVGHYIGDSVDPMKAPNITKDLINKNMFDPRPFEGKNGNPVL 104
Query: 47 -------DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
QQL+QINRFNL+ SDRIP+NR+LPDVR+ C T+ N LPK+SI+IVFH
Sbjct: 105 VPAKDFYKMQQLFQINRFNLMASDRIPLNRSLPDVRRKGCVTRYMNVGNLPKTSIIIVFH 164
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTV+SVI+RSPR +L+EI+LVDD S R+FLK +LDE+V L V T+V+RS
Sbjct: 165 NEAWSTLLRTVYSVINRSPRHLLEEIILVDDNSDRDFLKDALDEHVKNLKVSTKVLRSRK 224
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R+GL+ ARLLGA +A+GE+L FLDAHCECT+ + E L
Sbjct: 225 RIGLVNARLLGANRAKGEVLTFLDAHCECTVGWLEPLL 262
>gi|380030377|ref|XP_003698825.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Apis florea]
Length = 595
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 272/415 (65%), Positives = 337/415 (81%), Gaps = 4/415 (0%)
Query: 193 NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY 252
N + LPK+SI+IVFHNEAWS LLRTV+SVI RSPR +L+EI+LVDD S R+FLK +LDE+
Sbjct: 141 NLDNLPKTSIIIVFHNEAWSTLLRTVYSVIDRSPRQLLEEIILVDDNSDRDFLKDTLDEH 200
Query: 253 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR 312
V L V T+V+RS R+GL+ ARLLGA A+GE+L FLDAHCECT+GWLE L+ VA++R
Sbjct: 201 VKNLQVSTKVLRSRKRIGLVNARLLGANNAKGEVLTFLDAHCECTVGWLEPLLEAVAKNR 260
Query: 313 TRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
TRVV PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R++ EPF+TPA
Sbjct: 261 TRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFRTPA 320
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLF+++R YFF +G+YD +M++WGGENLE+SFRVWQCGGSIEIAPCSHV HLFRK+S
Sbjct: 321 MAGGLFSMNRDYFFELGSYDNQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLFRKSS 380
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
PY+FPGGV E+LYGNLARVALVWMDEWAEFYFKFN EA + RDKQ +RSRLELRK+L+C
Sbjct: 381 PYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFNAEAARLRDKQTIRSRLELRKKLQCK 440
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
+F+WYL ++WP HF P DD+FFGRI H+ + KC+ +P AKG+ +Q SG A L C P+
Sbjct: 441 NFEWYLDNIWPEHFFPKDDRFFGRIVHILSKKCIMRPSAKGTYSQPSGYAILHSCVPRPL 500
Query: 553 LTQMFVMKLPTD-LIATDESVCLDVPEYE-NDISPRVRILACSGFNRQRWTYDKE 605
L QMFVM TD +I TDESVCLD PE + + +P+V+I+AC+ RQ+W YDK+
Sbjct: 501 LNQMFVMT--TDGIIMTDESVCLDAPENDTHQTTPKVKIMACNNHIRQKWQYDKQ 553
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 120/218 (55%), Positives = 151/218 (69%), Gaps = 30/218 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSD---------------------- 47
L+L NQ+DYIDR+G+HVVVGHY+GDSVD N++
Sbjct: 36 LRLLDNQKDYIDRKGIHVVVGHYIGDSVDPMKTPNITKDLINKNMFNPRPFEGKNGNPVI 95
Query: 48 --------AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
QQL+QINRFNL+ SDRIP+NRTLPDVR+ C ++ N + LPK+SI+IVFH
Sbjct: 96 VPAKDFYKMQQLFQINRFNLMASDRIPLNRTLPDVRRKGCISRYMNLDNLPKTSIIIVFH 155
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTV+SVI RSPR +L+EI+LVDD S R+FLK +LDE+V L V T+V+RS
Sbjct: 156 NEAWSTLLRTVYSVIDRSPRQLLEEIILVDDNSDRDFLKDTLDEHVKNLQVSTKVLRSRK 215
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R+GL+ ARLLGA A+GE+L FLDAHCECT+ + E L
Sbjct: 216 RIGLVNARLLGANNAKGEVLTFLDAHCECTVGWLEPLL 253
>gi|307183924|gb|EFN70514.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Camponotus
floridanus]
Length = 471
Score = 590 bits (1521), Expect = e-166, Method: Compositional matrix adjust.
Identities = 273/421 (64%), Positives = 334/421 (79%), Gaps = 3/421 (0%)
Query: 187 ECTLVF-NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
+C L + N LPK+SI+IVFHNEAWS LLRTVHSVI+RSPR +LKEI+LVDD S REFL
Sbjct: 47 KCILRYVNLSNLPKTSIIIVFHNEAWSTLLRTVHSVINRSPRELLKEIILVDDNSEREFL 106
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ LD+YV LSVPTRV+RS R+GLIKARLLGA A+GE+L FLDAHCECT+GWLE L+
Sbjct: 107 KNPLDDYVKTLSVPTRVLRSNARIGLIKARLLGAHNAKGEVLTFLDAHCECTVGWLEPLL 166
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ ++ TRVV PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R +
Sbjct: 167 EAIGKNATRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRDNII 226
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
EPF+TPAMAGGLF++++ YFF +G+YD+EM++WGGENLE+SFR WQCGGS+EIAPCSHV
Sbjct: 227 EPFRTPAMAGGLFSMNKDYFFKLGSYDDEMRIWGGENLELSFRTWQCGGSVEIAPCSHVG 286
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
HLFRK+SPY+FPGGV ++LYGNLARVALVWMD+WA+FYFKFNPEA K R KQ +RSRL L
Sbjct: 287 HLFRKSSPYTFPGGVGDILYGNLARVALVWMDQWADFYFKFNPEAAKLRYKQQIRSRLAL 346
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
R++L+C SF+WYL +VWP HF P DD+FFG+I H T +C+ +P AK Q SG A L
Sbjct: 347 REKLQCKSFEWYLENVWPEHFFPTDDRFFGKIVHAATKRCLMRPTAKSLYAQPSGNAILH 406
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYE-NDISPRVRILACSGFNRQRWTYDK 604
C P+L QMFVM +I TDESVCLD PE + +P+V+I+ACSG RQRW YD+
Sbjct: 407 SCIPRPILGQMFVMT-KNGVIMTDESVCLDAPERDMQQRTPKVKIMACSGRERQRWQYDE 465
Query: 605 E 605
+
Sbjct: 466 Q 466
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 108/149 (72%), Positives = 125/149 (83%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
QQLYQINRFNL+VSDRIP+NR+LPDVRK KC + N LPK+SI+IVFHNEAWS LLR
Sbjct: 18 QQLYQINRFNLMVSDRIPLNRSLPDVRKKKCILRYVNLSNLPKTSIIIVFHNEAWSTLLR 77
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +LKEI+LVDD S REFLK+ LD+YV LSVPTRV+RS R+GLIKARL
Sbjct: 78 TVHSVINRSPRELLKEIILVDDNSEREFLKNPLDDYVKTLSVPTRVLRSNARIGLIKARL 137
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
LGA A+GE+L FLDAHCECT+ + E L
Sbjct: 138 LGAHNAKGEVLTFLDAHCECTVGWLEPLL 166
>gi|345497732|ref|XP_001601595.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Nasonia vitripennis]
Length = 610
Score = 589 bits (1518), Expect = e-165, Method: Compositional matrix adjust.
Identities = 264/410 (64%), Positives = 334/410 (81%), Gaps = 2/410 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSVI+RSPR +L+EI+LVDD S R+FL+ LDEYVA+L
Sbjct: 157 LPSTSVIIVFHNEAWSTLLRTVHSVINRSPRKLLEEIILVDDNSDRDFLRKPLDEYVAQL 216
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTRV+RS RVGL+ ARL+GA +A+GE+L FLDAHCECT GWLE L+ ++++RTRVV
Sbjct: 217 NVPTRVLRSDKRVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPLLEAISKNRTRVV 276
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVIDII+D TF+Y RSFELHWGAFNW+LHFRW + +R++ +PFKTPAMAGG
Sbjct: 277 SPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLMLNGALLRERRENIVDPFKTPAMAGG 336
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF++DR YFF +G+YDE M++WGGENLE+SFRVWQCGGS+EIAPCSHV H+FRK+SPY+F
Sbjct: 337 LFSMDREYFFELGSYDEHMRIWGGENLELSFRVWQCGGSVEIAPCSHVGHIFRKSSPYTF 396
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV E+LYGNLARVALVWMDEW +FYF FNP+A++ RDKQ +RSRLELR++LKC SF+W
Sbjct: 397 PGGVDEILYGNLARVALVWMDEWGKFYFNFNPQAQRVRDKQQIRSRLELRERLKCKSFEW 456
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +VWP HF P DD+FFG I H KC+ +P++KG+ +Q SG + C P L+QM
Sbjct: 457 YLDNVWPDHFFPKDDRFFGYILHPSNKKCLMRPMSKGAYSQPSGFVAYQDCIVPPNLSQM 516
Query: 557 FVMKLPTDLIATDESVCLDVPEYEN-DISPRVRILACSGFNRQRWTYDKE 605
FVM+ +I TDESVCLD PE +N P+V+++ACSGF Q+W YD++
Sbjct: 517 FVMR-KDGVIMTDESVCLDAPEKDNRHEKPKVKLMACSGFASQKWEYDEK 565
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/218 (56%), Positives = 156/218 (71%), Gaps = 30/218 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGG---------LHSNLSDA------------ 48
L+L+ Q++YID+RGVHVVVGHY+G+S+D ++ N+ D
Sbjct: 48 LQLSEGQKEYIDKRGVHVVVGHYVGNSMDSLQIPNITKEIINKNMFDPRPYEGNNGEPVQ 107
Query: 49 ---------QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
QQL+QINR+NLL SDRIP+NRTLPDVRK KC T+ N LP +S++IVFH
Sbjct: 108 IPAKDFQKMQQLFQINRYNLLASDRIPLNRTLPDVRKKKCITRYANLGDLPSTSVIIVFH 167
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTVHSVI+RSPR +L+EI+LVDD S R+FL+ LDEYVA+L+VPTRV+RS
Sbjct: 168 NEAWSTLLRTVHSVINRSPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLNVPTRVLRSDK 227
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RVGL+ ARL+GA +A+GE+L FLDAHCECT + E L
Sbjct: 228 RVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPLL 265
>gi|328783898|ref|XP_003250361.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Apis
mellifera]
Length = 603
Score = 588 bits (1517), Expect = e-165, Method: Compositional matrix adjust.
Identities = 269/418 (64%), Positives = 335/418 (80%), Gaps = 2/418 (0%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T N LPK+SI+IVFHNEAWS LLRTV+SVI RSP +L+EI+LVDD S R+FLK +
Sbjct: 145 TRYMNLGNLPKTSIIIVFHNEAWSTLLRTVYSVIDRSPIQLLEEIILVDDNSDRDFLKDA 204
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
LDE++ L V T+V+RS R+GL+ ARLLGA +A+GE+L FLDAHCECT+GWLE L+ V
Sbjct: 205 LDEHIKNLQVSTKVLRSKKRIGLVNARLLGANKAKGEVLTFLDAHCECTVGWLEPLLEAV 264
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
A++RTRVV PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R++ EPF
Sbjct: 265 AKNRTRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPF 324
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TPAMAGGLF+++R YFF +G+YD +M++WGGENLE+SFRVWQCGGSIEIAPCSHV HLF
Sbjct: 325 RTPAMAGGLFSMNRDYFFELGSYDNQMKIWGGENLELSFRVWQCGGSIEIAPCSHVGHLF 384
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RK+SPY+FPGGV E+LYGNLARVALVWMDEWAEFYFKFN EA + RDKQ +RSRLELRK+
Sbjct: 385 RKSSPYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFNAEAARLRDKQTIRSRLELRKK 444
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
L+C +F+WYL ++WP HF P DD+FFGRI H+ + KC+ +P AKG+ +Q SG A L C
Sbjct: 445 LQCKNFEWYLDNIWPEHFFPKDDRFFGRIVHILSKKCIMRPSAKGTYSQPSGYAILHSCV 504
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYE-NDISPRVRILACSGFNRQRWTYDKE 605
P+L QMFVM +I TDESVCLD PE + + +P+V+I+AC+ RQ+W YDK+
Sbjct: 505 PRPLLNQMFVMT-ADGIIMTDESVCLDAPENDTHQTTPKVKIMACNSHIRQKWQYDKQ 561
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 119/218 (54%), Positives = 150/218 (68%), Gaps = 30/218 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSD---------------------- 47
L+L NQ+DYIDR+G+HVVVGHY+GDSVD N++
Sbjct: 44 LRLLDNQKDYIDRKGIHVVVGHYIGDSVDPMKTPNITKDLINKNMFNPRPFEGKNGNPVI 103
Query: 48 --------AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
QQL+QINRFNL+ SDRIP+NRTLPDVR+ C T+ N LPK+SI+IVFH
Sbjct: 104 VPAKDFYKMQQLFQINRFNLMASDRIPLNRTLPDVRRKGCITRYMNLGNLPKTSIIIVFH 163
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTV+SVI RSP +L+EI+LVDD S R+FLK +LDE++ L V T+V+RS
Sbjct: 164 NEAWSTLLRTVYSVIDRSPIQLLEEIILVDDNSDRDFLKDALDEHIKNLQVSTKVLRSKK 223
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R+GL+ ARLLGA +A+GE+L FLDAHCECT+ + E L
Sbjct: 224 RIGLVNARLLGANKAKGEVLTFLDAHCECTVGWLEPLL 261
>gi|383848548|ref|XP_003699911.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Megachile rotundata]
Length = 604
Score = 587 bits (1513), Expect = e-165, Method: Compositional matrix adjust.
Identities = 268/414 (64%), Positives = 336/414 (81%), Gaps = 2/414 (0%)
Query: 193 NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY 252
N LP++SI+IVFHNEAWS LLRTV+SV++RSPR +L+EI+LVDD S REFLK +LDE+
Sbjct: 150 NLNGLPRTSIIIVFHNEAWSTLLRTVYSVVNRSPRHLLEEIILVDDDSDREFLKDALDEH 209
Query: 253 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR 312
V L VPT+V+RS R+GL+ ARLLGA +A+GE+L FLDAHCECT+GWLE L+ VA+++
Sbjct: 210 VKSLRVPTKVLRSKKRIGLVNARLLGANEAKGEVLTFLDAHCECTVGWLEPLLEAVAKNK 269
Query: 313 TRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
TRVV PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R++ EPF+TPA
Sbjct: 270 TRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRENIVEPFRTPA 329
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLF+++R YFF +G+YD++M++WGGENLE+SFRVWQCGGS+EIAPCSHV HLFRK+S
Sbjct: 330 MAGGLFSMNRDYFFELGSYDDQMKIWGGENLELSFRVWQCGGSVEIAPCSHVGHLFRKSS 389
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
PY+FPGGV E+LYGNLARVALVWMDEWAEFYFKFN EA + R KQ VR+RL LRK+L+C
Sbjct: 390 PYTFPGGVGEILYGNLARVALVWMDEWAEFYFKFNAEASRLRHKQPVRARLALRKRLQCK 449
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
SF+WYL +VWP HF P +D+FFGRI HV T KC+ +P AKG+ +Q SG A L C PV
Sbjct: 450 SFEWYLDNVWPEHFFPKNDRFFGRIVHVSTKKCIMRPTAKGTYSQPSGYALLESCIPRPV 509
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEYEND-ISPRVRILACSGFNRQRWTYDKE 605
L QMFVM + ++ TDES+CLD P+ + +PRV+I+ACS +RQ W YD++
Sbjct: 510 LNQMFVMT-KSGIVMTDESICLDAPDRDTQHKTPRVKIMACSSQSRQNWQYDEQ 562
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 122/218 (55%), Positives = 155/218 (71%), Gaps = 30/218 (13%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVD---------GGLHSNLSDA------------ 48
L+L NQ+DYIDRRGVHVVVGHY+GDSVD ++ N+ D
Sbjct: 45 LRLLDNQRDYIDRRGVHVVVGHYIGDSVDPLKAPNITKDLINKNMFDPRPFEGKNGSPVL 104
Query: 49 ---------QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
QQL+QINRFNL+ SDRIP+NR+LPDVR+ C ++ N LP++SI+IVFH
Sbjct: 105 VPAKDFYQMQQLFQINRFNLMASDRIPLNRSLPDVRRKGCISRYANLNGLPRTSIIIVFH 164
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTV+SV++RSPR +L+EI+LVDD S REFLK +LDE+V L VPT+V+RS
Sbjct: 165 NEAWSTLLRTVYSVVNRSPRHLLEEIILVDDDSDREFLKDALDEHVKSLRVPTKVLRSKK 224
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R+GL+ ARLLGA +A+GE+L FLDAHCECT+ + E L
Sbjct: 225 RIGLVNARLLGANEAKGEVLTFLDAHCECTVGWLEPLL 262
>gi|242020557|ref|XP_002430719.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515909|gb|EEB17981.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 511
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 266/410 (64%), Positives = 330/410 (80%), Gaps = 5/410 (1%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S+VIVFHNEAWS LLRTV SVI RSPR +L EI+LVDD STR+FLK LDEYVA
Sbjct: 65 ELLP-TSVVIVFHNEAWSTLLRTVQSVIDRSPRELLTEIILVDDGSTRKFLKEDLDEYVA 123
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
+L VP +VIR+ R GLI+AR++GA++A+G++L FLDAHCECT GWLE L+ RV+EDR +
Sbjct: 124 RLPVPVKVIRTKEREGLIRARMIGAKEAKGQVLTFLDAHCECTKGWLEPLLVRVSEDRKK 183
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVIDII+D TFAYVRSFELHWGAFNW LHFRWYT G+++ ++ D TEPF TPAMA
Sbjct: 184 VVCPVIDIINDDTFAYVRSFELHWGAFNWNLHFRWYTLGTTEIKKRKNDVTEPFPTPAMA 243
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAI R YF+ IGAYDE+M++WGGENLEMSFR WQCGGS+EI PCSHV HLFRK+SPY
Sbjct: 244 GGLFAIRRDYFYEIGAYDEQMKIWGGENLEMSFRGWQCGGSVEIVPCSHVGHLFRKSSPY 303
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGV E+L+ NLARVALVWMDEW EF+FKFNPEA +QRDKQ+VR+R++LR +LKC SF
Sbjct: 304 TFPGGVGEILHANLARVALVWMDEWQEFFFKFNPEAARQRDKQSVRARIQLRSRLKCKSF 363
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +VWP HF P +D+FFG I+ +KC+ +P S NQ +G +L PC L
Sbjct: 364 EWYLDNVWPQHFFPKNDRFFGLIKSASDNKCLTRPHGPPSTNQPTGVVTLTPCKE--TLE 421
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDK 604
FV +D++ TDESVCLD+ + +N++ +V+++ACSG RQ+W YDK
Sbjct: 422 HFFVYTKFSDVM-TDESVCLDLLD-KNEMKAKVKVMACSGSPRQKWMYDK 469
Score = 202 bits (515), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 100/159 (62%), Positives = 123/159 (77%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S ++ + LY INRFNLL SDRIP+NRTLPDVRK +C TK N L +S+VIVFHNEA
Sbjct: 20 SEMTKMKALYTINRFNLLASDRIPLNRTLPDVRKKRCLTKYQNLPELLPTSVVIVFHNEA 79
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRTV SVI RSPR +L EI+LVDD STR+FLK LDEYVA+L VP +VIR+ R G
Sbjct: 80 WSTLLRTVQSVIDRSPRELLTEIILVDDGSTRKFLKEDLDEYVARLPVPVKVIRTKEREG 139
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
LI+AR++GA++A+G++L FLDAHCECT + E L + S
Sbjct: 140 LIRARMIGAKEAKGQVLTFLDAHCECTKGWLEPLLVRVS 178
>gi|332030446|gb|EGI70134.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Acromyrmex
echinatior]
Length = 595
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 261/395 (66%), Positives = 320/395 (81%), Gaps = 2/395 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+SI+IVFHNEAWS LLRTVHSVI+RSP+ +L+EI+LVDD S REFLK+SLD+YV L
Sbjct: 163 LPKTSIIIVFHNEAWSTLLRTVHSVINRSPKELLEEIILVDDNSEREFLKNSLDDYVKNL 222
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
SV TRV+RS R+GLIKARLLGA A+GE+L FLDAHCECT+GWLE L+ V ++ TR+V
Sbjct: 223 SVSTRVLRSNERIGLIKARLLGANDAKGEVLTFLDAHCECTIGWLEPLLEAVGKNATRIV 282
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVIDII+D TF+Y RSFELHWGAFNW+LHFRW T +R + EPF+TPAMAGG
Sbjct: 283 APVIDIINDNTFSYTRSFELHWGAFNWDLHFRWLTLNGRLLKERRDNIVEPFRTPAMAGG 342
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+++R YFF +G+YD++M++WGGENLE+SFR WQCGGSIEIAPCSHV HLFRK+SPY+F
Sbjct: 343 LFSMNRDYFFKLGSYDDQMRIWGGENLELSFRAWQCGGSIEIAPCSHVGHLFRKSSPYTF 402
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV ++LYGNLARVALVWMD+WAEFYFKFNPEA + R KQ VRSRL LR++L+C SF+W
Sbjct: 403 PGGVGDILYGNLARVALVWMDQWAEFYFKFNPEAARLRYKQQVRSRLALREKLQCKSFEW 462
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +VWP HF P DD+FFGR+ H T KC+ +P AKGS Q SG A L C P+L+QM
Sbjct: 463 YLENVWPEHFFPTDDRFFGRVVHAGTKKCIMRPAAKGSYGQPSGNAVLHSCIPRPMLSQM 522
Query: 557 FVMKLPTDLIATDESVCLDVPEYE-NDISPRVRIL 590
FVM +I TDESVCLD PE + +P+++++
Sbjct: 523 FVMT-KNGVIMTDESVCLDAPERDMQQKTPKMKVI 556
Score = 244 bits (624), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 125/213 (58%), Positives = 152/213 (71%), Gaps = 30/213 (14%)
Query: 15 NQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSD--------------------------- 47
NQ+DYID+RG+HVVVGHY+GDSVD N++
Sbjct: 59 NQKDYIDQRGIHVVVGHYIGDSVDSLKIPNITKDVINQNLFNPLPFEGKNGEPVVIHPKD 118
Query: 48 ---AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
QQLYQINRFNL+ SD+IP+NR+LPDVRK KC ++ N LPK+SI+IVFHNEAWS
Sbjct: 119 FYKMQQLYQINRFNLMASDKIPLNRSLPDVRKKKCISRYTNLGNLPKTSIIIVFHNEAWS 178
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTVHSVI+RSP+ +L+EI+LVDD S REFLK+SLD+YV LSV TRV+RS R+GLI
Sbjct: 179 TLLRTVHSVINRSPKELLEEIILVDDNSEREFLKNSLDDYVKNLSVSTRVLRSNERIGLI 238
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
KARLLGA A+GE+L FLDAHCECT+ + E L
Sbjct: 239 KARLLGANDAKGEVLTFLDAHCECTIGWLEPLL 271
>gi|189240187|ref|XP_975207.2| PREDICTED: similar to AGAP008229-PA [Tribolium castaneum]
Length = 575
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 255/420 (60%), Positives = 321/420 (76%), Gaps = 2/420 (0%)
Query: 187 ECTLVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
+C +F + PK+SI+IVFHNEAWS LLRTV SVI+RSP +L+EI+LVDD+S R+FL
Sbjct: 111 KCATLFGDYPTYPKTSIIIVFHNEAWSTLLRTVWSVINRSPPELLEEIILVDDSSERKFL 170
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K LD+YVA L VPT+V+RS R+GLIKARL GA A+G +L FLDAHCECT GWLE L+
Sbjct: 171 KKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEALL 230
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ + +DRT VVCPVIDII+D TFAYV+SFELHWGAFNW L FRW+T G + +++ D T
Sbjct: 231 SVIKQDRTAVVCPVIDIINDDTFAYVKSFELHWGAFNWNLQFRWFTLGGRELKLRKNDAT 290
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+PF TP MAGGLFAIDR YFF +GAYD+ M +WGGENLEMSFR+WQCGG ++IAPCS V
Sbjct: 291 QPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQCGGKVQIAPCSRVG 350
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
HLFRK+SPYSFPGG+++ L+ NLARVA VWMD+WA FYFKFN A++ +++Q V SR+EL
Sbjct: 351 HLFRKSSPYSFPGGINKTLFSNLARVARVWMDDWARFYFKFNEPADRIKNEQNVTSRIEL 410
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
R++ KC F+WYL +VWP HF P DD+FFGRIR++ + C+ KP K NQ G A +
Sbjct: 411 RRKHKCKGFEWYLDNVWPQHFFPKDDRFFGRIRNLGQNMCLIKPQKKVVSNQPMGIAKID 470
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
C V+ +MFVM I TD+S+CLD PE +VRI+ACSG++RQ+W YDKE
Sbjct: 471 MCLGDEVILEMFVM-TKEGFIMTDDSICLDAPEKVVIGPSKVRIMACSGYSRQKWVYDKE 529
Score = 216 bits (549), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 114/213 (53%), Positives = 141/213 (66%), Gaps = 28/213 (13%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSV--------------------DGGLHSN-------- 44
A NQ++Y+D+ GV VVVGHY+G+ V G + N
Sbjct: 18 ANNQEEYVDKNGVKVVVGHYIGNPVHAIPNATHDMINENNYDPRPGAGKNGNPVVIEPKD 77
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L QQ +QINRFNLL SDRIP+NR+LPD R+ KC T + PK+SI+IVFHNEAWS
Sbjct: 78 LLKMQQYFQINRFNLLASDRIPLNRSLPDFRRKKCATLFGDYPTYPKTSIIIVFHNEAWS 137
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV SVI+RSP +L+EI+LVDD+S R+FLK LD+YVA L VPT+V+RS R+GLI
Sbjct: 138 TLLRTVWSVINRSPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLI 197
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
KARL GA A+G +L FLDAHCECT + E L
Sbjct: 198 KARLKGALVAKGPVLTFLDAHCECTTGWLEALL 230
>gi|270011650|gb|EFA08098.1| hypothetical protein TcasGA2_TC005702 [Tribolium castaneum]
Length = 607
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 255/420 (60%), Positives = 321/420 (76%), Gaps = 2/420 (0%)
Query: 187 ECTLVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
+C +F + PK+SI+IVFHNEAWS LLRTV SVI+RSP +L+EI+LVDD+S R+FL
Sbjct: 143 KCATLFGDYPTYPKTSIIIVFHNEAWSTLLRTVWSVINRSPPELLEEIILVDDSSERKFL 202
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K LD+YVA L VPT+V+RS R+GLIKARL GA A+G +L FLDAHCECT GWLE L+
Sbjct: 203 KKPLDDYVANLPVPTKVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEALL 262
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ + +DRT VVCPVIDII+D TFAYV+SFELHWGAFNW L FRW+T G + +++ D T
Sbjct: 263 SVIKQDRTAVVCPVIDIINDDTFAYVKSFELHWGAFNWNLQFRWFTLGGRELKLRKNDAT 322
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+PF TP MAGGLFAIDR YFF +GAYD+ M +WGGENLEMSFR+WQCGG ++IAPCS V
Sbjct: 323 QPFNTPTMAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQCGGKVQIAPCSRVG 382
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
HLFRK+SPYSFPGG+++ L+ NLARVA VWMD+WA FYFKFN A++ +++Q V SR+EL
Sbjct: 383 HLFRKSSPYSFPGGINKTLFSNLARVARVWMDDWARFYFKFNEPADRIKNEQNVTSRIEL 442
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
R++ KC F+WYL +VWP HF P DD+FFGRIR++ + C+ KP K NQ G A +
Sbjct: 443 RRKHKCKGFEWYLDNVWPQHFFPKDDRFFGRIRNLGQNMCLIKPQKKVVSNQPMGIAKID 502
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
C V+ +MFVM I TD+S+CLD PE +VRI+ACSG++RQ+W YDKE
Sbjct: 503 MCLGDEVILEMFVMT-KEGFIMTDDSICLDAPEKVVIGPSKVRIMACSGYSRQKWVYDKE 561
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 114/213 (53%), Positives = 141/213 (66%), Gaps = 28/213 (13%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSV--------------------DGGLHSN-------- 44
A NQ++Y+D+ GV VVVGHY+G+ V G + N
Sbjct: 50 ANNQEEYVDKNGVKVVVGHYIGNPVHAIPNATHDMINENNYDPRPGAGKNGNPVVIEPKD 109
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L QQ +QINRFNLL SDRIP+NR+LPD R+ KC T + PK+SI+IVFHNEAWS
Sbjct: 110 LLKMQQYFQINRFNLLASDRIPLNRSLPDFRRKKCATLFGDYPTYPKTSIIIVFHNEAWS 169
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV SVI+RSP +L+EI+LVDD+S R+FLK LD+YVA L VPT+V+RS R+GLI
Sbjct: 170 TLLRTVWSVINRSPPELLEEIILVDDSSERKFLKKPLDDYVANLPVPTKVLRSQARIGLI 229
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
KARL GA A+G +L FLDAHCECT + E L
Sbjct: 230 KARLKGALVAKGPVLTFLDAHCECTTGWLEALL 262
>gi|391342179|ref|XP_003745400.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Metaseiulus occidentalis]
Length = 610
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 260/488 (53%), Positives = 334/488 (68%), Gaps = 8/488 (1%)
Query: 118 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 177
P+ + E +L +A++ L+ + + +P+ V+RS + + LL + + +
Sbjct: 79 PKENITEAVL--NANSYNPLEGAGNMGQPVYPLPSEVVRSKMLYSINRFNLLVSDKISVD 136
Query: 178 ILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 237
+ C + LP +S++IVFHNEAWS LLRTVHSVI+RSPR ++KEI+LVD
Sbjct: 137 RTLADARKSVCRNISYAYDLPDTSVIIVFHNEAWSTLLRTVHSVINRSPRDLVKEIMLVD 196
Query: 238 DASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 297
DAS REFLK SLD YV L+ P +VIRSP R GLI+ARL+GAR AEG++L FLDAHCECT
Sbjct: 197 DASDREFLKRSLDAYVRSLNFPIKVIRSPKRSGLIRARLMGARAAEGKVLTFLDAHCECT 256
Query: 298 LGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDA 357
GWLE L+ R+ EDRTRVVCP+IDII D TFAYV+SFELHWGA NWE+HFRWY G
Sbjct: 257 TGWLEPLLQRIKEDRTRVVCPIIDIIHDDTFAYVKSFELHWGAINWEMHFRWYPVGPHVL 316
Query: 358 IIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIE 417
+ D +EPFKTP MAGGLF+ID+ YF+ +GAYDE+M +WGGEN+EMSFR+WQCGGS+E
Sbjct: 317 KQRHGDPSEPFKTPVMAGGLFSIDKEYFYEMGAYDEQMDIWGGENVEMSFRIWQCGGSLE 376
Query: 418 IAPCSHVAHLFRKASPYSF--PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRD 475
I PCSHV H+FR++SPY+F P GV +L+ NLARVA VWMD+WAEFYF N EA+K R
Sbjct: 377 IVPCSHVGHVFRRSSPYTFPHPKGVGGILFSNLARVAEVWMDDWAEFYFNMNTEAKKLRS 436
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM 535
V R LR +L C F WYLT+VWP +F P +++FFG+IR+ KC +P++K S
Sbjct: 437 TMDVAKRKALRDRLHCKPFSWYLTNVWPENFFPSENRFFGKIRNRAAEKCFGRPVSK-SY 495
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
+Q G L C Q FVM L+ TDESVCLD PE D + V ++AC G
Sbjct: 496 HQPIGKVKLEDCAVTHYARQYFVMTGEGYLM-TDESVCLDSPEGYEDTN--VVMIACQGI 552
Query: 596 NRQRWTYD 603
RQ+W +D
Sbjct: 553 QRQKWRFD 560
Score = 218 bits (556), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 113/223 (50%), Positives = 148/223 (66%), Gaps = 31/223 (13%)
Query: 7 VQLLKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDA------------------ 48
+Q L+ +Q++YID++G+HV+VGHY+G + N+++A
Sbjct: 45 LQWLRNDDHQKEYIDKKGIHVIVGHYMGKDLPWDPKENITEAVLNANSYNPLEGAGNMGQ 104
Query: 49 ------------QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
+ LY INRFNLLVSD+I V+RTL D RK C+ + + LP +S++I
Sbjct: 105 PVYPLPSEVVRSKMLYSINRFNLLVSDKISVDRTLADARKSVCRNISYAYD-LPDTSVII 163
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
VFHNEAWS LLRTVHSVI+RSPR ++KEI+LVDDAS REFLK SLD YV L+ P +VIR
Sbjct: 164 VFHNEAWSTLLRTVHSVINRSPRDLVKEIMLVDDASDREFLKRSLDAYVRSLNFPIKVIR 223
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
SP R GLI+ARL+GAR AEG++L FLDAHCECT + E L +
Sbjct: 224 SPKRSGLIRARLMGARAAEGKVLTFLDAHCECTTGWLEPLLQR 266
>gi|321463472|gb|EFX74488.1| hypothetical protein DAPPUDRAFT_307282 [Daphnia pulex]
Length = 612
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 234/426 (54%), Positives = 307/426 (72%), Gaps = 21/426 (4%)
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
L +N LP +S++IVFHNEAWS LLRTVHSVI+RSP +L EI+LVDDAS R FLK L
Sbjct: 148 LSYNINELPTTSVIIVFHNEAWSTLLRTVHSVINRSPPKLLWEIILVDDASNRTFLKKPL 207
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
+++V+ L V+RS R+GL++ARL+GAR+A G++L FLDAHCE T GWL+ L+ R+
Sbjct: 208 EDHVSVLPTTIIVLRSEKRIGLVRARLMGAREATGDVLTFLDAHCETTDGWLQPLLYRIK 267
Query: 310 EDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAII--KRKDFTEP 367
+ VCP+IDIISD TFA +RSFELH G +W LHFRW+ G+S+ ++ +R + + P
Sbjct: 268 TNPNVAVCPIIDIISDDTFALLRSFELHHGGMSWNLHFRWF--GASETLMAERRGNMSIP 325
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
F+TP MAGGLFAI R YF IG YD++M +WGGEN+EMS R+WQCGG +E +PCSHVAH+
Sbjct: 326 FRTPVMAGGLFAIGRDYFQEIGTYDDQMDIWGGENIEMSLRIWQCGGRVETSPCSHVAHV 385
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK+SPY+FPGGV+++LY NLAR ALVWMDEW EF+FK NPEA + RD Q++RSRL LR
Sbjct: 386 FRKSSPYTFPGGVNQILYSNLARAALVWMDEWKEFFFKMNPEANRLRDAQSIRSRLSLRS 445
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC+ F WYL +VWP +F P D+FFG+I H T KC+E+PL G ++Q G L+PC
Sbjct: 446 RLKCNDFNWYLKNVWPENFFPGPDRFFGKIIHRATGKCLERPLGGGGVSQPIGRLKLVPC 505
Query: 548 ------THLPVLTQMFVMK----LPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
HL V T K LP L+ +DES+C+D + E R +ACSGF++
Sbjct: 506 VQQWFDAHLWVATWNSTRKDGRTLPGYLM-SDESLCVDHLDGE------ARAMACSGFSK 558
Query: 598 QRWTYD 603
Q W ++
Sbjct: 559 QLWHHN 564
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 143/210 (68%), Gaps = 30/210 (14%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNL------------------------ 45
+K +NQ++YID RG+HVVVG Y+GDS+ G H NL
Sbjct: 46 IKFGKNQREYIDSRGIHVVVGRYVGDSLPGPSHPNLTEEVLNQNNFNPIPSQGEWGEPVA 105
Query: 46 ------SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFH 99
S +Q LY+I+RFNLL SDRIP+NRTLPD+RKP+CK +N LP +S++IVFH
Sbjct: 106 ILPHEHSKSQSLYRIHRFNLLASDRIPLNRTLPDIRKPQCKALSYNINELPTTSVIIVFH 165
Query: 100 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPG 159
NEAWS LLRTVHSVI+RSP +L EI+LVDDAS R FLK L+++V+ L V+RS
Sbjct: 166 NEAWSTLLRTVHSVINRSPPKLLWEIILVDDASNRTFLKKPLEDHVSVLPTTIIVLRSEK 225
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECT 189
R+GL++ARL+GAR+A G++L FLDAHCE T
Sbjct: 226 RIGLVRARLMGAREATGDVLTFLDAHCETT 255
>gi|157113401|ref|XP_001657811.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108877741|gb|EAT41966.1| AAEL006452-PA [Aedes aegypti]
Length = 661
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 238/470 (50%), Positives = 300/470 (63%), Gaps = 63/470 (13%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS LLRTV SVI RSPR ++KEILLVDDAS R FLK+ L+ YV KL
Sbjct: 142 LPTTSIIIVFHNEAWSVLLRTVWSVIIRSPRHLIKEILLVDDASDRRFLKNDLENYVQKL 201
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V ++R R GL+ ARL+GAR A G+ L FLDAHCEC+ GWLE L+ARV E+ +VV
Sbjct: 202 PVVISILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPLLARVQENPKKVV 261
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVIDIISD F+Y++SFE HWGAFNW++HFRWYT + +RKD T PF TPAMAGG
Sbjct: 262 CPVIDIISDDNFSYIKSFEFHWGAFNWQMHFRWYTLSDEELAERRKDTTMPFHTPAMAGG 321
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF IDR YFF +GAYDE +++WGG+NLEMSFR+WQCGG IEIAPCSHV HLFRK+SPY+F
Sbjct: 322 LFTIDRKYFFDVGAYDERLKIWGGDNLEMSFRIWQCGGEIEIAPCSHVGHLFRKSSPYTF 381
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGVS +L NLARVALVWMD+WA+F+FKFN E+ + V SR+ L+K L C SF W
Sbjct: 382 PGGVSGILNENLARVALVWMDDWAKFFFKFNKGTEEFKSLN-VSSRVALKKHLSCKSFDW 440
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHV------------------------------------ 520
YL +WP +F P +KFFGRI+ +
Sbjct: 441 YLRKIWPQNFFPAPNKFFGRIQPIDLSTFDHQEYITLMKKINLIVKNLNPELKWKFLIKY 500
Query: 521 ----------------QTHKCVEKPLAKGSMNQASGPASLLPCTHL-PVLTQMFVMKLPT 563
T C++KP + +NQ G + L C+ L VL + FV+
Sbjct: 501 LSENVKKIGDSMKAAKHTSFCLQKPKSNSIINQPYGQSFLKKCSLLINVLDEQFVID-DY 559
Query: 564 DLIATDESVCLDV-------PEYENDISPRVRILAC-SGFNRQRWTYDKE 605
+ TDE VCLD E + + +++++ C S + QRW Y+ +
Sbjct: 560 GRVMTDEGVCLDSFRKTSVDGEQIIEEARKIKMVTCGSNKSAQRWVYETD 609
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/195 (50%), Positives = 124/195 (63%), Gaps = 27/195 (13%)
Query: 21 DRRGVHVVVGHYLGDSVDGGLHSNLSDA--------------------------QQLYQI 54
DR + +VGHY+G G + + + QQL+QI
Sbjct: 49 DRPRIPQIVGHYVGVGSVGNVSKDFLNTNNFAPVPGVGENGDPVVIQAKDLLLMQQLFQI 108
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
NR+NLL SDR+ +NR+LPDVRK KC +K + + LP +SI+IVFHNEAWS LLRTV SVI
Sbjct: 109 NRYNLLASDRVALNRSLPDVRKSKCVSKEYPSK-LPTTSIIIVFHNEAWSVLLRTVWSVI 167
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSPR ++KEILLVDDAS R FLK+ L+ YV KL V ++R R GL+ ARL+GAR A
Sbjct: 168 IRSPRHLIKEILLVDDASDRRFLKNDLENYVQKLPVVISILRLNKREGLVAARLMGARVA 227
Query: 175 EGEILVFLDAHCECT 189
G+ L FLDAHCEC+
Sbjct: 228 TGDTLTFLDAHCECS 242
>gi|158296916|ref|XP_317241.4| AGAP008229-PA [Anopheles gambiae str. PEST]
gi|157014942|gb|EAA12407.4| AGAP008229-PA [Anopheles gambiae str. PEST]
Length = 663
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 211/324 (65%), Positives = 253/324 (78%), Gaps = 1/324 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS LLRTV SVI+RSP+ +++EILLVDDAS R FLK LD YV KL
Sbjct: 142 LPTTSIIIVFHNEAWSVLLRTVWSVINRSPKGLVREILLVDDASDRRFLKHELDNYVQKL 201
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ ++R R GL+ ARLLGAR A G+ L FLDAHCEC+ GWLE L+ARV E+ +VV
Sbjct: 202 PLSVTILRLNKREGLVAARLLGARMATGDTLTFLDAHCECSPGWLEPLLARVQENPKKVV 261
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVIDIISD F+Y++SFE HWGAFNW LHFRWYT + +RKD T PF+TPAMAGG
Sbjct: 262 CPVIDIISDDNFSYIKSFEFHWGAFNWPLHFRWYTLSDEELAERRKDTTTPFRTPAMAGG 321
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF IDR YFF IGAYDE +++WGG+NLEMSFRVWQCGG +EIAPCSHV HLFRK+SPY+F
Sbjct: 322 LFTIDRKYFFDIGAYDERLKIWGGDNLEMSFRVWQCGGEVEIAPCSHVGHLFRKSSPYTF 381
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGVS +L NLARVALVWMD+WA+F+FKFN E+ + V +RL L++ L C SF W
Sbjct: 382 PGGVSGILNENLARVALVWMDDWAKFFFKFNKGTEEFKSLN-VSNRLALKRSLNCKSFDW 440
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHV 520
YL VWP +F P +KFFGRI+ +
Sbjct: 441 YLRQVWPQNFFPAPNKFFGRIQPI 464
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 100/197 (50%), Positives = 127/197 (64%), Gaps = 27/197 (13%)
Query: 19 YIDRRGVHVVVGHYLGDSVDGGLHSNLSDA--------------------------QQLY 52
+ DR + +VGHY+G G L + + QQL+
Sbjct: 47 FPDRPRIPQIVGHYVGVGTVGNLSKDFMNTNNFDPIPGVGEHGDPVVIQAKDLLKMQQLF 106
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
QINR+NLL SDRI +NR+LPDVRKPKC +K++ + LP +SI+IVFHNEAWS LLRTV S
Sbjct: 107 QINRYNLLASDRIALNRSLPDVRKPKCVSKLYPAK-LPTTSIIIVFHNEAWSVLLRTVWS 165
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
VI+RSP+ +++EILLVDDAS R FLK LD YV KL + ++R R GL+ ARLLGAR
Sbjct: 166 VINRSPKGLVREILLVDDASDRRFLKHELDNYVQKLPLSVTILRLNKREGLVAARLLGAR 225
Query: 173 QAEGEILVFLDAHCECT 189
A G+ L FLDAHCEC+
Sbjct: 226 MATGDTLTFLDAHCECS 242
>gi|350402581|ref|XP_003486533.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 3 [Bombus impatiens]
Length = 607
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 212/421 (50%), Positives = 289/421 (68%), Gaps = 17/421 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L++YV L VPT V R+ R GLI+ARLLGA+ G+++ FLDAHCECT GWLE L++R
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 272 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+
Sbjct: 332 LRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHV 391
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FR SPY+FPGGVS+V+ N ARVA VWMDEW +FY+ NPE + V R++LR+
Sbjct: 392 FRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPEGARNVAVGDVSERIKLRE 451
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC SF+WYL +++P +P+D + G +++V+T C++ +M + +G +
Sbjct: 452 RLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVETQSCLD------TMGRRTGENVGISY 505
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDK 604
H Q+F +++ D +CLD SP+ V+I+ C G Q W Y++
Sbjct: 506 CHGLGGNQVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNE 557
Query: 605 E 605
E
Sbjct: 558 E 558
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +N+ +LP +SIVIVFHNEAWS LLR
Sbjct: 122 QELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNK-YLPDTSIVIVFHNEAWSTLLR 180
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 181 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 240
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ G+++ FLDAHCECT + E L +
Sbjct: 241 LGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
>gi|307204529|gb|EFN83209.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Harpegnathos
saltator]
Length = 605
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 228/518 (44%), Positives = 328/518 (63%), Gaps = 55/518 (10%)
Query: 118 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR----SPGRVG----------- 162
P+++++E ++S R++ +S L +L P +V+R SPG +G
Sbjct: 64 PKNLVRETSSKQESSKRQYPQSRL-----QLWRPAKVVRENKGSPGEMGAAVHIPPENEA 118
Query: 163 ------------LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEA 210
L+ + ++ ++ +I LD C ++LP +SIVIVFHNEA
Sbjct: 119 KQQELFKLNQFNLMASDMISLNRSLKDIR--LDG---CKNKKYNKYLPDTSIVIVFHNEA 173
Query: 211 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 270
W+ LLRTV SVI+RSPRS+LKE++LVDDAS R+ LK L++Y+A L VPT V R+ R G
Sbjct: 174 WTTLLRTVWSVINRSPRSLLKEVILVDDASERDHLKQDLEDYIATLPVPTYVYRTEKRSG 233
Query: 271 LIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAY 330
LI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R+A DR VVCP+ID+ISD TF Y
Sbjct: 234 LIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHTVVCPIIDVISDDTFEY 293
Query: 331 VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGA 390
+ + ++ WG FNW+L+FRWY + + D T P +TP MAGGLF+ID+ YF+ +GA
Sbjct: 294 IPASDMTWGGFNWKLNFRWYRVAQREMDRRNSDRTAPLRTPTMAGGLFSIDKEYFYELGA 353
Query: 391 YDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLAR 450
YDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+FR SPY+FPGGVS+++ N AR
Sbjct: 354 YDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPYTFPGGVSKIVLHNAAR 413
Query: 451 VALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD 510
VA VWMDEW +FY+ NP A + D V R++LR++LKC SF+WYL +++P +P+D
Sbjct: 414 VAEVWMDEWRDFYYAMNPGA-RNVDVGDVSERVKLRERLKCKSFRWYLENIYPESPMPLD 472
Query: 511 DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDE 570
+ G +++V+ C++ +M + +G + H Q+F +++ D
Sbjct: 473 YYYLGDVKNVEAQTCLD------TMGRRTGENVGISYCHGLGGNQVFAYTKRQQIMSDD- 525
Query: 571 SVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
+CLD SP+ V+I+ C G Q W Y+ E
Sbjct: 526 -MCLDA------ASPQGPVKIVRCHGMGGNQAWVYNDE 556
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/156 (56%), Positives = 120/156 (76%), Gaps = 1/156 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N + Q+L+++N+FNL+ SD I +NR+L D+R CK K +N+ +LP +SIVIVFHNEAW
Sbjct: 116 NEAKQQELFKLNQFNLMASDMISLNRSLKDIRLDGCKNKKYNK-YLPDTSIVIVFHNEAW 174
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
+ LLRTV SVI+RSPRS+LKE++LVDDAS R+ LK L++Y+A L VPT V R+ R GL
Sbjct: 175 TTLLRTVWSVINRSPRSLLKEVILVDDASERDHLKQDLEDYIATLPVPTYVYRTEKRSGL 234
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARLLGA+ +G+++ FLDAHCECT + E L +
Sbjct: 235 IRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 270
>gi|332025155|gb|EGI65335.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Acromyrmex
echinatior]
Length = 605
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 211/412 (51%), Positives = 286/412 (69%), Gaps = 14/412 (3%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
++LP +SIVIVFHNEAW+ LLRTV SVI+RSPRS+LKEI+LVDDAS RE LK L++YV
Sbjct: 158 KYLPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKQDLEDYVI 217
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R+A DR
Sbjct: 218 TLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSRIANDRHT 277
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 278 VVCPIIDVISDDTFEYISASDMTWGGFNWKLNFRWYRVAQREMDRRNSDRTAPLRTPTMA 337
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+FR SPY
Sbjct: 338 GGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPY 397
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGVS+++ N ARVA VWMDEW +FY+ NP A + D V R++LR++LKC SF
Sbjct: 398 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVDVGDVSERIKLRERLKCKSF 456
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D + G +++++T C++ +M + +G + H
Sbjct: 457 RWYLENIYPESPMPLDYYYLGDVKNIETQTCLD------TMGRRTGENVGISYCHGLGGN 510
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q+F +++ D +CLD N P V+I+ C G Q W Y E
Sbjct: 511 QVFAYTKRQQIMSDD--MCLDAA---NPQGP-VKIVRCHGMGGNQAWVYSDE 556
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/156 (57%), Positives = 118/156 (75%), Gaps = 1/156 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N + Q+L+++N+FNL+ SD I +NR+L D+R CK K + ++LP +SIVIVFHNEAW
Sbjct: 116 NDAKQQELFKLNQFNLMASDMISLNRSLKDIRLEGCKNKKY-LKYLPDTSIVIVFHNEAW 174
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
+ LLRTV SVI+RSPRS+LKEI+LVDDAS RE LK L++YV L VPT V R+ R GL
Sbjct: 175 TTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKQDLEDYVITLPVPTYVYRTEKRSGL 234
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARLLGA+ +G+++ FLDAHCECT + E L +
Sbjct: 235 IRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 270
>gi|328723394|ref|XP_003247832.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Acyrthosiphon pisum]
Length = 615
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 214/420 (50%), Positives = 285/420 (67%), Gaps = 14/420 (3%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC +P +SIVIVFHNEAWS LLRTV SVI+RSPRS+LKEILLVDDAS R+FL
Sbjct: 159 ECKSKQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLG 218
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L++YVA L V T+V+R+ R GLI+ARLLGA+ G+++ FLDAHCEC GWLE L+A
Sbjct: 219 KKLEDYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLA 278
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ +R VVCPVID+ISD TF YV + ++ WG FNW+L+FRWY + + +D T
Sbjct: 279 RIVLNRKTVVCPVIDVISDDTFEYVTASDMTWGGFNWKLNFRWYRVPQREMTRRNQDRTA 338
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P +TP MAGGLF+ID+ YF+ +G+YDE M +WGGENLEMSFRVWQCGG++EI PCSHV H
Sbjct: 339 PLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRVWQCGGTLEIIPCSHVGH 398
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FR SPYSFPGGVS+++ N ARVA VWMDEW +FY+ NP A + V RL LR
Sbjct: 399 VFRDKSPYSFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGASNV-EVGDVSERLALR 457
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLP 546
++LKC SF+WYL +++P +P+D + G I++V + +C++ +M++ SG +
Sbjct: 458 EKLKCKSFRWYLENIYPESQMPLDYYYLGEIKNVDSQQCLD------TMSRKSGEKVGMS 511
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + +++ D CLD N + P V ++ C G Q W YD +
Sbjct: 512 YCHGLGGNQVFAYTKRSQIMSDDN--CLDA---SNIVGP-VSLIRCHGLEGNQAWVYDSK 565
Score = 176 bits (445), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/151 (56%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+Q ++ N+FN++ SD I +NR+L D+R+ +CK+K + +P +SIVIVFHNEAWS LLR
Sbjct: 130 KQKFKENQFNIIASDMISLNRSLQDIRQGECKSKQY-PTLMPTTSIVIVFHNEAWSTLLR 188
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPRS+LKEILLVDDAS R+FL L++YVA L V T+V+R+ R GLI+ARL
Sbjct: 189 TVWSVINRSPRSLLKEILLVDDASERDFLGKKLEDYVATLPVETKVLRTEKRSGLIRARL 248
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ G+++ FLDAHCEC + E L +
Sbjct: 249 LGAKHVTGQVITFLDAHCECADGWLEPLLAR 279
>gi|350402571|ref|XP_003486531.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus impatiens]
Length = 606
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 212/421 (50%), Positives = 289/421 (68%), Gaps = 18/421 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L++YV L VPT V R+ R GLI+ARLLGA+ G+++ FLDAHCECT GWLE L++R
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 272 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+
Sbjct: 332 LRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHV 391
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FR SPY+FPGGVS+V+ N ARVA VWMDEW +FY+ NP A + V R++LR+
Sbjct: 392 FRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVAVGDVSERIKLRE 450
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC SF+WYL +++P +P+D + G +++V+T C++ +M + +G +
Sbjct: 451 RLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVETQSCLD------TMGRRTGENVGISY 504
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDK 604
H Q+F +++ D +CLD SP+ V+I+ C G Q W Y++
Sbjct: 505 CHGLGGNQVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNE 556
Query: 605 E 605
E
Sbjct: 557 E 557
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +N+ +LP +SIVIVFHNEAWS LLR
Sbjct: 122 QELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNK-YLPDTSIVIVFHNEAWSTLLR 180
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 181 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 240
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ G+++ FLDAHCECT + E L +
Sbjct: 241 LGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
>gi|380030098|ref|XP_003698695.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis florea]
Length = 605
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 212/421 (50%), Positives = 289/421 (68%), Gaps = 18/421 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 151 CKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 210
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV +L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R
Sbjct: 211 DLEHYVKRLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 270
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 271 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 330
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+
Sbjct: 331 LRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHV 390
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FR SPY+FPGGVS+V+ N ARVA VWMDEW +FY+ NP A + V R++LR+
Sbjct: 391 FRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVAVGDVSERIKLRQ 449
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC SF+WYL +++P +P+D + G +++V T C++ +M + +G +
Sbjct: 450 RLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVDTQTCLD------TMGRRTGENVGISY 503
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDK 604
H Q+F +++ D +CLD SP+ V+I+ C G Q W Y++
Sbjct: 504 CHGLGGNQVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNE 555
Query: 605 E 605
E
Sbjct: 556 E 556
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +++ +LP +SIVIVFHNEAWS LLR
Sbjct: 121 QELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYSK-YLPDTSIVIVFHNEAWSTLLR 179
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L+ YV +L VPT V R+ R GLI+ARL
Sbjct: 180 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEHYVKRLPVPTYVYRTEKRSGLIRARL 239
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 240 LGAKHVKGQVITFLDAHCECTEGWLEPLLSR 270
>gi|48143331|ref|XP_397422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Apis mellifera]
Length = 606
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 212/421 (50%), Positives = 289/421 (68%), Gaps = 18/421 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 152 CKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L++YV L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 271
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 272 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+
Sbjct: 332 LRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHV 391
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FR SPY+FPGGVS+V+ N ARVA VWMDEW +FY+ NP A + V R++LR+
Sbjct: 392 FRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVAVGDVSERIKLRE 450
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC SF+WYL +++P +P+D + G +++V T C++ +M + +G +
Sbjct: 451 RLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVDTQTCLD------TMGRRTGENVGISY 504
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDK 604
H Q+F +++ D +CLD SP+ V+I+ C G Q W Y++
Sbjct: 505 CHGLGGNQVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNE 556
Query: 605 E 605
E
Sbjct: 557 E 557
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +++ +LP +SIVIVFHNEAWS LLR
Sbjct: 122 QELFKLNQFNLMASDMISLNRSLKDIRLDGCKTKKYSK-YLPDTSIVIVFHNEAWSTLLR 180
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 181 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 240
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 241 LGAKHVKGQVITFLDAHCECTEGWLEPLLSR 271
>gi|340712006|ref|XP_003394556.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Bombus terrestris]
gi|340712008|ref|XP_003394557.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus terrestris]
Length = 606
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 212/421 (50%), Positives = 289/421 (68%), Gaps = 18/421 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L++YV L VPT V R+ R GLI+ARLLGA+ G+++ FLDAHCECT GWLE L++R
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 272 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+
Sbjct: 332 LRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHV 391
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FR SPY+FPGGVS+V+ N ARVA VWMDEW +FY+ NP A + V R++LR+
Sbjct: 392 FRDKSPYTFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPGA-RSVAVGDVSERIKLRE 450
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC SF+WYL +++P +P+D + G +++V+T C++ +M + +G +
Sbjct: 451 RLKCKSFRWYLENIYPESPMPLDYFYLGDVQNVETQSCLD------TMGRRTGENVGISY 504
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDK 604
H Q+F +++ D +CLD SP+ V+I+ C G Q W Y++
Sbjct: 505 CHGLGGNQVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNE 556
Query: 605 E 605
E
Sbjct: 557 E 557
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +N+ +LP +SIVIVFHNEAWS LLR
Sbjct: 122 QELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNK-YLPDTSIVIVFHNEAWSTLLR 180
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 181 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 240
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ G+++ FLDAHCECT + E L +
Sbjct: 241 LGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
>gi|242011902|ref|XP_002426682.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212510853|gb|EEB13944.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 605
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 209/409 (51%), Positives = 277/409 (67%), Gaps = 14/409 (3%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S+VIVFHNEAWS LLRTV SVI+RSP+ ++KEI+LVDDAS + L L+ YV
Sbjct: 155 ELLPTTSVVIVFHNEAWSTLLRTVWSVINRSPKPLIKEIILVDDASVQPHLGKKLENYVK 214
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
L VP V+R+P R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L+AR+ EDR
Sbjct: 215 TLPVPVTVLRTPKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARITEDRKT 274
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ + + WG FNW L+FRWY + + D T P +TP MA
Sbjct: 275 VVCPIIDVISDETFEYITASDTTWGGFNWRLNFRWYRVPKREMDRRNNDKTVPIRTPTMA 334
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI PCSHV H+FR SPY
Sbjct: 335 GGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEIVPCSHVGHVFRDKSPY 394
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGVS+++ N RVA VWMDEW +FY+ NP A K+ + + SRL+LR+ LKC SF
Sbjct: 395 TFPGGVSQIVLHNANRVAEVWMDEWRDFYYAMNPGA-KKIEVGDITSRLKLREDLKCKSF 453
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYLT+++P +P+D F G I++V+T +C++ +M + SG + H
Sbjct: 454 RWYLTNIYPESTMPLDYYFLGDIKNVETEQCLD------TMGRKSGENVGMSYCHGYGGN 507
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTY 602
Q+F + A D CLD + V+++ C G Q W Y
Sbjct: 508 QVFSYTKRHQITADDN--CLDAAS----VRGPVKLVRCHGMGGNQEWKY 550
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNLL S+RI +NR+LPDVR CKTK + E LP +S+VIVFHNEAWS LLR
Sbjct: 118 KEMFKINQFNLLASERISLNRSLPDVRAKGCKTKKYFE-LLPTTSVVIVFHNEAWSTLLR 176
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSP+ ++KEI+LVDDAS + L L+ YV L VP V+R+P R GLI+ARL
Sbjct: 177 TVWSVINRSPKPLIKEIILVDDASVQPHLGKKLENYVKTLPVPVTVLRTPKRSGLIRARL 236
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 237 LGAKHVKGQVITFLDAHCECTEGWLEPLLAR 267
>gi|242001786|ref|XP_002435536.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215498872|gb|EEC08366.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 460
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 215/414 (51%), Positives = 279/414 (67%), Gaps = 18/414 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S+VIVFHNEAWS LLRTVHSVI SPR++L+EI+LVDDAS RE L L++YV
Sbjct: 14 EKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKQLEDYVV 73
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
KL P +V+R+ R GLI+ARLLGA +G+++ FLDAHCECT WLE L+AR+AEDRTR
Sbjct: 74 KLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLARIAEDRTR 133
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+ISD TF Y+ + +L WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 134 VVCPVIDVISDETFEYISASDLTWGGFNWKLNFRWYRVPQRELDRRGGDRTLPVRTPTMA 193
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAID+ YF +G YDE M +WGGENLE+SFR+W CGG +EI PCSHV H+FRK++PY
Sbjct: 194 GGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPY 253
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG S+++ N AR+A VW+DEW EFYF NP A K DK + R LRK+LKC+SF
Sbjct: 254 TFPGGTSKIVNHNNARLAEVWLDEWKEFYFAINP-AAKNVDKGDLSHRRNLRKKLKCNSF 312
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D G I+H + C++ + + SG + H
Sbjct: 313 RWYLENIYPESHMPLDYYHLGEIKHADSPVCLD------TFGRKSGENVAVSTCHGQGGN 366
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
Q+F +++ D CLD SPR V++L C G Q W YDKE
Sbjct: 367 QVFAYTKRQQIMSDDN--CLDAS------SPRGPVKLLRCHGMGGNQLWIYDKE 412
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/123 (59%), Positives = 94/123 (76%), Gaps = 1/123 (0%)
Query: 77 PKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 136
P+CK KV+ E+ LP +S+VIVFHNEAWS LLRTVHSVI SPR++L+EI+LVDDAS RE
Sbjct: 5 PRCKDKVYPEK-LPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREH 63
Query: 137 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEF 196
L L++YV KL P +V+R+ R GLI+ARLLGA +G+++ FLDAHCECT + E
Sbjct: 64 LGKQLEDYVVKLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPL 123
Query: 197 LPK 199
L +
Sbjct: 124 LAR 126
>gi|195402751|ref|XP_002059968.1| GJ14949 [Drosophila virilis]
gi|194140834|gb|EDW57305.1| GJ14949 [Drosophila virilis]
Length = 666
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/467 (47%), Positives = 284/467 (60%), Gaps = 60/467 (12%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R FLK L+ Y+ L
Sbjct: 149 LPSTSVIIVFHNEAWSVLLRTITSVINRSPRQLLKEIILVDDASDRSFLKRQLEAYIKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARL+GA+ A G++L FLDAHCEC+ GWLE L+AR+ E R V+
Sbjct: 209 NVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLEPLLARIKESREVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYG-SSDAIIKRKDFTEPFKTPAMAG 375
CPVIDIISD F+Y ++FE HWGAFNW+L FRW++ +K KD T P TP MAG
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTSVKPKDSTAPIATPGMAG 328
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF+ +GAYD EM++WGGEN+EMSFR+WQCGG IEI+PCSHV H+FR ++PY+
Sbjct: 329 GLFAIDRKYFYEMGAYDSEMRIWGGENVEMSFRIWQCGGRIEISPCSHVGHIFRSSTPYT 388
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLKCHSF 494
FPGG+SEVL NLAR A VWMD+W F + RDK V RL LR++LKC F
Sbjct: 389 FPGGMSEVLTANLARAATVWMDDWQYFVMLYTAGLSLSARDKVNVTERLALREKLKCKPF 448
Query: 495 KWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT------------------ 522
WYL ++WP HF P D+FFG+I +H++
Sbjct: 449 SWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALSRDWKRAFEEID 508
Query: 523 --------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIAT 568
KC+ +PL P ++ CT MFV+ P I T
Sbjct: 509 NKAEEFMSLIDLERDKCL-RPLHDDVPRSTLQPVTVGDCTSHAQTMDMFVIT-PKGQIMT 566
Query: 569 DESVC----------LDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
+++VC L + N + V + C+ Q+WTYD +
Sbjct: 567 NDNVCLTYRPPKAGALKLLRNRNATTSNVMLTQCATEASQQWTYDMD 613
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 84/151 (55%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FN+L SDRIP+NRTL D R +C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFKLNSFNILASDRIPLNRTLKDYRTNECRDKRY-AHGLPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R FLK L+ Y+ L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRQLLKEIILVDDASDRSFLKRQLEAYIKVLNVPTRLYRMKERSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA+ A G++L FLDAHCEC+ + E L +
Sbjct: 229 MGAQHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|158293352|ref|XP_314708.4| AGAP008613-PA [Anopheles gambiae str. PEST]
gi|157016664|gb|EAA10180.4| AGAP008613-PA [Anopheles gambiae str. PEST]
Length = 596
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 216/422 (51%), Positives = 280/422 (66%), Gaps = 14/422 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C LP +SIVIVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS RE
Sbjct: 130 HHDCKKKHYPAKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREH 189
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L L+EYV L VPT V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L
Sbjct: 190 LGRQLEEYVRTLPVPTFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 249
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ DR VVCP+ID+ISD TF YV + + WG FNW+L+FRWY + + + D
Sbjct: 250 LARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRNHDR 309
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P +TP MAGGLF+IDR YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI+PCSHV
Sbjct: 310 TAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEISPCSHV 369
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR SPY+FPGGV+ ++ N ARVA VW+DEW EFY++ +P A K V R
Sbjct: 370 GHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPGARKA-SAGDVSERRA 428
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
LR++LKC SF+WYL +++P +P+D F G IR+V+TH C++ +M + S
Sbjct: 429 LRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVKTHNCLD------TMGRKSNEKIG 482
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
H Q+F +++ D CLD N + P V ++ C G Q W YD
Sbjct: 483 SSYCHGLGGNQVFAYTKRHQIMSDDN--CLDA---SNALGP-VNLVRCHGMGGNQEWIYD 536
Query: 604 KE 605
E
Sbjct: 537 DE 538
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR CK K + + LP +SIVIVFHNEAWS LLR
Sbjct: 103 KEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYPAK-LPTTSIVIVFHNEAWSTLLR 161
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS RE L L+EYV L VPT V+R+ R GLI+ARL
Sbjct: 162 TIWSVINRSPRPLLKEIILVDDASEREHLGRQLEEYVRTLPVPTFVLRTGKRSGLIRARL 221
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 222 LGAKHVKGQVITFLDAHCECTEGWLEPLLAR 252
>gi|383865231|ref|XP_003708078.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Megachile rotundata]
Length = 605
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/414 (50%), Positives = 288/414 (69%), Gaps = 18/414 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK L++YV
Sbjct: 158 KYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVK 217
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L+AR+AE+R+
Sbjct: 218 TLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAENRST 277
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 278 VVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAPLRTPTMA 337
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+PCSHV H+FR SPY
Sbjct: 338 GGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISPCSHVGHVFRDKSPY 397
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGVS+V+ N ARVA VWMDEW +FY+ NP A + V R++LR++LKC SF
Sbjct: 398 TFPGGVSKVVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVAVGDVSERIKLRERLKCKSF 456
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D + G ++++ T C++ +M + +G + H
Sbjct: 457 RWYLENIYPESPMPLDYYYLGDVQNIDTQTCLD------TMGRRTGENVGISYCHGLGGN 510
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
Q+F +++ D +CLD SP+ V+I+ C G Q W Y++E
Sbjct: 511 QVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNEE 556
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK + ++LP +SIVIVFHNEAWS LLR
Sbjct: 121 QELFKLNQFNLMASDMISLNRSLRDIRLEGCKTKKY-PKYLPDTSIVIVFHNEAWSTLLR 179
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 180 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 239
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 240 LGAKHVKGQVITFLDAHCECTEGWLEPLLAR 270
>gi|328723396|ref|XP_001946856.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 1 [Acyrthosiphon pisum]
Length = 615
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/420 (49%), Positives = 284/420 (67%), Gaps = 14/420 (3%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC +P +SIVIVFHNEAWS LLRTV SVI+RSPRS+LKEILLVDDAS R+FL
Sbjct: 159 ECKSKQYPTLMPTTSIVIVFHNEAWSTLLRTVWSVINRSPRSLLKEILLVDDASERDFLG 218
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L++YVA L V T+V+R+ R GLI+ARLLGA+ G+++ FLDAHCEC GWLE L+A
Sbjct: 219 KKLEDYVATLPVETKVLRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECADGWLEPLLA 278
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ +R VVCPVID+ISD TF YV + ++ WG FNW+L+FRWY + + +D T
Sbjct: 279 RIVLNRKTVVCPVIDVISDDTFEYVTASDMTWGGFNWKLNFRWYRVPQREMTRRNQDRTA 338
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P +TP MAGGLF+ID+ YF+ +G+YDE M +WGGENLEMSFR+W CGG++EI+PCSHV H
Sbjct: 339 PLRTPTMAGGLFSIDKDYFYQLGSYDEGMDIWGGENLEMSFRIWMCGGTLEISPCSHVGH 398
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK++PY+FPGG S ++ N AR+A VWMDEW FY+ NP A + V RL LR
Sbjct: 399 VFRKSTPYTFPGGTSHIVNHNNARLAEVWMDEWKHFYYAINPGASNV-EVGDVSERLALR 457
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLP 546
++LKC SF+WYL +++P +P+D + G I++V + +C++ +M++ SG +
Sbjct: 458 EKLKCKSFRWYLENIYPESQMPLDYYYLGEIKNVDSQQCLD------TMSRKSGEKVGMS 511
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + +++ D CLD N + P V ++ C G Q W YD +
Sbjct: 512 YCHGLGGNQVFAYTKRSQIMSDDN--CLDA---SNIVGP-VSLIRCHGLEGNQAWVYDSK 565
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 85/151 (56%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+Q ++ N+FN++ SD I +NR+L D+R+ +CK+K + +P +SIVIVFHNEAWS LLR
Sbjct: 130 KQKFKENQFNIIASDMISLNRSLQDIRQGECKSKQY-PTLMPTTSIVIVFHNEAWSTLLR 188
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPRS+LKEILLVDDAS R+FL L++YVA L V T+V+R+ R GLI+ARL
Sbjct: 189 TVWSVINRSPRSLLKEILLVDDASERDFLGKKLEDYVATLPVETKVLRTEKRSGLIRARL 248
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ G+++ FLDAHCEC + E L +
Sbjct: 249 LGAKHVTGQVITFLDAHCECADGWLEPLLAR 279
>gi|307189895|gb|EFN74139.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Camponotus
floridanus]
Length = 608
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 208/414 (50%), Positives = 287/414 (69%), Gaps = 18/414 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
++LP +SIVIVFHNEAW+ LLRTV SVI+RSPRS+LKEI+LVDDAS RE LK L++++
Sbjct: 161 KYLPDTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKKELEKHIT 220
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
+L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R+A DR
Sbjct: 221 ELPVPTYVYRTEKRSGLIRARLLGAKYVKGQVITFLDAHCECTEGWLEPLLSRIANDRHT 280
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 281 VVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRNGDRTAPLRTPTMA 340
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG++EI+ CSHV H+FR SPY
Sbjct: 341 GGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFRVWQCGGTLEISSCSHVGHVFRDKSPY 400
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGVS+++ N ARVA VWMDEW +FY+ NP A + D V R++LR++LKC SF
Sbjct: 401 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVDVGDVSERIKLRERLKCKSF 459
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D + G +++V+ C++ +M + +G + H
Sbjct: 460 RWYLENIYPESPMPLDYYYLGDVKNVEMQTCLD------TMGRRTGENVGISYCHGLGGN 513
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
Q+F +++ D +CLD SP+ V+I+ C G Q W Y+ E
Sbjct: 514 QVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNNE 559
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 87/156 (55%), Positives = 119/156 (76%), Gaps = 1/156 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N + Q+L+++N+FNL+ SD I +NR+L D+R CK K + ++LP +SIVIVFHNEAW
Sbjct: 119 NEAKQQELFKLNQFNLMASDLISLNRSLKDIRLEGCKNKKY-PKYLPDTSIVIVFHNEAW 177
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
+ LLRTV SVI+RSPRS+LKEI+LVDDAS RE LK L++++ +L VPT V R+ R GL
Sbjct: 178 TTLLRTVWSVINRSPRSLLKEIILVDDASEREHLKKELEKHITELPVPTYVYRTEKRSGL 237
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARLLGA+ +G+++ FLDAHCECT + E L +
Sbjct: 238 IRARLLGAKYVKGQVITFLDAHCECTEGWLEPLLSR 273
>gi|91088223|ref|XP_973543.1| PREDICTED: similar to polypeptide GalNAc transferase 5 CG31651-PA
[Tribolium castaneum]
gi|270011823|gb|EFA08271.1| hypothetical protein TcasGA2_TC005902 [Tribolium castaneum]
Length = 602
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 212/411 (51%), Positives = 280/411 (68%), Gaps = 14/411 (3%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
+ LP +SIVIVFHNEAWS LLRTV SVI+RSPR +LKEI+LVDDAS RE L L+EYV
Sbjct: 150 KLLPTTSIVIVFHNEAWSTLLRTVWSVINRSPRPLLKEIILVDDASEREHLGRKLEEYVQ 209
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
L VP V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L+AR+ +DR
Sbjct: 210 TLPVPVIVLRTHKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIVQDRKT 269
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 270 VVCPIIDVISDETFEYITASDMTWGGFNWKLNFRWYRVPQREMERRNNDRTAPLRTPTMA 329
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF+ +G+YDE M +WGGENLEMSFRVWQCGG +EI PCSHV H+FR SPY
Sbjct: 330 GGLFSIDKEYFYELGSYDEGMDIWGGENLEMSFRVWQCGGKLEIIPCSHVGHVFRDKSPY 389
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGVS+++ N ARVA VWMDEW +FY+ NP A + V +R ELR++LKC SF
Sbjct: 390 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGA-RSVPVGDVSARRELRERLKCKSF 448
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +V+P +P++ + G IR+V+T C++ +M + SG + H
Sbjct: 449 RWYLENVYPESQMPLEYYYLGDIRNVETKNCLD------TMGRKSGENLGMTYCHNLGGN 502
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDK 604
Q+F +++ D CLD N P V+++ C G Q W YD+
Sbjct: 503 QVFAYTKRQQIMSDDN--CLDA---SNKKGP-VKLVRCHGMGGNQAWAYDE 547
Score = 172 bits (435), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ +++N+FNLL SD I +NR+L DVR CK K + + LP +SIVIVFHNEAWS LLR
Sbjct: 113 KEKFKLNQFNLLASDMISLNRSLADVRLEGCKDKKY-PKLLPTTSIVIVFHNEAWSTLLR 171
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR +LKEI+LVDDAS RE L L+EYV L VP V+R+ R GLI+ARL
Sbjct: 172 TVWSVINRSPRPLLKEIILVDDASEREHLGRKLEEYVQTLPVPVIVLRTHKRSGLIRARL 231
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 232 LGAKHVKGQVITFLDAHCECTEGWLEPLLAR 262
>gi|442756891|gb|JAA70604.1| Putative polypeptide n-acetylgalactosaminyltransferase [Ixodes
ricinus]
Length = 582
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 213/414 (51%), Positives = 277/414 (66%), Gaps = 18/414 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S+ IVFHNEAWS LLRTVHSVI SPR++L+EI+LVDDAS RE L L++YV
Sbjct: 136 EKLPTTSVDIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKQLEDYVV 195
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
KL P +V+R+ R GLI+ARLLGA +G+++ FLDAHCECT WLE L+AR+AEDRTR
Sbjct: 196 KLDTPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQNWLEPLLARIAEDRTR 255
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+ISD TF Y+ + +L WG FNW+L+FR Y + + D T P +TP MA
Sbjct: 256 VVCPVIDVISDETFEYISASDLTWGGFNWKLNFRGYRVPQRELDRRGGDRTLPVRTPTMA 315
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAID+ YF +G YDE M +WGGENLE+SFR+W CGG +EI PCSHV H+FRK++PY
Sbjct: 316 GGLFAIDKDYFVELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPY 375
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG S+++ N AR+A VW+DEW EFYF NP A K DK + R LRK+LKC+SF
Sbjct: 376 TFPGGTSKIVNHNNARLAEVWLDEWKEFYFAINP-AAKNVDKGDLSHRRNLRKKLKCNSF 434
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D G I+H + C++ + + SG + H
Sbjct: 435 RWYLENIYPESHMPLDYYHLGEIKHADSPVCLD------TFGRKSGENVAVSTCHGXXXN 488
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
Q+F +++ D CLD SPR V++L C G Q W YDKE
Sbjct: 489 QVFAYTKRQQIMSDDN--CLDAS------SPRGPVKLLRCHGMGGNQLWIYDKE 534
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ +++N+FNLL SDRI +NR+LPDVR KCK KV+ E+ LP +S+ IVFHNEAWS LLR
Sbjct: 99 KEKFKLNQFNLLASDRIALNRSLPDVRLEKCKDKVYPEK-LPTTSVDIVFHNEAWSTLLR 157
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI SPR++L+EI+LVDDAS RE L L++YV KL P +V+R+ R GLI+ARL
Sbjct: 158 TVHSVIRTSPRALLEEIILVDDASEREHLGKQLEDYVVKLDTPVKVMRTGKRSGLIRARL 217
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA +G+++ FLDAHCECT + E L +
Sbjct: 218 LGAAAVKGQVITFLDAHCECTQNWLEPLLAR 248
>gi|427796213|gb|JAA63558.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 621
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 212/414 (51%), Positives = 279/414 (67%), Gaps = 18/414 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S+VIVFHNEAWS LLRTVHSVI SPR++L+EI+LVDDAS RE L L++YV
Sbjct: 175 EKLPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVDDASEREHLGKKLEDYVV 234
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
KL VP +V+R+ R GLI+ARLLGA +G+++ FLDAHCECT WLE L+AR+AEDRTR
Sbjct: 235 KLEVPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTQHWLEPLLARIAEDRTR 294
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 295 VVCPVIDVISDETFEYISASDMTWGGFNWKLNFRWYRVPQREVERRGGDRTLPIRTPTMA 354
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF +G YDE M +WGGENLE+SFR+W CGG +EI PCSHV H+FRK++PY
Sbjct: 355 GGLFSIDKDYFNELGKYDEGMDIWGGENLELSFRIWMCGGELEIVPCSHVGHVFRKSTPY 414
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
SFPGG S ++ N AR+A VW+DEW +FYF NP A K DK + R +LR +LKC++F
Sbjct: 415 SFPGGTSRIVNHNNARLAEVWLDEWKDFYFAINP-AAKNVDKGDLSYRKQLRTKLKCNTF 473
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D G I+H T C++ + + SG + H
Sbjct: 474 RWYLENIYPESHMPLDYYHLGEIKHADTSDCLD------TFGRKSGENVAVSKCHGMGGN 527
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
Q+F +++ D CLD SPR V++L C G Q W Y+KE
Sbjct: 528 QVFAYTKRQQIMSDDN--CLDAS------SPRGPVKLLRCHGMGGNQLWIYNKE 573
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 98/190 (51%), Positives = 131/190 (68%), Gaps = 12/190 (6%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNR 69
++LA N D+ RG V +G + ++ +++N+FNLL SDRI +NR
Sbjct: 110 VELAENPSDWPGERGRGVEIGP-----------EEEALKKEKFKLNQFNLLASDRIALNR 158
Query: 70 TLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 129
+LPDVR KCK KV+ E+ LP +S+VIVFHNEAWS LLRTVHSVI SPR++L+EI+LVD
Sbjct: 159 SLPDVRLEKCKDKVYPEK-LPTTSVVIVFHNEAWSTLLRTVHSVIRTSPRALLEEIILVD 217
Query: 130 DASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
DAS RE L L++YV KL VP +V+R+ R GLI+ARLLGA +G+++ FLDAHCECT
Sbjct: 218 DASEREHLGKKLEDYVVKLEVPVKVMRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECT 277
Query: 190 LVFNEEFLPK 199
+ E L +
Sbjct: 278 QHWLEPLLAR 287
>gi|195027660|ref|XP_001986700.1| GH20386 [Drosophila grimshawi]
gi|193902700|gb|EDW01567.1| GH20386 [Drosophila grimshawi]
Length = 666
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 219/466 (46%), Positives = 281/466 (60%), Gaps = 58/466 (12%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +L+EI+LVDDAS R FLK L+ Y+ L
Sbjct: 149 LPNTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASNRSFLKRQLEAYIQVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARLLGA+ A G++L FLDAHCEC+ GWLE L+AR+ E R V+
Sbjct: 209 AVPTRLYRMKERSGLVPARLLGAQHARGDVLTFLDAHCECSRGWLEPLLARIGESREVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYG-SSDAIIKRKDFTEPFKTPAMAG 375
CPVIDIISD F+Y ++FE HWGAFNW+L FRW++ KD T P TP MAG
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTTANTKDSTAPIATPGMAG 328
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF+ +GAYD +M++WGGEN+EMSFR+WQCGG IEI+PCSHV H+FR ++PY+
Sbjct: 329 GLFAIDRKYFYEMGAYDSDMRIWGGENVEMSFRIWQCGGRIEISPCSHVGHIFRSSTPYT 388
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLKCHSF 494
FPGG+SEVL NLAR A VWMD+W F + RDK V RL LR++L+C F
Sbjct: 389 FPGGMSEVLTANLARAATVWMDDWQYFVMLYTAGLSLSARDKVNVTDRLALREKLQCKPF 448
Query: 495 KWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT------------------ 522
WYL +WP HF P D+FFGRI +H++
Sbjct: 449 SWYLETIWPEHFFPAPDRFFGRIIWLDGETECTQAYSKHMKNLPGRALSREWKRAFEEID 508
Query: 523 HKCVE-------------KPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATD 569
HK E +PL + P ++ CT MFV+ P I T+
Sbjct: 509 HKAEEFMSLIDLERDKCLRPLHDNVPRSSLQPVTVGDCTSHAQTMDMFVIT-PKGQIMTN 567
Query: 570 ESVC----------LDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
++VC L + N V + C+ Q+WTYD +
Sbjct: 568 DNVCLTYRPPKQGALKLLRNRNATISNVMLTQCAIEPSQQWTYDMD 613
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 84/151 (55%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FN+L SDRIP+NRTL D R +C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFKLNSFNILASDRIPLNRTLKDYRTGECRDKRYANS-LPNTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +L+EI+LVDDAS R FLK L+ Y+ L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLREIILVDDASNRSFLKRQLEAYIQVLAVPTRLYRMKERSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAQHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|157135226|ref|XP_001663438.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870268|gb|EAT34493.1| AAEL013274-PA [Aedes aegypti]
Length = 592
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 214/422 (50%), Positives = 280/422 (66%), Gaps = 14/422 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C LP +SIVIVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R+
Sbjct: 127 HHDCKKKHYPTKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L L++YV L V T V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L
Sbjct: 187 LGQQLEDYVQTLPVHTYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 246
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ DR VVCP+ID+ISD TF YV + + WG FNW+L+FRWY + + + D
Sbjct: 247 LARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRNHDR 306
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P +TP MAGGLF+IDR YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EIAPCSHV
Sbjct: 307 TAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHV 366
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR SPY+FPGGV+ ++ N ARVA VW+DEW EFY++ +P A K V R E
Sbjct: 367 GHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPGARKA-SAGDVSERKE 425
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
LR++LKC SF+WYL +++P +P+D F G IR+V+T C++ +M + S
Sbjct: 426 LRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVETGNCLD------TMGRKSNEKIG 479
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
H Q+F +++ D CLD N + P V ++ C G Q W YD
Sbjct: 480 SSYCHGLGGNQVFAYTKRHQVMSDDN--CLDA---SNALGP-VNLVRCHGMGGNQEWVYD 533
Query: 604 KE 605
+E
Sbjct: 534 EE 535
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/176 (51%), Positives = 118/176 (67%), Gaps = 6/176 (3%)
Query: 29 VGHYLGDSVDGGLHSNLSDAQQ-----LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKV 83
VG G + G + +QQ ++ N+FNLL SD I +NR+L DVR CK K
Sbjct: 75 VGENYGKPGELGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKH 134
Query: 84 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 143
+ + LP +SIVIVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R+ L L++
Sbjct: 135 YPTK-LPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDHLGQQLED 193
Query: 144 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
YV L V T V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT + E L +
Sbjct: 194 YVQTLPVHTYVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLAR 249
>gi|357624971|gb|EHJ75544.1| hypothetical protein KGM_17358 [Danaus plexippus]
Length = 626
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 210/411 (51%), Positives = 273/411 (66%), Gaps = 18/411 (4%)
Query: 196 FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK 255
LP +S+VIVFHNEAW+ LLRT+ S I+RSPR +LKEI+LVDDAS +E L L+EY+
Sbjct: 178 LLPTTSVVIVFHNEAWTTLLRTIWSTINRSPRPLLKEIILVDDASEKEHLGKKLEEYIKT 237
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L V TR+ R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R+ EDR+ V
Sbjct: 238 LPVSTRLFRTESRSGLIRARLLGAKHVKGDVITFLDAHCECTEGWLEPLLSRIVEDRSTV 297
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCP+ID+ISD TF Y+++ ++ WG FNW+L+FRWY + + D T P +TP MAG
Sbjct: 298 VCPIIDVISDTTFEYIQASDMTWGGFNWKLNFRWYRVPEREMQRRGGDRTAPLRTPTMAG 357
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF+ IG+YDE M +WGGENLEMSFRVWQCGG +EI PCSHV H+FR SPYS
Sbjct: 358 GLFAIDREYFYKIGSYDEGMDIWGGENLEMSFRVWQCGGVLEIVPCSHVGHVFRDKSPYS 417
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FPGGV V+ N ARVA VWMDEW EFY+ NP A V R LR++LKC SF+
Sbjct: 418 FPGGVQAVVLKNAARVAEVWMDEWGEFYYAMNPGA-LNVPVGDVSERKALRERLKCKSFR 476
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
WYL +++P +P+D + G IR+ +T C++ K A P + C H Q
Sbjct: 477 WYLENIYPESQMPLDYYYLGEIRNAETSNCLDTLGGK-----AGQPLGMGYC-HGMGGNQ 530
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYD 603
+F +++ D CLD PR ++++ C G Q WTYD
Sbjct: 531 VFAYTKRKQIMSDDN--CLDAAH------PRGPIKLIRCHGMRGNQEWTYD 573
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 83/148 (56%), Positives = 108/148 (72%), Gaps = 1/148 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
+Q N+FNLL SD I +NR+L DVR KCK K + LP +S+VIVFHNEAW+ LLRT+
Sbjct: 143 FQENQFNLLASDMISLNRSLTDVRFEKCKAKRY-PTLLPTTSVVIVFHNEAWTTLLRTIW 201
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S I+RSPR +LKEI+LVDDAS +E L L+EY+ L V TR+ R+ R GLI+ARLLGA
Sbjct: 202 STINRSPRPLLKEIILVDDASEKEHLGKKLEEYIKTLPVSTRLFRTESRSGLIRARLLGA 261
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ +G+++ FLDAHCECT + E L +
Sbjct: 262 KHVKGDVITFLDAHCECTEGWLEPLLSR 289
>gi|345492127|ref|XP_001602037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Nasonia vitripennis]
Length = 635
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 210/415 (50%), Positives = 283/415 (68%), Gaps = 13/415 (3%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
+ LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDDAS RE LK L++YV
Sbjct: 181 KLLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRALLKEIILVDDASEREHLKQKLEDYVE 240
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L+AR+A D+
Sbjct: 241 TLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLARIAHDKKT 300
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P +TP MA
Sbjct: 301 VVCPIIDVISDDTFEYITASDMTWGGFNWKLNFRWYRVAQREMDRRNGDRTAPLRTPTMA 360
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF+ +GAYDE M +WGGENLEMSFRVWQCGG +EI+PCSHV H+FR SPY
Sbjct: 361 GGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRVWQCGGILEISPCSHVGHVFRDKSPY 420
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGGVS+++ N ARVA VWMDEW +FY+ NP A + V R++LR+QLKC SF
Sbjct: 421 TFPGGVSKIVLHNAARVAEVWMDEWRDFYYAMNPGA-RNVPVGDVSERVKLREQLKCKSF 479
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE-KPLAKGSMNQASGPASLLPCTHLPVL 553
+WYL +++P +P+D + G I++ + + + +M + +G + H
Sbjct: 480 RWYLENIYPESPMPLDYYYLGDIKNADPNNPEKVQNYCLDTMGRRTGENVGMSYCHGLGG 539
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDKE 605
Q+F +++ D +CLD SP+ V+I+ C G Q W Y++E
Sbjct: 540 NQIFAYTKRQQIMSDD--MCLDA------ASPQGPVKIVRCHGMGGNQAWIYNEE 586
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L DVR CK+K F + LP +SIVIVFHNEAWS LLR
Sbjct: 144 QELFKLNQFNLMASDMISLNRSLKDVRLSGCKSKKF-PKLLPDTSIVIVFHNEAWSTLLR 202
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS RE LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 203 TVWSVINRSPRALLKEIILVDDASEREHLKQKLEDYVETLPVPTYVYRTEKRSGLIRARL 262
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 263 LGAKHVKGQVITFLDAHCECTEGWLEPLLAR 293
>gi|170043866|ref|XP_001849590.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167867153|gb|EDS30536.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 600
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 212/422 (50%), Positives = 279/422 (66%), Gaps = 14/422 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C LP +SIVIVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R+
Sbjct: 132 HHDCKKKHYSAKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASERDH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L L++YV+ L V T V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L
Sbjct: 192 LGKQLEDYVSTLPVSTFVLRTGKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 251
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ DR VVCP+ID+ISD TF YV + + WG FNW+L+FRWY S + + D
Sbjct: 252 LARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPSREMQRRNHDR 311
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P +TP MAGGLF+IDR YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EIAPCSHV
Sbjct: 312 TAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHV 371
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR SPY+FPGGV+ ++ N ARVA VW+DEW EFY++ +P A K V R
Sbjct: 372 GHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFYYQMSPGARKA-SAGDVSERRA 430
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
LR++LKC SF+WYL +++P +P+D F G IR+ ++ C++ +M + S
Sbjct: 431 LREKLKCKSFRWYLENIYPESQMPLDYYFLGEIRNEESQNCLD------TMGRKSNEKIG 484
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
H Q+F +++ D CLD N + P V ++ C G Q W YD
Sbjct: 485 SSYCHGLGGNQVFAYTKRHQIMSDDN--CLDA---SNALGP-VNLVRCHGMGGNQEWVYD 538
Query: 604 KE 605
+E
Sbjct: 539 EE 540
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 84/151 (55%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR CK K ++ + LP +SIVIVFHNEAWS LLR
Sbjct: 105 KEKFKENQFNLLASDMIWLNRSLTDVRHHDCKKKHYSAK-LPTTSIVIVFHNEAWSTLLR 163
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R+ L L++YV+ L V T V+R+ R GLI+ARL
Sbjct: 164 TIWSVINRSPRPLLKEIILVDDASERDHLGKQLEDYVSTLPVSTFVLRTGKRSGLIRARL 223
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 224 LGAKHVKGQVITFLDAHCECTEGWLEPLLAR 254
>gi|350402574|ref|XP_003486532.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
isoform 2 [Bombus impatiens]
Length = 606
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 205/421 (48%), Positives = 287/421 (68%), Gaps = 18/421 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 152 CKTKKYNKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L++YV L VPT V R+ R GLI+ARLLGA+ G+++ FLDAHCECT GWLE L++R
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 272 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFR+W CGG++EIA CSHV H+
Sbjct: 332 LRTPTMAGGLFSIDKDYFYELGAYDEGMDIWGGENLEMSFRIWMCGGTLEIATCSHVGHV 391
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK++PY+FPGG S+++ N AR+A VW+D+W FY+ NP A + V R++LR+
Sbjct: 392 FRKSTPYTFPGGTSKIVNHNNARLAEVWLDQWKYFYYNINPGA-RNVAVGDVSERIKLRE 450
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+LKC SF+WYL +++P +P+D + G +++V+T C++ +M + +G +
Sbjct: 451 RLKCKSFRWYLENIYPESPMPLDYYYLGDVQNVETQSCLD------TMGRRTGENVGISY 504
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRWTYDK 604
H Q+F +++ D +CLD SP+ V+I+ C G Q W Y++
Sbjct: 505 CHGLGGNQVFAYTKRQQIMSDD--MCLDAA------SPQGPVKIVRCHGMGGNQAWVYNE 556
Query: 605 E 605
E
Sbjct: 557 E 557
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +N+ +LP +SIVIVFHNEAWS LLR
Sbjct: 122 QELFKLNQFNLMASDMISLNRSLKDIRLEGCKTKKYNK-YLPDTSIVIVFHNEAWSTLLR 180
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 181 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 240
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ G+++ FLDAHCECT + E L +
Sbjct: 241 LGAKHVTGQVITFLDAHCECTEGWLEPLLSR 271
>gi|195120313|ref|XP_002004673.1| GI20058 [Drosophila mojavensis]
gi|193909741|gb|EDW08608.1| GI20058 [Drosophila mojavensis]
Length = 668
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 219/479 (45%), Positives = 288/479 (60%), Gaps = 62/479 (12%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + +P +S++IVFHNEAWS LLRT+ SVI+RSPR +L+EI+LVDDAS R FLK
Sbjct: 139 ECREKRYTQNMPTTSVIIVFHNEAWSVLLRTITSVINRSPRHLLREIILVDDASDRSFLK 198
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+ Y+ L VPTR+ R R GL+ ARL+GA+ A G++L FLDAHCEC+ GWLE L+A
Sbjct: 199 RQLEAYIEVLKVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLEPLLA 258
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYG--SSDAIIK-RKD 363
R+ E R V+CPVIDIISD F+Y ++FE HWGAFNW+L FRW++ + AI K KD
Sbjct: 259 RIKESRNVVICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKTRQAIAKENKD 318
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P TP MAGGLFAIDR YF+ +GAYD +M++WGGEN+EMSFR+WQCGG IEI+PCSH
Sbjct: 319 STAPIATPGMAGGLFAIDRKYFYEMGAYDRDMRIWGGENVEMSFRIWQCGGRIEISPCSH 378
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSR 482
V H+FR ++PY+FPGG+SEVL NLAR A VWMD+W F + +DK V R
Sbjct: 379 VGHIFRSSTPYTFPGGMSEVLTSNLARAATVWMDDWQYFVMLYTAGLSLSAKDKVNVTER 438
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT------ 522
L LR++L C F WYL ++WP HF P D+FFG+I H++
Sbjct: 439 LALREKLNCKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSNHMKNLPGRAL 498
Query: 523 --------------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
KC+ +PL + + P ++ C+ M
Sbjct: 499 SREWKRAFEEIESKAEEFMSLIDLERDKCL-RPLHDDAPRTSLQPVTVGDCSSHAQTMDM 557
Query: 557 FVMKLPTDLIATDESVCL----------DVPEYENDISPRVRILACSGFNRQRWTYDKE 605
FV+ P I T+++VCL + + N S V + C Q+WTYD +
Sbjct: 558 FVIT-PKGQIMTNDNVCLTYRPPKNGAFKLLKNRNATSSNVVLTQCISDPSQQWTYDMD 615
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 83/151 (54%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R +C+ K + + +P +S++IVFHNEAWS LLR
Sbjct: 110 QRFFKLNSFNLLASDRIPLNRTLKDYRTNECREKRYTQN-MPTTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +L+EI+LVDDAS R FLK L+ Y+ L VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLREIILVDDASDRSFLKRQLEAYIEVLKVPTRLYRMKERSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA+ A G++L FLDAHCEC+ + E L +
Sbjct: 229 MGAQHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|195430254|ref|XP_002063171.1| GK21532 [Drosophila willistoni]
gi|194159256|gb|EDW74157.1| GK21532 [Drosophila willistoni]
Length = 667
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 215/468 (45%), Positives = 285/468 (60%), Gaps = 61/468 (13%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSP+ +LKEI+LVDDAS R +LK L+ Y+ L
Sbjct: 149 LPNTSVIIVFHNEAWSVLLRTITSVINRSPKHLLKEIILVDDASDRTYLKRQLESYIKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARL+GA+ A G++L FLDAHCEC+ GWLE L+AR+ E R V+
Sbjct: 209 AVPTRLYRMKERSGLVPARLMGAQHARGDVLTFLDAHCECSRGWLEPLLARIRESRQVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR--KDFTEPFKTPAMA 374
CPVIDIISD F+Y ++FE HWGAFNW+L FRW++ + + KD T P TP MA
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSERKRSTTESSVKDLTAPIATPGMA 328
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAIDR YF+ +G+YD EM+VWGGEN+EMSFR+WQCGG IEI+PCSHV H+FR ++PY
Sbjct: 329 GGLFAIDRKYFYEMGSYDSEMRVWGGENVEMSFRIWQCGGRIEISPCSHVGHVFRSSTPY 388
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLKCHS 493
+FPGG+SEVL NLAR A VWMD+W F + +DK V R+ LR++LKC
Sbjct: 389 TFPGGMSEVLTNNLARAASVWMDDWQYFVMLYTSGLTLGAKDKINVTERVALREKLKCKP 448
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT----------------- 522
F WYL H+WP HF P D+FFG+I RH++
Sbjct: 449 FSWYLEHIWPEHFFPAPDRFFGKIIWLDGETECAQAYSRHMKNLPGRALSREWSRAFDEI 508
Query: 523 ---------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIA 567
KC+ +PL + P ++ CT MFV+ P I
Sbjct: 509 DNKAAEFTSLIDLERDKCL-RPLRDDVPRSSLQPVTVGDCTSHAQTMDMFVIT-PKGQIM 566
Query: 568 TDESVCL----------DVPEYENDISPRVRILACSGFNRQRWTYDKE 605
T+++VCL + + N + + + C+ Q W+YD +
Sbjct: 567 TNDNVCLTYRPPKQGVIKLLKNRNATTSNIMLTQCASEPSQFWSYDMD 614
Score = 178 bits (451), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 83/151 (54%), Positives = 114/151 (75%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ + + + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDRKYTKN-LPNTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSP+ +LKEI+LVDDAS R +LK L+ Y+ L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPKHLLKEIILVDDASDRTYLKRQLESYIKVLAVPTRLYRMKERSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA+ A G++L FLDAHCEC+ + E L +
Sbjct: 229 MGAQHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|194753512|ref|XP_001959056.1| GF12252 [Drosophila ananassae]
gi|190620354|gb|EDV35878.1| GF12252 [Drosophila ananassae]
Length = 667
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 218/471 (46%), Positives = 287/471 (60%), Gaps = 67/471 (14%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L
Sbjct: 149 LPHTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRTYLKRQLESYVKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
SVPT++ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L+AR+ E R V+
Sbjct: 209 SVPTKIYRMKKRSGLVPARLLGAEHARGDVLTFLDAHCECSRGWLEPLLARIKESRKVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSD-----AIIKRKDFTEPFKTP 371
CPVIDIISD F+Y ++FE HWGAFNW+L FRW+ SSD + KD T P TP
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRQATVSGAKDSTAPIATP 325
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLF+IDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV H+FR +
Sbjct: 326 GMAGGLFSIDRKYFYEMGSYDANMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSS 385
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLK 490
+PY+FPGG+SEVL NLAR A VWMD+W F + +DK V RL LR++L+
Sbjct: 386 TPYTFPGGMSEVLTDNLARAATVWMDDWQYFVMLYTSGLTLDAKDKVNVTERLALREKLQ 445
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT-------------- 522
C F WYL ++WP HF P D+FFG+I +H+++
Sbjct: 446 CKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKSLPGRALSREWKRAF 505
Query: 523 ------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTD 564
KC+ +PL + + P ++ CT MFV+ P
Sbjct: 506 EEIESQSEQLVALIDLERDKCL-RPLKQDVPRSSLSPVTVGDCTSHAQTMDMFVIT-PKG 563
Query: 565 LIATDESVCL----------DVPEYENDISPRVRILACSGFNRQRWTYDKE 605
I T+++VCL + + N + V + C+ + Q WTYD +
Sbjct: 564 QIMTNDNVCLTYRQPKLGVIKMLKNRNATTSNVMLSQCASDSSQLWTYDMD 614
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYTGS-LPHTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV LSVPT++ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASDRTYLKRQLESYVKVLSVPTKIYRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAEHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|125806852|ref|XP_001360187.1| GA18187 [Drosophila pseudoobscura pseudoobscura]
gi|54635358|gb|EAL24761.1| GA18187 [Drosophila pseudoobscura pseudoobscura]
Length = 667
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/478 (45%), Positives = 287/478 (60%), Gaps = 61/478 (12%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK
Sbjct: 139 ECRDKKYDSGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASERSYLK 198
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+ Y+ L+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L+A
Sbjct: 199 RQLESYIRVLTVPTRIYRMKKRSGLVPARLLGAEHARGDVLTFLDAHCECSRGWLEPLLA 258
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAI--IKRKDF 364
R+ E R V+CPVIDIISD F+Y ++FE HWG FNW+L FRW++ I KD
Sbjct: 259 RIKEARNVVICPVIDIISDDNFSYTKTFENHWGGFNWQLSFRWFSSERKRQTTEITAKDS 318
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P TP MAGGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV
Sbjct: 319 TAPIATPGMAGGLFAIDRNYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHV 378
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRL 483
H+FR ++PY+FPGG+SEVL NLAR A VWMD+W F + +DK V R+
Sbjct: 379 GHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFVMLYTSGLSLSVKDKVNVTERV 438
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT------- 522
LR++L+C SF WYL ++WP HF P D+FFG+I +H++
Sbjct: 439 ALREKLQCKSFAWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALS 498
Query: 523 -------------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMF 557
KC+ +PL + + + P ++ CT MF
Sbjct: 499 REWKRAFEEIESKSEEFMSLIDLERDKCL-RPLKEVAPRSSLQPVTVGDCTSHAQTMDMF 557
Query: 558 VMKLPTDLIATDESVCL----------DVPEYENDISPRVRILACSGFNRQRWTYDKE 605
V+ P I T++++CL + + N + V + C+G Q W YD +
Sbjct: 558 VIT-PKGQIMTNDNICLAYRPPKQGVIKLLKNRNATTSNVMLTQCAGDASQLWNYDMD 614
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K ++ LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYDSG-LPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ Y+ L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASERSYLKRQLESYIRVLTVPTRIYRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAEHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|195149249|ref|XP_002015570.1| GL10955 [Drosophila persimilis]
gi|194109417|gb|EDW31460.1| GL10955 [Drosophila persimilis]
Length = 667
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 216/478 (45%), Positives = 287/478 (60%), Gaps = 61/478 (12%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK
Sbjct: 139 ECRDKKYDSGLPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASERSYLK 198
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+ Y+ L+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L+A
Sbjct: 199 RQLESYIRVLTVPTRIYRMKKRSGLVPARLLGAEHARGDVLTFLDAHCECSRGWLEPLLA 258
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAI--IKRKDF 364
R+ E R V+CPVIDIISD F+Y ++FE HWG FNW+L FRW++ I KD
Sbjct: 259 RIKEARNVVICPVIDIISDDNFSYTKTFENHWGGFNWQLSFRWFSSERKRQTTEITAKDS 318
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P TP MAGGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV
Sbjct: 319 TAPIATPGMAGGLFAIDRNYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHV 378
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRL 483
H+FR ++PY+FPGG+SEVL NLAR A VWMD+W F + +DK V R+
Sbjct: 379 GHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDWQYFVMLYTSGLSLSVKDKVNVTERV 438
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT------- 522
LR++L+C SF WYL ++WP HF P D+FFG+I +H++
Sbjct: 439 ALREKLQCKSFAWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALS 498
Query: 523 -------------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMF 557
KC+ +PL + + + P ++ CT MF
Sbjct: 499 REWQRAFEEIESKSEEFMSLIDLERDKCL-RPLKEVAPRSSLQPVTVGDCTSHAQTMDMF 557
Query: 558 VMKLPTDLIATDESVCL----------DVPEYENDISPRVRILACSGFNRQRWTYDKE 605
V+ P I T++++CL + + N + V + C+G Q W YD +
Sbjct: 558 VIT-PKGQIMTNDNICLAYRPPKQGVIKLLKNRNATTSNVMLTQCAGDASQLWNYDMD 614
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K ++ LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYDSG-LPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ Y+ L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASERSYLKRQLESYIRVLTVPTRIYRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAEHARGDVLTFLDAHCECSRGWLEPLLAR 259
>gi|321456141|gb|EFX67256.1| hypothetical protein DAPPUDRAFT_218737 [Daphnia pulex]
Length = 639
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 205/411 (49%), Positives = 277/411 (67%), Gaps = 14/411 (3%)
Query: 196 FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK 255
LP +SIVIVFHNEAWS LLRTV S+I+RSPR +L EI+LVDDAS R++L L+++VA
Sbjct: 188 LLPTTSIVIVFHNEAWSTLLRTVWSIITRSPRELLAEIILVDDASERDYLGKELEDHVAN 247
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
VP V+R+ R GLI+ARL+GA+Q +G+++ FLDAHCECT GWLE L+ARVAE+R V
Sbjct: 248 FPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDAHCECTEGWLEPLLARVAENRKIV 307
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCP+ID+ISD +F YV + ++ WG FNW+L+FRWY + + D T+P +TP MAG
Sbjct: 308 VCPIIDVISDESFEYVTASDMTWGGFNWKLNFRWYRVPQREMDRRNGDRTQPLRTPTMAG 367
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+ID+ YF IG YDE M +WGGENLEMSFRVWQCGG +EI PCSHV H+FR SPYS
Sbjct: 368 GLFSIDKDYFEEIGTYDEGMDIWGGENLEMSFRVWQCGGELEIIPCSHVGHVFRDKSPYS 427
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FPGGV++++ N ARVA VWMD W +F+++ NP A + + V SR LRK+L+C SF+
Sbjct: 428 FPGGVAKIVNKNAARVAEVWMDRWKDFFYEMNPGA-RSVEVGDVSSRRSLRKKLQCKSFR 486
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
WYL +V+P +P+D F G IR+ +T C++ +M + G + H Q
Sbjct: 487 WYLENVYPESQMPLDYFFLGEIRNAETQTCLD------TMGRKGGENVGISYCHGLGGNQ 540
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
+F +++ D CLD + V+++ C G Q W Y+ +
Sbjct: 541 VFAYTKRQQIMSDDN--CLDATGTDG----IVKLIRCHGMGGNQAWLYEAQ 585
Score = 171 bits (434), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 94/195 (48%), Positives = 131/195 (67%), Gaps = 9/195 (4%)
Query: 10 LKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNL-SDAQQL----YQINRFNLLVSDR 64
LK A+NQ + + + VV G + G +L +D + L +++N+FNLL SD
Sbjct: 109 LKYAKNQ---LKKWSLAPVVPEQAGQPGEMGKPVHLPADQESLMREKFRLNQFNLLASDS 165
Query: 65 IPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 124
I +NR+LPDVR C+ K + LP +SIVIVFHNEAWS LLRTV S+I+RSPR +L E
Sbjct: 166 ISLNRSLPDVRLEGCRDKSY-PGLLPTTSIVIVFHNEAWSTLLRTVWSIITRSPRELLAE 224
Query: 125 ILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 184
I+LVDDAS R++L L+++VA VP V+R+ R GLI+ARL+GA+Q +G+++ FLDA
Sbjct: 225 IILVDDASERDYLGKELEDHVANFPVPVHVLRTHKRSGLIRARLIGAKQVKGQVITFLDA 284
Query: 185 HCECTLVFNEEFLPK 199
HCECT + E L +
Sbjct: 285 HCECTEGWLEPLLAR 299
>gi|116007284|ref|NP_001036338.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
gi|113194958|gb|ABI31292.1| polypeptide GalNAc transferase 5, isoform B [Drosophila
melanogaster]
Length = 630
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/409 (50%), Positives = 272/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFRVW CGG +EIAPCS V H+FRK++PY+F
Sbjct: 366 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRVWMCGGVLEIAPCSRVGHVFRKSTPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +E++ N AR+ VW+D+W EFY+ F P A K V R LR +LKC SF+W
Sbjct: 426 PGGTTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKA-SAGDVSDRKALRDRLKCKSFRW 484
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 485 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 538
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 539 FAYTKRQQIMSDD--LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 582
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 147 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 205
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL V T V+R+ R GLI+ARL
Sbjct: 206 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARL 265
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 266 LGAEHVSGEVITFLDAHCECTEGWLEPLLAR 296
>gi|19921720|ref|NP_610256.1| polypeptide GalNAc transferase 3 [Drosophila melanogaster]
gi|51316121|sp|Q9Y117.1|GALT3_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
Short=pp-GaNTase 3; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|5052600|gb|AAD38630.1|AF145655_1 BcDNA.GH09147 [Drosophila melanogaster]
gi|7304264|gb|AAF59298.1| polypeptide GalNAc transferase 3 [Drosophila melanogaster]
Length = 667
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 217/471 (46%), Positives = 286/471 (60%), Gaps = 67/471 (14%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L
Sbjct: 149 LPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L++R+ E R V+
Sbjct: 209 AVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKESRKVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSD-----AIIKRKDFTEPFKTP 371
CPVIDIISD F+Y ++FE HWGAFNW+L FRW+ SSD A KD T+P TP
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRQTAGNSSKDSTDPIATP 325
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV H+FR +
Sbjct: 326 GMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSS 385
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLK 490
+PY+FPGG+SEVL NLAR A VWMD+W F + +DK V R+ LR++L+
Sbjct: 386 TPYTFPGGMSEVLTDNLARAATVWMDDWQYFIMLYTSGLTLGAKDKVNVTERVALRERLQ 445
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT-------------- 522
C F WYL ++WP HF P D+FFG+I +H++
Sbjct: 446 CKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALSREWKRAF 505
Query: 523 ------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTD 564
KC+ +PL + + ++ CT MFV+ P
Sbjct: 506 EEIDSKAEELMALIDLERDKCL-RPLKEDVPRSSLSAVTVGDCTSHAQSMDMFVIT-PKG 563
Query: 565 LIATDESVCLDVPEYE----------NDISPRVRILACSGFNRQRWTYDKE 605
I T+++VCL + + N + V + C+ + Q WTYD +
Sbjct: 564 QIMTNDNVCLTYRQQKLGVIKMLKNRNATTSNVMLAQCASDSSQLWTYDMD 614
Score = 181 bits (459), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKY-ASGLPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAENARGDVLTFLDAHCECSRGWLEPLLSR 259
>gi|24581865|ref|NP_608906.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|195342664|ref|XP_002037920.1| GM18035 [Drosophila sechellia]
gi|51315874|sp|Q6WV17.2|GALT5_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|22945641|gb|AAF52218.2| polypeptide GalNAc transferase 5, isoform A [Drosophila
melanogaster]
gi|194132770|gb|EDW54338.1| GM18035 [Drosophila sechellia]
Length = 630
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/409 (50%), Positives = 272/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 366 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR +LKC SF+W
Sbjct: 426 PGGVAKIVLHNAARVAEVWLDEWRDFYYSMSTGARKA-SAGDVSDRKALRDRLKCKSFRW 484
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 485 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 538
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 539 FAYTKRQQIMSDD--LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 582
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 147 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 205
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL V T V+R+ R GLI+ARL
Sbjct: 206 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARL 265
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 266 LGAEHVSGEVITFLDAHCECTEGWLEPLLAR 296
>gi|194863912|ref|XP_001970676.1| GG10775 [Drosophila erecta]
gi|190662543|gb|EDV59735.1| GG10775 [Drosophila erecta]
Length = 667
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 219/471 (46%), Positives = 285/471 (60%), Gaps = 67/471 (14%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L
Sbjct: 149 LPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L++R+ E R V+
Sbjct: 209 AVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKESRKVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSD-----AIIKRKDFTEPFKTP 371
CPVIDIISD F+Y ++FE HWGAFNW+L FRW+ SSD A KD T P TP
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRQTADNSSKDSTAPIATP 325
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV H+FR +
Sbjct: 326 GMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSS 385
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLK 490
+PY+FPGG+SEVL NLAR A VWMD+W F + +DK V RL LR++L+
Sbjct: 386 TPYTFPGGMSEVLTENLARAATVWMDDWQYFVMLYTSGLTLGAKDKVNVTERLALRERLQ 445
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT-------------- 522
C F WYL ++WP HF P D+FFG+I +H++
Sbjct: 446 CKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALSREWKRAF 505
Query: 523 ------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTD 564
KC+ +PL + + ++ CT MFV+ P
Sbjct: 506 EEIDSKAEELLALIDLERDKCL-RPLKEDVPRSSLSAVTVGDCTSHAQSMDMFVIT-PKG 563
Query: 565 LIATDESVCLDVPEYE----------NDISPRVRILACSGFNRQRWTYDKE 605
I T+++VCL + + N + V + CS + Q WTYD +
Sbjct: 564 QIMTNDNVCLTFRQQKLGVIKMLKNRNATTSNVMLAQCSPDSSQLWTYDMD 614
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYASG-LPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAENARGDVLTFLDAHCECSRGWLEPLLSR 259
>gi|34042969|gb|AAQ56702.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 617
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/409 (50%), Positives = 272/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 173 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 232
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 233 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 292
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 293 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 352
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 353 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 412
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR +LKC SF+W
Sbjct: 413 PGGVAKIVLHNAARVAEVWLDEWRDFYYSMSTGARKA-SAGDVSDRKALRDRLKCKSFRW 471
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 472 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 525
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 526 FAYTKRQQIMSDD--LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 569
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 134 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 192
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL V T V+R+ R GLI+ARL
Sbjct: 193 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARL 252
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 253 LGAEHVSGEVITFLDAHCECTEGWLEPLLAR 283
>gi|16648224|gb|AAL25377.1| GH23657p [Drosophila melanogaster]
Length = 536
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 207/409 (50%), Positives = 272/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 92 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 151
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 152 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 212 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 272 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR +LKC SF+W
Sbjct: 332 PGGVAKIVLHNAARVAEVWLDEWRDFYYSMSTGARKA-SAGDVSDRKALRDRLKCKSFRW 390
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 391 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 444
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 445 FAYTKRQQIMSDD--LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 488
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 53 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 111
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL V T V+R+ R GLI+ARL
Sbjct: 112 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARL 171
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 172 LGAEHVSGEVITFLDAHCECTEGWLEPLLAR 202
>gi|194856530|ref|XP_001968770.1| GG24317 [Drosophila erecta]
gi|190660637|gb|EDV57829.1| GG24317 [Drosophila erecta]
Length = 630
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 206/409 (50%), Positives = 272/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ +G+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 366 LFSIDKDYFYELGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR +LKC SF+W
Sbjct: 426 PGGVAKIVLHNAARVAEVWLDEWRDFYYSMSTGARKA-SAGDVSDRKALRDRLKCKSFRW 484
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 485 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 538
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 539 FAYTKRQQIMSDD--LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 582
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 147 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 205
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL V T V+R+ R GLI+ARL
Sbjct: 206 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKLPVKTFVLRTEKRSGLIRARL 265
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 266 LGAEHVSGEVITFLDAHCECTEGWLEPLLAR 296
>gi|195474291|ref|XP_002089425.1| GE24246 [Drosophila yakuba]
gi|194175526|gb|EDW89137.1| GE24246 [Drosophila yakuba]
Length = 667
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/471 (46%), Positives = 285/471 (60%), Gaps = 67/471 (14%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L
Sbjct: 149 LPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L++R+ E R V+
Sbjct: 209 AVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKESRKVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSD-----AIIKRKDFTEPFKTP 371
CPVIDIISD F+Y ++FE HWGAFNW+L FRW+ SSD A KD T P TP
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRRTADNSSKDSTAPIATP 325
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV H+FR +
Sbjct: 326 GMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSS 385
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLK 490
+PY+FPGG+SEVL NLAR A VWMD+W F + +DK V R+ LR++L+
Sbjct: 386 TPYTFPGGMSEVLTDNLARAATVWMDDWQYFIMLYTSGLTLGAKDKVNVTERVALRERLQ 445
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT-------------- 522
C F WYL ++WP HF P D+FFG+I +H++
Sbjct: 446 CKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALSREWKRAF 505
Query: 523 ------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTD 564
KC+ +PL + + ++ CT MFV+ P
Sbjct: 506 EEIDSKAEELMALIDLERDKCL-RPLKEDVPRSSLSAVTVGDCTSHAQSMDMFVIT-PKG 563
Query: 565 LIATDESVCLDVPEYE----------NDISPRVRILACSGFNRQRWTYDKE 605
I T+++VCL + + N + V + C+ + Q WTYD +
Sbjct: 564 QIMTNDNVCLTYRQQKLGVIKMLKNRNATTSNVMLAQCAPDSSQLWTYDMD 614
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYASG-LPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAENARGDVLTFLDAHCECSRGWLEPLLSR 259
>gi|195581118|ref|XP_002080381.1| GD10277 [Drosophila simulans]
gi|194192390|gb|EDX05966.1| GD10277 [Drosophila simulans]
Length = 667
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 213/468 (45%), Positives = 282/468 (60%), Gaps = 61/468 (13%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L
Sbjct: 149 LPSTSVIIVFHNEAWSVLLRTITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L++R+ E R V+
Sbjct: 209 AVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKESRKVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAII--KRKDFTEPFKTPAMA 374
CPVIDIISD F+Y ++FE HWGAFNW+L FRW++ KD T P TP MA
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQATGNSSKDSTAPIATPGMA 328
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV H+FR ++PY
Sbjct: 329 GGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSSTPY 388
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLKCHS 493
+FPGG+SEVL NLAR A VWMD+W F + +DK V R+ LR++L+C
Sbjct: 389 TFPGGMSEVLTDNLARAATVWMDDWQYFIMLYTSGLTLGAKDKVNVTERVALRERLQCKP 448
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT----------------- 522
F WYL ++WP HF P D+FFG+I +H++
Sbjct: 449 FSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALSREWKRAFEEI 508
Query: 523 ---------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIA 567
KC+ +PL + + ++ CT MFV+ P I
Sbjct: 509 DSKAEELMALIDLERDKCL-RPLKEDVPRSSLSAVTVGDCTSHAQSMDMFVIT-PKGQIM 566
Query: 568 TDESVCLDVPEYE----------NDISPRVRILACSGFNRQRWTYDKE 605
T+++VCL + + N + V + C+ + Q WTYD +
Sbjct: 567 TNDNVCLTYRQQKLGVIKMLKNHNATTSNVMLAQCAPDSSQLWTYDMD 614
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYAGG-LPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L+VPTR+ R R GL+ ARL
Sbjct: 169 TITSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAENARGDVLTFLDAHCECSRGWLEPLLSR 259
>gi|125985507|ref|XP_001356517.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
gi|54644841|gb|EAL33581.1| GA16368 [Drosophila pseudoobscura pseudoobscura]
Length = 630
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/409 (50%), Positives = 273/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L++YVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEDYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA G+++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMSRRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 366 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R +LR +LKC SF+W
Sbjct: 426 PGGVAKIVLHNAARVAEVWLDEWRDFYYAMSTGARKA-SAGDVSDRKDLRDRLKCKSFRW 484
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 485 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 538
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 539 FAYTKRQQIMSDD--LCLDAASSNGPVN-MVRCHNMGG--NQEWVYDAE 582
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 147 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 205
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L++YVAKL V T V+R+ R GLI+ARL
Sbjct: 206 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEDYVAKLPVRTFVLRTEKRSGLIRARL 265
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA G+++ FLDAHCECT + E L +
Sbjct: 266 LGAEHVSGDVITFLDAHCECTEGWLEPLLAR 296
>gi|195147490|ref|XP_002014712.1| GL18803 [Drosophila persimilis]
gi|194106665|gb|EDW28708.1| GL18803 [Drosophila persimilis]
Length = 630
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/409 (50%), Positives = 273/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L++YVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEDYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA G+++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVRTFVLRTEKRSGLIRARLLGAEHVSGDVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMSRRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 366 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R +LR +LKC SF+W
Sbjct: 426 PGGVAKIVLHNAARVAEVWLDEWRDFYYAMSTGARKA-SAGDVSDRKDLRDRLKCKSFRW 484
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 485 YLENVYPESLMPLDYYYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQV 538
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 539 FAYTKRQQIMSDD--LCLDAASSNGPVN-MVRCHNMGG--NQEWVYDAE 582
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 147 KEKFKENQFNLLASDMISLNRSLTDVRHEGCRRKHYASK-LPTTSIVIVFHNEAWTTLLR 205
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDDAS R+FL L++YVAKL V T V+R+ R GLI+ARL
Sbjct: 206 TVWSVINRSPRALLKEIILVDDASERDFLGKQLEDYVAKLPVRTFVLRTEKRSGLIRARL 265
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA G+++ FLDAHCECT + E L +
Sbjct: 266 LGAEHVSGDVITFLDAHCECTEGWLEPLLAR 296
>gi|195332013|ref|XP_002032693.1| GM20824 [Drosophila sechellia]
gi|194124663|gb|EDW46706.1| GM20824 [Drosophila sechellia]
Length = 667
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/471 (46%), Positives = 285/471 (60%), Gaps = 67/471 (14%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L
Sbjct: 149 LPSTSVIIVFHNEAWSVLLRTLTSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VPTR+ R R GL+ ARLLGA A G++L FLDAHCEC+ GWLE L++R+ E R V+
Sbjct: 209 AVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRINESRKVVI 268
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSD-----AIIKRKDFTEPFKTP 371
CPVIDIISD F+Y ++FE HWGAFNW+L FRW+ SSD A KD T P TP
Sbjct: 269 CPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWF---SSDRKRQTAGNSSKDSTAPIATP 325
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLFAIDR YF+ +G+YD M+VWGGEN+EMSFR+WQCGG +EI+PCSHV H+FR +
Sbjct: 326 GMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISPCSHVGHVFRSS 385
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVRSRLELRKQLK 490
+PY+FPGG+SEVL NLAR A VWMD+W F + +DK V R+ LR++L+
Sbjct: 386 TPYTFPGGMSEVLTDNLARAATVWMDDWQYFIMLYTSGLTLGAKDKVNVTERVALRERLQ 445
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRI--------------RHVQT-------------- 522
C F WYL ++WP HF P D+FFG+I +H++
Sbjct: 446 CKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLPGRALSREWKRAF 505
Query: 523 ------------------HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTD 564
KC+ +PL + + ++ CT MFV+ P
Sbjct: 506 EEIDSKAEELMALIDLERDKCL-RPLKEEVPRSSLSAVTVGDCTSHTQSMDMFVIT-PKG 563
Query: 565 LIATDESVCLDVPEYE----------NDISPRVRILACSGFNRQRWTYDKE 605
I T+++VCL + + N + V + C+ + Q WTYD +
Sbjct: 564 QIMTNDNVCLTYRQQKLGVIKMLKNRNATTSNVMLAQCAPDSSQLWTYDMD 614
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 112/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ +++N FNLL SDRIP+NRTL D R P+C+ K + LP +S++IVFHNEAWS LLR
Sbjct: 110 QRFFRLNSFNLLASDRIPLNRTLKDYRTPECRDKKY-AGGLPSTSVIIVFHNEAWSVLLR 168
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+ SVI+RSPR +LKEI+LVDDAS R +LK L+ YV L+VPTR+ R R GL+ ARL
Sbjct: 169 TLTSVINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVLAVPTRIFRMKKRSGLVPARL 228
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A G++L FLDAHCEC+ + E L +
Sbjct: 229 LGAENARGDVLTFLDAHCECSRGWLEPLLSR 259
>gi|195114266|ref|XP_002001688.1| GI16986 [Drosophila mojavensis]
gi|193912263|gb|EDW11130.1| GI16986 [Drosophila mojavensis]
Length = 633
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 207/409 (50%), Positives = 273/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPRS+LKEI+LVDDAS R+FL L++YVAKL
Sbjct: 189 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLEDYVAKL 248
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 249 PVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 308
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY + + D T P +TP MAGG
Sbjct: 309 CPIIDVISDDTFEYITASDSTWGGFNWKLNFRWYRVPQREMARRNNDRTAPLRTPTMAGG 368
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 369 LFSIDKEYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 428
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR++L+C SF+W
Sbjct: 429 PGGVAKIVLHNAARVAEVWLDEWRDFYYAMSTGARKA-SAGDVSDRKALRERLQCKSFRW 487
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ K N+ G + H Q+
Sbjct: 488 YLENVYPESLMPLDYYYLGEIRNAETETCLDTMGRK--YNEKVGSSY----CHGLGGNQV 541
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 542 FAYTKRQQIMSDD--LCLDAASSNGPVN-MVRCHNMGG--NQEWVYDAE 585
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 150 KEKFKENQFNLLASDMISLNRSLTDVRHENCRHKHYPSK-LPTTSIVIVFHNEAWTTLLR 208
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPRS+LKEI+LVDDAS R+FL L++YVAKL V T V+R+ R GLI+ARL
Sbjct: 209 TVWSVINRSPRSLLKEIILVDDASERDFLGKQLEDYVAKLPVRTFVLRTEKRSGLIRARL 268
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 269 LGAEHVTGEVITFLDAHCECTEGWLEPLLAR 299
>gi|195035019|ref|XP_001989024.1| GH11491 [Drosophila grimshawi]
gi|193905024|gb|EDW03891.1| GH11491 [Drosophila grimshawi]
Length = 621
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 205/409 (50%), Positives = 271/409 (66%), Gaps = 12/409 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPRS+LKEI+LVDDAS R+FL L++YVAKL
Sbjct: 177 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLEDYVAKL 236
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 237 PVRTFVLRTEKRSGLIRARLLGAEHVAGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 296
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY + + D T P +TP MAGG
Sbjct: 297 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPQREMARRNNDRTAPLRTPTMAGG 356
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 357 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 416
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR +L+C SF+W
Sbjct: 417 PGGVAKIVLHNAARVAEVWLDEWRDFYYAMSTGARKA-SAGDVSDRKSLRDRLQCKSFRW 475
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ +M + + H Q+
Sbjct: 476 YLENVYPESLMPLDYYYLGEIRNSETETCLD------TMGRKYNEKVGISYCHGLGGNQV 529
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
F +++ D +CLD ++ VR G Q W YD E
Sbjct: 530 FAYTKRQQIMSDD--LCLDAASSNGPVN-MVRCHNMGG--NQEWVYDAE 573
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 138 KEKFKENQFNLLASDMISLNRSLTDVRHENCRHKHYASK-LPTTSIVIVFHNEAWTTLLR 196
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPRS+LKEI+LVDDAS R+FL L++YVAKL V T V+R+ R GLI+ARL
Sbjct: 197 TVWSVINRSPRSLLKEIILVDDASERDFLGKQLEDYVAKLPVRTFVLRTEKRSGLIRARL 256
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 257 LGAEHVAGEVITFLDAHCECTEGWLEPLLAR 287
>gi|195386582|ref|XP_002051983.1| GJ24116 [Drosophila virilis]
gi|194148440|gb|EDW64138.1| GJ24116 [Drosophila virilis]
Length = 632
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 207/410 (50%), Positives = 272/410 (66%), Gaps = 14/410 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPRS+LKEI+LVDDAS R+FL L++YVAKL
Sbjct: 188 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDFLGKQLEDYVAKL 247
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 248 PVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 307
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY + + D T P +TP MAGG
Sbjct: 308 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPQREMARRNNDRTAPLRTPTMAGG 367
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 368 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 427
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV++++ N ARVA VW+DEW +FY+ + A K V R LR +L+C SF+W
Sbjct: 428 PGGVAKIVLHNAARVAEVWLDEWRDFYYAMSTGARKA-SAGDVSDRKALRDRLQCKSFRW 486
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P+D + G IR+ +T C++ K N+ G + H Q+
Sbjct: 487 YLENVYPESLMPLDYYYLGEIRNAETETCLDTMGRK--YNEKVGSSY----CHGLGGNQV 540
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
F +++ D +CLD S V ++ C Q W YD E
Sbjct: 541 FAYTKRQQIMSDD--LCLDAASS----SGPVNMVRCHNMGGNQEWVYDAE 584
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 113/151 (74%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N+FNLL SD I +NR+L DVR C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 149 KEKFKENQFNLLASDMISLNRSLTDVRHENCRHKHYPSK-LPTTSIVIVFHNEAWTTLLR 207
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPRS+LKEI+LVDDAS R+FL L++YVAKL V T V+R+ R GLI+ARL
Sbjct: 208 TVWSVINRSPRSLLKEIILVDDASERDFLGKQLEDYVAKLPVRTFVLRTEKRSGLIRARL 267
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA GE++ FLDAHCECT + E L +
Sbjct: 268 LGAEHVTGEVITFLDAHCECTEGWLEPLLAR 298
>gi|326670471|ref|XP_002663357.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Danio rerio]
Length = 556
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 212/438 (48%), Positives = 275/438 (62%), Gaps = 38/438 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAWS LLRTVHS I+RSPR +L EILLVDDAS R+FLK L++YVA L
Sbjct: 114 LPNTSIVIVFHNEAWSTLLRTVHSAINRSPRQLLYEILLVDDASERDFLKEKLEDYVATL 173
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP R++R R GLI+ARL GA G+++ FLDAHCECT GWLE L+AR+ EDR VV
Sbjct: 174 EVPVRILRMEQRTGLIRARLRGAAATRGQVITFLDAHCECTTGWLEPLMARIKEDRRAVV 233
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 234 CPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 293
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PYSF
Sbjct: 294 LFSIDRTYFEEIGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSF 353
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V+ N R+A VWMDE+ +F++ +P + D V SR LR+ LKC F W
Sbjct: 354 PGGTGQVINKNNRRLAEVWMDEFKDFFYIISPGVVRV-DYGDVSSRKALRESLKCKPFSW 412
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAK-----GSMN--------------- 536
YL +V+P +P G IR+V+T++CV+ K G N
Sbjct: 413 YLENVYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTAD 472
Query: 537 -------------QASGPASLLPCTHLPVLTQMFVMKLPT-DLIATDESVCLDVPEYEND 582
+ +GP +L C H+ QMF L+ + + CLD+P E+
Sbjct: 473 KEIRTDDLCLDASRLNGPVVMLKCHHMKG-NQMFEYDAERLTLLHVNSNQCLDMPSEEDK 531
Query: 583 ISPRVRILACSGFNRQRW 600
+ P ++ CSG Q+W
Sbjct: 532 MVPTLK--DCSGRRSQQW 547
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTK + ++ LP +SIVIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKTYPDD-LPNTSIVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHS I+RSPR +L EILLVDDAS R+FLK L++YVA L VP R++R R GLI+ARL
Sbjct: 134 TVHSAINRSPRQLLYEILLVDDASERDFLKEKLEDYVATLEVPVRILRMEQRTGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA G+++ FLDAHCECT + E + +
Sbjct: 194 RGAAATRGQVITFLDAHCECTTGWLEPLMAR 224
>gi|405975554|gb|EKC40113.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Crassostrea gigas]
Length = 624
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 203/410 (49%), Positives = 269/410 (65%), Gaps = 14/410 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTVHS+I+RSPR +L EILLVDDAS RE L L++Y+A+L
Sbjct: 174 LPDTSVVIVFHNEAWSTLLRTVHSIINRSPRELLNEILLVDDASEREELGKKLEDYIARL 233
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V TRVIRS R GLI+ARL GA+QA G+++ FLDAHCECT GWLE L+ + +DRT VV
Sbjct: 234 PVSTRVIRSEERTGLIRARLKGAKQARGKVITFLDAHCECTEGWLEPLLYEIHKDRTAVV 293
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+I D +F Y+ ++ WG FNW+L+FRWY + + D + P KTP MAGG
Sbjct: 294 CPIIDVIGDDSFEYITGSDMTWGGFNWKLNFRWYPVPQRELDRRGGDRSNPTKTPTMAGG 353
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF+ +G+YDE M +WGGENLEMSFRVW CGG + I CS V H+FRK SPYS+
Sbjct: 354 LFSIDRDYFYEVGSYDEGMDIWGGENLEMSFRVWMCGGKVYIVTCSRVGHVFRKTSPYSW 413
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV+ ++ N R+ VWMDE+ +F++K NP + V R LR++L C SFKW
Sbjct: 414 PGGVARIINHNTQRIVEVWMDEYKDFFYKINP-GVRSTSYGDVSERKALREKLHCKSFKW 472
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P++ G IR+ T +C++ SM + SG + H Q+
Sbjct: 473 YLQNVYPESQMPVEYHALGEIRNKATGQCID------SMGRKSGEKVGMVQCHGMGGNQI 526
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
F L D VCLDV + + V++ C G Q+W YD++
Sbjct: 527 FSYTKKQALQTDD--VCLDV----SSLHGPVKLFQCHGLGGNQKWEYDRQ 570
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 89/153 (58%), Positives = 116/153 (75%), Gaps = 1/153 (0%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFN-EEFLPKSSIVIVFHNEAWS 104
+++++ ++IN+FNL+ SD I +NR+LPD R CK K + LP +S+VIVFHNEAWS
Sbjct: 130 AESKEKFKINQFNLVASDMISLNRSLPDYRMDACKRKSYPPNSDLPDTSVVIVFHNEAWS 189
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTVHS+I+RSPR +L EILLVDDAS RE L L++Y+A+L V TRVIRS R GLI
Sbjct: 190 TLLRTVHSIINRSPRELLNEILLVDDASEREELGKKLEDYIARLPVSTRVIRSEERTGLI 249
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ARL GA+QA G+++ FLDAHCECT + E L
Sbjct: 250 RARLKGAKQARGKVITFLDAHCECTEGWLEPLL 282
>gi|443683126|gb|ELT87494.1| hypothetical protein CAPTEDRAFT_198873 [Capitella teleta]
Length = 495
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 200/412 (48%), Positives = 276/412 (66%), Gaps = 14/412 (3%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S+VIVFHNEAWS LLRTVHSVI+RSP +LKEI+LVDDAS ++FL LDEY++
Sbjct: 54 ESLPTTSVVIVFHNEAWSTLLRTVHSVINRSPPPLLKEIILVDDASEKDFLGRQLDEYLS 113
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
KLSV V+R R GLI+ARL GA +AEG+++ FLDAHCECT GWLE L+ + ++R
Sbjct: 114 KLSVHVYVLRMEKRTGLIRARLKGAARAEGKVITFLDAHCECTEGWLEPLLFEIHKNRKS 173
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF Y+ ++ WG FNW+L+FRWY + + D + P ++P MA
Sbjct: 174 VVCPIIDVISDETFEYITGSDMTWGGFNWKLNFRWYPVPQREVERRGGDRSLPLRSPTMA 233
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGL AI+R YF+ IG+YD+ M +WGGENLEMSFR+W CGG++ I CSHV H+FRKA+PY
Sbjct: 234 GGLLAIERDYFYEIGSYDDGMDIWGGENLEMSFRIWMCGGTLLIVTCSHVGHVFRKATPY 293
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG ++ N AR+A VWMDEW FY+K NP KQ D + R++LR++L+C SF
Sbjct: 294 TFPGGTGRIINHNNARLAEVWMDEWRSFYYKINP-GVKQTDYGDLSPRIQLREKLECKSF 352
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +P+D G IR+ +T++C++ SM + +G + H
Sbjct: 353 RWYLQNIYPESQMPLDYYSLGEIRNKETNQCLD------SMGRKAGEKVGIVGCHGMGGN 406
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q+F D +CLDV + ++ V++ C G Q W +D+E
Sbjct: 407 QIFSYSKKKAFQTDD--LCLDV----SALTGPVKLYQCHGLGGNQLWEHDQE 452
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 89/155 (57%), Positives = 117/155 (75%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S ++ ++ +++N+FNL+ S+ I VNRTL DVR CK+K + E LP +S+VIVFHNEA
Sbjct: 10 SQHAEMKEKFKVNQFNLMASELISVNRTLRDVRMDSCKSKTYPVESLPTTSVVIVFHNEA 69
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRTVHSVI+RSP +LKEI+LVDDAS ++FL LDEY++KLSV V+R R G
Sbjct: 70 WSTLLRTVHSVINRSPPPLLKEIILVDDASEKDFLGRQLDEYLSKLSVHVYVLRMEKRTG 129
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI+ARL GA +AEG+++ FLDAHCECT + E L
Sbjct: 130 LIRARLKGAARAEGKVITFLDAHCECTEGWLEPLL 164
>gi|391343213|ref|XP_003745907.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Metaseiulus occidentalis]
Length = 583
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 203/418 (48%), Positives = 272/418 (65%), Gaps = 14/418 (3%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + LP +SIVIVFHNEAW+ LLRTVHS+I SPR ++ EI+LVDDAS + L
Sbjct: 125 ECRKKTYPDRLPTTSIVIVFHNEAWTTLLRTVHSIIQMSPRELIAEIILVDDASEFDHLG 184
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L++YVAKL VP V+R+ R GLI+ARL+GA G+++ FLDAHCECT GWLE L+A
Sbjct: 185 QKLEDYVAKLPVPVHVLRTGKRSGLIRARLIGAETVTGQVITFLDAHCECTEGWLEPLLA 244
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+AED TRVVCPVID+ISD FAYV + + WG FNW+L+FRWY + + D T
Sbjct: 245 RIAEDNTRVVCPVIDVISDENFAYVPASDQTWGGFNWKLNFRWYRVPQRENDRRGGDRTL 304
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P +TP MAGGLFA+D+AYF +G YDE M +WGGENLEMSFR+W CGG++EI CSHV H
Sbjct: 305 PVRTPTMAGGLFAMDKAYFEKLGKYDEGMDIWGGENLEMSFRIWMCGGTLEIVTCSHVGH 364
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK++PY+FPGG +++ N AR+A VW+DEW +FYF NP A K+ D+ R +LR
Sbjct: 365 VFRKSTPYTFPGGTGKIVNHNNARLADVWLDEWKDFYFAINPVA-KKVDRGDTSGRHKLR 423
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLP 546
+ L+C SF+WYL +++P +P+D G I++ + C++ + + SG +
Sbjct: 424 QDLQCKSFRWYLENIYPESHMPLDYYHLGEIKNADGNLCLD------TYGKKSGDVLYMG 477
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
H Q+F ++A D CLD + S V++ C Q WTYD
Sbjct: 478 KCHGLGGNQVFAYTKRQQIMADDS--CLDA----SSPSGPVKLFRCHNMGGNQMWTYD 529
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 82/151 (54%), Positives = 111/151 (73%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+ ++IN+FNLL S+RI +NR+LPDVR +C+ K + + LP +SIVIVFHNEAW+ LLR
Sbjct: 96 NEKFKINQFNLLASERIALNRSLPDVRLAECRKKTYPDR-LPTTSIVIVFHNEAWTTLLR 154
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHS+I SPR ++ EI+LVDDAS + L L++YVAKL VP V+R+ R GLI+ARL
Sbjct: 155 TVHSIIQMSPRELIAEIILVDDASEFDHLGQKLEDYVAKLPVPVHVLRTGKRSGLIRARL 214
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA G+++ FLDAHCECT + E L +
Sbjct: 215 IGAETVTGQVITFLDAHCECTEGWLEPLLAR 245
>gi|432932493|ref|XP_004081766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Oryzias latipes]
Length = 557
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/438 (47%), Positives = 277/438 (63%), Gaps = 38/438 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAWS LLRTVHSVISRSPR +L EI+LVDDAS R+FLK L+ YV L
Sbjct: 115 LPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTL 174
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP +++R R GLI+ARL GA G+++ FLDAHCECT GWLE L+AR+ EDRT VV
Sbjct: 175 EVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVV 234
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 235 CPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 294
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF IG+YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PYSF
Sbjct: 295 LFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSF 354
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V+ N R+A VWMDE+ +F++ +P + D V SR LR+ LKC F W
Sbjct: 355 PGGTGQVINKNNRRLAEVWMDEFKDFFYIISPGVMRV-DYGDVSSRKALREALKCKPFAW 413
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++CV+ K +
Sbjct: 414 YLENIYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTAD 473
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLP-TDLIATDESVCLDVPEYEND 582
+++ +GP +L C H+ QMF L+ + + CLD+P E+
Sbjct: 474 KEIRTDDLCLDVSRLNGPVVMLKCHHMKG-NQMFEYDAERLTLLHVNSNQCLDMPSEEDK 532
Query: 583 ISPRVRILACSGFNRQRW 600
+ P +R C+G Q+W
Sbjct: 533 MVPTLR--DCNGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ ++ LP +SIVIVFHNEAWS LLR
Sbjct: 76 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADD-LPTTSIVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVISRSPR +L EI+LVDDAS R+FLK L+ YV L VP +++R R GLI+ARL
Sbjct: 135 TVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLEVPVKILRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA G+++ FLDAHCECT + E L +
Sbjct: 195 RGAAATTGQVITFLDAHCECTEGWLEPLLAR 225
>gi|432932495|ref|XP_004081767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Oryzias latipes]
Length = 556
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/438 (47%), Positives = 277/438 (63%), Gaps = 38/438 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAWS LLRTVHSVISRSPR +L EI+LVDDAS R+FLK L+ YV L
Sbjct: 114 LPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTL 173
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP +++R R GLI+ARL GA G+++ FLDAHCECT GWLE L+AR+ EDRT VV
Sbjct: 174 EVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVV 233
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 234 CPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 293
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF IG+YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PYSF
Sbjct: 294 LFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSF 353
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V+ N R+A VWMDE+ +F++ +P + D V SR LR+ LKC F W
Sbjct: 354 PGGTGQVINKNNRRLAEVWMDEFKDFFYIISPGVMRV-DYGDVSSRKALREALKCKPFAW 412
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++CV+ K +
Sbjct: 413 YLENIYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTAD 472
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLP-TDLIATDESVCLDVPEYEND 582
+++ +GP +L C H+ QMF L+ + + CLD+P E+
Sbjct: 473 KEIRTDDLCLDVSRLNGPVVMLKCHHMKG-NQMFEYDAERLTLLHVNSNQCLDMPSEEDK 531
Query: 583 ISPRVRILACSGFNRQRW 600
+ P +R C+G Q+W
Sbjct: 532 MVPTLR--DCNGSRSQQW 547
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ ++ LP +SIVIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADD-LPTTSIVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVISRSPR +L EI+LVDDAS R+FLK L+ YV L VP +++R R GLI+ARL
Sbjct: 134 TVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLEVPVKILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA G+++ FLDAHCECT + E L +
Sbjct: 194 RGAAATTGQVITFLDAHCECTEGWLEPLLAR 224
>gi|432932497|ref|XP_004081768.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Oryzias latipes]
Length = 558
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 208/438 (47%), Positives = 277/438 (63%), Gaps = 38/438 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAWS LLRTVHSVISRSPR +L EI+LVDDAS R+FLK L+ YV L
Sbjct: 116 LPTTSIVIVFHNEAWSTLLRTVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTL 175
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP +++R R GLI+ARL GA G+++ FLDAHCECT GWLE L+AR+ EDRT VV
Sbjct: 176 EVPVKILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTEGWLEPLLARIKEDRTAVV 235
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 236 CPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 295
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF IG+YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PYSF
Sbjct: 296 LFSIDKMYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSF 355
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V+ N R+A VWMDE+ +F++ +P + D V SR LR+ LKC F W
Sbjct: 356 PGGTGQVINKNNRRLAEVWMDEFKDFFYIISPGVMRV-DYGDVSSRKALREALKCKPFAW 414
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++CV+ K +
Sbjct: 415 YLENIYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTAD 474
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLP-TDLIATDESVCLDVPEYEND 582
+++ +GP +L C H+ QMF L+ + + CLD+P E+
Sbjct: 475 KEIRTDDLCLDVSRLNGPVVMLKCHHMKG-NQMFEYDAERLTLLHVNSNQCLDMPSEEDK 533
Query: 583 ISPRVRILACSGFNRQRW 600
+ P +R C+G Q+W
Sbjct: 534 MVPTLR--DCNGSRSQQW 549
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ ++ LP +SIVIVFHNEAWS LLR
Sbjct: 77 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYADD-LPTTSIVIVFHNEAWSTLLR 135
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVISRSPR +L EI+LVDDAS R+FLK L+ YV L VP +++R R GLI+ARL
Sbjct: 136 TVHSVISRSPRHLLVEIVLVDDASERDFLKKKLEGYVRTLEVPVKILRMEQRSGLIRARL 195
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA G+++ FLDAHCECT + E L +
Sbjct: 196 RGAAATTGQVITFLDAHCECTEGWLEPLLAR 226
>gi|357620060|gb|EHJ72385.1| hypothetical protein KGM_13871 [Danaus plexippus]
Length = 600
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 202/383 (52%), Positives = 259/383 (67%), Gaps = 7/383 (1%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP +S++IVFHNEAWS L+RTV SVI RSP +LKEI+LVDDAS R++L LD+
Sbjct: 155 YDIENLPTASVIIVFHNEAWSTLMRTVMSVILRSPDMLLKEIILVDDASERKYLGKELDD 214
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
VA L ++RS R GL+ ARL+GA+ A G +LVFLDAHCE T GWLE L+ R D
Sbjct: 215 AVANLD-KVVILRSLNRTGLVGARLMGAKTATGNVLVFLDAHCEVTKGWLEPLLDRAGSD 273
Query: 312 RTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
+CP ID++SD T AY +S + HWGAF+W LHFRW + + K + ++PF TP
Sbjct: 274 DV-FICPHIDLLSDDTLAYTKSIDAHWGAFSWRLHFRWLMPSNEIMMNKSRYPSKPFPTP 332
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
AMAGGLFA+ ++ F+ +G YDEEM +WGGENLE+S+R WQCG +EI CS V H+FR+
Sbjct: 333 AMAGGLFAVRKSLFWRLGGYDEEMSIWGGENLELSWRAWQCGARVEITHCSRVGHIFRRH 392
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKC 491
SPY +PGGV +VL NLAR A VWMDEWA+F+FKFNP RD Q V +R+ELRK LKC
Sbjct: 393 SPYKYPGGVFKVLNTNLARAATVWMDEWADFFFKFNPSVAAIRDTQNVANRIELRKNLKC 452
Query: 492 HSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
SFKWYL +VWP +F P D+++FGRIR+ C+ G GPAS + C
Sbjct: 453 KSFKWYLENVWPKNFFPSDERWFGRIRN-DKEGCIG---VVGGTPGLGGPASGVHCGSDL 508
Query: 552 VLTQMFVMKLPTDLIATDESVCL 574
L ++ V P I DE +CL
Sbjct: 509 DLDRLLVYT-PDGNIMADEGLCL 530
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 132/222 (59%), Gaps = 27/222 (12%)
Query: 11 KLARNQQDYIDRRGVHVV-------------------VGHY--LGDSVDGG-----LHSN 44
L+ +Q Y+DR G+ V+ HY + + +GG L
Sbjct: 56 NLSYDQDSYMDRSGMRVIVGHYVGGHGGGNLSEDVINTNHYSPVQGAGEGGRPVQLLPKE 115
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
+ A++LY ++ +N+ VSDRI +NR LPD+R C+ ++ E LP +S++IVFHNEAWS
Sbjct: 116 IIPARELYSLHSYNIFVSDRISINRHLPDMRSESCRNVKYDIENLPTASVIIVFHNEAWS 175
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
L+RTV SVI RSP +LKEI+LVDDAS R++L LD+ VA L ++RS R GL+
Sbjct: 176 TLMRTVMSVILRSPDMLLKEIILVDDASERKYLGKELDDAVANLD-KVVILRSLNRTGLV 234
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVF 206
ARL+GA+ A G +LVFLDAHCE T + E L ++ VF
Sbjct: 235 GARLMGAKTATGNVLVFLDAHCEVTKGWLEPLLDRAGSDDVF 276
>gi|312377724|gb|EFR24483.1| hypothetical protein AND_10876 [Anopheles darlingi]
Length = 594
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 214/467 (45%), Positives = 279/467 (59%), Gaps = 59/467 (12%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C LP +SIVIVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS RE
Sbjct: 137 HHDCKKKHYPAKLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREH 196
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L L++YV L V T V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L
Sbjct: 197 LGRQLEDYVKTLPVSTIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPL 256
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ DR VVCP+ID+ISD TF YV + + WG FNW+L+FRWY + + + D
Sbjct: 257 LARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMQRRNHDR 316
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P +TP MAGGLF+IDR YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EIAPCSHV
Sbjct: 317 TAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHV 376
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWM---------------------------- 456
H+FR SPY+FPGGV+ ++ N ARVA VWM
Sbjct: 377 GHVFRDKSPYTFPGGVANIVLKNAARVAEVWMCGGTLEIAPCSRVGHVFRKSTPYSFPGG 436
Query: 457 -----------------DEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLT 499
D W+EFY+ NP A K V R ELR++LKC SF+WYL
Sbjct: 437 TSQIVNKNNARLAEVWLDGWSEFYYNINPGARKA-SAGDVSERRELRERLKCKSFRWYLE 495
Query: 500 HVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVM 559
+++P +P+D F G IR+V++ C++ +M + + H Q+F
Sbjct: 496 NIYPESQMPLDYYFLGEIRNVESQNCLD------TMGRKANEKIGSSYCHGLGGNQVFAY 549
Query: 560 KLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
+++ D CLD N + P V ++ C G Q W YD+E
Sbjct: 550 TKRHQIMSDDN--CLDA---SNALGP-VNLVRCHGMAGNQEWIYDEE 590
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 92/180 (51%), Positives = 119/180 (66%), Gaps = 6/180 (3%)
Query: 25 VHVVVGHYLGDSVDGGLHSNLSDAQQ-----LYQINRFNLLVSDRIPVNRTLPDVRKPKC 79
V VG G + G + +QQ ++ N+FNLL SD I +NR+L DVR C
Sbjct: 81 VAPTVGENYGRPGEMGKPVKIPSSQQELMKEKFKENQFNLLASDMIWLNRSLTDVRHHDC 140
Query: 80 KTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 139
K K + + LP +SIVIVFHNEAWS LLRT+ SVI+RSPR +LKEI+LVDDAS RE L
Sbjct: 141 KKKHYPAK-LPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREHLGR 199
Query: 140 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L++YV L V T V+R+ R GLI+ARLLGA+ +G+++ FLDAHCECT + E L +
Sbjct: 200 QLEDYVKTLPVSTIVLRTVKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLAR 259
>gi|348519902|ref|XP_003447468.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Oreochromis niloticus]
Length = 556
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 205/446 (45%), Positives = 284/446 (63%), Gaps = 39/446 (8%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V++++ LP +SIVIVFHNEAWS LLRTVHSVI+RSP+ +L EI+LVDDAS R+FLK
Sbjct: 107 TKVYSDD-LPNTSIVIVFHNEAWSTLLRTVHSVINRSPKHLLVEIILVDDASERDFLKKK 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP R++R R GLI+ARL GA G+++ FLDAHCECT+GWLE L+AR+
Sbjct: 166 LENYVRTLEVPVRILRMEQRSGLIRARLRGAAATTGQVITFLDAHCECTVGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDRT VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+ID+ YF IG+YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PYSFPGG +V+ N R+A VWMD++ +F++ +P + + V SR LR+
Sbjct: 346 RKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYIISPGVMRV-EYGDVSSRKALREA 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCMDNMGRKENEKVGFFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHLPVLTQMFVMKLP-TDLIATDESVCL 574
+++ +GP +L C H+ QMF L+ + + CL
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVVMLKCHHMKG-NQMFEYDAERLTLLHVNSNQCL 523
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D+P ++ + P +R C+G Q+W
Sbjct: 524 DMPSEDDKMVPTLR--DCNGSRSQQW 547
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV++++ LP +SIVIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYSDD-LPNTSIVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSP+ +L EI+LVDDAS R+FLK L+ YV L VP R++R R GLI+ARL
Sbjct: 134 TVHSVINRSPKHLLVEIILVDDASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA G+++ FLDAHCECT+ + E L +
Sbjct: 194 RGAAATTGQVITFLDAHCECTVGWLEPLLAR 224
>gi|195472767|ref|XP_002088670.1| GE18697 [Drosophila yakuba]
gi|194174771|gb|EDW88382.1| GE18697 [Drosophila yakuba]
Length = 675
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/454 (46%), Positives = 273/454 (60%), Gaps = 57/454 (12%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 366 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWM---------------------------------------- 456
PGGV++++ N ARVA VWM
Sbjct: 426 PGGVAKIVLHNAARVAEVWMCGGVLEIAPCSRVGHVFRKSTPYTFPGGTTEIVNHNNARL 485
Query: 457 -----DEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD 511
D+W EFY+ F P A K V R LR +LKC SF+WYL +V+P +P+D
Sbjct: 486 VEVWLDDWKEFYYSFYPGARKA-SAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDY 544
Query: 512 KFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDES 571
+ G IR+ +T C++ +M + + H Q+F +++ D
Sbjct: 545 YYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDD-- 596
Query: 572 VCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
+CLD ++ VR G Q W YD E
Sbjct: 597 LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 627
>gi|410897068|ref|XP_003962021.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Takifugu rubripes]
Length = 556
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 204/438 (46%), Positives = 277/438 (63%), Gaps = 38/438 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
+P +SIVIVFHNEAWS LLRTVHSVI+RSPR +L EI+LVDDAS R+FLK L+ YV L
Sbjct: 114 VPNTSIVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASERDFLKKKLENYVRTL 173
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP R++R R GLI+ARL GA +G+++ FLDAHCECT+GWLE L+AR+ EDRT VV
Sbjct: 174 EVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPLLARIKEDRTAVV 233
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 234 CPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 293
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF IG+YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PYSF
Sbjct: 294 LFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSF 353
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V+ N R+A VWMD++ +F++ +P + D V SR LR L+C F W
Sbjct: 354 PGGTGQVINKNNRRLAEVWMDDFKDFFYIISPGVMRV-DYGDVSSRKGLRDALRCKPFSW 412
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++CV+ K +
Sbjct: 413 YLENIYPDSQIPRRYYSLGEIRNVETNQCVDNMGRKENEKVGFFNCHGMGGNQVFSYTAD 472
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLP-TDLIATDESVCLDVPEYEND 582
+++ +GP +L C H+ QMF L+ + + CLD+P E+
Sbjct: 473 KEIRTDDLCLDVSRLNGPVLMLKCHHMKG-NQMFEYDAERLTLLHVNSNQCLDMPSEEDK 531
Query: 583 ISPRVRILACSGFNRQRW 600
+ P +R C+ Q+W
Sbjct: 532 LVPTLR--ECNNSRSQQW 547
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ ++ +P +SIVIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYPDD-VPNTSIVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +L EI+LVDDAS R+FLK L+ YV L VP R++R R GLI+ARL
Sbjct: 134 TVHSVINRSPRHLLVEIVLVDDASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA +G+++ FLDAHCECT+ + E L +
Sbjct: 194 RGAAATKGQVITFLDAHCECTVGWLEPLLAR 224
>gi|241133788|ref|XP_002404588.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215493637|gb|EEC03278.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 459
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 210/418 (50%), Positives = 274/418 (65%), Gaps = 18/418 (4%)
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
L +N E P +SI+IVFHNEAWS LLRTVHS I+RSPR +L+EILLVDDAS R +
Sbjct: 6 LKYNTEGYPDTSIIIVFHNEAWSTLLRTVHSAINRSPRHLLREILLVDDASERSKFTAP- 64
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGA--RQAEGEILVFLDAHCECTLGWLENLVAR 307
A +P + + G+ + A Q + VF+ + + NL R
Sbjct: 65 ----ASYFLPQTLHIFSIQGGIFRRIFASAVFHQNRSFLSVFVSGREDSLF--IANL--R 116
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ T VVCPVIDII+D TFAYVRSFE+HWGAFNWELHFRW+ G + + + T P
Sbjct: 117 ITRQATVVVCPVIDIINDETFAYVRSFEMHWGAFNWELHFRWFPVGEREHKRRSGNATAP 176
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
F+TP MAGGLF+IDR YF+ +GAYD++M +WGGEN+E+SFR+WQCGGS+E+ PCSHV HL
Sbjct: 177 FRTPVMAGGLFSIDRGYFYEMGAYDDQMDIWGGENMEISFRIWQCGGSVEVVPCSHVGHL 236
Query: 428 FRKASPYSF--PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
FR+ SPY+F PGGV VL+ NLARVA VWMDEWA FYF N EK+ Q V +R +L
Sbjct: 237 FRRTSPYTFPNPGGVGSVLFSNLARVAAVWMDEWAAFYFNMN-RGEKRHMLQDVTARKKL 295
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
R++L+C SFKWYL ++WP +FLP D+ FFG++R+ ++ KC +P +K + +Q G L
Sbjct: 296 REKLQCKSFKWYLKNIWPENFLPNDNIFFGKVRNKKSGKCFVRPSSK-NYHQPVGRVVLE 354
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYD 603
C Q FV I TDES+CLD PE + D + V ++AC+ RQ+W YD
Sbjct: 355 ECALTYYAMQHFVFT-EEGFIKTDESICLDSPESKADTN--VVMIACNDLQRQKWRYD 409
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 39/56 (69%), Positives = 45/56 (80%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 134
CK +N E P +SI+IVFHNEAWS LLRTVHS I+RSPR +L+EILLVDDAS R
Sbjct: 3 CKKLKYNTEGYPDTSIIIVFHNEAWSTLLRTVHSAINRSPRHLLREILLVDDASER 58
>gi|195433228|ref|XP_002064617.1| GK23729 [Drosophila willistoni]
gi|194160702|gb|EDW75603.1| GK23729 [Drosophila willistoni]
Length = 677
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 212/466 (45%), Positives = 276/466 (59%), Gaps = 57/466 (12%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C LP +SIVIVFHNEAW+ LLRTV SVI+RSPRS+LKEI+LVDDAS R+F
Sbjct: 176 HENCRRKHYASKLPTTSIVIVFHNEAWTTLLRTVWSVINRSPRSLLKEIILVDDASERDF 235
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L L++YVAKL V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L
Sbjct: 236 LGKKLEDYVAKLPVRTFVLRTEKRSGLIRARLLGAEHVTGEVITFLDAHCECTEGWLEPL 295
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ ++R VVCP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D
Sbjct: 296 LARIVQNRRTVVCPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDR 355
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P +TP MAGGLF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV
Sbjct: 356 TAPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHV 415
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWM---------------------------- 456
H+FR SPY+FPGGV++++ N ARVA VWM
Sbjct: 416 GHVFRDKSPYTFPGGVAKIVLHNAARVAEVWMCGGILEIAPCSRVGHVFRKSTPYTFPGG 475
Query: 457 -----------------DEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLT 499
D+W EFY+ F P A K V R LR++LKC SF+WYL
Sbjct: 476 TTEIVNHNNARLVEVWLDDWKEFYYSFYPGARKA-SAGDVSDRKNLRERLKCKSFRWYLE 534
Query: 500 HVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVM 559
+V+P +P+D + G IR+ +T C++ +M + + H Q+F
Sbjct: 535 NVYPESLMPLDYYYLGEIRNSETETCLD------TMGRKYNEKVGISYCHGLGGNQVFAY 588
Query: 560 KLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
+++ D +CLD ++ VR G Q W YD E
Sbjct: 589 TKRQQIMSDD--LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 629
>gi|194761562|ref|XP_001962998.1| GF15722 [Drosophila ananassae]
gi|190616695|gb|EDV32219.1| GF15722 [Drosophila ananassae]
Length = 675
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 209/454 (46%), Positives = 274/454 (60%), Gaps = 57/454 (12%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L++YVAKL
Sbjct: 186 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEDYVAKL 245
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 246 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 305
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 306 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 365
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 366 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 425
Query: 437 PGGVSEVLYGNLARVALVWM---------------------------------------- 456
PGGV++++ N ARVA VWM
Sbjct: 426 PGGVAKIVLHNAARVAEVWMCGGVLEIAPCSRVGHVFRKSTPYTFPGGTTEIVNHNNARL 485
Query: 457 -----DEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD 511
D+W EFY+ F P A K V R LR++LKC SF+WYL +V+P +P+D
Sbjct: 486 VEVWLDDWKEFYYSFYPGARKA-SAGDVSDRKALRERLKCKSFRWYLENVYPESLMPLDY 544
Query: 512 KFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDES 571
+ G IR+ +T C++ +M + + H Q+F +++ D
Sbjct: 545 YYLGEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDD-- 596
Query: 572 VCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
+CLD ++ VR G Q W YD E
Sbjct: 597 LCLDASSSNGPVN-MVRCHNMGG--NQEWVYDAE 627
>gi|224045872|ref|XP_002187347.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Taeniopygia guttata]
Length = 559
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 277/446 (62%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKADRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR+
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRR 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L+ + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLLHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCTGSRSQQW 548
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYADN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|195550891|ref|XP_002076130.1| GD11982 [Drosophila simulans]
gi|194201779|gb|EDX15355.1| GD11982 [Drosophila simulans]
Length = 541
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/451 (46%), Positives = 273/451 (60%), Gaps = 54/451 (11%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAW+ LLRTV SVI+RSPR++LKEI+LVDDAS R+FL L+EYVAKL
Sbjct: 55 LPTTSIVIVFHNEAWTTLLRTVWSVINRSPRALLKEIILVDDASERDFLGKQLEEYVAKL 114
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V T V+R+ R GLI+ARLLGA GE++ FLDAHCECT GWLE L+AR+ ++R VV
Sbjct: 115 PVKTFVLRTEKRSGLIRARLLGAEHVSGEVITFLDAHCECTEGWLEPLLARIVQNRRTVV 174
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ + + WG FNW+L+FRWY S + + D T P +TP MAGG
Sbjct: 175 CPIIDVISDETFEYITASDSTWGGFNWKLNFRWYRVPSREMARRNNDRTAPLRTPTMAGG 234
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF+ IG+YDE M +WGGENLEMSFR+WQCGG +EI PCSHV H+FR SPY+F
Sbjct: 235 LFSIDKDYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIIPCSHVGHVFRDKSPYTF 294
Query: 437 PGGVSEVLYGNLARVAL------------------------------------------V 454
PGGV++++ N ARV + V
Sbjct: 295 PGGVAKIVLHNAARVWMCGGVLEIAPCSRVGHVFRKSTPYTFPGGTTEIVNHNNARLVEV 354
Query: 455 WMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFF 514
W+D+W EFY+ F P A K V R LR +LKC SF+WYL +V+P +P+D +
Sbjct: 355 WLDDWKEFYYSFYPGARKA-SAGDVSDRKALRDRLKCKSFRWYLENVYPESLMPLDYYYL 413
Query: 515 GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
G IR+ +T C++ +M + + H Q+F +++ D +CL
Sbjct: 414 GEIRNAETETCLD------TMGRKYNEKVGISYCHGLGGNQVFAYTKRQQIMSDD--LCL 465
Query: 575 DVPEYENDISPRVRILACSGFNRQRWTYDKE 605
D ++ VR G Q W YD E
Sbjct: 466 DASSSNGPVN-MVRCHNMGG--NQEWVYDAE 493
>gi|170046940|ref|XP_001851002.1| polypeptide N-acetylgalactosaminyltransferase 3 [Culex
quinquefasciatus]
gi|167869510|gb|EDS32893.1| polypeptide N-acetylgalactosaminyltransferase 3 [Culex
quinquefasciatus]
Length = 628
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 218/470 (46%), Positives = 273/470 (58%), Gaps = 97/470 (20%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS LLRTV SVI RSP+ ++KEILLVDDAS R FLK+ L+ YV KL
Sbjct: 142 LPTTSIIIVFHNEAWSVLLRTVWSVIIRSPKHLIKEILLVDDASDRRFLKNDLENYVQKL 201
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ ++R R GL+ ARL+GAR A G+ L FLDAHCEC+ GWLE L+AR+ E+ +VV
Sbjct: 202 PLVVSILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPLLARIKENPKKVV 261
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVIDIISD F+Y++SFE HWGAFNW++HFRWYT + +RKD T PF TPAMAGG
Sbjct: 262 CPVIDIISDDNFSYIKSFEFHWGAFNWQMHFRWYTLSDEELAERRKDTTLPFHTPAMAGG 321
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF IDR YFF +G+YDE +++WGG+NLEMSFR+WQCGG IEIAPCSHV HLFRK+SPY+F
Sbjct: 322 LFTIDRKYFFDVGSYDERLKIWGGDNLEMSFRIWQCGGEIEIAPCSHVGHLFRKSSPYTF 381
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGVS GN V SR+ L+KQL C SF W
Sbjct: 382 PGGVS----GN-------------------------------VSSRVALKKQLNCKSFDW 406
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHV------------------------------------ 520
YL VWP +F P +KFFGRI+ +
Sbjct: 407 YLRKVWPQNFFPAANKFFGRIQPIDLSTFDHQEYITLMKKINLIVKNLNPELKWKFLIKY 466
Query: 521 ----------------QTHKCVEKPLAKGSMNQASGPASLLPCTHL-PVLTQMFVMKLPT 563
+ C++KP + Q G + L C+ L L + FV+
Sbjct: 467 LTENVKKIGDSMKAAKHSSYCLQKPKTNSLITQPFGQSFLRKCSLLINQLDEQFVID-DY 525
Query: 564 DLIATDESVCLD-----VPEYENDI--SPRVRILACSGFNR-QRWTYDKE 605
I TDE VCLD E E I + + +++ C R QRW Y+ +
Sbjct: 526 GRIMTDEGVCLDSYRKTTVEGEALIEGAKKTKMVTCGSNKRAQRWVYETD 575
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 105/213 (49%), Positives = 132/213 (61%), Gaps = 29/213 (13%)
Query: 5 SVVQLLKLARN--QQDYIDRRGVHVVVGHYLGDSVDGGLHSN------------------ 44
SV + + RN + DR + +VGHY+G G + N
Sbjct: 31 SVEKFINYQRNVRSSKFQDRPRIPQIVGHYVGVGSVGNVSKNFLNTNNFAPVPGVGENGD 90
Query: 45 --------LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
L QQL+QINR+NLL SDRI +NRTLPDVRKPKC +K + + LP +SI+I
Sbjct: 91 PVVVQAKDLLQMQQLFQINRYNLLASDRIALNRTLPDVRKPKCVSKEYPTK-LPTTSIII 149
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
VFHNEAWS LLRTV SVI RSP+ ++KEILLVDDAS R FLK+ L+ YV KL + ++R
Sbjct: 150 VFHNEAWSVLLRTVWSVIIRSPKHLIKEILLVDDASDRRFLKNDLENYVQKLPLVVSILR 209
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
R GL+ ARL+GAR A G+ L FLDAHCEC+
Sbjct: 210 LNKREGLVAARLMGARVATGDTLTFLDAHCECS 242
>gi|339242863|ref|XP_003377357.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
gi|316973849|gb|EFV57398.1| polypeptide N-acetylgalactosaminyltransferase 5 [Trichinella
spiralis]
Length = 383
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 185/324 (57%), Positives = 237/324 (73%), Gaps = 1/324 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV SVI+RSP+ +LKEI+LVDD S R FLK +LD +V L
Sbjct: 58 LPTTSVIIVFHNEAWSTLLRTVFSVINRSPKKLLKEIILVDDCSQRAFLKKALDNFVLNL 117
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP ++RS R+GLI+AR+LGA +A G++L FLD+HCECT GWLE L+ R+A DR V
Sbjct: 118 PVPVLIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECTEGWLEPLLDRIAFDRKIAV 177
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVID+I+D TF Y + +++ G FNW L FRWY+ S+ + D T P +TP +AGG
Sbjct: 178 APVIDVINDETFQYQKGIDVYRGGFNWNLQFRWYSSPPSELKRRGNDVTHPVRTPTIAGG 237
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR +FF IGAYD+EM++WGGENLEMSFR+WQCGG +EI PCSHV H+FRK SP+ F
Sbjct: 238 LFSIDRQFFFEIGAYDKEMKIWGGENLEMSFRIWQCGGQLEIIPCSHVGHVFRKKSPHDF 297
Query: 437 P-GGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + L NL RVA VWMDEW ++ + A+ + V R ELRK+LKC SF
Sbjct: 298 PRGNSARTLTTNLVRVAEVWMDEWKSLFYIISSAAKNISEIIDVSERKELRKRLKCKSFA 357
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRH 519
WYL +VWP HF+P D+ FFGR+ +
Sbjct: 358 WYLDNVWPDHFMPQDNAFFGRVSN 381
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 91/158 (57%), Positives = 121/158 (76%), Gaps = 4/158 (2%)
Query: 33 LGDSVDGGLHSNLSD-AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPK 91
LG SV+ L+ N S A+ L+QIN+FN++ SDRIP+NRTL D R+ C+ K ++ LP
Sbjct: 4 LGRSVN--LNDNDSKLAKHLFQINQFNIVASDRIPLNRTLIDARRAACRNKTYSSA-LPT 60
Query: 92 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP 151
+S++IVFHNEAWS LLRTV SVI+RSP+ +LKEI+LVDD S R FLK +LD +V L VP
Sbjct: 61 TSVIIVFHNEAWSTLLRTVFSVINRSPKKLLKEIILVDDCSQRAFLKKALDNFVLNLPVP 120
Query: 152 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
++RS R+GLI+AR+LGA +A G++L FLD+HCECT
Sbjct: 121 VLIVRSKERIGLIQARILGAEKASGDVLTFLDSHCECT 158
>gi|57530428|ref|NP_001006381.1| polypeptide N-acetylgalactosaminyltransferase 1 [Gallus gallus]
gi|326917238|ref|XP_003204908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Meleagris gallopavo]
gi|53133506|emb|CAG32082.1| hypothetical protein RCJMB04_17f16 [Gallus gallus]
Length = 559
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 277/446 (62%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYADNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKADRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR+
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRR 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L+ + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLLHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYADN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|13878612|sp|Q29121.1|GALT1_PIG RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1339955|dbj|BAA12800.1| N-acetylgalactosaminyl transferase [Sus sp.]
Length = 559
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKTFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYSSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R CSG Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCSGSRSQQW 548
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|304259|gb|AAA68489.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase, partial
[Bos taurus]
Length = 519
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 66 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 125
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 126 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 185
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 186 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 245
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 246 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 305
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 306 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 364
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 365 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 424
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 425 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 484
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R CSG Q+W
Sbjct: 485 DKATDEDSQVPSIR--DCSGSRSQQW 508
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 36 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 94
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 95 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 154
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 155 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 185
>gi|431896245|gb|ELK05661.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Pteropus alecto]
Length = 559
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/448 (45%), Positives = 280/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR +L+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDIASRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + MKL + +++
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPMKLTLQHVNSNQ-- 522
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R CSG Q+W
Sbjct: 523 CLDKATEEDSQVPSIR--DCSGSRSQQW 548
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +L+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHLLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|74004307|ref|XP_855648.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
3 [Canis lupus familiaris]
Length = 556
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 206/447 (46%), Positives = 282/447 (63%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSPR +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYADE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYADE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSPR +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|350586068|ref|XP_003482105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Sus scrofa]
Length = 559
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R CSG Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCSGSRSQQW 548
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|149639572|ref|XP_001511824.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Ornithorhynchus anatinus]
Length = 556
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 223/534 (41%), Positives = 311/534 (58%), Gaps = 65/534 (12%)
Query: 105 ALLRTVHSVISRS---PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
+LL + +VISRS P M K +L+ D + ++++ S + RS
Sbjct: 41 SLLPALRAVISRSQEGPGEMGKAVLISKDDQEKMKELFKINQFNLMASDLIALNRS---- 96
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 221
L RL G + T ++ +E LP + +VIVFHNEAWS LLRTV SV
Sbjct: 97 -LPDVRLEGCK----------------TKIYPDE-LPNTRVVIVFHNEAWSTLLRTVFSV 138
Query: 222 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 281
I+RSPRS+L E++LVDDAS R+FLK+SL+ YV L VP ++IR R GLI+ARL GA
Sbjct: 139 INRSPRSLLSEVILVDDASERDFLKTSLENYVKNLDVPVKIIRMEQRSGLIRARLRGAAA 198
Query: 282 AEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAF 341
+ G+++ FLDAHCECT GWLE L+AR+ EDR VVCP+ID+ISD TF Y+ ++ +G F
Sbjct: 199 SRGQVITFLDAHCECTFGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGF 258
Query: 342 NWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGE 401
NW+L+FRWY + ++ D T P +TP MAGGLF+IDR YF IG YD M +WGGE
Sbjct: 259 NWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGE 318
Query: 402 NLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE 461
NLEMSFR+WQCGGS+EI CSHV H+FRKA+PY+FPGG V+ N R+A VWMDE+ +
Sbjct: 319 NLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKD 378
Query: 462 FYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQ 521
F++ +P K D V R LR+ LKC F WYL +++P +P G IR+V+
Sbjct: 379 FFYIISPGVVKV-DYGDVSVRKALRQNLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVE 437
Query: 522 THKCVEKPLAKGS---------------------------------MNQASGPASLLPCT 548
T++C++ K + +++ +GP +L C
Sbjct: 438 TNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCH 497
Query: 549 HL--PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
H+ L + +L + +++ CLD P E+ + P ++ CSG Q+W
Sbjct: 498 HMRGNQLWEYDAERLTLRHVNSNQ--CLDEPSEEDKMVPTMK--DCSGSRSQQW 547
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTK++ +E LP + +VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE-LPNTRVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPRS+L E++LVDDAS R+FLK+SL+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVFSVINRSPRSLLSEVILVDDASERDFLKTSLENYVKNLDVPVKIIRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA + G+++ FLDAHCECT + E L +
Sbjct: 194 RGAAASRGQVITFLDAHCECTFGWLEPLLAR 224
>gi|29135331|ref|NP_803485.1| polypeptide N-acetylgalactosaminyltransferase 1 precursor [Bos
taurus]
gi|1171989|sp|Q07537.1|GALT1_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|289412|gb|AAA30532.1| UDP-GalNAc:polypeptide, N-acetylgalactosaminyltransferase [Bos
taurus]
gi|296473855|tpg|DAA15970.1| TPA: polypeptide N-acetylgalactosaminyltransferase 1 [Bos taurus]
Length = 559
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R CSG Q+W
Sbjct: 525 DKATDEDSQVPSIR--DCSGSRSQQW 548
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|147900163|ref|NP_001083410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Xenopus
laevis]
gi|38014522|gb|AAH60419.1| MGC68664 protein [Xenopus laevis]
Length = 559
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 201/446 (45%), Positives = 278/446 (62%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAW+ LLRTVHSVI+RSPR +L+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLREIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+L+ YV KLSVP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 ALETYVKKLSVPVHVIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + +R D T P
Sbjct: 226 INHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRRGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + +R+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDIATRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP +L C HL + L+ + + CL
Sbjct: 465 NQVFSYTASKEIRTDDLCLDVSKLNGPVIMLKCHHLRGNQLWEYDPVKLTLVHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G + Q+W
Sbjct: 525 DKAAEEDSQVPSIR--DCNGGHSQQW 548
Score = 185 bits (469), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 120/151 (79%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAW+ LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWTTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +L+EI+LVDDAS R+FLK +L+ YV KLSVP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHLLREIVLVDDASERDFLKRALETYVKKLSVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 RGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|301766697|ref|XP_002918769.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Ailuropoda melanoleuca]
Length = 556
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/447 (45%), Positives = 282/447 (63%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T ++ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSPR +L E++LVDDAS R+FLK +
Sbjct: 107 TKIYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTK++ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSPR +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|335775065|gb|AEH58447.1| polypeptide N-acetylgalactosaminyltransferase 1-like protein [Equus
caballus]
Length = 453
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 201/437 (45%), Positives = 273/437 (62%), Gaps = 36/437 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL
Sbjct: 9 LPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKL 68
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR+ DR VV
Sbjct: 69 KVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLARIKHDRKTVV 128
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 129 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 188
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+FRKA+PY+F
Sbjct: 189 LFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTF 248
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +++ N R+A VWMDE+ F++ +P K D + SRL LR +L+C F W
Sbjct: 249 PGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRHKLQCRPFSW 307
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++C++ K +
Sbjct: 308 YLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTAN 367
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
+++ +GP ++L C HL + L + + CLD E+
Sbjct: 368 KEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCLDKATEEDSQ 427
Query: 584 SPRVRILACSGFNRQRW 600
P +R C+G Q+W
Sbjct: 428 VPSIR--DCTGSRSQQW 442
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 75/120 (62%), Positives = 93/120 (77%), Gaps = 1/120 (0%)
Query: 80 KTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 139
KTKV+ + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 1 KTKVYPDN-LPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 59
Query: 140 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 60 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 119
>gi|324503401|gb|ADY41481.1| N-acetylgalactosaminyltransferase 6 [Ascaris suum]
Length = 927
Score = 407 bits (1046), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/414 (48%), Positives = 271/414 (65%), Gaps = 11/414 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S+R FLK LD +V L
Sbjct: 158 LPTTSVIIVYHNEAFSTLLRTVVSVIDRSPKEVLKEIILVDDFSSRSFLKDDLDNFVVTL 217
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ ++IR+ RVGLI+ARL+GA +A+GE+L FLD+HCECT GWLE L+AR+ E+R VV
Sbjct: 218 GIRIKIIRAQRRVGLIRARLMGANEADGEVLTFLDSHCECTKGWLEPLLARIKENRKAVV 277
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+I+D TFAY + EL G FNW L FRWY + D T P ++P MAGG
Sbjct: 278 CPVIDVINDRTFAYQKGIELFRGGFNWNLQFRWYAVPPDIVKGRANDPTMPIQSPTMAGG 337
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YF +GAYD M++WGGEN+E+SFR+WQCGG IEI PCSHV H+FRKASP+ F
Sbjct: 338 LFSIDKRYFEELGAYDPGMEIWGGENIEISFRIWQCGGRIEILPCSHVGHIFRKASPHDF 397
Query: 437 PGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PG S ++L NL RVA VWMDEW ++K P+A + R V R+ELRK+L+C F
Sbjct: 398 PGKSSGKILNSNLLRVAEVWMDEWKYLFYKTAPQALQMRSSIDVSERIELRKRLQCKDFN 457
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHK-----CVEKPLAKGSMNQASGPASLLPCTHL 550
WYL +VWP +F+P + FGRI H + C+ + +G + + AS+ C+
Sbjct: 458 WYLQNVWPDNFMPRPEHTFGRILHSYSESVGDPCCLYWTIPRGGVVRV---ASMHNCSAT 514
Query: 551 PVLTQMFV-MKLPTDLIATDESVCLDVPEYENDISP-RVRILACSGFNRQRWTY 602
+ + + I +DE +CL + N +++ C ++ + W Y
Sbjct: 515 SRFDRTELWLYTNEGQIKSDEHLCLSAYQPLNSKGKWAIQLKECGQYDNEYWDY 568
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 99/233 (42%), Positives = 144/233 (61%), Gaps = 37/233 (15%)
Query: 3 LISVVQLLKLARNQQDYIDR-----RGVHVVVGHYLGDSVDGGLHSNLSDAQ-------- 49
LI + Q ++++ ID+ RG+ VVVGHY G+ + +NL++ Q
Sbjct: 37 LILLAQEKIFTTSEENIIDQNSYAGRGIQVVVGHYNGN-LPAEKRANLTEEQLNANRFSP 95
Query: 50 ----------------------QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEE 87
+ IN+FNL++SD+I +NR+LPDVRK +C+ K++
Sbjct: 96 VPGVGEDGRPVKLDELEDRLSDDTFGINQFNLIISDKIALNRSLPDVRKHQCRDKIYPAP 155
Query: 88 F-LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 146
LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S+R FLK LD +V
Sbjct: 156 SELPTTSVIIVYHNEAFSTLLRTVVSVIDRSPKEVLKEIILVDDFSSRSFLKDDLDNFVV 215
Query: 147 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L + ++IR+ RVGLI+ARL+GA +A+GE+L FLD+HCECT + E L +
Sbjct: 216 TLGIRIKIIRAQRRVGLIRARLMGANEADGEVLTFLDSHCECTKGWLEPLLAR 268
>gi|149720888|ref|XP_001496819.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Equus caballus]
Length = 559
Score = 407 bits (1046), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCTGSRSQQW 548
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|432908535|ref|XP_004077909.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oryzias latipes]
Length = 557
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/437 (45%), Positives = 274/437 (62%), Gaps = 36/437 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VIVFHNEAWS LLRTVHSVI RSPRS+L+EI+LVDDAS R+FLK L++YV +L
Sbjct: 113 LPRTSVVIVFHNEAWSTLLRTVHSVIDRSPRSLLEEIVLVDDASERDFLKRQLEQYVRRL 172
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP RV+R R GLI+ARL GA + G+++ FLDAHCECTLGWLE L+ R+ +D+ VV
Sbjct: 173 EVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTLGWLEPLLTRIKQDKRTVV 232
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 233 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTIPVRTPTMAGG 292
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+FRKA+PY+F
Sbjct: 293 LFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTF 352
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +++ N R+A VWMDE+ F++ +P K D + +R LR++L+C F W
Sbjct: 353 PGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISTRTSLRQKLQCKPFSW 411
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++C++ K +
Sbjct: 412 YLENIYPDSQIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTAN 471
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
+++ +GP +L C HL + L+ + + CLD E+
Sbjct: 472 KEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQLWEYDPVKLTLVHVNSNQCLDKASEEDSQ 531
Query: 584 SPRVRILACSGFNRQRW 600
P VR C+ Q+W
Sbjct: 532 VPSVR--DCTHVRSQQW 546
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 89/156 (57%), Positives = 121/156 (77%), Gaps = 1/156 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N +++++IN+FNL+ S+ I +NR+LPDVR CK K++ ++ LP++S+VIVFHNEAW
Sbjct: 69 NQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDD-LPRTSVVIVFHNEAW 127
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTVHSVI RSPRS+L+EI+LVDDAS R+FLK L++YV +L VP RV+R R GL
Sbjct: 128 STLLRTVHSVIDRSPRSLLEEIVLVDDASERDFLKRQLEQYVRRLEVPVRVVRMEQRSGL 187
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARL GA + G+++ FLDAHCECTL + E L +
Sbjct: 188 IRARLKGASISTGQVITFLDAHCECTLGWLEPLLTR 223
>gi|344269062|ref|XP_003406374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Loxodonta africana]
Length = 559
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/446 (45%), Positives = 276/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP++S+VIVFHNEAWS LLRTVHSV++RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDALPRTSVVIVFHNEAWSTLLRTVHSVLNRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP++S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDA-LPRTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSV++RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVLNRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|348526962|ref|XP_003450988.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Oreochromis niloticus]
Length = 557
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 199/437 (45%), Positives = 272/437 (62%), Gaps = 36/437 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VIVFHNEAW+ LLRTVHSVI RSP ++L+EI+LVDDAS R+FLK L+ YV KL
Sbjct: 113 LPRTSVVIVFHNEAWTTLLRTVHSVIDRSPHTLLEEIVLVDDASERDFLKQQLERYVRKL 172
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP RV+R R GLI+ARL GA + G+++ FLDAHCECT GWLE L+AR+ +DR VV
Sbjct: 173 EVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLARIKQDRKTVV 232
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 233 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 292
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+FRKA+PY+F
Sbjct: 293 LFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTF 352
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +++ N R+A VWMDE+ F++ +P K D + SR LR++L+C F W
Sbjct: 353 PGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDITSRTALRQKLQCKPFSW 411
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++C++ K +
Sbjct: 412 YLENIYPDSQIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTAN 471
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
+++ +GP +L C HL L LI + + CLD E+
Sbjct: 472 KEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQLWEYDPLKLTLIHVNSNQCLDKASEEDSQ 531
Query: 584 SPRVRILACSGFNRQRW 600
P VR C+ Q+W
Sbjct: 532 VPSVR--DCTHTRSQQW 546
Score = 179 bits (453), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 85/151 (56%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CK K++ + LP++S+VIVFHNEAW+ LLR
Sbjct: 74 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN-LPRTSVVIVFHNEAWTTLLR 132
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI RSP ++L+EI+LVDDAS R+FLK L+ YV KL VP RV+R R GLI+ARL
Sbjct: 133 TVHSVIDRSPHTLLEEIVLVDDASERDFLKQQLERYVRKLEVPVRVVRMEQRSGLIRARL 192
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA + G+++ FLDAHCECT + E L +
Sbjct: 193 KGASISTGQVITFLDAHCECTTGWLEPLLAR 223
>gi|73961264|ref|XP_537284.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Canis lupus familiaris]
gi|301764431|ref|XP_002917637.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Ailuropoda melanoleuca]
gi|281348455|gb|EFB24039.1| hypothetical protein PANDA_005970 [Ailuropoda melanoleuca]
Length = 559
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|410977586|ref|XP_003995186.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Felis
catus]
Length = 559
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|444723970|gb|ELW64593.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Tupaia chinensis]
Length = 591
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 138 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 197
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 198 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 257
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 258 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 317
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 318 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 377
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 378 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 436
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 437 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 496
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 497 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 556
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 557 DKATEEDSQVPSIR--DCNGSRSQQW 580
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 108 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 166
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 167 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 226
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 227 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 257
>gi|426253597|ref|XP_004020479.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Ovis
aries]
Length = 559
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATDEDSQVPSIR--DCNGSRSQQW 548
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|327275061|ref|XP_003222292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Anolis carolinensis]
Length = 559
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 276/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR +L+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYSDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHILEEIILVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL +P VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 LLENYVKKLQIPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKADRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L+ + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLLHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P ++ C+G Q+W
Sbjct: 525 DKATEEDSQVPSIK--DCNGNRSQQW 548
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+++ LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYSDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +L+EI+LVDDAS R+FLK L+ YV KL +P VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHILEEIILVDDASERDFLKRLLENYVKKLQIPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|281347645|gb|EFB23229.1| hypothetical protein PANDA_007284 [Ailuropoda melanoleuca]
Length = 516
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 208/418 (49%), Positives = 276/418 (66%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T ++ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSPR +L E++LVDDAS R+FLK +
Sbjct: 62 TKIYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLT 120
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 121 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 180
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 181 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 240
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 241 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 300
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 301 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 359
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +++P +P G IR+V+T++C++ K N+ G + C
Sbjct: 360 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 413
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV +S V +L C Q W YD E
Sbjct: 414 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LSGPVIMLKCHHMRGNQLWEYDAE 465
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTK++ +E LP +S+VIVFHNEAWS LLR
Sbjct: 30 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE-LPNTSVVIVFHNEAWSTLLR 88
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSPR +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 89 TVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 148
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 149 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 179
>gi|56554527|pdb|1XHB|A Chain A, The Crystal Structure Of Udp-Galnac: Polypeptide Alpha-N-
Acetylgalactosaminyltransferase-T1
Length = 472
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/448 (45%), Positives = 279/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 19 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 78
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA + G+++ FLDAHCECT GWLE L+AR
Sbjct: 79 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLAR 138
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 139 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 198
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 199 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 258
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR+
Sbjct: 259 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRR 317
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 318 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 377
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 378 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 435
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R C+G Q+W
Sbjct: 436 CLDKATEEDSQVPSIR--DCTGSRSQQW 461
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 84/138 (60%), Positives = 104/138 (75%), Gaps = 1/138 (0%)
Query: 62 SDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSM 121
S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M
Sbjct: 2 SEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHM 60
Query: 122 LKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVF 181
++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL GA + G+++ F
Sbjct: 61 IEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITF 120
Query: 182 LDAHCECTLVFNEEFLPK 199
LDAHCECT + E L +
Sbjct: 121 LDAHCECTAGWLEPLLAR 138
>gi|33440465|gb|AAH56215.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/448 (45%), Positives = 279/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA + G+++ FLDAHCECT GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR+
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRR 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 522
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R C+G Q+W
Sbjct: 523 CLDKATEEDSQVPSIR--DCTGSRSQQW 548
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++ N+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKTNQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA + G+++ FLDAHCECT + E L +
Sbjct: 195 KGAAVSRGQVITFLDAHCECTAGWLEPLLAR 225
>gi|237874259|ref|NP_038842.3| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|237874270|ref|NP_001153876.1| polypeptide N-acetylgalactosaminyltransferase 1 [Mus musculus]
gi|13878613|sp|O08912.1|GALT1_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|2149049|gb|AAB58477.1| polypeptide GalNAc transferase-T1 [Mus musculus]
gi|60552620|gb|AAH90962.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mus musculus]
Length = 559
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/446 (45%), Positives = 274/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA + G+++ FLDAHCECT GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR+
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRR 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCTGSRSQQW 548
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA + G+++ FLDAHCECT + E L +
Sbjct: 195 KGAAVSRGQVITFLDAHCECTAGWLEPLLAR 225
>gi|301766699|ref|XP_002918770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Ailuropoda melanoleuca]
Length = 557
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 205/447 (45%), Positives = 282/447 (63%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T ++ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSPR +L E++LVDDAS R+FLK +
Sbjct: 108 TKIYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLT 166
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 167 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 226
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 227 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 286
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 287 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 346
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 347 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 405
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 406 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 465
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L + +++ C
Sbjct: 466 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 523
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 524 LDEPSEEDKMVPTMQ--DCSGSRSQQW 548
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTK++ +E LP +S+VIVFHNEAWS LLR
Sbjct: 76 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKIYPDE-LPNTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSPR +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 135 TVYSVINRSPRYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 195 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
>gi|149412842|ref|XP_001510290.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Ornithorhynchus anatinus]
Length = 559
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 274/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G ++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKFDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL L+ + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPTKLTLLHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 91/151 (60%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+RI NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASERIAFNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVRKLRVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G ++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAASKGRVITFLDAHCECTVGWLEPLLAR 225
>gi|345308178|ref|XP_003428667.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Ornithorhynchus anatinus]
Length = 558
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/446 (45%), Positives = 274/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 105 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 164
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G ++ FLDAHCECT+GWLE L+AR
Sbjct: 165 PLESYVRKLRVPVHVIRMEQRSGLIRARLKGAAASKGRVITFLDAHCECTVGWLEPLLAR 224
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 225 IKFDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 284
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 285 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 344
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 345 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 403
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 404 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 463
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL L+ + + CL
Sbjct: 464 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPTKLTLLHVNSNQCL 523
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 524 DKATEEDSQVPSIR--DCNGSRSQQW 547
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 91/151 (60%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+RI NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 75 KEMFKINQFNLMASERIAFNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 134 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVRKLRVPVHVIRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G ++ FLDAHCECT+ + E L +
Sbjct: 194 KGAAASKGRVITFLDAHCECTVGWLEPLLAR 224
>gi|410968681|ref|XP_003990830.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Felis
catus]
Length = 546
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/447 (46%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSPR +L E++LVDDAS R+FLK +
Sbjct: 97 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASERDFLKLT 155
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA + G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 156 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLEPLLARI 215
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 216 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 275
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 276 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 335
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 336 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 394
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 395 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 454
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L + +++ C
Sbjct: 455 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 512
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 513 LDEPSEEDKMVPTMQ--DCSGSRSQQW 537
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 74/125 (59%), Positives = 96/125 (76%), Gaps = 1/125 (0%)
Query: 75 RKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 134
R +CKTKV+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSPR +L E++LVDDAS R
Sbjct: 91 RFDRCKTKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPRYLLSEVILVDDASER 149
Query: 135 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNE 194
+FLK +L+ YV L VP ++IR R GLI+ARL GA + G+++ FLDAHCECTL + E
Sbjct: 150 DFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASRGQVITFLDAHCECTLGWLE 209
Query: 195 EFLPK 199
L +
Sbjct: 210 PLLAR 214
>gi|291391573|ref|XP_002712184.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Oryctolagus cuniculus]
Length = 557
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 205/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 108 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 166
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 167 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 226
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 227 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 286
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 287 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 346
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 347 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 405
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 406 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 465
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + KL + +++ C
Sbjct: 466 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAEKLTLRHVNSNQ--C 523
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 524 LDEPSEEDKMVPTMQ--DCSGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 76 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 135 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 195 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
>gi|126320794|ref|XP_001362869.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Monodelphis domestica]
Length = 559
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/448 (45%), Positives = 280/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKVDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + +R+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISTRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 522
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R CSG Q+W
Sbjct: 523 CLDKATEEDSQVPSIR--DCSGSRSQQW 548
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 91/154 (59%), Positives = 119/154 (77%), Gaps = 1/154 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NRTLPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRTLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVRKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSSI 202
GA ++G+++ FLDAHCECT+ + E L + +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLARIKV 228
>gi|13242273|ref|NP_077349.1| polypeptide N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1709559|sp|Q10473.1|GALT1_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|1141792|gb|AAC52511.1| polypeptide GalNAc transferase [Rattus norvegicus]
gi|149017082|gb|EDL76133.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Rattus norvegicus]
gi|1587757|prf||2207253A UDP-GalNAc polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/448 (45%), Positives = 280/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDSLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 522
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R C+G Q+W
Sbjct: 523 CLDKATEEDSQVPSIR--DCTGSRSQQW 548
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIAFNRSLPDVRLEGCKTKVYPDS-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|440911421|gb|ELR61095.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Bos grunniens
mutus]
Length = 564
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 203/448 (45%), Positives = 279/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHS+I+ SPR ML+EI+LVDDAS R+FLK
Sbjct: 111 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSIINHSPRHMLEEIVLVDDASERDFLKR 170
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 171 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 230
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 231 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 290
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 291 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 350
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 351 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 409
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 410 KLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 469
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 470 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 527
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R CSG Q+W
Sbjct: 528 CLDKATDEDSQVPSIR--DCSGSRSQQW 553
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 88/156 (56%), Positives = 117/156 (75%), Gaps = 6/156 (3%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPK-----CKTKVFNEEFLPKSSIVIVFHNEAW 103
+++++IN+FNL+ S+ I +NR+LPDV P CKTKV+ + LP +S+VIVFHNEAW
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAW 134
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTVHS+I+ SPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GL
Sbjct: 135 STLLRTVHSIINHSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGL 194
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARL GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 IRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 230
>gi|348576706|ref|XP_003474127.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Cavia porcellus]
Length = 559
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRIGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|296222514|ref|XP_002757211.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Callithrix jacchus]
gi|403265072|ref|XP_003924779.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Saimiri
boliviensis boliviensis]
Length = 559
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/448 (45%), Positives = 280/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 522
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R C+G Q+W
Sbjct: 523 CLDKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|40018588|ref|NP_954537.1| polypeptide N-acetylgalactosaminyltransferase 13 [Rattus
norvegicus]
gi|51315705|sp|Q6UE39.1|GLT13_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|34577141|gb|AAQ75749.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Rattus norvegicus]
gi|149047803|gb|EDM00419.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047804|gb|EDM00420.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
gi|149047805|gb|EDM00421.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a
[Rattus norvegicus]
Length = 556
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/447 (46%), Positives = 280/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L L + + C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLT--LRHANSNQC 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|26337335|dbj|BAC32353.1| unnamed protein product [Mus musculus]
Length = 556
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/447 (46%), Positives = 280/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFKCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L L + + C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLT--LRHANSNQC 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|395510712|ref|XP_003759616.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1
[Sarcophilus harrisii]
Length = 559
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVRKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKVDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRHYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + +R+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISTRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +V+P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R CSG Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCSGSRSQQW 548
Score = 184 bits (468), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 91/154 (59%), Positives = 119/154 (77%), Gaps = 1/154 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NRTLPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRTLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVRKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSSI 202
GA ++G+++ FLDAHCECT+ + E L + +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLARIKV 228
>gi|15620895|dbj|BAB67811.1| KIAA1918 protein [Homo sapiens]
Length = 516
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 222/534 (41%), Positives = 312/534 (58%), Gaps = 65/534 (12%)
Query: 105 ALLRTVHSVISRS---PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
+LL + +VISR+ P M K +L+ D + ++++ S + RS
Sbjct: 1 SLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRS---- 56
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 221
L RL G + T V+ +E LP +S+VIVFHNEAWS LLRTV+SV
Sbjct: 57 -LPDVRLEGCK----------------TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSV 98
Query: 222 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 281
I+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL GA
Sbjct: 99 INRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAA 158
Query: 282 AEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAF 341
++G+++ FLDAHCECTLGWLE L+AR+ EDR VVCP+ID+ISD TF Y+ ++ +G F
Sbjct: 159 SKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGF 218
Query: 342 NWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGE 401
NW+L+FRWY + ++ D T P +TP MAGGLF+IDR YF IG YD M +WGGE
Sbjct: 219 NWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGE 278
Query: 402 NLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE 461
NLEMSFR+WQCGGS+EI CSHV H+FRKA+PY+FPGG V+ N R+A VWMDE+ +
Sbjct: 279 NLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKD 338
Query: 462 FYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQ 521
F++ +P K D V R LR+ LKC F WYL +++P +P G IR+V+
Sbjct: 339 FFYIISPGVVKV-DYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVE 397
Query: 522 THKCVEKPLAKGS---------------------------------MNQASGPASLLPCT 548
T++C++ K + +++ +GP +L C
Sbjct: 398 TNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCH 457
Query: 549 HL--PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
H+ L + +L + +++ CLD P E+ + P ++ CSG Q+W
Sbjct: 458 HMRGNQLWEYDAERLTLRHVNSNQ--CLDEPSEEDKMVPTMQ--DCSGSRSQQW 507
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 35 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 93
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 94 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 153
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 154 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 184
>gi|449278148|gb|EMC86104.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Columba livia]
Length = 553
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 205/417 (49%), Positives = 273/417 (65%), Gaps = 14/417 (3%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLENYVKKLKVPVHVIRMEQRSGLIRARLKGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ + +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKADRRTVVCPIIDVISDDTFEYMAGSDKTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR+
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRR 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+L+C F WYL +V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 KLQCRPFSWYLENVYPDSQIPRHYFSLGEIRNVETNQCLDNMARK--ENEKVG---IFNC 459
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
H Q+F + I TD+ +CLDV + ++ +L C Q W YD
Sbjct: 460 -HGMGGNQVFSYTANKE-IRTDD-LCLDVSKLNGPVT----MLKCHHLKGNQLWEYD 509
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEIIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLENYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|13124891|ref|NP_065207.2| polypeptide N-acetylgalactosaminyltransferase 1 [Homo sapiens]
gi|386780838|ref|NP_001247531.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|332225596|ref|XP_003261968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Nomascus leucogenys]
gi|332849764|ref|XP_001135802.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pan troglodytes]
gi|397520346|ref|XP_003830280.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Pan
paniscus]
gi|426385782|ref|XP_004059381.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Gorilla
gorilla gorilla]
gi|1709558|sp|Q10472.1|GALT1_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
AltName: Full=Polypeptide GalNAc transferase 1;
Short=GalNAc-T1; Short=pp-GaNTase 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 1;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 1
soluble form
gi|971459|emb|CAA59380.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|119621764|gb|EAX01359.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|119621765|gb|EAX01360.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1), isoform
CRA_a [Homo sapiens]
gi|261861328|dbj|BAI47186.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [synthetic
construct]
gi|355701910|gb|EHH29263.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|355754989|gb|EHH58856.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Macaca
fascicularis]
gi|380784241|gb|AFE63996.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|383411871|gb|AFH29149.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|384942418|gb|AFI34814.1| polypeptide N-acetylgalactosaminyltransferase 1 [Macaca mulatta]
gi|410258728|gb|JAA17331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410292416|gb|JAA24808.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
gi|410338657|gb|JAA38275.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 (GalNAc-T1) [Pan
troglodytes]
Length = 559
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|417402739|gb|JAA48205.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 559
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 202/448 (45%), Positives = 279/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSV RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVTDRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGASVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ +DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKQDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D V SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDVASRIGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 522
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD ++ P +R C+G Q+W
Sbjct: 523 CLDKATEDDSQVPSIR--DCTGSRSQQW 548
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSV RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVTDRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGASVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|76677928|ref|NP_766618.2| polypeptide N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|51315989|sp|Q8CF93.1|GLT13_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|27531011|dbj|BAC54546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297181|gb|AAI31652.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Mus musculus]
gi|124297498|gb|AAI31653.1| Galnt13 protein [Mus musculus]
gi|148694972|gb|EDL26919.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694973|gb|EDL26920.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
gi|148694975|gb|EDL26922.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_a [Mus
musculus]
Length = 556
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 206/447 (46%), Positives = 280/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + +L L + + C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLSGPVIMLKCHHMRGNQLWEYDAERLT--LRHANSNQC 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|332251760|ref|XP_003275017.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|402902957|ref|XP_003914352.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 [Papio
anubis]
Length = 559
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLERYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLERYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|27530993|dbj|BAC54545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|193785960|dbj|BAG54747.1| unnamed protein product [Homo sapiens]
Length = 556
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|326674972|ref|XP_687472.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
2 [Danio rerio]
Length = 557
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/437 (45%), Positives = 272/437 (62%), Gaps = 36/437 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VIVFHNEAW+ LLRTVHSVI RSPR +L+EI+LVDDAS R+FLK L+ YV KL
Sbjct: 113 LPRTSVVIVFHNEAWTTLLRTVHSVIDRSPRHLLEEIVLVDDASERDFLKRQLEHYVRKL 172
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP RV+R R GLI+ARL GA + G+++ FLDAHCECT GWLE L++R+ D+ VV
Sbjct: 173 EVPVRVVRMEQRSGLIRARLKGASISTGQVITFLDAHCECTTGWLEPLLSRIKLDKKTVV 232
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 233 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 292
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+FRKA+PY+F
Sbjct: 293 LFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTF 352
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +++ N R+A VWMDE+ F++ +P K D + +R LR++L+C F W
Sbjct: 353 PGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISTRTSLRQRLQCKPFSW 411
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +V+P +P G IR+V+T++C++ K +
Sbjct: 412 YLENVYPDSQIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTAN 471
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
+++ +GP +L C HL + L+ + + CLD ++
Sbjct: 472 KEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQLWEYDPVKLSLVHVNSNQCLDKASEDDSQ 531
Query: 584 SPRVRILACSGFNRQRW 600
P VR CSG Q+W
Sbjct: 532 VPSVR--DCSGARSQQW 546
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ ++ LP++S+VIVFHNEAW+ LLR
Sbjct: 74 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDD-LPRTSVVIVFHNEAWTTLLR 132
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI RSPR +L+EI+LVDDAS R+FLK L+ YV KL VP RV+R R GLI+ARL
Sbjct: 133 TVHSVIDRSPRHLLEEIVLVDDASERDFLKRQLEHYVRKLEVPVRVVRMEQRSGLIRARL 192
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA + G+++ FLDAHCECT + E L +
Sbjct: 193 KGASISTGQVITFLDAHCECTTGWLEPLLSR 223
>gi|145309313|ref|NP_443149.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|114581261|ref|XP_515839.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pan troglodytes]
gi|297668636|ref|XP_002812536.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Pongo abelii]
gi|297668638|ref|XP_002812537.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Pongo abelii]
gi|397525640|ref|XP_003832767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Pan
paniscus]
gi|116242497|sp|Q8IUC8.2|GLT13_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 13;
AltName: Full=Polypeptide GalNAc transferase 13;
Short=GalNAc-T13; Short=pp-GaNTase 13; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 13;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 13
gi|51490969|emb|CAD44533.2| polypeptide N-acetylgalactosaminyltransferase 13 [Homo sapiens]
gi|71680339|gb|AAI01032.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|71681791|gb|AAI01034.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|115528820|gb|AAI01035.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Homo
sapiens]
gi|119631869|gb|EAX11464.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|119631870|gb|EAX11465.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
isoform CRA_a [Homo sapiens]
gi|380783281|gb|AFE63516.1| polypeptide N-acetylgalactosaminyltransferase 13 [Macaca mulatta]
Length = 556
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|332251762|ref|XP_003275018.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Nomascus leucogenys]
Length = 557
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 108 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 166
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 167 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 226
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 227 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 286
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 287 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 346
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 347 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 405
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 406 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 465
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 466 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 523
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 524 LDEPSEEDKMVPTMQ--DCSGSRSQQW 548
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 76 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 135 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 195 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
>gi|116003987|ref|NP_001070354.1| polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
gi|115304963|gb|AAI23663.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13) [Bos
taurus]
gi|296490573|tpg|DAA32686.1| TPA: polypeptide N-acetylgalactosaminyltransferase 13 [Bos taurus]
Length = 556
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TRVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGTRSQQW 547
Score = 181 bits (460), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKT+V+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTRVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|426221079|ref|XP_004004739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Ovis
aries]
Length = 556
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 523 LDEPSEEDKMVPTMQ--DCSGTRSQQW 547
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|1582794|prf||2119305A UDP-GalNAc/polypeptide N-acetylgalactosaminyltransferase
Length = 559
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/446 (44%), Positives = 274/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLDFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|351714454|gb|EHB17373.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 559
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/446 (45%), Positives = 275/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMVEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ +DR VVCP+I +ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKQDRRTVVCPIICVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R CSG Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCSGSRSQQW 548
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMVEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|395846602|ref|XP_003795992.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Otolemur garnettii]
Length = 556
Score = 404 bits (1037), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/445 (45%), Positives = 276/445 (62%), Gaps = 37/445 (8%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLD 575
+++ +GP +L C H+ L + + CLD
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWDYDSERLTLRHVNSNQCLD 524
Query: 576 VPEYENDISPRVRILACSGFNRQRW 600
P E+ + P ++ CSG Q+W
Sbjct: 525 EPSEEDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLDVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|354486376|ref|XP_003505357.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cricetulus griseus]
Length = 497
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 204/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 48 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 106
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 107 LENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 166
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 167 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 226
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 227 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 286
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 287 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 345
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 346 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 405
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 406 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 463
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ CSG Q+W
Sbjct: 464 LDEPSEEDKMVPTMQ--DCSGSRSQQW 488
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 16 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 74
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 75 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARL 134
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 135 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 165
>gi|395846604|ref|XP_003795993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Otolemur garnettii]
Length = 558
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/445 (45%), Positives = 276/445 (62%), Gaps = 37/445 (8%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 109 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 167
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 168 LENYVKNLDVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 227
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 228 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 287
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 288 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 347
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 348 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 406
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 407 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 466
Query: 535 -------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLD 575
+++ +GP +L C H+ L + + CLD
Sbjct: 467 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWDYDSERLTLRHVNSNQCLD 526
Query: 576 VPEYENDISPRVRILACSGFNRQRW 600
P E+ + P ++ CSG Q+W
Sbjct: 527 EPSEEDKMVPTMQ--DCSGSRSQQW 549
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 77 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 135
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 136 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLDVPVKIIRMEERSGLIRARL 195
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 196 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 226
>gi|393911417|gb|EFO27036.2| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 597
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 209/434 (48%), Positives = 273/434 (62%), Gaps = 25/434 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
C LP +S++IV+HNEA+S L+RTV SVI RSP LKEI+LVDD STR F
Sbjct: 110 QCRAKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTF 169
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK+ LD +VA+L +VIR+ RVGLI+ARL+GA +A+G++L FLD+HCECT GW+E L
Sbjct: 170 LKAELDNFVAQLGTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPL 229
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ E+R VVCPVID+I++ TFAY + EL G FNW L FRWY + D
Sbjct: 230 LARIKENRKAVVCPVIDVINERTFAYQKGIELFRGGFNWNLQFRWYALPPEMIKSRSNDP 289
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+P +P MAGGLF+IDR YF IG YD EM +WGGEN+E+S RVWQCGG IEI PCSHV
Sbjct: 290 TKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPCSHV 349
Query: 425 AHLFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
H+FR+ASP+ FP S +L NL RVA VWMDEW +++ P+ K R+ V R+
Sbjct: 350 GHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYRTAPQVYKMRETVDVSDRV 409
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT-----HKCVE--KPLAK---- 532
ELRK+L C SFKW+L +VW HFLP FGR+ H ++ + C+ PL +
Sbjct: 410 ELRKRLHCKSFKWFLDNVWKDHFLPQPGSAFGRVIHSRSQIGGLNGCLHWTVPLGESMRV 469
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR---VRI 589
+M+ S P S T L + T+ MK TDE +CL Y+ PR V++
Sbjct: 470 ATMHNCSAPES-FDRTELWLFTKEGQMK-------TDEHLCLSA--YQPVQGPRNWKVQL 519
Query: 590 LACSGFNRQRWTYD 603
C + + W Y+
Sbjct: 520 KECGQYESEYWDYN 533
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 133/214 (62%), Gaps = 37/214 (17%)
Query: 23 RGVHVVVGHYLGD-------------------------------SVDGGLHSNLSDAQQ- 50
RG+ +VVGHY G+ + + G LS+A +
Sbjct: 19 RGIQIVVGHYNGNLARDKLSNLTAGKFLLNFEQLNANNYAPITGAGEDGRPVKLSEADER 78
Query: 51 ----LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF-NEEFLPKSSIVIVFHNEAWSA 105
+ IN+FNL+VSDRI +NR+LPD+RK +C+ K + LP +S++IV+HNEA+S
Sbjct: 79 LSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSELPTTSVIIVYHNEAFST 138
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
L+RTV SVI RSP LKEI+LVDD STR FLK+ LD +VA+L +VIR+ RVGLI+
Sbjct: 139 LMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQLGTHIKVIRANERVGLIR 198
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
ARL+GA +A+G++L FLD+HCECT + E L +
Sbjct: 199 ARLIGATEAKGDVLTFLDSHCECTKGWMEPLLAR 232
>gi|431894826|gb|ELK04619.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Pteropus alecto]
Length = 519
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/439 (45%), Positives = 276/439 (62%), Gaps = 40/439 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L
Sbjct: 77 LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNL 136
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+ EDR VV
Sbjct: 137 EVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVV 196
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 197 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 256
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PY+F
Sbjct: 257 LFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTF 316
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V+ N R+A VWMDE+ +F++ +P K D V R LR+ LKC F W
Sbjct: 317 PGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLRENLKCKPFSW 375
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++C++ K +
Sbjct: 376 YLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTAD 435
Query: 535 -----------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVCLDVPEYEN 581
+++ +GP +L C H+ L + +L + +++ CLD P E+
Sbjct: 436 KEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--CLDEPSEED 493
Query: 582 DISPRVRILACSGFNRQRW 600
+ P ++ CSG Q+W
Sbjct: 494 KMVPTMQ--DCSGNRSQQW 510
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 87/151 (57%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ ++ LP +S+VIVFHNEAWS LLR
Sbjct: 38 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDQ-LPNTSVVIVFHNEAWSTLLR 96
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 97 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 156
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 157 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 187
>gi|291243604|ref|XP_002741691.1| PREDICTED: Polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 565
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/411 (47%), Positives = 269/411 (65%), Gaps = 14/411 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+SIVIVFHNEAWS L+R VHS+I+RSPR +L+EI+LVDDAS R+FL L++YV KL
Sbjct: 121 LPKTSIVIVFHNEAWSTLIRNVHSIINRSPRMLLEEIILVDDASERDFLGKELEDYVKKL 180
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV R R GLI+ARL GA + GE++ FLDAHCECT GWLE L+AR+AEDR+RVV
Sbjct: 181 PVRVRVERMDKRSGLIRARLRGAGVSTGEVITFLDAHCECTQGWLEPLMARIAEDRSRVV 240
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF + ++ +G FNW+L+FRWY+ + ++ D T P TP MAGG
Sbjct: 241 CPIIDVISDETFEFHAGSDMTYGGFNWKLNFRWYSVPKREMDRRKGDRTIPLNTPTMAGG 300
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI + YF IG YD M +WGGENLEMSFR+W CGG++EI CSHV H+FRK +PYSF
Sbjct: 301 LFAIHKDYFEEIGTYDAGMDIWGGENLEMSFRIWMCGGTLEIVTCSHVGHVFRKTTPYSF 360
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG ++ N R+A VWMD++ F++K +P K+ + V +R +LR +L+C SFKW
Sbjct: 361 PGGTGAIINKNNRRLAEVWMDDYKTFFYKISP-GSKKSEYGDVTNRKQLRDKLQCKSFKW 419
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +++P MD G IR+++T +C++ +M + + H Q+
Sbjct: 420 YLENIYPESQFMMDYNMIGEIRNMETKQCLD------NMGRKENNKVGIYACHGQGGNQI 473
Query: 557 FVMKLPTDLIATDESVCLDVPEYE--NDISPRVRILACSGFNRQRWTYDKE 605
F +L D +CLD NDI +++ + Q W++++E
Sbjct: 474 FAWTKKKELKHDD--LCLDASRQSGFNDI---MQLRCHNQGGNQEWSFNRE 519
Score = 175 bits (443), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 85/142 (59%), Positives = 109/142 (76%), Gaps = 1/142 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFN-EEFLPKSSIVIVFHNEAWSALL 107
+++++IN+FNLL S++I VNR+LPDVR CK K + LPK+SIVIVFHNEAWS L+
Sbjct: 80 KEMFKINQFNLLASNKISVNRSLPDVRMDGCKKKTYPPHNTLPKTSIVIVFHNEAWSTLI 139
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
R VHS+I+RSPR +L+EI+LVDDAS R+FL L++YV KL V RV R R GLI+AR
Sbjct: 140 RNVHSIINRSPRMLLEEIILVDDASERDFLGKELEDYVKKLPVRVRVERMDKRSGLIRAR 199
Query: 168 LLGARQAEGEILVFLDAHCECT 189
L GA + GE++ FLDAHCECT
Sbjct: 200 LRGAGVSTGEVITFLDAHCECT 221
>gi|126326410|ref|XP_001373038.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Monodelphis domestica]
Length = 556
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 205/447 (45%), Positives = 279/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L EI+LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEIILVDDASERDFLKMA 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
E R VVCP+ID+ISD F Y ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KESRKTVVCPIIDLISDDNFEYTAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IGAYD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGAYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKALREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKDNEKVGMFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ SGP +L C H+ L + KL I +++ C
Sbjct: 465 QVFSYTAEKEIRTDDFCLDVSRLSGPVIMLKCHHMRGNQLWEYDAEKLTLRHINSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P ++ + P ++ C+G Q+W
Sbjct: 523 LDEPSEDDKMVPTMK--DCNGSRSQQW 547
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L EI+LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEIILVDDASERDFLKMALENYVKNLEVPVKIIRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|403258987|ref|XP_003922020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Saimiri boliviensis boliviensis]
Length = 556
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS REFLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASEREFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LQCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P ++ + P ++ CSG Q+W
Sbjct: 523 LDEPSEDDKMVPTMQ--DCSGSRSQQW 547
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS REFLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASEREFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|115528959|gb|AAI01033.1| GALNT13 protein [Homo sapiens]
gi|355564904|gb|EHH21393.1| hypothetical protein EGK_04446 [Macaca mulatta]
Length = 561
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/418 (49%), Positives = 275/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +++P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 458
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 459 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 510
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|395749824|ref|XP_002828218.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1 isoform
1 [Pongo abelii]
Length = 612
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/417 (48%), Positives = 274/417 (65%), Gaps = 14/417 (3%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+L+C F WYL +++P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARK--ENEKVG---IFNC 459
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
H Q+F + I TD+ +CLDV + ++ V +L C Q W YD
Sbjct: 460 -HGMGGNQVFSYTANKE-IRTDD-LCLDVSK----LNGPVTMLKCHHLKGNQLWEYD 509
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|116284114|gb|AAH38440.1| GALNT1 protein [Homo sapiens]
Length = 499
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/448 (44%), Positives = 279/448 (62%), Gaps = 40/448 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 46 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 105
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 106 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 165
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 166 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 225
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+ID YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 226 VRTPTMAGGLFSIDIDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 285
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 286 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 344
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 345 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGG 404
Query: 535 --------------------MNQASGPASLLPCTHLP--VLTQMFVMKLPTDLIATDESV 572
+++ +GP ++L C HL L + +KL + +++
Sbjct: 405 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQ-- 462
Query: 573 CLDVPEYENDISPRVRILACSGFNRQRW 600
CLD E+ P +R C+G Q+W
Sbjct: 463 CLDKATEEDSQVPSIR--DCNGSRSQQW 488
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 16 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 74
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 75 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 134
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 135 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 165
>gi|112418488|gb|AAI21876.1| galnt13 protein [Xenopus (Silurana) tropicalis]
Length = 483
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/418 (49%), Positives = 275/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +SIVIVFHNEAWS LLRTVHSVI+RSP ++ EI+LVDD+S R+FLKS
Sbjct: 34 TKVYPDE-LPNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDSSERDFLKSP 92
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP +++R R GLI+ARL GA A+G+I+ FLDAHCECT+GWLE L+AR+
Sbjct: 93 LENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQIITFLDAHCECTIGWLEPLLARI 152
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 153 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 212
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+ID+ YF +G YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 213 RTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 272
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMD++ +F++ +P K D V R LR+
Sbjct: 273 RKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPGVVKV-DYGDVSERKALREN 331
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC+ F WYL V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 332 LKCNPFSWYLETVYPDSQIPRRYFSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 385
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 386 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 437
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +SIVIVFHNEAWS LLR
Sbjct: 2 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSIVIVFHNEAWSTLLR 60
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSP ++ EI+LVDD+S R+FLKS L+ YV L VP +++R R GLI+ARL
Sbjct: 61 TVHSVINRSPHRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARL 120
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA A+G+I+ FLDAHCECT+ + E L +
Sbjct: 121 RGANVAKGQIITFLDAHCECTIGWLEPLLAR 151
>gi|296204781|ref|XP_002749478.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
1 [Callithrix jacchus]
Length = 556
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/447 (45%), Positives = 281/447 (62%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKILREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P ++ + P ++ CSG Q+W
Sbjct: 523 LDEPSEDDKMVPTMQ--DCSGSRSQQW 547
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|390464496|ref|XP_003733230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 isoform
2 [Callithrix jacchus]
Length = 561
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/418 (49%), Positives = 275/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRTYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKILREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +++P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 458
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 459 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 510
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|402888363|ref|XP_003907534.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13,
partial [Papio anubis]
Length = 444
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/418 (49%), Positives = 275/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 41 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 99
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 100 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 159
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 160 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 219
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 220 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 279
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 280 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 338
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +++P +P G IR+V+T++C++ K N+ G + C
Sbjct: 339 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 392
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 393 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 444
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 72/122 (59%), Positives = 95/122 (77%), Gaps = 1/122 (0%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+CKTKV+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FL
Sbjct: 38 RCKTKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFL 96
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
K +L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTL + E L
Sbjct: 97 KLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 156
Query: 198 PK 199
+
Sbjct: 157 AR 158
>gi|291238116|ref|XP_002738977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 561
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 201/412 (48%), Positives = 270/412 (65%), Gaps = 15/412 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS L+R +HS+I+RSPR +L+EI+LVDDAS R+FL LD+YV L
Sbjct: 113 LPTTSIIIVFHNEAWSTLIRNIHSIINRSPREILEEIILVDDASERDFLGKQLDDYVRGL 172
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
SV RV+R R G++ ARL GA + GE+L FLDAHCECT GWLE L+AR+AEDRTRVV
Sbjct: 173 SVRVRVVRMAERSGIVGARLRGAAISTGEVLTFLDAHCECTKGWLEPLIARIAEDRTRVV 232
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVID ISD TF Y EL G FNW L+FRWY + ++ D T P TP MAGG
Sbjct: 233 SPVIDSISDETFEYNSVPELGCGGFNWRLNFRWYPMSKREKKRRKGDATIPINTPTMAGG 292
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+I + YF+ IG YDE M +WGGENLEMSFR+W CGG++EI PCSHV H+FR SPY+F
Sbjct: 293 LFSIHKEYFYRIGTYDEGMDIWGGENLEMSFRIWMCGGTLEIVPCSHVGHVFRGKSPYTF 352
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGGV+ V++ N R+A VWMDE+ FY+K P A + + + R +LR++L+C+SF+W
Sbjct: 353 PGGVATVVHNNNRRLAEVWMDEYKSFYYKTVPNA-RNAEYGDIEDRKQLREKLQCNSFRW 411
Query: 497 YLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
YL +++P +D+ F F +R+++T +C++ +M Q + L H Q
Sbjct: 412 YLENIFPDSQFLLDNYFRFCEVRNMETKQCLD------NMGQKEKSKAALSRCHGQGGHQ 465
Query: 556 MFVMKLPTDLIATDESVCLD--VPEYENDISPRVRILACSGFNRQRWTYDKE 605
++ +L D +CLD P D+ + R + G Q W Y++E
Sbjct: 466 IYAWSKLNELKHDD--LCLDASAPSGFKDVE-QSRCNSHGG--TQEWRYNEE 512
Score = 175 bits (443), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 83/140 (59%), Positives = 108/140 (77%), Gaps = 1/140 (0%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFN-EEFLPKSSIVIVFHNEAWSALLRT 109
+++IN+FNLL S++I VNRTLPDVR CK K++ + LP +SI+IVFHNEAWS L+R
Sbjct: 74 MFKINQFNLLASNKISVNRTLPDVRIDGCKKKIYPPSQKLPTTSIIIVFHNEAWSTLIRN 133
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
+HS+I+RSPR +L+EI+LVDDAS R+FL LD+YV LSV RV+R R G++ ARL
Sbjct: 134 IHSIINRSPREILEEIILVDDASERDFLGKQLDDYVRGLSVRVRVVRMAERSGIVGARLR 193
Query: 170 GARQAEGEILVFLDAHCECT 189
GA + GE+L FLDAHCECT
Sbjct: 194 GAAISTGEVLTFLDAHCECT 213
>gi|62859717|ref|NP_001017277.1| polypeptide N-acetylgalactosaminyltransferase 13 [Xenopus
(Silurana) tropicalis]
gi|89267464|emb|CAJ81616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus (Silurana) tropicalis]
Length = 498
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 207/418 (49%), Positives = 275/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +SIVIVFHNEAWS LLRTVHSVI+RSP ++ EI+LVDD+S R+FLKS
Sbjct: 34 TKVYPDE-LPNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDSSERDFLKSP 92
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP +++R R GLI+ARL GA A+G+I+ FLDAHCECT+GWLE L+AR+
Sbjct: 93 LENYVKHLEVPVKILRMEQRSGLIRARLRGANVAKGQIITFLDAHCECTIGWLEPLLARI 152
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 153 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 212
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+ID+ YF +G YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 213 RTPTMAGGLFSIDKTYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 272
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMD++ +F++ +P K D V R LR+
Sbjct: 273 RKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPGVVKV-DYGDVSERKALREN 331
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC+ F WYL V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 332 LKCNPFSWYLETVYPDSQIPRRYFSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 385
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 386 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 437
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +SIVIVFHNEAWS LLR
Sbjct: 2 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSIVIVFHNEAWSTLLR 60
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSP ++ EI+LVDD+S R+FLKS L+ YV L VP +++R R GLI+ARL
Sbjct: 61 TVHSVINRSPHRLISEIILVDDSSERDFLKSPLENYVKHLEVPVKILRMEQRSGLIRARL 120
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA A+G+I+ FLDAHCECT+ + E L +
Sbjct: 121 RGANVAKGQIITFLDAHCECTIGWLEPLLAR 151
>gi|1136285|gb|AAC50327.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 559
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 273/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR +T++C++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRKEETNQCLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|158259585|dbj|BAF85751.1| unnamed protein product [Homo sapiens]
Length = 559
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 199/446 (44%), Positives = 274/446 (61%), Gaps = 36/446 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR M++EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SR+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------------- 534
+L+C F WYL +++P +P G IR+V+T++ ++ K +
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQFLDNMARKENEKVGIFNCHGMGG 464
Query: 535 --------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCL 574
+++ +GP ++L C HL + L + + CL
Sbjct: 465 NQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHVNSNQCL 524
Query: 575 DVPEYENDISPRVRILACSGFNRQRW 600
D E+ P +R C+G Q+W
Sbjct: 525 DKATEEDSQVPSIR--DCNGSRSQQW 548
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 89/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR M++EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|268574330|ref|XP_002642142.1| C. briggsae CBR-GLY-6 protein [Caenorhabditis briggsae]
Length = 617
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 207/418 (49%), Positives = 273/418 (65%), Gaps = 8/418 (1%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK- 246
C + E LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S REFL+
Sbjct: 147 CRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLKEIILVDDFSDREFLRY 206
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDE + + ++IRS RVGLI+AR++GA++A+G++L FLD+HCECT GWLE L+
Sbjct: 207 PKLDESIKPIPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ +R V CPVIDII+D TF Y + E+ G FNW L FRWY SS A D T
Sbjct: 267 RIKLNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPSSMAKQHLLDPTG 326
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P MAGGLF+IDR YF +G YD M +WGGENLEMSFR+WQCGG +EI PCSHV H
Sbjct: 327 PIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGH 386
Query: 427 LFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
+FRK+SP+ FPG S +VL NL RVA VWMDEW +++K P+A + R V R+EL
Sbjct: 387 VFRKSSPHDFPGKSSGKVLNANLLRVAEVWMDEWKYYFYKIAPQAYRMRPSIDVSERVEL 446
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
RK L C SFKWYL +V+ HFLP FGRI + + + +P G N A
Sbjct: 447 RKTLNCKSFKWYLQNVFKDHFLPTPLDKFGRISNSK-YCAAYRPGDTGPKNHRLLGA--- 502
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP-RVRILACSGFNRQRWTY 602
PCT L Q+++ L TDE +CL V + + S ++++ C+GF+ + W +
Sbjct: 503 PCTMGLDLWQLWLYTGDHRL-RTDEHLCLTVVQLLHTSSDWKIQLKECAGFDTEYWDF 559
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 95/211 (45%), Positives = 131/211 (62%), Gaps = 32/211 (15%)
Query: 20 IDRRGVHVVVGHYLG------------DSVDGGLHSNLSD------------------AQ 49
I R HVVVGHY G D ++ L+S D A
Sbjct: 58 IGRSNFHVVVGHYNGNLPEEKKRNLTSDELNANLYSPHDDWGEGGTGVSHLTPEQQKRAD 117
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+ +N+FNLLVSD I V R+LP++RKP C+ + E+ LP +S++IV+HNEA+S LLRT
Sbjct: 118 STFAVNQFNLLVSDGISVRRSLPEIRKPSCRNITYPED-LPTTSVIIVYHNEAYSTLLRT 176
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
V SVI RSP+ +LKEI+LVDD S REFL+ LDE + + ++IRS RVGLI+AR+
Sbjct: 177 VWSVIDRSPKHLLKEIILVDDFSDREFLRYPKLDESIKPIPTDIKIIRSKERVGLIRARM 236
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA++A+G++L FLD+HCECT + E L +
Sbjct: 237 MGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
>gi|341896063|gb|EGT51998.1| CBN-GLY-6 protein [Caenorhabditis brenneri]
Length = 617
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 207/419 (49%), Positives = 275/419 (65%), Gaps = 10/419 (2%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK- 246
C + + LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S REFLK
Sbjct: 147 CRNITFPDNLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLKY 206
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDE + L +++RS RVGLI+AR++GA++A+G++L FLD+HCECT GWLE L+
Sbjct: 207 PKLDESLKPLPTDIKIVRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ +R V CPVIDII+D TF Y + E+ G FNW L FRWY SS A D T
Sbjct: 267 RIKLNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPSSMAKEHLLDPTG 326
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P MAGGLF+IDR YF +G YD M +WGGENLEMSFR+WQCGG +EI PCSHV H
Sbjct: 327 PIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGH 386
Query: 427 LFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
+FRK+SP+ FPG S ++L NL RVA VWMDEW +++K P A + R V R+EL
Sbjct: 387 VFRKSSPHDFPGKSSGKILNANLLRVAEVWMDEWKYYFYKLAPVAYRMRQSIDVSERVEL 446
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE-KPLAKGSMNQASGPASL 544
RK+L C SFKWYL +V+ HFLP FGRI + ++ C +P G N A
Sbjct: 447 RKKLNCKSFKWYLQNVFKDHFLPTPLDKFGRISN--SNYCAAFRPGDTGPKNHRLLGA-- 502
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP-RVRILACSGFNRQRWTY 602
PCT L Q+++ + I TDE +CL V + + S ++++ C+GF+ + W +
Sbjct: 503 -PCTMGFDLWQLWLYTGDSR-IRTDEHLCLSVVQLLHTSSDWKIQLKECAGFDTEYWDF 559
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/209 (46%), Positives = 131/209 (62%), Gaps = 32/209 (15%)
Query: 22 RRGVHVVVGHYLG------------------------DSVDGGLH-SNLSDAQQ-----L 51
R HVVVGHY G D +GG S+L+ QQ
Sbjct: 60 RSNYHVVVGHYNGNLPEDKKRNLTSEELNANLYSPHDDWGEGGAGVSHLTPEQQKLADST 119
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
+ +N+FNLLVSD I V R+LP++RKP C+ F + LP +S++IV+HNEA+S LLRTV
Sbjct: 120 FAVNQFNLLVSDGISVRRSLPEIRKPSCRNITFPDN-LPTTSVIIVYHNEAYSTLLRTVW 178
Query: 112 SVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLG 170
SVI RSP+ +LKEI+LVDD S REFLK LDE + L +++RS RVGLI+AR++G
Sbjct: 179 SVIDRSPKELLKEIILVDDFSDREFLKYPKLDESLKPLPTDIKIVRSKERVGLIRARMMG 238
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPK 199
A++A+G++L FLD+HCECT + E L +
Sbjct: 239 AQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
>gi|355689583|gb|AER98881.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1 [Mustela putorius
furo]
Length = 461
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 184/341 (53%), Positives = 243/341 (71%), Gaps = 1/341 (0%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKHDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + SRL LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISSRLGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEK 528
+L+C F WYL +++P +P G IR+V+T++C++
Sbjct: 405 KLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDN 445
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|26332527|dbj|BAC29981.1| unnamed protein product [Mus musculus]
Length = 592
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 203/427 (47%), Positives = 275/427 (64%), Gaps = 26/427 (6%)
Query: 105 ALLRTVHSVISRS---PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
+LL + +VISR+ P M K +L+ D + ++++ S + RS
Sbjct: 41 SLLPALRAVISRNQEGPGEMGKAVLIPKDDQEKMKELFKINQFNLMASDLIALNRS---- 96
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 221
L RL G + T V+ +E LP +S+VIVFHNEAWS LLRTV+SV
Sbjct: 97 -LPDVRLEGCK----------------TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSV 138
Query: 222 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 281
I+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL GA
Sbjct: 139 INRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAA 198
Query: 282 AEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAF 341
++G+++ FLDAHCECTLGWLE L+AR+ EDR VVCP+ID+ISD TF Y+ ++ +G F
Sbjct: 199 SKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGF 258
Query: 342 NWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGE 401
NW+L+FRWY + ++ D T P +TP MAGGLF+IDR YF IG YD M +WGGE
Sbjct: 259 NWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGE 318
Query: 402 NLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE 461
NLEMSFR+WQCGGS+EI CSHV H+FRKA+PY+FPGG V+ N R+A VWMDE+ +
Sbjct: 319 NLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKD 378
Query: 462 FYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQ 521
F++ +P K D V R LR+ LKC F WYL +++P +P G IR+V+
Sbjct: 379 FFYIISPGVVKV-DYGDVSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVE 437
Query: 522 THKCVEK 528
T++C++
Sbjct: 438 TNQCLDN 444
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|148694974|gb|EDL26921.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13, isoform CRA_b [Mus
musculus]
Length = 594
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 185/340 (54%), Positives = 244/340 (71%), Gaps = 2/340 (0%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 109 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 167
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 168 LENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 227
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 228 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 287
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 288 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 347
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 348 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLREN 406
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEK 528
LKC F WYL +++P +P G IR+V+T++C++
Sbjct: 407 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDN 446
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 77 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 135
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 136 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVKIIRMEERSGLIRARL 195
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 196 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 226
>gi|312068074|ref|XP_003137043.1| polypeptide N-acetylgalactosaminyltransferase [Loa loa]
Length = 547
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 208/427 (48%), Positives = 269/427 (62%), Gaps = 25/427 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
C LP +S++IV+HNEA+S L+RTV SVI RSP LKEI+LVDD STR F
Sbjct: 121 QCRAKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVILRSPHENLKEIILVDDFSTRTF 180
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK+ LD +VA+L +VIR+ RVGLI+ARL+GA +A+G++L FLD+HCECT GW+E L
Sbjct: 181 LKAELDNFVAQLGTHIKVIRANERVGLIRARLIGATEAKGDVLTFLDSHCECTKGWMEPL 240
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ E+R VVCPVID+I++ TFAY + EL G FNW L FRWY + D
Sbjct: 241 LARIKENRKAVVCPVIDVINERTFAYQKGIELFRGGFNWNLQFRWYALPPEMIKSRSNDP 300
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+P +P MAGGLF+IDR YF IG YD EM +WGGEN+E+S RVWQCGG IEI PCSHV
Sbjct: 301 TKPIISPTMAGGLFSIDRKYFEEIGTYDHEMNIWGGENIEISLRVWQCGGRIEILPCSHV 360
Query: 425 AHLFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
H+FR+ASP+ FP S +L NL RVA VWMDEW +++ P+ K R+ V R+
Sbjct: 361 GHVFRRASPHDFPSHKSGTILNSNLLRVAEVWMDEWKFHFYRTAPQVYKMRETVDVSDRV 420
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT-----HKCVE--KPLAK---- 532
ELRK+L C SFKW+L +VW HFLP FGR+ H ++ + C+ PL +
Sbjct: 421 ELRKRLHCKSFKWFLDNVWKDHFLPQPGSAFGRVIHSRSQIGGLNGCLHWTVPLGESMRV 480
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR---VRI 589
+M+ S P S T L + T+ MK TDE +CL Y+ PR V++
Sbjct: 481 ATMHNCSAPES-FDRTELWLFTKEGQMK-------TDEHLCLSA--YQPVQGPRNWKVQL 530
Query: 590 LACSGFN 596
C N
Sbjct: 531 KECGQGN 537
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 96/214 (44%), Positives = 133/214 (62%), Gaps = 37/214 (17%)
Query: 23 RGVHVVVGHYLGD-------------------------------SVDGGLHSNLSDAQQ- 50
RG+ +VVGHY G+ + + G LS+A +
Sbjct: 30 RGIQIVVGHYNGNLARDKLSNLTAGKFLLNFEQLNANNYAPITGAGEDGRPVKLSEADER 89
Query: 51 ----LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF-NEEFLPKSSIVIVFHNEAWSA 105
+ IN+FNL+VSDRI +NR+LPD+RK +C+ K + LP +S++IV+HNEA+S
Sbjct: 90 LSDDTFAINQFNLVVSDRIALNRSLPDIRKHQCRAKTYLPSSELPTTSVIIVYHNEAFST 149
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
L+RTV SVI RSP LKEI+LVDD STR FLK+ LD +VA+L +VIR+ RVGLI+
Sbjct: 150 LMRTVMSVILRSPHENLKEIILVDDFSTRTFLKAELDNFVAQLGTHIKVIRANERVGLIR 209
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
ARL+GA +A+G++L FLD+HCECT + E L +
Sbjct: 210 ARLIGATEAKGDVLTFLDSHCECTKGWMEPLLAR 243
>gi|327281385|ref|XP_003225429.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 2 [Anolis carolinensis]
Length = 557
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 206/418 (49%), Positives = 273/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRT++SVI+R+P +L EI+LVDDAS R+FLK
Sbjct: 108 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFLKVP 166
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP +++R R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 167 LENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 226
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 227 KEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 286
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 287 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 346
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR
Sbjct: 347 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVTVRKALRDN 405
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 406 LKCKPFSWYLENVYPDSQIPRRYFSLGEIRNVETNQCLDNMGRKE--NEKVG---IFNC- 459
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 460 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 511
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 76 KELFKINQFNLMASDMIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T++SVI+R+P +L EI+LVDDAS R+FLK L+ YV L VP +++R R GLI+ARL
Sbjct: 135 TIYSVINRAPHYLLAEIILVDDASERDFLKVPLENYVKTLQVPVKIMRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 195 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 225
>gi|148223895|ref|NP_001086128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13)
[Xenopus laevis]
gi|49258003|gb|AAH74234.1| MGC83963 protein [Xenopus laevis]
Length = 556
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 206/418 (49%), Positives = 273/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +SIVIVFHNEAWS LLRTVHSVI+RSP ++ EI+LVDDAS R+FLK+
Sbjct: 107 TKVYPDE-LPNTSIVIVFHNEAWSTLLRTVHSVINRSPHRLISEIILVDDASERDFLKTP 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L V +++R R GLI+ARL GA A+G+I+ FLDAHCECT GWLE L+AR+
Sbjct: 166 LENYVKHLEVAVKILRMEQRSGLIRARLSGANVAKGKIITFLDAHCECTFGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+ID+ YF +G YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDKKYFEELGTYDSGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMD++ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPGVVKV-DYGDVSERKALREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC+ F WYL V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 LKCNPFSWYLETVYPDSQIPRRYFSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 458
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 459 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 510
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 116/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPD+R CKTKV+ +E LP +SIVIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDIRLEGCKTKVYPDE-LPNTSIVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSP ++ EI+LVDDAS R+FLK+ L+ YV L V +++R R GLI+ARL
Sbjct: 134 TVHSVINRSPHRLISEIILVDDASERDFLKTPLENYVKHLEVAVKILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA A+G+I+ FLDAHCECT + E L +
Sbjct: 194 SGANVAKGKIITFLDAHCECTFGWLEPLLAR 224
>gi|327281383|ref|XP_003225428.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 1 [Anolis carolinensis]
Length = 556
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 206/418 (49%), Positives = 273/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRT++SVI+R+P +L EI+LVDDAS R+FLK
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFLKVP 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP +++R R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVTVRKALRDN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 LKCKPFSWYLENVYPDSQIPRRYFSLGEIRNVETNQCLDNMGRKE--NEKVG---IFNC- 458
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 459 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 510
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T++SVI+R+P +L EI+LVDDAS R+FLK L+ YV L VP +++R R GLI+ARL
Sbjct: 134 TIYSVINRAPHYLLAEIILVDDASERDFLKVPLENYVKTLQVPVKIMRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|387017208|gb|AFJ50722.1| Polypeptide N-acetylgalactosaminyltransferase 13-like [Crotalus
adamanteus]
Length = 556
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 205/418 (49%), Positives = 273/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRT++SV++RSP +L EI+LVDDAS R+FLK
Sbjct: 107 TKVYPDE-LPTTSVVIVFHNEAWSTLLRTIYSVMNRSPHYLLSEIILVDDASERDFLKLP 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECT GWLE L+AR+
Sbjct: 166 LENYVRNLQVPVKIIRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTTGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKALREN 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 405 LKCKPFSWYLEYVYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKE--NEKVG---IFNC- 458
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +C+DV ++ V +L C Q W YD E
Sbjct: 459 HGMGGNQVFSYTADKE-IRTDD-LCMDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 510
Score = 178 bits (452), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIAFNRSLPDVRLEGCKTKVYPDE-LPTTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T++SV++RSP +L EI+LVDDAS R+FLK L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TIYSVMNRSPHYLLSEIILVDDASERDFLKLPLENYVRNLQVPVKIIRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTTGWLEPLLAR 224
>gi|326923136|ref|XP_003207797.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Meleagris gallopavo]
Length = 556
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/447 (46%), Positives = 284/447 (63%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ EE LP +S+VIVFHNEAWS LLRTVHSV++RSPR +L EI+LVDDAS REFLK+S
Sbjct: 107 TKVYPEE-LPNTSVVIVFHNEAWSTLLRTVHSVLARSPRRLLAEIILVDDASEREFLKAS 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV KL VP +++R R GLI+ARL GA A G+++ FLDAHCECT GWLE L+AR+
Sbjct: 166 LENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVVTFLDAHCECTRGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 REDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG+YD M +WGGENLEMSFRVWQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V +R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSARKALREA 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVDTNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP ++L C H+ L + KL + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVTMLKCHHMRGNQLWEYDAEKLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ C+G Q+W
Sbjct: 523 LDGPSEEDKLVPSMK--ECAGGRSQQW 547
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/151 (61%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ EE LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLDGCKTKVYPEE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSV++RSPR +L EI+LVDDAS REFLK+SL+ YV KL VP +++R R GLI+ARL
Sbjct: 134 TVHSVLARSPRRLLAEIILVDDASEREFLKASLENYVKKLEVPVKILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA A G+++ FLDAHCECT + E L +
Sbjct: 194 RGAAAARGQVVTFLDAHCECTRGWLEPLLAR 224
>gi|327281387|ref|XP_003225430.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
isoform 3 [Anolis carolinensis]
Length = 498
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 206/418 (49%), Positives = 273/418 (65%), Gaps = 15/418 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRT++SVI+R+P +L EI+LVDDAS R+FLK
Sbjct: 34 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTIYSVINRAPHYLLAEIILVDDASERDFLKVP 92
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP +++R R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 93 LENYVKTLQVPVKIMRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 152
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 153 KEDRKIVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 212
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 213 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 272
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR
Sbjct: 273 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVTVRKALRDN 331
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
LKC F WYL +V+P +P G IR+V+T++C++ K N+ G + C
Sbjct: 332 LKCKPFSWYLENVYPDSQIPRRYFSLGEIRNVETNQCLDNMGRK--ENEKVG---IFNC- 385
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q+F + I TD+ +CLDV ++ V +L C Q W YD E
Sbjct: 386 HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVIMLKCHHMRGNQLWEYDAE 437
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 2 KELFKINQFNLMASDMIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 60
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T++SVI+R+P +L EI+LVDDAS R+FLK L+ YV L VP +++R R GLI+ARL
Sbjct: 61 TIYSVINRAPHYLLAEIILVDDASERDFLKVPLENYVKTLQVPVKIMRMEQRSGLIRARL 120
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 121 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 151
>gi|118093951|ref|XP_422165.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Gallus
gallus]
Length = 556
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 209/447 (46%), Positives = 284/447 (63%), Gaps = 41/447 (9%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTVHSV++RSPR +L EI+LVDDAS REFLK+S
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVHSVVARSPRRLLAEIILVDDASEREFLKAS 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV KL VP +++R R GLI+ARL GA A G+++ FLDAHCECT GWLE L+AR+
Sbjct: 166 LENYVKKLEVPVKILRMEQRSGLIRARLRGAAAARGQVITFLDAHCECTRGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 WEDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG+YD M +WGGENLEMSFRVWQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGSYDAGMDIWGGENLEMSFRVWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V +R LR+
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSARKALREA 404
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V T++C++ K +
Sbjct: 405 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVDTNQCLDNMGRKENEKVGIFNCHGMGGN 464
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP ++L C H+ L + KL + +++ C
Sbjct: 465 QVFSYTADKEIRTDDLCLDVSRLNGPVTMLKCHHMRGNQLWEYDAEKLTLRHVNSNQ--C 522
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P E+ + P ++ C+G Q+W
Sbjct: 523 LDGPSEEDKLVPSMK--ECAGGRSQQW 547
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 92/151 (60%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLDGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSV++RSPR +L EI+LVDDAS REFLK+SL+ YV KL VP +++R R GLI+ARL
Sbjct: 134 TVHSVVARSPRRLLAEIILVDDASEREFLKASLENYVKKLEVPVKILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA A G+++ FLDAHCECT + E L +
Sbjct: 194 RGAAAARGQVITFLDAHCECTRGWLEPLLAR 224
>gi|308481980|ref|XP_003103194.1| CRE-GLY-3 protein [Caenorhabditis remanei]
gi|308260299|gb|EFP04252.1| CRE-GLY-3 protein [Caenorhabditis remanei]
Length = 615
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 185/333 (55%), Positives = 239/333 (71%), Gaps = 1/333 (0%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +SI+IVFHNEAW+ LLRT+HSVI+RSPR +L+EI+LVDD S R++L LD Y+
Sbjct: 169 EGLPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIILVDDKSDRDYLVKPLDAYIK 228
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K VP ++ R GLI+ARL G+ A+G+IL+FLDAH E T GWLE LV RVAEDR R
Sbjct: 229 KFPVPVHLVHLEDRSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVTRVAEDRKR 288
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VV P+ID+ISD TF YV + E WG FNW L+FRWY + + D + P +TP +A
Sbjct: 289 VVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRGADRSMPIQTPTIA 348
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAID+ +F+ IG+YDE MQVWGGENLE+SFRVW CGGS+EI PCS V H+FRK +PY
Sbjct: 349 GGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPY 408
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG ++V++ N AR A VWMDE+ F++K P A + + V R +LR+ L+C SF
Sbjct: 409 TFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVP-AARNVEAGDVTERKKLRETLQCKSF 467
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
KWYL +++P LP D K G I + T KC++
Sbjct: 468 KWYLENIYPEAPLPADFKSLGAIVNRFTEKCID 500
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/137 (54%), Positives = 98/137 (71%), Gaps = 2/137 (1%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKT--KVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
N+FN++ S+ I +NRTLPD R C+T E LP +SI+IVFHNEAW+ LLRT+HS
Sbjct: 135 NQFNVVASEMISINRTLPDYRSEACRTTGNSLKTEGLPTTSIIIVFHNEAWTTLLRTLHS 194
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
VI+RSPR +L+EI+LVDD S R++L LD Y+ K VP ++ R GLI+ARL G+
Sbjct: 195 VINRSPRHLLEEIILVDDKSDRDYLVKPLDAYIKKFPVPVHLVHLEDRSGLIRARLTGSG 254
Query: 173 QAEGEILVFLDAHCECT 189
A+G+IL+FLDAH E T
Sbjct: 255 MAKGKILLFLDAHVEVT 271
>gi|17553814|ref|NP_498722.1| Protein GLY-3 [Caenorhabditis elegans]
gi|21264486|sp|P34678.2|GALT3_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=GalNAc-T1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 3; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3; Short=pp-GaNTase 3
gi|3047187|gb|AAC13669.1| GLY3 [Caenorhabditis elegans]
gi|351020565|emb|CCD62541.1| Protein GLY-3 [Caenorhabditis elegans]
Length = 612
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 182/331 (54%), Positives = 239/331 (72%), Gaps = 1/331 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
+PK+SI+IVFHNEAW+ LLRT+HSVI+RSPR +L+EI+LVDD S R++L LD Y+
Sbjct: 169 MPKTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIILVDDKSDRDYLVKPLDSYIKMF 228
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+P ++ R GLI+ARL G+ A+G+IL+FLDAH E T GWLE LV+RVAEDR RVV
Sbjct: 229 PIPIHLVHLENRSGLIRARLTGSEMAKGKILLFLDAHVEVTDGWLEPLVSRVAEDRKRVV 288
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+ISD TF YV + E WG FNW L+FRWY + + D + P +TP +AGG
Sbjct: 289 APIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRGSDRSMPIQTPTIAGG 348
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAID+ +F+ IG+YDE MQVWGGENLE+SFRVW CGGS+EI PCS V H+FRK +PY+F
Sbjct: 349 LFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPYTF 408
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG ++V++ N AR A VWMDE+ F++K P A + + V R +LR+ L+C SFKW
Sbjct: 409 PGGTAKVIHHNAARTAEVWMDEYKAFFYKMVP-AARNVEAGDVSERKKLRETLQCKSFKW 467
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
YL +++P LP D + G I + T KCV+
Sbjct: 468 YLENIYPEAPLPADFRSLGAIVNRFTEKCVD 498
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 73/137 (53%), Positives = 98/137 (71%), Gaps = 2/137 (1%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFN--EEFLPKSSIVIVFHNEAWSALLRTVHS 112
N+FN++ S+ I VNRTLPD R C+T N +PK+SI+IVFHNEAW+ LLRT+HS
Sbjct: 133 NQFNVVASEMISVNRTLPDYRSDACRTSGNNLKTAGMPKTSIIIVFHNEAWTTLLRTLHS 192
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
VI+RSPR +L+EI+LVDD S R++L LD Y+ +P ++ R GLI+ARL G+
Sbjct: 193 VINRSPRHLLEEIILVDDKSDRDYLVKPLDSYIKMFPIPIHLVHLENRSGLIRARLTGSE 252
Query: 173 QAEGEILVFLDAHCECT 189
A+G+IL+FLDAH E T
Sbjct: 253 MAKGKILLFLDAHVEVT 269
>gi|71987788|ref|NP_001022645.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
gi|3047199|gb|AAC13675.1| GLY6b [Caenorhabditis elegans]
gi|14530524|emb|CAC42317.1| Protein GLY-6, isoform b [Caenorhabditis elegans]
Length = 617
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 202/412 (49%), Positives = 273/412 (66%), Gaps = 14/412 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAK 255
LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S REFL+ +LD +
Sbjct: 156 LPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKP 215
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L ++IRS RVGLI+AR++GA++A+G++L FLD+HCECT GWLE L+ R+ +R V
Sbjct: 216 LPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRIKLNRKAV 275
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVIDII+D TF Y + E+ G FNW L FRWY ++ A D T P ++P MAG
Sbjct: 276 PCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQHLLDPTGPIESPTMAG 335
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+I+R YF +G YD M +WGGENLEMSFR+WQCGG +EI PCSHV H+FRK+SP+
Sbjct: 336 GLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGHVFRKSSPHD 395
Query: 436 FPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FPG S +VL NL RVA VWMD+W +++K P+A + R V R+ELRK+L C SF
Sbjct: 396 FPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAPQAHRMRSSIDVSERVELRKKLNCKSF 455
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE-KPLAKGSMNQASGPASLL--PCTHLP 551
KWYL +V+ HFLP FGR+ + ++ C +P G N LL PCT
Sbjct: 456 KWYLQNVFQDHFLPTPLDRFGRMSN--SNYCTAFRPGDTGPKNH-----RLLGSPCTMGF 508
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISP-RVRILACSGFNRQRWTY 602
L Q+++ I TDE +CL V + + S ++++ C+GF+ + W +
Sbjct: 509 DLWQLWLYT-GDRRIRTDEHLCLSVVQLLHTTSDWKIQLKECAGFDTEYWDF 559
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 132/211 (62%), Gaps = 32/211 (15%)
Query: 20 IDRRGVHVVVGHYLG------------------------DSVDGGLH-SNLSDAQQ---- 50
I R HVVVGHY G D +GG S+L+ QQ
Sbjct: 58 IARSNYHVVVGHYNGNLPEDKKRNLTSEELNANLYAPHDDWGEGGAGVSHLTPEQQKLAD 117
Query: 51 -LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+ +N+FNLLVSD I V R+LP++RKP C+ + + LP +S++IV+HNEA+S LLRT
Sbjct: 118 STFAVNQFNLLVSDGISVRRSLPEIRKPSCRNMTYPDN-LPTTSVIIVYHNEAYSTLLRT 176
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
V SVI RSP+ +LKEI+LVDD S REFL+ +LD + L ++IRS RVGLI+AR+
Sbjct: 177 VWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKPLPTDIKIIRSKERVGLIRARM 236
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA++A+G++L FLD+HCECT + E L +
Sbjct: 237 MGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
>gi|71987784|ref|NP_001022644.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
gi|51315809|sp|O61394.1|GALT6_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 6;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 6; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6; Short=pp-GaNTase 6
gi|3047197|gb|AAC13674.1| GLY6a [Caenorhabditis elegans]
gi|3878104|emb|CAA19707.1| Protein GLY-6, isoform a [Caenorhabditis elegans]
Length = 618
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 198/410 (48%), Positives = 268/410 (65%), Gaps = 9/410 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAK 255
LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S REFL+ +LD +
Sbjct: 156 LPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKP 215
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L ++IRS RVGLI+AR++GA++A+G++L FLD+HCECT GWLE L+ R+ +R V
Sbjct: 216 LPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRIKLNRKAV 275
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVIDII+D TF Y + E+ G FNW L FRWY ++ A D T P ++P MAG
Sbjct: 276 PCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQHLLDPTGPIESPTMAG 335
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+I+R YF +G YD M +WGGENLEMSFR+WQCGG +EI PCSHV H+FRK+SP+
Sbjct: 336 GLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGHVFRKSSPHD 395
Query: 436 FPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FPG S +VL NL RVA VWMD+W +++K P+A + R V R+ELRK+L C SF
Sbjct: 396 FPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAPQAHRMRSSIDVSERVELRKKLNCKSF 455
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCTHLPVL 553
KWYL +V+ HFLP FGR+ C+ L + AS C +
Sbjct: 456 KWYLQNVFQDHFLPTPLDRFGRMTSSSNSSVCLAWTLRSSGIKTASTAD----CLKIFHK 511
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISP-RVRILACSGFNRQRWTY 602
TQ+++ I TDE +CL V + + S ++++ C+GF+ + W +
Sbjct: 512 TQLWLYT-GDRRIRTDEHLCLSVVQLLHTTSDWKIQLKECAGFDTEYWDF 560
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 132/211 (62%), Gaps = 32/211 (15%)
Query: 20 IDRRGVHVVVGHYLG------------------------DSVDGGLH-SNLSDAQQ---- 50
I R HVVVGHY G D +GG S+L+ QQ
Sbjct: 58 IARSNYHVVVGHYNGNLPEDKKRNLTSEELNANLYAPHDDWGEGGAGVSHLTPEQQKLAD 117
Query: 51 -LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+ +N+FNLLVSD I V R+LP++RKP C+ + + LP +S++IV+HNEA+S LLRT
Sbjct: 118 STFAVNQFNLLVSDGISVRRSLPEIRKPSCRNMTYPDN-LPTTSVIIVYHNEAYSTLLRT 176
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
V SVI RSP+ +LKEI+LVDD S REFL+ +LD + L ++IRS RVGLI+AR+
Sbjct: 177 VWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKPLPTDIKIIRSKERVGLIRARM 236
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA++A+G++L FLD+HCECT + E L +
Sbjct: 237 MGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
>gi|268575444|ref|XP_002642701.1| C. briggsae CBR-GLY-3 protein [Caenorhabditis briggsae]
Length = 611
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 181/331 (54%), Positives = 238/331 (71%), Gaps = 1/331 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAW+ LLRT+HSVI+RSPR +L+EI+++DD S R++L LD Y+ K
Sbjct: 170 LPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKKF 229
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+P ++ R GLI+ARL G+ A+G+IL+FLDAH E T GWLE LV RVAEDR RVV
Sbjct: 230 PIPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTDGWLEPLVHRVAEDRKRVV 289
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+ISD TF YV + E WG FNW L+FRWY + + D + P +TP +AGG
Sbjct: 290 APIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRGSDRSMPIQTPTIAGG 349
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAID+ +F+ IG+YDE MQVWGGENLE+SFRVW CGGS+EI PCS V H+FRK +PY+F
Sbjct: 350 LFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPYTF 409
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG ++V++ N AR A VWMDE+ F++K P A K + V R +LR+ L+C SFKW
Sbjct: 410 PGGTAKVIHHNAARTAEVWMDEYKAFFYKMVP-AAKNVEAGDVTDRKKLRETLQCKSFKW 468
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
YL +++P LP D + G I + T KC++
Sbjct: 469 YLENIYPEAPLPADFRSLGSIVNRFTEKCID 499
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 72/138 (52%), Positives = 99/138 (71%), Gaps = 3/138 (2%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEF---LPKSSIVIVFHNEAWSALLRTVH 111
N+FN++ S+ I VNRTLPD R C+ NE+ LP +SI+IVFHNEAW+ LLRT+H
Sbjct: 133 NQFNVVASEMISVNRTLPDYRSEACRNAAGNEKTTVGLPTTSIIIVFHNEAWTTLLRTLH 192
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SVI+RSPR +L+EI+++DD S R++L LD Y+ K +P ++ R GLI+ARL G+
Sbjct: 193 SVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKKFPIPVHLVHLEERSGLIRARLTGS 252
Query: 172 RQAEGEILVFLDAHCECT 189
A+G+IL+FLDAH E T
Sbjct: 253 GMAKGKILLFLDAHVEVT 270
>gi|341900678|gb|EGT56613.1| CBN-GLY-3 protein [Caenorhabditis brenneri]
Length = 613
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 180/331 (54%), Positives = 240/331 (72%), Gaps = 1/331 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
+P +SI+IVFHNEAW+ LLRT+HSVI+RSPR +L+EI+++DD S R++L LD Y+ L
Sbjct: 170 MPTTSIIIVFHNEAWTTLLRTLHSVINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKAL 229
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP ++ R GLI+ARL G+ A+G+IL+FLDAH E T GWLE L++RVAEDR RVV
Sbjct: 230 PVPVHLVHLEERSGLIRARLTGSGMAKGKILLFLDAHVEVTEGWLEPLISRVAEDRKRVV 289
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+ISD TF YV + E WG FNW L+FRWY+ + + D + P +TP +AGG
Sbjct: 290 APIIDVISDDTFEYVTASETTWGGFNWHLNFRWYSVPKRELNRRGSDRSMPIQTPTIAGG 349
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAID+ +F+ IG+YDE MQVWGGENLE+SFRVW CGGS+EI PCS V H+FRK +PY+F
Sbjct: 350 LFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKQTPYTF 409
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG ++V++ N AR A VWMDE+ F++K P A + + V R +LR+ L+C SFKW
Sbjct: 410 PGGTAKVIHHNAARTAEVWMDEYKAFFYKMVP-AARNVEAGDVTERKKLRETLQCKSFKW 468
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
YL +++P LP D + G I + T KCV+
Sbjct: 469 YLENIYPEAPLPADFRSLGAIVNRFTEKCVD 499
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 71/137 (51%), Positives = 97/137 (70%), Gaps = 2/137 (1%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKT--KVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
N+FN++ S+ I VNRTLPD R C+T +P +SI+IVFHNEAW+ LLRT+HS
Sbjct: 134 NQFNVVASEMISVNRTLPDYRSEACRTAGNSIKTTGMPTTSIIIVFHNEAWTTLLRTLHS 193
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
VI+RSPR +L+EI+++DD S R++L LD Y+ L VP ++ R GLI+ARL G+
Sbjct: 194 VINRSPRHLLEEIIMIDDKSDRDYLVKPLDAYIKALPVPVHLVHLEERSGLIRARLTGSG 253
Query: 173 QAEGEILVFLDAHCECT 189
A+G+IL+FLDAH E T
Sbjct: 254 MAKGKILLFLDAHVEVT 270
>gi|402594510|gb|EJW88436.1| hypothetical protein WUBG_00649 [Wuchereria bancrofti]
Length = 612
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/459 (45%), Positives = 276/459 (60%), Gaps = 52/459 (11%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IV+HNEA+S L+RTV SVI RSPR LKEI+LVDD STR FLK L++ VA+L
Sbjct: 104 LPTTSVIIVYHNEAFSTLMRTVMSVILRSPRENLKEIILVDDFSTRTFLKVELEKLVAQL 163
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR+ RVGLI+ARL+GA +AEG++L FLD+HCECT GW+E L+AR+ E+R VV
Sbjct: 164 GTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPLLARIKENRKAVV 223
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVIDII++ TFAY + EL G FNW L FRWY + D T+P +P MAGG
Sbjct: 224 CPVIDIINERTFAYQKGIELFRGGFNWNLQFRWYALPPEMIKSRSDDPTKPIISPTMAGG 283
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFR----------VWQCGGSIEIAPCSHVAH 426
LF+IDR YF IG YD EM +WGGEN+E+S R VWQCGG +EI PCSHV H
Sbjct: 284 LFSIDRKYFEEIGTYDHEMDIWGGENIEISLRLKLLKKNCFLVWQCGGRVEILPCSHVGH 343
Query: 427 LFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
+FR+ SP+ FPG S +L NL RVA VWMDEW +++ P+A K R+ V R+EL
Sbjct: 344 VFRRTSPHDFPGRKSGTILNSNLLRVAEVWMDEWKFHFYRTAPQAYKMRETVDVSDRVEL 403
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT-----HKCVE--KPLAK----GS 534
RK+L C SFKW+L +VW HFLP FGR+ H ++ + C+ PL + +
Sbjct: 404 RKRLHCKSFKWFLDNVWKDHFLPQPGSAFGRVIHSRSQIGGMNGCLHWTVPLGESVRVAT 463
Query: 535 MNQASGPAS--------------LLPCTHLPVLTQMFVM----KLPTDL----------- 565
M+ S P S L+ T P Q +M L DL
Sbjct: 464 MHNCSAPDSFDRTEISHSTSRNCLVARTGEPGSKQTSIMMARCTLGFDLWQLWLFTEEGQ 523
Query: 566 IATDESVCLDVPEYENDISP-RVRILACSGFNRQRWTYD 603
+ TDE +CL + D ++++ C + + W Y+
Sbjct: 524 MKTDEHLCLSAYQPVQDPQNWKIQLKECGQYESEYWDYN 562
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 85/153 (55%), Positives = 117/153 (76%), Gaps = 1/153 (0%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF-NEEFLPKSSIVIVFHNEAWSAL 106
++ + IN+FNL+VSDRI +NR+LPD+RK +C+TK + LP +S++IV+HNEA+S L
Sbjct: 62 SEDTFVINQFNLVVSDRIALNRSLPDIRKHQCRTKTYLPSSELPTTSVIIVYHNEAFSTL 121
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
+RTV SVI RSPR LKEI+LVDD STR FLK L++ VA+L ++IR+ RVGLI+A
Sbjct: 122 MRTVMSVILRSPRENLKEIILVDDFSTRTFLKVELEKLVAQLGTRIKIIRANERVGLIRA 181
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RL+GA +AEG++L FLD+HCECT + E L +
Sbjct: 182 RLMGANEAEGDVLTFLDSHCECTKGWMEPLLAR 214
>gi|71896287|ref|NP_001025547.1| polypeptide N-acetylgalactosaminyltransferase 1 [Xenopus (Silurana)
tropicalis]
gi|60649677|gb|AAH90583.1| galnt1 protein [Xenopus (Silurana) tropicalis]
Length = 452
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 178/338 (52%), Positives = 237/338 (70%), Gaps = 1/338 (0%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAW+ LLRTVHSVI+RSPR +L+EI+LVDDAS REFLK
Sbjct: 106 CKTKVYPDSLPTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLQEIILVDDASEREFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL+VP V+R R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLETYVKKLTVPVHVLRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + +R D T P
Sbjct: 226 IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRRGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + +R+ LR
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISTRVGLRH 404
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKC 525
+L+C F WYL +V+P +P G + ++ C
Sbjct: 405 KLQCKPFSWYLENVYPDSQIPRHYYSLGEVSNIIIKLC 442
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 119/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAW+ LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDS-LPTTSVVIVFHNEAWTTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +L+EI+LVDDAS REFLK L+ YV KL+VP V+R R GLI+ARL
Sbjct: 135 TVHSVINRSPRHLLQEIILVDDASEREFLKRPLETYVKKLTVPVHVLRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 RGAAASKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|313227425|emb|CBY22572.1| unnamed protein product [Oikopleura dioica]
Length = 588
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/407 (45%), Positives = 260/407 (63%), Gaps = 12/407 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+SI+ VFHNEAWS LLR++HSVI+RSPR ML+EI+LVDD S ++FL LD+YV L
Sbjct: 143 LPKTSIIFVFHNEAWSTLLRSIHSVINRSPREMLEEIILVDDKSEKDFLGKQLDDYVKNL 202
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP +IR R GLI+ARL GA+ A+GE+L FLDAH E + GWLE L+ + +DRT V+
Sbjct: 203 PVPVHIIRQQHREGLIRARLEGAKIAKGEVLTFLDAHIEASPGWLEPLLYEIKKDRTNVI 262
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF ++ +L +G FNW+L+FRWY + + D + P +TP MAGG
Sbjct: 263 CPIIDVISDDTFEFLTGSDLTYGGFNWKLNFRWYPVPQREVDRRGGDRSLPMQTPTMAGG 322
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID++YF+ IG+YD M +WGGENLEMSFR+W CGG++ IA CSHV H+FRKA+PY+F
Sbjct: 323 LFSIDKSYFYEIGSYDSGMDIWGGENLEMSFRIWMCGGTVLIATCSHVGHVFRKATPYTF 382
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG S+++ N R+A VWMD++ +F++ NP K + V R LR L+C SF+W
Sbjct: 383 PGGTSQIINKNNRRLAEVWMDDYKKFFYIVNPTVMKHK-YGDVSDRKTLRNDLQCKSFQW 441
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P +P K G I++ + C++ +M + H Q+
Sbjct: 442 YLDNVYPDAQIPRRYKVLGEIKNTGANICLD------TMGRKENKKVGCYSCHGQGGNQV 495
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYD 603
F + ++ D +CLDV N P + + Q W Y+
Sbjct: 496 FSFTMDNEIRIDD--LCLDVA---NSKGPVMMVKCHHQKGNQYWEYN 537
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 82/152 (53%), Positives = 110/152 (72%), Gaps = 1/152 (0%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF-NEEFLPKSSIVIVFHNEAWSA 105
++++++Q N+FNL+ S+ I +NRTL DVR CK + N LPK+SI+ VFHNEAWS
Sbjct: 100 ESKKMFQENQFNLMASNMISLNRTLKDVRMSGCKKHDYANLGALPKTSIIFVFHNEAWST 159
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLR++HSVI+RSPR ML+EI+LVDD S ++FL LD+YV L VP +IR R GLI+
Sbjct: 160 LLRSIHSVINRSPREMLEEIILVDDKSEKDFLGKQLDDYVKNLPVPVHIIRQQHREGLIR 219
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
ARL GA+ A+GE+L FLDAH E + + E L
Sbjct: 220 ARLEGAKIAKGEVLTFLDAHIEASPGWLEPLL 251
>gi|339244173|ref|XP_003378012.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
gi|316973116|gb|EFV56743.1| polypeptide N-acetylgalactosaminyltransferase 3 [Trichinella
spiralis]
Length = 670
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 200/420 (47%), Positives = 268/420 (63%), Gaps = 15/420 (3%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + E+ K+S+VIVFHNEAWS L+RTV SVI+RS L+EI+LVDDAS ++ L +
Sbjct: 121 CRSIKYEKISLKTSVVIVFHNEAWSTLMRTVQSVINRSSVDYLEEIILVDDASEKDELIA 180
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
++ ++ + V +IR P R GLI R+ GA A+G++L FLDAH E T GWLE L++R
Sbjct: 181 LVESFLKTIPVAHTLIRLPQRSGLIVGRVRGAEIAKGDVLTFLDAHVEVTDGWLEPLLSR 240
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
++EDRTRVV PVID+ISD TF YV + E WG F+W ++FRWY + + + K+ T P
Sbjct: 241 ISEDRTRVVAPVIDVISDDTFQYVTAAESTWGGFSWTMNFRWYQASAREQKRRGKNKTTP 300
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP +AGGLF+IDR YFF IGAYDE M++WGGENLE+SFRVW CGG++EI PCSHV H+
Sbjct: 301 IRTPTIAGGLFSIDRKYFFDIGAYDEGMRIWGGENLEISFRVWMCGGTLEINPCSHVGHV 360
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK +PY+F GG S V+YGN R A VWMDE+ EFY+K P A + R+ LRK
Sbjct: 361 FRKQTPYTFEGGTSNVIYGNARRTAEVWMDEYKEFYYKMTPSA-MFAPLGNISDRIALRK 419
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
+L C SFKWYL +++P +P G I++ + C++ K ASG +LL C
Sbjct: 420 RLGCKSFKWYLKNIYPESNIPPTYYSIGYIKNEKNDLCLDTMGRK-----ASGSPALLTC 474
Query: 548 THLPVLTQMFVMKL----PTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYD 603
H Q+ MK+ T I DE +CL + SP + + C+ Q W YD
Sbjct: 475 -HNSGGNQVLFMKVWSYTGTLNIRADE-LCLQASRKAD--SP-IFLQQCNNDESQIWDYD 529
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/156 (45%), Positives = 105/156 (67%), Gaps = 1/156 (0%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
S+ + ++ N+FN++ S+RI +NRTLPD R C++ + E+ K+S+VIVFHNEAWS
Sbjct: 88 SEVKARFKENQFNVVASERISLNRTLPDYRSSACRSIKY-EKISLKTSVVIVFHNEAWST 146
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
L+RTV SVI+RS L+EI+LVDDAS ++ L + ++ ++ + V +IR P R GLI
Sbjct: 147 LMRTVQSVINRSSVDYLEEIILVDDASEKDELIALVESFLKTIPVAHTLIRLPQRSGLIV 206
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
R+ GA A+G++L FLDAH E T + E L + S
Sbjct: 207 GRVRGAEIAKGDVLTFLDAHVEVTDGWLEPLLSRIS 242
>gi|47226346|emb|CAG09314.1| unnamed protein product [Tetraodon nigroviridis]
Length = 632
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 183/355 (51%), Positives = 240/355 (67%), Gaps = 24/355 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
+P +S+VIVFHNEAWS LLRTVHSVI+RSPR +L EI+LVDDAS R+FLK L+ YV L
Sbjct: 114 VPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASERDFLKKKLENYVRTL 173
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR---- 312
VP R++R R GLI+ARL GA +G+++ FLDAHCECT+GWLE L+AR+ EDR
Sbjct: 174 EVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPLLARIKEDRWDCN 233
Query: 313 -------------------TRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYG 353
T VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY
Sbjct: 234 TALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMAGSDMTYGGFNWKLNFRWYPVP 293
Query: 354 SSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCG 413
+ ++ D T P +TP MAGGLF+ID+ YF IG+YD M +WGGENLEMSFR+WQCG
Sbjct: 294 QREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGENLEMSFRIWQCG 353
Query: 414 GSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQ 473
GS+EI CSHV H+FRKA+PYSFPGG +V+ N R+A VWMD++ +F++ +P +
Sbjct: 354 GSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKDFFYIISPGVMRV 413
Query: 474 RDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEK 528
D V SR LR L C F WYL +++P +P G IR+V+T++CV+
Sbjct: 414 -DYGDVSSRKGLRDALHCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCVDN 467
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ ++ +P +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDMIALNRSLPDVRLDGCKTKVYPDD-VPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR +L EI+LVDDAS R+FLK L+ YV L VP R++R R GLI+ARL
Sbjct: 134 TVHSVINRSPRHLLVEIVLVDDASERDFLKKKLENYVRTLEVPVRILRMEQRSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA +G+++ FLDAHCECT+ + E L +
Sbjct: 194 RGAAATKGQVITFLDAHCECTVGWLEPLLAR 224
>gi|71987795|ref|NP_001022646.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
gi|3047201|gb|AAC13676.1| GLY6c [Caenorhabditis elegans]
gi|14530525|emb|CAC42318.1| Protein GLY-6, isoform c [Caenorhabditis elegans]
Length = 562
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/367 (51%), Positives = 247/367 (67%), Gaps = 10/367 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAK 255
LP +S++IV+HNEA+S LLRTV SVI RSP+ +LKEI+LVDD S REFL+ +LD +
Sbjct: 156 LPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKP 215
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L ++IRS RVGLI+AR++GA++A+G++L FLD+HCECT GWLE L+ R+ +R V
Sbjct: 216 LPTDIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTRIKLNRKAV 275
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVIDII+D TF Y + E+ G FNW L FRWY ++ A D T P ++P MAG
Sbjct: 276 PCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQHLLDPTGPIESPTMAG 335
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+I+R YF +G YD M +WGGENLEMSFR+WQCGG +EI PCSHV H+FRK+SP+
Sbjct: 336 GLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGHVFRKSSPHD 395
Query: 436 FPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FPG S +VL NL RVA VWMD+W +++K P+A + R V R+ELRK+L C SF
Sbjct: 396 FPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAPQAHRMRSSIDVSERVELRKKLNCKSF 455
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHK-CVE---KPLAKGSMNQASGPASLLP---- 546
KWYL +V+ HFLP FGR+ C+ + + Q SG + +P
Sbjct: 456 KWYLQNVFQDHFLPTPLDRFGRMTSSSNSSVCLAWTLRSCPTQTTAQPSGQVTRVPKITD 515
Query: 547 CTHLPVL 553
C+ PVL
Sbjct: 516 CSDHPVL 522
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 132/211 (62%), Gaps = 32/211 (15%)
Query: 20 IDRRGVHVVVGHYLG------------------------DSVDGGLH-SNLSDAQQ---- 50
I R HVVVGHY G D +GG S+L+ QQ
Sbjct: 58 IARSNYHVVVGHYNGNLPEDKKRNLTSEELNANLYAPHDDWGEGGAGVSHLTPEQQKLAD 117
Query: 51 -LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+ +N+FNLLVSD I V R+LP++RKP C+ + + LP +S++IV+HNEA+S LLRT
Sbjct: 118 STFAVNQFNLLVSDGISVRRSLPEIRKPSCRNMTYPDN-LPTTSVIIVYHNEAYSTLLRT 176
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
V SVI RSP+ +LKEI+LVDD S REFL+ +LD + L ++IRS RVGLI+AR+
Sbjct: 177 VWSVIDRSPKELLKEIILVDDFSDREFLRYPTLDTTLKPLPTDIKIIRSKERVGLIRARM 236
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA++A+G++L FLD+HCECT + E L +
Sbjct: 237 MGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
>gi|312075557|ref|XP_003140470.1| Gly-3 protein [Loa loa]
gi|307764367|gb|EFO23601.1| Gly-3 protein [Loa loa]
Length = 584
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 178/351 (50%), Positives = 244/351 (69%), Gaps = 6/351 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SI+IVFHNEAWS LLRT+HSVI+RSP ++KE++L+DD S R +L+S LD Y+ +
Sbjct: 145 LPTVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRSPLDLYIKRF 204
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+P +I P R GLI+ARL GA+ A+G++L+FLDAH E T GWLE L+ RV+ DR RVV
Sbjct: 205 SLPFHLIHLPERSGLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPLLDRVSVDRKRVV 264
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+ISD F Y+ + ++ WG FNW L+FRWY + + D + P +TP +AGG
Sbjct: 265 APIIDVISDENFEYITASDITWGGFNWHLNFRWYPVPMREMERRNHDRSVPLQTPTIAGG 324
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAIDR +F+ IG+YDE M+VWGGENLE+SFRVW CGGS+EI PCS V H+FRK +PYSF
Sbjct: 325 LFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKHTPYSF 384
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG + V++ N AR A VWMDE+ + ++K P A K D + R LR+ L+C SF+W
Sbjct: 385 PGGTANVIHRNAARTAEVWMDEYKDIFYKMVP-AAKNVDIGDLTERKVLRENLQCKSFRW 443
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
YL ++P +P+D G+I+++ C++ + A ++LPC
Sbjct: 444 YLETIYPESPIPIDFLSLGQIQNMGVVGCLDT-----AGRSAGDSPAILPC 489
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 76/162 (46%), Positives = 112/162 (69%), Gaps = 2/162 (1%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCK--TKVFNEEFLPKSSIVIVFHN 100
S + ++ + N+F+++ SD I +NR LPD R KC+ + ++ LP SI+IVFHN
Sbjct: 97 SQQEERKKKFNENQFDVMASDLISINRALPDYRSSKCREAARKYDITSLPTVSIIIVFHN 156
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGR 160
EAWS LLRT+HSVI+RSP ++KE++L+DD S R +L+S LD Y+ + S+P +I P R
Sbjct: 157 EAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRSPLDLYIKRFSLPFHLIHLPER 216
Query: 161 VGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSI 202
GLI+ARL GA+ A+G++L+FLDAH E T + E L + S+
Sbjct: 217 SGLIRARLQGAKIAKGKVLLFLDAHVEVTEGWLEPLLDRVSV 258
>gi|326508656|dbj|BAJ95850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 637
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/411 (46%), Positives = 264/411 (64%), Gaps = 13/411 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST-REFLKSSLDEYVAK 255
LP +SIVIVFHNE S LLRT+ S++ RSP ++EI++VDDAS RE+LK L+ +V +
Sbjct: 190 LPTTSIVIVFHNEGNSTLLRTLTSIVMRSPTEFIQEIIMVDDASVDREYLKDILETFVKE 249
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L V +IR+ R+GL+K+RL GA +A G+ L FLDAH EC+ GWLE L+ V +DRT V
Sbjct: 250 LPVRVEIIRNTQRLGLMKSRLKGAEKATGDTLTFLDAHIECSPGWLEYLLYEVKKDRTAV 309
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCP+ID+I+D FAY+ ++ WG FNW L+FRWY + + + + D + P +P MAG
Sbjct: 310 VCPIIDVINDDDFAYLTGSDMTWGGFNWRLNFRWYPVPNREEVRRNYDHSLPLLSPTMAG 369
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF IDR YF+ IGAYD M+VWGGENLEMSFRVWQCGG + I PCSHV H+FRK +PY+
Sbjct: 370 GLFTIDRKYFYEIGAYDPGMEVWGGENLEMSFRVWQCGGKVLIHPCSHVGHVFRKQTPYT 429
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FPGG +V++ N R+ VW+D++ +F + PE K D V RL LR++L+C F+
Sbjct: 430 FPGGTGKVIFHNNKRLVEVWLDKYKDFVYAIMPEL-KNVDAGDVSERLALRERLQCKDFR 488
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
WYL +++P +P+D G +R+ Q H C + L S N + A + PC H Q
Sbjct: 489 WYLQNIYPESSMPVDFHHVGALRN-QDHGCADS-LGYDSENGVNQNAGIFPC-HNQGGNQ 545
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS-GFNRQRWTYDKE 605
+ V +L D +C+ E + V++ C+ G +Q W Y+KE
Sbjct: 546 IVVFSKSGELKFDD--LCM-----EGSKNSAVKLQKCTEGNQKQVWEYNKE 589
Score = 135 bits (339), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 67/144 (46%), Positives = 102/144 (70%), Gaps = 2/144 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ + N+FN++ SD + +NR++ D R +C++ F + LP +SIVIVFHNE S L
Sbjct: 149 EAKKRFPENQFNIVASDLMALNRSINDQRSSRCRSHEFPSD-LPTTSIVIVFHNEGNSTL 207
Query: 107 LRTVHSVISRSPRSMLKEILLVDDAST-REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LRT+ S++ RSP ++EI++VDDAS RE+LK L+ +V +L V +IR+ R+GL+K
Sbjct: 208 LRTLTSIVMRSPTEFIQEIIMVDDASVDREYLKDILETFVKELPVRVEIIRNTQRLGLMK 267
Query: 166 ARLLGARQAEGEILVFLDAHCECT 189
+RL GA +A G+ L FLDAH EC+
Sbjct: 268 SRLKGAEKATGDTLTFLDAHIECS 291
>gi|410905319|ref|XP_003966139.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Takifugu rubripes]
Length = 557
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/437 (43%), Positives = 264/437 (60%), Gaps = 36/437 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VIVFHNEAWS LLRTVHSVI RSP ++L+EI+LVDDAS R+FLK L++YV +L
Sbjct: 113 LPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPLEQYVRRL 172
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP RV+R R GLI+ARL GA + G+++ FLDAHCECT GWLE L+AR+ +DR VV
Sbjct: 173 EVPVRVVRMDQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLARIKKDRKTVV 232
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P + AGG
Sbjct: 233 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRWVRCAGG 292
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
R YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+FRKA+PY+F
Sbjct: 293 XXXXXRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTF 352
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +++ N R+A VWMDE+ F++ +P K D + +R LR++L+C F W
Sbjct: 353 PGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDIATRTALRQKLQCKPFSW 411
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL ++P +P G IR+V+T++C++ K +
Sbjct: 412 YLESIYPDSQIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTAN 471
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
+++ +GP +L C HL + LI + + CLD E+
Sbjct: 472 KEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQLWDYDPVKLTLIHVNSNQCLDKASEEDSQ 531
Query: 584 SPRVRILACSGFNRQRW 600
P +R C+ Q+W
Sbjct: 532 VPSIR--DCTHSRSQQW 546
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/156 (55%), Positives = 118/156 (75%), Gaps = 1/156 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N +++++IN+FNL+ S+ I +NR+LPDVR CK K++ + LP++S+VIVFHNEAW
Sbjct: 69 NQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN-LPRTSVVIVFHNEAW 127
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTVHSVI RSP ++L+EI+LVDDAS R+FLK L++YV +L VP RV+R R GL
Sbjct: 128 STLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPLEQYVRRLEVPVRVVRMDQRSGL 187
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARL GA + G+++ FLDAHCECT + E L +
Sbjct: 188 IRARLKGASLSTGQVITFLDAHCECTTGWLEPLLAR 223
>gi|308487864|ref|XP_003106127.1| CRE-GLY-6 protein [Caenorhabditis remanei]
gi|308254701|gb|EFO98653.1| CRE-GLY-6 protein [Caenorhabditis remanei]
Length = 693
Score = 380 bits (976), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/332 (54%), Positives = 234/332 (70%), Gaps = 2/332 (0%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK- 246
C + E LP +S++IV+HNEA+S LLRTV SVI RSP+ +L+EILLVDD S R+FL+
Sbjct: 147 CRNITYPEDLPTTSVIIVYHNEAYSTLLRTVWSVIDRSPKHLLREILLVDDFSDRDFLRY 206
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDE + L ++IRS RVGLI+AR++GA++A+G++L FLD+HCECT GWLE L+
Sbjct: 207 PKLDESLKPLPTDIKIIRSNQRVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLT 266
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ +R V CPVIDII+D TF Y + E+ G FNW L FRWY + A D T
Sbjct: 267 RIKLNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTEMAKQHLLDPTG 326
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P MAGGLF+IDR YF +G YD M +WGGENLEMSFR+WQCGG +EI PCSHV H
Sbjct: 327 PIESPTMAGGLFSIDRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGH 386
Query: 427 LFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
+FRK+SP+ FPG S +VL NL RVA VWMDEW +++K P A + R+ V R+EL
Sbjct: 387 VFRKSSPHDFPGKSSGKVLNANLLRVAEVWMDEWKYYFYKIAPVAFRMRESIDVSERVEL 446
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRI 517
RK+L C SFKWYL +++ HFLP FGR+
Sbjct: 447 RKKLNCKSFKWYLQNIFKDHFLPTPLDRFGRV 478
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 136/219 (62%), Gaps = 32/219 (14%)
Query: 12 LARNQQDYIDRRGVHVVVGHYLGD------------SVDGGLHS-------------NLS 46
LA+ + + HVVVGHY G+ ++ L+S +L+
Sbjct: 50 LAQKASKQVTKSNYHVVVGHYNGNLPEDKKRNLTSEELNANLYSPREEWGEGGSGVTHLT 109
Query: 47 DAQQ-----LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
QQ + +N+FNL VSD I V R+LP++RKP C+ + E+ LP +S++IV+HNE
Sbjct: 110 PEQQKLADSTFAVNQFNLFVSDGISVRRSLPEIRKPSCRNITYPED-LPTTSVIIVYHNE 168
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPTRVIRSPGR 160
A+S LLRTV SVI RSP+ +L+EILLVDD S R+FL+ LDE + L ++IRS R
Sbjct: 169 AYSTLLRTVWSVIDRSPKHLLREILLVDDFSDRDFLRYPKLDESLKPLPTDIKIIRSNQR 228
Query: 161 VGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
VGLI+AR++GA++A+G++L FLD+HCECT + E L +
Sbjct: 229 VGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTR 267
>gi|256052108|ref|XP_002569620.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 573
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/365 (48%), Positives = 245/365 (67%), Gaps = 5/365 (1%)
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 222
L+ + L+G R+ + F C + ++ K+S++IVFHNEAWSALLRTVHSV+
Sbjct: 95 LVVSNLIGTRRNLDD---FRHPSCRHQIPLDKLLPFKTSVIIVFHNEAWSALLRTVHSVL 151
Query: 223 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 282
R+P +L EI+LVDDAST+ L L YV L+ P R+ R R GLI+ARL GA+ +
Sbjct: 152 DRTPVQLLHEIILVDDASTQSHLGDQLKNYVKSLNKPVRIERMSSRSGLIRARLHGAKIS 211
Query: 283 EGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFN 342
G+ L FLDAHCE T+GWLE L+ ++E++ R+VCP+ID+IS TF Y+ + WG F+
Sbjct: 212 TGKTLTFLDAHCEVTIGWLETLLKHISENQKRIVCPIIDVISHDTFEYLLGSDRTWGTFD 271
Query: 343 WELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGEN 402
W+ +F W T + + P +TP MAGGLF I R YF+ IGAYDE+M++WGGEN
Sbjct: 272 WQFNFHWETVVDREIDRINDEHNVPLRTPTMAGGLFTITREYFYEIGAYDEDMEIWGGEN 331
Query: 403 LEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEF 462
+E+SFRVWQCGG + I PCS V H+FRK+SPY++PGGVS +L+ N R ALVW+D+++ F
Sbjct: 332 IELSFRVWQCGGELLIDPCSRVGHVFRKSSPYTWPGGVSHILHKNFVRTALVWLDQYSRF 391
Query: 463 YFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT 522
YF NP A D V R +LR+QL C SF+WYL H++P +P+D G IRH ++
Sbjct: 392 YFMLNPSA-LSVDYGDVTKRKKLRQQLNCKSFRWYLEHIYPESSIPIDVIRLGEIRH-KS 449
Query: 523 HKCVE 527
+C++
Sbjct: 450 GQCLD 454
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 105/153 (68%), Gaps = 1/153 (0%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLP-KSSIVIVFHNEAWS 104
++++ + +N FNL+VS+ I R L D R P C+ ++ ++ LP K+S++IVFHNEAWS
Sbjct: 82 AESKLTFHLNEFNLVVSNLIGTRRNLDDFRHPSCRHQIPLDKLLPFKTSVIIVFHNEAWS 141
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
ALLRTVHSV+ R+P +L EI+LVDDAST+ L L YV L+ P R+ R R GLI
Sbjct: 142 ALLRTVHSVLDRTPVQLLHEIILVDDASTQSHLGDQLKNYVKSLNKPVRIERMSSRSGLI 201
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ARL GA+ + G+ L FLDAHCE T+ + E L
Sbjct: 202 RARLHGAKISTGKTLTFLDAHCEVTIGWLETLL 234
>gi|198415713|ref|XP_002128877.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
1 [Ciona intestinalis]
Length = 573
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 171/333 (51%), Positives = 238/333 (71%), Gaps = 1/333 (0%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +SIVIVFHNEAWS LLRTVHS+I+RSP +L+EI+LVDDAS R+FL + L+ YV
Sbjct: 126 EKLPTTSIVIVFHNEAWSTLLRTVHSIINRSPSHLLEEIILVDDASERDFLGAPLERYVR 185
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
KL RV+R R GLI+ARL GA + G+++ FLDAHCECT GWLE L++ +A+DRT
Sbjct: 186 KLRTLVRVVRMEKRTGLIRARLRGASVSTGQVITFLDAHCECTEGWLEPLLSEIAKDRTT 245
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD TF ++ ++ +G FNW+L+FRWY + ++ D T P ++P MA
Sbjct: 246 VVCPIIDVISDETFEFMVGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRSPTMA 305
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID++YF +G YD M +WGGENLE+SFR+WQCGG++ I CSHV H+FRKA+PY
Sbjct: 306 GGLFSIDKSYFEELGTYDAGMDIWGGENLEISFRIWQCGGTLLIVTCSHVGHVFRKATPY 365
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG +++ N R+A VWMD + F++ P KQ + + R+ LR++L+C SF
Sbjct: 366 TFPGGTGQIINKNNRRLAEVWMDSFKNFFYIITPGVLKQ-EYGDISERVRLREKLQCKSF 424
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
KWYL +++P +P + G IR+ + C++
Sbjct: 425 KWYLENIYPDSQIPGEYYSLGEIRNEEGGLCLD 457
Score = 168 bits (425), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 82/136 (60%), Positives = 105/136 (77%), Gaps = 1/136 (0%)
Query: 54 INRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 113
IN+FNL+ S+ I +NR+LPDVR CK+K + E+ LP +SIVIVFHNEAWS LLRTVHS+
Sbjct: 94 INQFNLMASEMIALNRSLPDVRMEGCKSKKYPEK-LPTTSIVIVFHNEAWSTLLRTVHSI 152
Query: 114 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 173
I+RSP +L+EI+LVDDAS R+FL + L+ YV KL RV+R R GLI+ARL GA
Sbjct: 153 INRSPSHLLEEIILVDDASERDFLGAPLERYVRKLRTLVRVVRMEKRTGLIRARLRGASV 212
Query: 174 AEGEILVFLDAHCECT 189
+ G+++ FLDAHCECT
Sbjct: 213 STGQVITFLDAHCECT 228
>gi|402592820|gb|EJW86747.1| hypothetical protein WUBG_02341 [Wuchereria bancrofti]
Length = 584
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/351 (49%), Positives = 243/351 (69%), Gaps = 6/351 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SI+IVFHNEAWS LLRT+HSVI+RSP ++KE++L+DD S R +L+ LD Y+ +
Sbjct: 145 LPMVSIIIVFHNEAWSTLLRTIHSVINRSPLHLIKEVILIDDLSNRTYLRKPLDTYIKRF 204
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+P +I P R GLI+ARL GA+ A+G++L+FLDAH E T GWLE L+ RV+ DR RVV
Sbjct: 205 SLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWLEPLLDRVSTDRKRVV 264
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+ISD F Y+ + ++ WG FNW L+FRWY + + D + P +TP +AGG
Sbjct: 265 APIIDVISDENFEYITASDVTWGGFNWHLNFRWYPVPMREMERRNHDRSVPLQTPTIAGG 324
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAIDR +F+ IG+YDE M+VWGGENLE+SFRVW CGGS+EI PCS V H+FRK +PYSF
Sbjct: 325 LFAIDRQFFYDIGSYDEGMEVWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKHTPYSF 384
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG + V++ N AR A VWMDE+ + ++ P A + D + R LR+ L+C SF+W
Sbjct: 385 PGGTARVIHHNTARTAEVWMDEYKDIFYSMVP-AARNVDVGDLTERKILRENLQCKSFRW 443
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
YL ++P +P+D G+++++ +C++ + A ++LPC
Sbjct: 444 YLETIYPESPIPIDFFSLGQVQNMGVMECLDT-----AGRSAGDSPAMLPC 489
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/152 (48%), Positives = 107/152 (70%), Gaps = 2/152 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCK--TKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
++ N+F+++ SD I +NR LPD R KC+ + ++ LP SI+IVFHNEAWS LLRT
Sbjct: 106 FKENQFDVMASDLISINRALPDYRSSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRT 165
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
+HSVI+RSP ++KE++L+DD S R +L+ LD Y+ + S+P +I P R GLI+ARL
Sbjct: 166 IHSVINRSPLHLIKEVILIDDLSNRTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQ 225
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
GA+ A+G++L+FLDAH E T + E L + S
Sbjct: 226 GAKVAKGKVLLFLDAHVEVTEGWLEPLLDRVS 257
>gi|170592315|ref|XP_001900914.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Brugia malayi]
gi|158591609|gb|EDP30214.1| Polypeptide N-acetylgalactosaminyltransferase 3, putative [Brugia
malayi]
Length = 584
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/351 (49%), Positives = 243/351 (69%), Gaps = 6/351 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SI+IVFHNEAWS LLRT+HSVI+RSP ++KE++L+DD S R +L+ LD Y+ +
Sbjct: 145 LPMVSIIIVFHNEAWSTLLRTLHSVINRSPLHLIKEVILIDDLSNRTYLRKPLDTYIKRF 204
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+P +I P R GLI+ARL GA+ A+G++L+FLDAH E T GWLE L+ RV+ DR RVV
Sbjct: 205 SLPFHLIHLPERSGLIRARLQGAKVAKGKVLLFLDAHVEVTEGWLEPLLDRVSTDRKRVV 264
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+ISD F Y+ + ++ WG FNW L+FRWY + + D + P +TP +AGG
Sbjct: 265 APIIDVISDENFEYITASDVTWGGFNWHLNFRWYPVPMREMERRNHDRSVPLQTPTIAGG 324
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAIDR +F+ IG+YDE M++WGGENLE+SFRVW CGGS+EI PCS V H+FRK +PYSF
Sbjct: 325 LFAIDRQFFYDIGSYDEGMEIWGGENLEISFRVWMCGGSLEIHPCSRVGHVFRKHTPYSF 384
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG + V++ N AR A VWMDE+ + ++ P A K D + R LR+ L+C SF+W
Sbjct: 385 PGGTARVIHHNAARTAEVWMDEYKDIFYGMVP-AAKNVDVGDLTERKILRENLQCKSFRW 443
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
YL ++P +P+D G+++++ +C++ + A ++LPC
Sbjct: 444 YLETIYPESPIPIDFFSLGQVQNMGVMECLDT-----AGRSAGDSPAMLPC 489
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 74/152 (48%), Positives = 107/152 (70%), Gaps = 2/152 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCK--TKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
++ N+F+++ SD I +NR LPD R KC+ + ++ LP SI+IVFHNEAWS LLRT
Sbjct: 106 FKENQFDVMASDLISINRALPDYRSSKCREAARKYDVTSLPMVSIIIVFHNEAWSTLLRT 165
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
+HSVI+RSP ++KE++L+DD S R +L+ LD Y+ + S+P +I P R GLI+ARL
Sbjct: 166 LHSVINRSPLHLIKEVILIDDLSNRTYLRKPLDTYIKRFSLPFHLIHLPERSGLIRARLQ 225
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
GA+ A+G++L+FLDAH E T + E L + S
Sbjct: 226 GAKVAKGKVLLFLDAHVEVTEGWLEPLLDRVS 257
>gi|390336582|ref|XP_001187912.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 490
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 188/411 (45%), Positives = 257/411 (62%), Gaps = 18/411 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+++V+HNEA S LLR VHS+I+RSP +L EI+LVDDAS +E L SL++Y+AKL
Sbjct: 55 LPTTSVILVYHNEARSTLLRNVHSIINRSPHDLLAEIILVDDASDQEHLGKSLEDYIAKL 114
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V V++ GR GLI+AR+ GA A+G++L FLD+HCE T GWLE ++AR+AEDRT V
Sbjct: 115 PVSVYVVKMKGRSGLIRARMAGAAVAKGQVLTFLDSHCEVTEGWLEPMLARIAEDRTTSV 174
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+ISD TF Y + G F W L F+W+ + I ++ D TEP + MAGG
Sbjct: 175 CPVIDVISDDTFQYQHGNDPQMGGFGWSLFFKWFPVPKREQIRRKGDPTEPVRVSTMAGG 234
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAID++YF +G YD +WGGENLE+SF++W CGG +E PCSHV H+FRK SPY F
Sbjct: 235 LFAIDKSYFEELGQYDPGFNIWGGENLELSFKLWMCGGKLEFIPCSHVGHVFRKKSPYHF 294
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
P G + V N R+A VW+DE+ FY++ +P K D + RL LRK L C SFKW
Sbjct: 295 PPGTNYVNKNN-KRLAEVWLDEYKNFYYRISPSVAKT-DPGDISDRLNLRKSLSCKSFKW 352
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +++P P++ +F G +R+ + H C++ M +A L C H Q+
Sbjct: 353 YLENIYPESSWPVNYQFMGEVRNTEAHVCLDT-----MMKEAGNKVGLYGC-HGQGGNQI 406
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFNRQRWTYDKE 605
+ +L D +CLDV V +L+C G N Q W YD++
Sbjct: 407 WAFTKNNELRHDD--LCLDVAR-----GGPVMMLSCHMQGGN-QHWNYDEK 449
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 79/154 (51%), Positives = 114/154 (74%), Gaps = 1/154 (0%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
+++++L+ IN FNL+ SDRI +NR+LPDVR C KV+ ++ LP +S+++V+HNEA S
Sbjct: 13 AESERLWPINEFNLMASDRIALNRSLPDVRPRGCANKVYPKK-LPTTSVILVYHNEARST 71
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLR VHS+I+RSP +L EI+LVDDAS +E L SL++Y+AKL V V++ GR GLI+
Sbjct: 72 LLRNVHSIINRSPHDLLAEIILVDDASDQEHLGKSLEDYIAKLPVSVYVVKMKGRSGLIR 131
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
AR+ GA A+G++L FLD+HCE T + E L +
Sbjct: 132 ARMAGAAVAKGQVLTFLDSHCEVTEGWLEPMLAR 165
>gi|348585735|ref|XP_003478626.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Cavia porcellus]
Length = 568
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 195/464 (42%), Positives = 278/464 (59%), Gaps = 63/464 (13%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+F
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVF 345
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE-------------AEKQRD 475
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P E D
Sbjct: 346 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPAKCNFLTRDLDVFMGETDSD 405
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLP----MDDKFFGRIRHVQTHKCVEKPLA 531
+ +L+++ ++ L+H ++P M + IR+V+T++C++
Sbjct: 406 IVGTKYTYKLKEE------RFVLSHRNYSPYIPSQGNMAKEKQSMIRNVETNQCLDNMGR 459
Query: 532 KGS---------------------------------MNQASGPASLLPCTHL--PVLTQM 556
K + +++ +GP +L C H+ L +
Sbjct: 460 KENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEY 519
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+L + +++ CLD P E+ + P ++ CSG Q+W
Sbjct: 520 DAERLTLRHVNSNQ--CLDEPSEEDKMVPTMQ--DCSGSRSQQW 559
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|432098984|gb|ELK28470.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Myotis davidii]
Length = 501
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 190/417 (45%), Positives = 263/417 (63%), Gaps = 36/417 (8%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+RSPR ML+EI+LVDDAS R+FLK
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINRSPRHMLEEIVLVDDASERDFLKR 165
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ YV KL VP VIR R GLI+ARL GA ++G+++ FLDAHCECT+GWLE L+AR
Sbjct: 166 PLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ +DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 IKQDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 285
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 286 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHV 345
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P+ + +
Sbjct: 346 FRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPDIGR------------IEH 393
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHK-CVEKPLAKGSMNQASGPASLL 545
L C S H + + F + + ++T C++ +++ +GP ++L
Sbjct: 394 WLYCDSL---------HGGMLVFQVFSYTANKEIRTDDLCLD-------VSKLNGPVTML 437
Query: 546 PCTHLP--VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
C HL L + +KL + +++ CLD E+ P +R CSG Q+W
Sbjct: 438 KCHHLKGNQLWEYDPVKLTLQHVNSNQ--CLDKATEEDSQVPSIR--DCSGGRSQQW 490
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 90/151 (59%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSPR ML+EI+LVDDAS R+FLK L+ YV KL VP VIR R GLI+ARL
Sbjct: 135 TVHSVINRSPRHMLEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARL 194
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECT+ + E L +
Sbjct: 195 KGAAVSKGQVITFLDAHCECTVGWLEPLLAR 225
>gi|351712481|gb|EHB15400.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Heterocephalus
glaber]
Length = 399
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 168/316 (53%), Positives = 225/316 (71%), Gaps = 1/316 (0%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSVI+ SPR M++EI+LVDDA+ R+FLK
Sbjct: 85 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVINCSPRHMVEEIVLVDDANERDFLKR 144
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+L+ YV KL VP VIR R GLI+ RL G ++G++++FLDAHCECT+GWLE L+ R
Sbjct: 145 TLESYVKKLKVPVHVIRMEHRSGLIRDRLKGDAVSKGQVIIFLDAHCECTVGWLEPLLTR 204
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ +DR VVCP+ID+ISD TF + ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 205 IKQDRRTVVCPIIDVISDDTFECMAGSDMTYGGFNWKLNFRWYLVPQREMDRRKGDRTLP 264
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP MAGG F+IDR YF IG YD M +WG ENLE+SFR+WQCGG++EI CSHV H+
Sbjct: 265 VRTPTMAGGCFSIDRDYFQEIGTYDAGMDIWGRENLEISFRIWQCGGTLEIVTCSHVGHV 324
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
F+KA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D V SRL LR
Sbjct: 325 FQKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDVSSRLGLRH 383
Query: 488 QLKCHSFKWYLTHVWP 503
+L+C F WYL +++P
Sbjct: 384 KLQCKPFSWYLENIYP 399
Score = 168 bits (425), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 82/151 (54%), Positives = 115/151 (76%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+ NL+ S+ I +NR+LP+ R CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 55 KEMFKINQVNLMASEMIALNRSLPNDRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 113
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+ SPR M++EI+LVDDA+ R+FLK +L+ YV KL VP VIR R GLI+ RL
Sbjct: 114 TVHSVINCSPRHMVEEIVLVDDANERDFLKRTLESYVKKLKVPVHVIRMEHRSGLIRDRL 173
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
G ++G++++FLDAHCECT+ + E L +
Sbjct: 174 KGDAVSKGQVIIFLDAHCECTVGWLEPLLTR 204
>gi|390347277|ref|XP_780324.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 580
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 187/412 (45%), Positives = 257/412 (62%), Gaps = 14/412 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVIVFHNEAWS LLR++HS+I+RSPR +L EI+LVDDAS R+FL LD+YV +L
Sbjct: 133 LPSTSIVIVFHNEAWSTLLRSIHSIINRSPRELLTEIILVDDASERDFLGQQLDDYVKRL 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE--NLVARVAEDRTR 314
VP V R R GLI+ARL GA +G +L FL +H +C+ L L A DR
Sbjct: 193 QVPVHVERMGTRSGLIRARLRGAGLVKGHVLGFLXSHDQCSASSLRPVYLEASRRHDRRN 252
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCP+ID+ISD FA+ ++ +G FNW+L FRWY +A + D T P ++P MA
Sbjct: 253 VVCPIIDVISDDNFAFHTGSDMTYGGFNWKLQFRWYPVPQREADRRGGDRTIPLRSPTMA 312
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ YF IG YD M VWGGENLE+SFR+W CGG++EI CSHV H+FRK++PY
Sbjct: 313 GGLFSIDKTYFEEIGTYDAGMDVWGGENLEISFRIWMCGGTLEIVTCSHVGHVFRKSTPY 372
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG ++ N R+A VWMD++ FY++ +P K + V R +LR +LKCH+F
Sbjct: 373 TFPGGTGRIINRNNQRLAEVWMDDFRHFYYRISPGVRKT-EFGDVSQRKKLRDRLKCHTF 431
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
+WYL +++P +D K G IR+++THKC++ K N+ G + C H
Sbjct: 432 EWYLENIYPESQFRLDFKTIGEIRNIETHKCLDNMGRK--ENEKVG---IFSC-HGQGGN 485
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG-FNRQRWTYDKE 605
Q+F + ++ D +CLD N V ++ C G Q W Y ++
Sbjct: 486 QIFALTKQNEIKHDD--LCLDAS--ANSHYKDVVMIKCHGKHGNQEWLYKED 533
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 80/142 (56%), Positives = 103/142 (72%), Gaps = 1/142 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALL 107
++++IN+FNLL SD I +NRTLPDVR CK K + LP +SIVIVFHNEAWS LL
Sbjct: 92 NEMFRINQFNLLASDMISINRTLPDVRMDGCKRKSYPPVSELPSTSIVIVFHNEAWSTLL 151
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
R++HS+I+RSPR +L EI+LVDDAS R+FL LD+YV +L VP V R R GLI+AR
Sbjct: 152 RSIHSIINRSPRELLTEIILVDDASERDFLGQQLDDYVKRLQVPVHVERMGTRSGLIRAR 211
Query: 168 LLGARQAEGEILVFLDAHCECT 189
L GA +G +L FL +H +C+
Sbjct: 212 LRGAGLVKGHVLGFLXSHDQCS 233
>gi|260788889|ref|XP_002589481.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
gi|229274659|gb|EEN45492.1| hypothetical protein BRAFLDRAFT_125191 [Branchiostoma floridae]
Length = 488
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 185/410 (45%), Positives = 253/410 (61%), Gaps = 28/410 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP+ S+VIVFHNEAW LLR+V+S+I+R+PR L+EI+LVDDAS R
Sbjct: 55 LPRMSVVIVFHNEAWCTLLRSVNSIINRTPRPYLEEIILVDDASER-------------- 100
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP ++ R R GLI+ARL G+ A+G ++ FLDAH ECT GW E L+ R+AEDRT VV
Sbjct: 101 GVPVKLERMGKRSGLIRARLRGSGAAKGPVITFLDAHIECTEGWAEPLLTRIAEDRTTVV 160
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + + D T P +TP MAGG
Sbjct: 161 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRGGDRTMPLRTPTMAGG 220
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAID++YF IG YD M +WGGENLE+SFR+WQCGG++EI CSHV H+FRKA+PY+F
Sbjct: 221 LFAIDKSYFEEIGTYDSGMDIWGGENLEISFRIWQCGGTLEIVTCSHVGHVFRKATPYTF 280
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +++ N R+A VWMD + +F++ +P K D V R ELR +L C FKW
Sbjct: 281 PGGTGQIINKNNRRLAEVWMDNFKDFFYIISPGVTKV-DYGDVTGRKELRDKLNCKPFKW 339
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +++P +P G IR+V +++C++ K N+ G + C H Q+
Sbjct: 340 YLENIYPDSQIPTSYHSLGEIRNVDSNQCIDNMARK--ENEKVG---IFSC-HGMGGNQV 393
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
F +L D +CLDV + V + C Q W YD++
Sbjct: 394 FSYTKEKELRTDD--LCLDVSKPGGP----VMLFKCHHLGGNQLWEYDEK 437
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/146 (49%), Positives = 97/146 (66%), Gaps = 15/146 (10%)
Query: 54 INRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 113
IN+FNL+ + I +NRTLPDVR CK+K + +E LP+ S+VIVFHNEAW LLR+V+S+
Sbjct: 21 INQFNLMACEMIALNRTLPDVRMEGCKSKTYPKE-LPRMSVVIVFHNEAWCTLLRSVNSI 79
Query: 114 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 173
I+R+PR L+EI+LVDDAS R VP ++ R R GLI+ARL G+
Sbjct: 80 INRTPRPYLEEIILVDDASER--------------GVPVKLERMGKRSGLIRARLRGSGA 125
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPK 199
A+G ++ FLDAH ECT + E L +
Sbjct: 126 AKGPVITFLDAHIECTEGWAEPLLTR 151
>gi|170591418|ref|XP_001900467.1| Polypeptide N-acetylgalactosaminyltransferase [Brugia malayi]
gi|158592079|gb|EDP30681.1| Polypeptide N-acetylgalactosaminyltransferase, putative [Brugia
malayi]
Length = 575
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 205/463 (44%), Positives = 270/463 (58%), Gaps = 60/463 (12%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
C LP +S++IV+HNEA+S L+RTV SVI RSPR LKEI+LVDD STR F
Sbjct: 76 QCRTKTYLPSSELPTTSVIIVYHNEAFSTLMRTVMSVIQRSPRENLKEIILVDDFSTRTF 135
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+++VA+L ++IR+ RVGLI+ARL+GA +AEG++L FLD+HCECT GW+E L
Sbjct: 136 LKVELEKFVAQLGTRIKIIRANERVGLIRARLMGANEAEGDVLTFLDSHCECTKGWMEPL 195
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+AR+ E+R VVCPVIDII+D TFAY +S EL G FNW L FRWY S + D
Sbjct: 196 LARIKENRKAVVCPVIDIINDRTFAYQKSIELFRGGFNWNLQFRWYALPSEMIKSRSDDP 255
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+P +P MAGGLF+IDR YF IG YD EM +WGGEN+E+S RV+ EI PCSHV
Sbjct: 256 TKPIISPTMAGGLFSIDRKYFEEIGTYDHEMDIWGGENIEISLRVF------EILPCSHV 309
Query: 425 AHLFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
H+FR+ SP+ FPG S +L NL RVA VWMDEW +++ P +R V S
Sbjct: 310 GHVFRRTSPHDFPGRKSGTILNSNLLRVAEVWMDEWKFHFYRTAP----RRFGCVVNS-- 363
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT-----HKCVE--KPLAK---- 532
RK+L C SFKW+L +VW HFLP FGR+ H ++ + C+ PL +
Sbjct: 364 --RKRLHCKSFKWFLDNVWKDHFLPQPGSAFGRVIHSRSQIGGMNGCLHWTVPLGESMRV 421
Query: 533 GSMNQASGPAS--------------LLPCTHLPVLTQMFVM----KLPTDL--------- 565
+M+ S P S L+ T P Q +M L DL
Sbjct: 422 AAMHNCSAPDSFDRTEISHSTSRNCLVARTGEPGSKQTSIMMARCTLGFDLWQLWLFTEE 481
Query: 566 --IATDESVCLDVPEYENDISPR---VRILACSGFNRQRWTYD 603
+ TD+ +CL Y+ PR +++ C + + W Y+
Sbjct: 482 GQMKTDDHLCLSA--YQPIQEPRNWKIQLKECGQYESEYWDYN 522
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/153 (54%), Positives = 118/153 (77%), Gaps = 1/153 (0%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF-NEEFLPKSSIVIVFHNEAWSAL 106
++ + IN+F+L+VSDRI +NR+LPD+RK +C+TK + LP +S++IV+HNEA+S L
Sbjct: 46 SEDTFVINQFSLVVSDRIALNRSLPDIRKHQCRTKTYLPSSELPTTSVIIVYHNEAFSTL 105
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
+RTV SVI RSPR LKEI+LVDD STR FLK L+++VA+L ++IR+ RVGLI+A
Sbjct: 106 MRTVMSVIQRSPRENLKEIILVDDFSTRTFLKVELEKFVAQLGTRIKIIRANERVGLIRA 165
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RL+GA +AEG++L FLD+HCECT + E L +
Sbjct: 166 RLMGANEAEGDVLTFLDSHCECTKGWMEPLLAR 198
>gi|351714167|gb|EHB17086.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Heterocephalus
glaber]
Length = 330
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 163/280 (58%), Positives = 212/280 (75%), Gaps = 1/280 (0%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 48 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKFT 106
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 107 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 166
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 167 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 226
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGGS+EI CSHV H+F
Sbjct: 227 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEISFRIWQCGGSLEIVTCSHVGHVF 286
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNP 468
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P
Sbjct: 287 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISP 326
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 16 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 74
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 75 TVYSVINRSPHYLLSEVILVDDASERDFLKFTLENYVKNLEVPVKIIRMEERSGLIRARL 134
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 135 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 165
>gi|196001849|ref|XP_002110792.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
gi|190586743|gb|EDV26796.1| hypothetical protein TRIADDRAFT_22976 [Trichoplax adhaerens]
Length = 515
Score = 358 bits (920), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 189/414 (45%), Positives = 265/414 (64%), Gaps = 18/414 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRT+HSV+ R+ +L EI+LVDD S + L + LD Y+AKL
Sbjct: 67 LPPTSVVIVFHNEAWSTLLRTIHSVLDRTAPDLLIEIILVDDKSVVKELHAPLDAYIAKL 126
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQA--EGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
+ ++IR+ R GLI++RL G A + ++ FLDAHCE GWLE L+ R+ DR+
Sbjct: 127 A-KVKIIRNKKREGLIRSRLNGKSFAASKAPVVTFLDAHCEANTGWLEPLLERIYNDRST 185
Query: 315 VVCPVIDIISDVTFAYVRSFE-LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VVCP ID+ISD FAY L G FNW+LHFRW S++ +R+ +P +TP M
Sbjct: 186 VVCPEIDVISDENFAYQYGPSGLMRGIFNWDLHFRWRAV-STEEQKRRQSPIDPVRTPTM 244
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLFAI+R YF IG YDEEM +WGGENLE+SFR+WQCGG++EI PCSHV H+FRK+ P
Sbjct: 245 AGGLFAINRDYFKEIGTYDEEMDIWGGENLEISFRIWQCGGTLEIVPCSHVGHVFRKSQP 304
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y FP GV + L N RVA VWMD + EF+++ P + + RLE+RK+LKC S
Sbjct: 305 YGFPKGVVDTLGKNSQRVAEVWMDGYKEFFYQRQPHL-RGHAYGDISKRLEIRKKLKCKS 363
Query: 494 FKWYLTHVWPHHFLPMDDKFF-GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
FKWYL +++ LP + G++R+ ++ C++ L++ ++ L PCT L
Sbjct: 364 FKWYLENIYTDAVLPNESVIAKGKVRNPASNMCLDS-LSRPKLSY----IGLSPCT-LSA 417
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
+T + + +L+ D +CLDV +Y + +V++ C G Q W ++K+
Sbjct: 418 MTMIISFTVRQELVVQD--ICLDVSDY--NPGTKVQLYECHGMKGNQLWMHEKD 467
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 104/155 (67%), Gaps = 4/155 (2%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++++L++ N+FN SD+I ++R LPD R CK KV+ + LP +S+VIVFHNEAWS
Sbjct: 24 LIESKRLFERNKFNQWASDKISLHRILPDARPKLCKDKVYPGD-LPPTSVVIVFHNEAWS 82
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ R+ +L EI+LVDD S + L + LD Y+AKL+ ++IR+ R GLI
Sbjct: 83 TLLRTIHSVLDRTAPDLLIEIILVDDKSVVKELHAPLDAYIAKLA-KVKIIRNKKREGLI 141
Query: 165 KARLLGARQA--EGEILVFLDAHCECTLVFNEEFL 197
++RL G A + ++ FLDAHCE + E L
Sbjct: 142 RSRLNGKSFAASKAPVVTFLDAHCEANTGWLEPLL 176
>gi|268580247|ref|XP_002645106.1| Hypothetical protein CBG16794 [Caenorhabditis briggsae]
Length = 568
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 181/415 (43%), Positives = 257/415 (61%), Gaps = 14/415 (3%)
Query: 193 NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY 252
N LP++SI+I FHNEAW+ ++RT+HS+ +RSPR +++EI+LVDD S + +LK LD Y
Sbjct: 125 NTTELPRASIIITFHNEAWTTIIRTLHSISNRSPRHLIEEIVLVDDYSDKYWLKGPLDIY 184
Query: 253 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR 312
V + +P V P R GLI+ARL GA+ A+G IL+FLD+H E + GWLE L++RVA+DR
Sbjct: 185 VRQFEIPVHVTHLPERSGLIRARLTGAKIAKGPILLFLDSHIEVSEGWLEPLISRVADDR 244
Query: 313 TRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDA--IIKRKDFTEPFKT 370
TR++ P+ID ISD F + WG F+W L F+W+ +D +I +K EP +T
Sbjct: 245 TRIIAPIIDNISDEDFGFSTGRTDLWGGFSWILSFKWFDMNGNDTQRLIAKK--AEPIRT 302
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P +AGGLFAI+R YF+ +GAYDE M+VWGGEN+E+SFR+W CGGS+EI PCSHV H+FR
Sbjct: 303 PTIAGGLFAINREYFYEMGAYDEGMEVWGGENVEISFRIWMCGGSMEIHPCSHVGHVFRT 362
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYSF V+ V+ N AR A VWMDE+ EF+FK P A+K + ++ R LR++LK
Sbjct: 363 KTPYSFTKEVNFVIRRNQARTAEVWMDEYKEFFFKMVPSAQKM-EIGDLQERKSLRERLK 421
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
C FKWYL +V +P + G I + KCV++ GP L C H
Sbjct: 422 CKPFKWYLKNVCSECHMPSEYHSLGAIVNKLNGKCVDR-----GGRVLGGPPGLGTCIHS 476
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
V + ++ CL N ++I C+G Q++ ++++
Sbjct: 477 HEQQGNQVWSWTGNKEIRSQNFCLS----SNKKGSELKIEMCNGSEDQKFEFNRK 527
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 74/159 (46%), Positives = 105/159 (66%), Gaps = 5/159 (3%)
Query: 34 GDSVDGGLH-SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTK--VFNEEFLP 90
GD V H NL +++ + N FN++ S+ I VNRTLPD R C+ N LP
Sbjct: 73 GDGVKIPDHLKNLEESR--FSENNFNVVASEMISVNRTLPDYRSDACRISGGKINTTELP 130
Query: 91 KSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSV 150
++SI+I FHNEAW+ ++RT+HS+ +RSPR +++EI+LVDD S + +LK LD YV + +
Sbjct: 131 RASIIITFHNEAWTTIIRTLHSISNRSPRHLIEEIVLVDDYSDKYWLKGPLDIYVRQFEI 190
Query: 151 PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
P V P R GLI+ARL GA+ A+G IL+FLD+H E +
Sbjct: 191 PVHVTHLPERSGLIRARLTGAKIAKGPILLFLDSHIEVS 229
>gi|156407314|ref|XP_001641489.1| predicted protein [Nematostella vectensis]
gi|156228628|gb|EDO49426.1| predicted protein [Nematostella vectensis]
Length = 353
Score = 357 bits (917), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 174/327 (53%), Positives = 226/327 (69%), Gaps = 4/327 (1%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
L + +C+ +LP +++VI FHNEAWS LLRTVHSVI RSP +L+EILL+DD ST
Sbjct: 20 LGSQGKCSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFST 79
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
++LKS L YVAKL RV+R+ R GLI+ARL+GAR A+G+++ FLDAHCE + WL
Sbjct: 80 HDYLKSKLTAYVAKLR-NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWL 138
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
+ L++R+ DRT V PVIDIIS F Y + G F+W++ F W++ ++ +R
Sbjct: 139 QPLLSRIHSDRTIVAVPVIDIISSTNFMYSGTPSAVIGGFSWDMQFTWHSL-PNNRQSER 197
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
KD T P +TP MAGGLF+IDR YFF G+YDE M VWGGENLEMSFR+WQCGG +EI PC
Sbjct: 198 KDRTAPIRTPTMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPC 257
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
S V H+FR PYSFPGG SEV NLARV VWMDE+ ++ + P+ + + + S
Sbjct: 258 SRVGHVFRTRFPYSFPGGYSEVSV-NLARVVHVWMDEYNQYVYMKRPDLQSLKYGD-ITS 315
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLP 508
R+ LR +LKC SFKWYL +V+P P
Sbjct: 316 RVALRNKLKCKSFKWYLENVYPEQTFP 342
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/122 (54%), Positives = 90/122 (73%), Gaps = 2/122 (1%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
KC +K + +LP +++VI FHNEAWS LLRTVHSVI RSP +L+EILL+DD ST ++L
Sbjct: 25 KCSSKSY-PSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYL 83
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
KS L YVAKL RV+R+ R GLI+ARL+GAR A+G+++ FLDAHCE + + + L
Sbjct: 84 KSKLTAYVAKLR-NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLL 142
Query: 198 PK 199
+
Sbjct: 143 SR 144
>gi|291243602|ref|XP_002741690.1| PREDICTED: polypeptide GalNAc transferase 5-like [Saccoglossus
kowalevskii]
Length = 753
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 185/420 (44%), Positives = 255/420 (60%), Gaps = 27/420 (6%)
Query: 196 FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE--FLKSSLDEYV 253
LPK+S++IVFHNEAW+ LLRTV SVI RSP +L+EILLVDDAST E +L+S LDEYV
Sbjct: 297 ILPKTSVIIVFHNEAWTTLLRTVISVIDRSPWQLLEEILLVDDASTSEKYWLQSELDEYV 356
Query: 254 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
AKL V TRVIR+ RVGLI+ RL G +A GE+L FLD+HCEC +GWLE L++ + DRT
Sbjct: 357 AKLPVITRVIRTGKRVGLIQGRLRGVEEARGEVLTFLDSHCECNIGWLEPLLSEIVNDRT 416
Query: 314 RVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
VV P +D+ISD TF Y ++ + G F W + F+WY+ + + D + P +TP
Sbjct: 417 TVVAPNLDVISDKTFGYTFIKPEQTMIGGFGWLVDFKWYSLPKRERLRVNNDMSRPLRTP 476
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
+AGGLFAID YF IG YD WG ENLE+SFRVWQCGG++EI PCSHV H+FR +
Sbjct: 477 TIAGGLFAIDADYFHRIGLYDPGFDTWGAENLELSFRVWQCGGTLEIVPCSHVGHVFRSS 536
Query: 432 SPYSFPGGVSE--VLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQL 489
PY + + + N R+ VWMD+ F+ P +Q + R +LR L
Sbjct: 537 IPYKYKDNKNPGLTIAKNNMRLMDVWMDDLKYFFLAILPHYAEQ-EFGDTSERKQLRSNL 595
Query: 490 KCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTH 549
KC FKWYL +++P + +PM + G I+HV++ +C+E G+ PA + PC
Sbjct: 596 KCKDFKWYLENIYPENTMPMQYQILGHIKHVESGECLEMSRKDGNT-----PA-IQPCGG 649
Query: 550 LPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQR------WTYD 603
++ + ++L + +CL EY+N ++ R C+ Q+ W+YD
Sbjct: 650 H--FDEVLMYTKQSNL--QHDYLCLYANEYDNRVTER----NCAYKEEQKNNPNFQWSYD 701
Score = 169 bits (427), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 97/202 (48%), Positives = 130/202 (64%), Gaps = 19/202 (9%)
Query: 13 ARNQQDYIDRRGVHVV--VGH-------YLGDSVDGGLHSNLSDAQ------QLYQINRF 57
+ +Q I RR +++ +GH Y +GG+ +L+ + YQ F
Sbjct: 209 TQTEQQQIQRRSGNILPQLGHRPLEQPWYPDSPGEGGMPVDLTPQEARLSKATFYQF-EF 267
Query: 58 NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 117
N++ SD+I +NRTLPD R C+ + + LPK+S++IVFHNEAW+ LLRTV SVI RS
Sbjct: 268 NIIASDKIALNRTLPDSRPVACEHREY-PHILPKTSVIIVFHNEAWTTLLRTVISVIDRS 326
Query: 118 PRSMLKEILLVDDASTRE--FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
P +L+EILLVDDAST E +L+S LDEYVAKL V TRVIR+ RVGLI+ RL G +A
Sbjct: 327 PWQLLEEILLVDDASTSEKYWLQSELDEYVAKLPVITRVIRTGKRVGLIQGRLRGVEEAR 386
Query: 176 GEILVFLDAHCECTLVFNEEFL 197
GE+L FLD+HCEC + + E L
Sbjct: 387 GEVLTFLDSHCECNIGWLEPLL 408
>gi|291235412|ref|XP_002737638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 497
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 227/343 (66%), Gaps = 8/343 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+++VFHNEAWS LLRT HS+I+RSPR +L E++LVDD ST+E LK LD+YVAKL
Sbjct: 77 LPTTSVIVVFHNEAWSTLLRTTHSIINRSPRELLMEVILVDDCSTQEHLKKPLDDYVAKL 136
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP V R R GLI++RL G A+G++L +LD+HCECT GWLE LV+R+ +DR V
Sbjct: 137 PVPVHVERMEVRSGLIRSRLRGGSVAKGDVLTYLDSHCECTEGWLEPLVSRIGDDRKTRV 196
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF------TEPFKT 370
P+IDII D +FAY+ + E + G F W+L +W + + ++ T +T
Sbjct: 197 QPIIDIIDDRSFAYIGASESNSGGFTWQLQHQWVRIPEYEQNRRVSEYDNIRQVTLFHRT 256
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLF+I++ YF +GAYD M VWGGEN+EMSFR+W CGG IEI PCS + H++R+
Sbjct: 257 PTMAGGLFSINKTYFEKMGAYDTGMDVWGGENIEMSFRIWMCGGKIEIIPCSRIGHVYRR 316
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PYSFP G +Y N RVA VWMD + +F++ + D V RLELR++L
Sbjct: 317 YIPYSFPNGSDPTIYRNAMRVAEVWMDHYKKFFYATQTKLH-MVDYGDVSDRLELRRKLG 375
Query: 491 CHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAK 532
CH+F WYL ++ P LP+DD +FG IR+ T C++ K
Sbjct: 376 CHNFTWYLKNIIPEMILPVDDANYFGEIRNDATGLCLDSASGK 418
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 78/138 (56%), Positives = 104/138 (75%), Gaps = 1/138 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++IN FNL+ SD+I +NR+LPDVR C K + + LP +S+++VFHNEAWS LLRT H
Sbjct: 41 WKINEFNLMASDKIALNRSLPDVRPRGCNDKKYPGK-LPTTSVIVVFHNEAWSTLLRTTH 99
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+RSPR +L E++LVDD ST+E LK LD+YVAKL VP V R R GLI++RL G
Sbjct: 100 SIINRSPRELLMEVILVDDCSTQEHLKKPLDDYVAKLPVPVHVERMEVRSGLIRSRLRGG 159
Query: 172 RQAEGEILVFLDAHCECT 189
A+G++L +LD+HCECT
Sbjct: 160 SVAKGDVLTYLDSHCECT 177
>gi|449676829|ref|XP_002167311.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 603
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 194/418 (46%), Positives = 263/418 (62%), Gaps = 29/418 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEAWS LLRTVHSVI+R+P LKEI+LVDDAST + LK LD+Y+ L
Sbjct: 164 LPPTSVIICFHNEAWSTLLRTVHSVINRTPPQYLKEIILVDDASTSDDLKQRLDDYIPNL 223
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ + ++R R GLI+ARL GA++A+G IL FLDAHCECTLGW E L+A++ EDR VV
Sbjct: 224 KIVS-IVRLRDRQGLIRARLEGAKKAKGPILTFLDAHCECTLGWAEPLLAKIKEDRQNVV 282
Query: 317 CPVIDIISDVTFAYVRSFE-LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVID IS+ F Y E G F W L F W S + +RK ++ KTP MAG
Sbjct: 283 MPVIDEISETNFNYNAVPEPFQRGVFKWRLEFTWRPIPSYEEQ-RRKHESDGIKTPVMAG 341
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+I+R YF+ +G+YD M +WGGEN+E+SFR+W CGGSIE+ PCS V H+FR PYS
Sbjct: 342 GLFSINRDYFYEMGSYDTGMDIWGGENIEISFRIWMCGGSIEMLPCSRVGHVFRPRFPYS 401
Query: 436 FP---GGVSEVLYGNLARVALVWMDEWAEFYF--KFNPEAEKQRDKQAVRSRLELRKQLK 490
FP GG +V+ NL RVA VWMDE+A+ ++ +F+ + +K D V +R++LR +L+
Sbjct: 402 FPNRRGGDGDVVSRNLMRVADVWMDEYAKHFYNIRFDLKRKKHDD---VTARVKLRSKLQ 458
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C SF+WYL +V+P +P DDKF G IR+ ++ C++ L K Q P L C
Sbjct: 459 CKSFQWYLENVYPELEIP-DDKFLAAGEIRNPESGICLDT-LGK----QEGAPVGLYAC- 511
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRWTYDKE 605
H Q + ++ A D C+D ++ + I C G Q+WTY +
Sbjct: 512 HGQGGNQYYTYNNKGEIKAEDN--CMDFNGHD------LYIRECDGLGLNQKWTYKNQ 561
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 81/148 (54%), Positives = 108/148 (72%), Gaps = 2/148 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN LVSD+I ++R+L D R +CK K + + LP +S++I FHNEAWS LLRTVH
Sbjct: 128 YEKHAFNQLVSDKISIHRSLKDYRNDQCKVKKYPVD-LPPTSVIICFHNEAWSTLLRTVH 186
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SVI+R+P LKEI+LVDDAST + LK LD+Y+ L + + ++R R GLI+ARL GA
Sbjct: 187 SVINRTPPQYLKEIILVDDASTSDDLKQRLDDYIPNLKIVS-IVRLRDRQGLIRARLEGA 245
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
++A+G IL FLDAHCECTL + E L K
Sbjct: 246 KKAKGPILTFLDAHCECTLGWAEPLLAK 273
>gi|196001819|ref|XP_002110777.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
gi|190586728|gb|EDV26781.1| hypothetical protein TRIADDRAFT_22201 [Trichoplax adhaerens]
Length = 518
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 191/422 (45%), Positives = 259/422 (61%), Gaps = 18/422 (4%)
Query: 187 ECT-LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC+ L + LP++S++IVFHNEAWS LLRTVHSV+ RSP +L+EI+LVDD+S E L
Sbjct: 61 ECSSLKYPIHKLPQTSVIIVFHNEAWSTLLRTVHSVLDRSPPELLREIILVDDSSDHEEL 120
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
S+L++YVAKLS +++R+ R GLI++RL G A + FLDAHCE +GWLE L+
Sbjct: 121 HSTLEKYVAKLS-KVKIVRNKAREGLIRSRLNGFAHATSPTVTFLDAHCEANVGWLEPLL 179
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
R+ ++RT VVCP ID+ISD TF Y S G+FNW L+FRW + +R T
Sbjct: 180 YRIMQNRTIVVCPEIDVISDETFEYTYSSGNVRGSFNWNLNFRWKAVPEYENK-RRAART 238
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+ ++P MAGGLF I YF IG YD++M++WGGENLE+SFR+WQCGG +EI PCSHV
Sbjct: 239 DGIRSPTMAGGLFTIHSQYFKDIGLYDKQMEIWGGENLELSFRIWQCGGQLEIIPCSHVG 298
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
H+FRK+ PYSFP G E L NL RVA VWMD + +++K P K + RLEL
Sbjct: 299 HVFRKSQPYSFPKGTGETLSKNLQRVAEVWMDGYKRYFYKRQPHL-KGHPFGDISKRLEL 357
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFF-GRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
RK+LKC +F WY+ +V P FLP G +R+ + C++ + G+ A +
Sbjct: 358 RKKLKCKNFDWYIKNVVPEIFLPNSSIIARGELRNPASGDCID---SLGAGEHAY--IGI 412
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYEN-DISPRVRILACSGF-NRQRWTY 602
C H + Q V ++I D C D Y N S +V++L C Q+W +
Sbjct: 413 YKC-HKQMGNQYLVYTKNEEIIVDDN--CFD---YANSQPSSKVKMLDCHSMKGNQQWLH 466
Query: 603 DK 604
K
Sbjct: 467 TK 468
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 75/151 (49%), Positives = 107/151 (70%), Gaps = 1/151 (0%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
++++L+Q NRFN SDRI ++R+LPD R +C + + LP++S++IVFHNEAWS L
Sbjct: 30 ESRKLFQRNRFNQWASDRISLHRSLPDARILECSSLKYPIHKLPQTSVIIVFHNEAWSTL 89
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LRTVHSV+ RSP +L+EI+LVDD+S E L S+L++YVAKLS +++R+ R GLI++
Sbjct: 90 LRTVHSVLDRSPPELLREIILVDDSSDHEELHSTLEKYVAKLS-KVKIVRNKAREGLIRS 148
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL G A + FLDAHCE + + E L
Sbjct: 149 RLNGFAHATSPTVTFLDAHCEANVGWLEPLL 179
>gi|344266859|ref|XP_003405496.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Loxodonta africana]
Length = 622
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P LKEI+LVDDAST E+LK LD+YV +L
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTAPAIFLKEIILVDDASTEEYLKEQLDQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QI-VRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDETVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIITIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWETVPLHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G++ V+ N R+A VWMD++ E +++ N +A K ++++ + RL+L++QL
Sbjct: 414 PHTFPKGIN-VIARNQVRLAEVWMDDYKEIFYRRNLQAAKMAEEKSFGDISERLKLKEQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L +V+P F+P + F+G I+ + T C L G N P + PC
Sbjct: 473 HCHNFSWFLHNVYPEMFVPDLKPTFYGAIKSLGTDHC----LDVGENNHGGKPLIMYPCH 528
Query: 549 HL 550
L
Sbjct: 529 SL 530
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 78/145 (53%), Positives = 100/145 (68%), Gaps = 3/145 (2%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAW 103
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAW
Sbjct: 131 QEKEEGYKKHCFNAFASDRISLQRALGPDTRPPECLDQKFRRCPQLPTTSVIIVFHNEAW 190
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTV+SV+ +P LKEI+LVDDAST E+LK LD+YV +L + RV+R R GL
Sbjct: 191 STLLRTVYSVLHTAPAIFLKEIILVDDASTEEYLKEQLDQYVKQLQI-VRVVRQQERKGL 249
Query: 164 IKARLLGARQAEGEILVFLDAHCEC 188
I ARLLGA A+ E+L FLDAHCEC
Sbjct: 250 ITARLLGASVAQAEVLTFLDAHCEC 274
>gi|240120031|ref|NP_766039.2| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120034|ref|NP_001155239.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|240120036|ref|NP_001155240.1| polypeptide N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|51315988|sp|Q8C7U7.1|GALT6_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|26339910|dbj|BAC33618.1| unnamed protein product [Mus musculus]
gi|74196150|dbj|BAE32989.1| unnamed protein product [Mus musculus]
gi|74198297|dbj|BAE35316.1| unnamed protein product [Mus musculus]
gi|111601267|gb|AAI19325.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
gi|111601271|gb|AAI19327.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 [Mus musculus]
Length = 622
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 179/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKERLEQYVQQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QI-VRVVRQRERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTAVV 294
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + R + G F+W L F W + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEK---QRDKQAVRSRLELRKQL 489
P++FP G S V+ N R+A VWMD++ + +++ N +A K + + + RL LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+CH+F WYL +V+P F+P ++ F+G I+++ T++C L G N+ P + C
Sbjct: 473 RCHNFSWYLHNVYPEMFVPDLNPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYVCH 528
Query: 549 HL 550
+L
Sbjct: 529 NL 530
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/144 (54%), Positives = 101/144 (70%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +LKEI+LVDDAST E LK L++YV +L + RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKERLEQYVQQLQI-VRVVRQRERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|449667968|ref|XP_002168066.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Hydra magnipapillata]
Length = 548
Score = 348 bits (893), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 195/458 (42%), Positives = 267/458 (58%), Gaps = 41/458 (8%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVH 219
R+G +K L ++Q E LP +S++I FHNEA SALLRTVH
Sbjct: 119 RLGNVKGDLCSSKQYPAE-------------------LPNTSVIICFHNEATSALLRTVH 159
Query: 220 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR-----VIRSPGRVGLIKA 274
SVI+ +P ++L I+LVDDAS LK L Y+ +L+ + R+ R GL+++
Sbjct: 160 SVINETPPNILSNIVLVDDASVGAALKKPLRNYINELNRKLGEEMVILYRNAKRQGLVRS 219
Query: 275 RLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSF 334
RL GA A G +L FLD+HCE T GW+E L+ R+ ED+ VVCPVI++I V +Y ++
Sbjct: 220 RLKGAELASGTVLTFLDSHCEATEGWVEPLLFRIKEDKRNVVCPVIEVIDAVDLSYKKT- 278
Query: 335 EL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGA 390
EL G F W+L F W + D R D T+P K+P MAGGLFAID++YF+ IG+
Sbjct: 279 ELDRITQVGGFTWDLFFNWKEI-TEDEKRLRADGTQPLKSPTMAGGLFAIDKSYFYEIGS 337
Query: 391 YDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA-SPYSFPGGVSEVLYGNLA 449
YD +M++WGGENLEMSFR+W CGG +EI PCS V H+FRK SPYSFP GVS+ L N
Sbjct: 338 YDNQMEIWGGENLEMSFRIWMCGGKLEIIPCSRVGHIFRKENSPYSFPNGVSKTLAKNFN 397
Query: 450 RVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHF-LP 508
R+A VWMDE+ E Y++ P +K + R+ELRK+L C SFKWY+ +V P
Sbjct: 398 RLAEVWMDEYKELYYRRKPPEDKLVKYGDISERVELRKKLGCKSFKWYIDNVIPDMIGAD 457
Query: 509 MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIAT 568
+ G +R+V ++ C++ KG+ Q + PC L Q FV+ ++I
Sbjct: 458 PNPPAHGEVRNVASNMCLDSMGNKGNRAQ----IKVFPCHRLGG-NQFFVLSKRGEIIHN 512
Query: 569 DESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
DES CLD EN+ +V + C G Q W Y K+
Sbjct: 513 DES-CLDY-SLENE-ENKVDMWNCHGLGGNQEWIYTKD 547
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 97/149 (65%), Gaps = 6/149 (4%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
S ++ L++ + F+ L+SDRI +NR L +V+ C +K + E LP +S++I FHNEA SA
Sbjct: 95 SQSKNLFKNHSFDSLLSDRISLNRRLGNVKGDLCSSKQYPAE-LPNTSVIICFHNEATSA 153
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR-----VIRSPGR 160
LLRTVHSVI+ +P ++L I+LVDDAS LK L Y+ +L+ + R+ R
Sbjct: 154 LLRTVHSVINETPPNILSNIVLVDDASVGAALKKPLRNYINELNRKLGEEMVILYRNAKR 213
Query: 161 VGLIKARLLGARQAEGEILVFLDAHCECT 189
GL+++RL GA A G +L FLD+HCE T
Sbjct: 214 QGLVRSRLKGAELASGTVLTFLDSHCEAT 242
>gi|417403183|gb|JAA48410.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 599
Score = 348 bits (893), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 177/362 (48%), Positives = 242/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L++YV +L
Sbjct: 168 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEPLEQYVQQL 227
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+ ED T VV
Sbjct: 228 RI-VRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARITEDETAVV 286
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V+ +H G F+W L F W T + + +RKD T+P K+P
Sbjct: 287 SPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQ-RRKDETDPIKSPT 345
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 346 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 405
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G + V+ N R+A VWMDE+ E +++ N +A K +++ + RL+LR+QL
Sbjct: 406 PHTFPKGTN-VIARNQVRLAEVWMDEYKEIFYRRNIQAAKMAREKSFGDISERLQLREQL 464
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F WYL +++P F+P + F+G I+++ +C L G N+ P + PC
Sbjct: 465 HCRNFSWYLHNIYPEMFVPDLKPTFYGAIKNLGIDQC----LDVGENNRGGKPLIMYPCH 520
Query: 549 HL 550
L
Sbjct: 521 SL 522
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/145 (52%), Positives = 101/145 (69%), Gaps = 3/145 (2%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAW 103
+ ++ Y+ + FN SD+I + R L PD R P+C + F LP +S++IVFHNEAW
Sbjct: 123 QEKEEGYKKHCFNAFASDQISLQRALGPDTRPPECVNQKFRRCPPLPTTSVIIVFHNEAW 182
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTV+SV+ +P +LKEI+LVDDAST E+LK L++YV +L + RV+R R GL
Sbjct: 183 STLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEPLEQYVQQLRI-VRVVRQERRKGL 241
Query: 164 IKARLLGARQAEGEILVFLDAHCEC 188
I ARLLGA A+ E+L FLDAHCEC
Sbjct: 242 ITARLLGASVAQAEVLTFLDAHCEC 266
>gi|417412000|gb|JAA52417.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 624
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 177/362 (48%), Positives = 242/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L++YV +L
Sbjct: 178 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEPLEQYVQQL 237
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+ ED T VV
Sbjct: 238 RI-VRVVRQERRKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARITEDETAVV 296
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V+ +H G F+W L F W T + + +RKD T+P K+P
Sbjct: 297 SPDIVTIDLNTFEFSKPVQKGRVHSRGNFDWSLTFGWETLPAHERQ-RRKDETDPIKSPT 355
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 356 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 415
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G + V+ N R+A VWMDE+ E +++ N +A K +++ + RL+LR+QL
Sbjct: 416 PHTFPKGTN-VIARNQVRLAEVWMDEYKEIFYRRNIQAAKMAREKSFGDISERLQLREQL 474
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F WYL +++P F+P + F+G I+++ +C L G N+ P + PC
Sbjct: 475 HCRNFSWYLHNIYPEMFVPDLKPTFYGAIKNLGIDQC----LDVGENNRGGKPLIMYPCH 530
Query: 549 HL 550
L
Sbjct: 531 SL 532
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 101/144 (70%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SD+I + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 134 EKEEGYKKHCFNAFASDQISLQRALGPDTRPPECVNQKFRRCPPLPTTSVIIVFHNEAWS 193
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E+LK L++YV +L + RV+R R GLI
Sbjct: 194 TLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEPLEQYVQQLRI-VRVVRQERRKGLI 252
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 253 TARLLGASVAQAEVLTFLDAHCEC 276
>gi|291220820|ref|XP_002730422.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Saccoglossus kowalevskii]
Length = 1082
Score = 347 bits (890), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 181/422 (42%), Positives = 262/422 (62%), Gaps = 14/422 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V + LP +S++I FHNEAWS LLRT++SVI+RS +L+EI+LVDD S R+
Sbjct: 629 HSLCDTVRYNKDLPTASVIISFHNEAWSTLLRTIYSVINRSKIKLLQEIILVDDYSDRDE 688
Query: 245 LKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK +LDEY+ + + +++ + R GLI+ARL+GA +A G+ILVFLD+HCE WLE
Sbjct: 689 LKVALDEYIQSNFNNKVKILHTTEREGLIRARLIGASKATGKILVFLDSHCEVNYNWLEP 748
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ R+ D + + CPVIDII +FAY S L G NW L F+W + +++R
Sbjct: 749 LIERIYRDSSTIACPVIDIIDPDSFAYSAS-PLVRGGVNWGLQFKWKNVPPVE-LLRRNS 806
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
EP K+P MAGGLFA+DR YF HIG+YD++MQ+WGGE+LE+SFR+WQCGG++EI PCS
Sbjct: 807 EIEPIKSPIMAGGLFAVDRNYFEHIGSYDKDMQIWGGEHLELSFRIWQCGGTLEIVPCSR 866
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK+ PY+ PGG+ V N RVA VWMD++ F++ P+A+ + + RL
Sbjct: 867 VGHIFRKSHPYTIPGGMENVFTHNSIRVAEVWMDDYKRFFYATRPDAQG-KTYGDLSERL 925
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
+L+ +LKC FKWYL +V+P +P ++ + +G ++ ++ C++ M +A P
Sbjct: 926 KLKSRLKCKDFKWYLDNVYPELSVPNENAYAWGECQNAASNVCLDT-----LMREAGQPV 980
Query: 543 SLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
L C H Q+F ++ E +CLDV + +P G N Q W +
Sbjct: 981 GLYIC-HGGGGNQVFSYTKLGEV--RHEELCLDVSTKKVGETPVFEQCHALGGN-QMWEH 1036
Query: 603 DK 604
K
Sbjct: 1037 RK 1038
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 68/142 (47%), Positives = 105/142 (73%), Gaps = 2/142 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A + + ++ FN++VSD I + R++ DV+ C T +N++ LP +S++I FHNEAWS L
Sbjct: 600 EADETFDVHGFNVVVSDMISLERSITDVKHSLCDTVRYNKD-LPTASVIISFHNEAWSTL 658
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIK 165
LRT++SVI+RS +L+EI+LVDD S R+ LK +LDEY+ + + +++ + R GLI+
Sbjct: 659 LRTIYSVINRSKIKLLQEIILVDDYSDRDELKVALDEYIQSNFNNKVKILHTTEREGLIR 718
Query: 166 ARLLGARQAEGEILVFLDAHCE 187
ARL+GA +A G+ILVFLD+HCE
Sbjct: 719 ARLIGASKATGKILVFLDSHCE 740
>gi|260823684|ref|XP_002606210.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
gi|229291550|gb|EEN62220.1| hypothetical protein BRAFLDRAFT_246892 [Branchiostoma floridae]
Length = 595
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 193/430 (44%), Positives = 258/430 (60%), Gaps = 36/430 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+SIVI F NEAWS LLRTVHSV+ R+PR +L+EI+L+DD S + LK L+EY+
Sbjct: 140 LPKTSIVICFFNEAWSTLLRTVHSVLDRTPRELLQEIILIDDFSDQSHLKEELEEYIRDH 199
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++ R+ R GLI+AR+ GA A G++L+FLD+HCE + WLE L+AR+AEDRTRVV
Sbjct: 200 LPMVQLYRTDKREGLIRARVKGATHASGDVLMFLDSHCEVSKQWLEPLLARIAEDRTRVV 259
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD-FTEPFKTPAMAG 375
CP+IDII+ TF Y S L G FNW LHF+W +++ D P +P MAG
Sbjct: 260 CPIIDIINSDTFEYTAS-PLVRGGFNWGLHFKWDQV--PQQLLQGPDGAAAPINSPTMAG 316
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF +G YDE M +WGGENLE+SFR+W CGG++EI PCS V H+FRK PY
Sbjct: 317 GLFAIDREYFDELGRYDEGMDIWGGENLEISFRIWMCGGTLEIIPCSRVGHVFRKRRPYG 376
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + N R+A VWMDE+ + YF PE K R + RL+LR++L CHSFK
Sbjct: 377 SPNG-EDTMSKNSLRMAHVWMDEYKDQYFSLRPEM-KTRTYGDISDRLKLREKLNCHSFK 434
Query: 496 WYLTHVWPHHFLPMDDKF----------------FGRIRHVQTHKCV---EKPLAKGSMN 536
WYL +++P F+P DK G I+H+ + C+ P KGS+
Sbjct: 435 WYLDNIYPELFVPGGDKLKQVGVGQLPPRPKVIKKGHIKHLDSGLCLISQNGPNEKGSLV 494
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN 596
S S Q++ + +L T +CLDV END RI+ C G +
Sbjct: 495 VVSECLS-------EDKNQVWYLTDQDELQLTG-LLCLDV--NENDPKSFPRIMKCHGTS 544
Query: 597 -RQRWTYDKE 605
Q+W + ++
Sbjct: 545 GGQQWKFAED 554
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 106/163 (65%), Gaps = 3/163 (1%)
Query: 39 GGLHS--NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
G LHS + Y+ + FNLL+SDRI +R +PD R KC+ K + LPK+SIVI
Sbjct: 89 GNLHSPEDQETRDMGYRRHAFNLLISDRIGFHRNIPDTRNDKCRGKSY-PSGLPKTSIVI 147
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
F NEAWS LLRTVHSV+ R+PR +L+EI+L+DD S + LK L+EY+ ++ R
Sbjct: 148 CFFNEAWSTLLRTVHSVLDRTPRELLQEIILIDDFSDQSHLKEELEEYIRDHLPMVQLYR 207
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ R GLI+AR+ GA A G++L+FLD+HCE + + E L +
Sbjct: 208 TDKREGLIRARVKGATHASGDVLMFLDSHCEVSKQWLEPLLAR 250
>gi|156397426|ref|XP_001637892.1| predicted protein [Nematostella vectensis]
gi|156225008|gb|EDO45829.1| predicted protein [Nematostella vectensis]
Length = 513
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 185/413 (44%), Positives = 256/413 (61%), Gaps = 15/413 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FH E S LLRTVHSVI+R+P +L E+++VDD S L LD++VA+
Sbjct: 67 LPTTSVIICFHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQF 126
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GL++ARL GA A+G++L FLD+HCE T GW E L+AR+A DR VV
Sbjct: 127 T-KVKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNVV 185
Query: 317 CPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CP I++I+ TFAY S G F+W+L F+W + + R D ++P +TP MAG
Sbjct: 186 CPAIEVINADTFAYQGSTNADQRGGFSWDLFFKWKGIPPEEQKL-RNDDSDPIRTPTMAG 244
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK-ASPY 434
GLF+I R YFF IG+YDEEM +WGGENLE+SFRVW CGG +EI CS V H+FRK SPY
Sbjct: 245 GLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVGHVFRKYTSPY 304
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FP GV L N R+A VWMDE+ + Y+ P+A K D + RLELRK+LKC SF
Sbjct: 305 KFPDGVERTLTKNFNRLAEVWMDEYKDLYYNKKPQA-KNSDYGDISKRLELRKRLKCKSF 363
Query: 495 KWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVL 553
KWY+ +++P +P +D G +R+ +++C++ AK N G + C H
Sbjct: 364 KWYINNIYPDVQMPELDPPARGEVRNPSSNQCLDSLGAKPEHNARVG---IYTC-HGQGG 419
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q+ +P +LI +E+ C DV + V ++ C G Q W +D+E
Sbjct: 420 NQV-SKYMPRELIFEEEN-CFDVSKTHPGAP--VELMKCHGMRGNQEWKHDRE 468
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 100/153 (65%), Gaps = 2/153 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFN-EEFLPKSSIVIVFHNEAWSAL 106
A++ + + FN L+SD+I ++RTL DVR +CK K LP +S++I FH E S L
Sbjct: 25 AEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAKLPTTSVIICFHKERLSVL 84
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LRTVHSVI+R+P +L E+++VDD S L LD++VA+ + +V+R R GL++A
Sbjct: 85 LRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQFT-KVKVLRMKKREGLVRA 143
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RL GA A+G++L FLD+HCE T + E L +
Sbjct: 144 RLQGANTAKGDVLTFLDSHCEATPGWAEPLLAR 176
>gi|281348732|gb|EFB24316.1| hypothetical protein PANDA_010523 [Ailuropoda melanoleuca]
Length = 621
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 179/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +L+EI+LVDDAST ++LK L++YV KL
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDDYLKDQLEQYVKKL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEEETAVV 294
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V S +H G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G+S V+ N R+A VWMD + E +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGIS-VIARNQVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+LT+++P F+P + F+G IR++ ++C L G N P + C
Sbjct: 473 HCRNFSWFLTNIYPEMFVPDLKPTFYGAIRNLGINQC----LDVGENNHGGKPLIMYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/144 (54%), Positives = 101/144 (70%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPATSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +L+EI+LVDDAST ++LK L++YV KL V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLREIILVDDASTDDYLKDQLEQYVKKLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|301772392|ref|XP_002921627.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Ailuropoda melanoleuca]
Length = 622
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 179/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +L+EI+LVDDAST ++LK L++YV KL
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLREIILVDDASTDDYLKDQLEQYVKKL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEEETAVV 294
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V S +H G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPVPSGRIHSRGNFDWSLTFGWEALPAHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G+S V+ N R+A VWMD + E +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGIS-VIARNQVRLAEVWMDSYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+LT+++P F+P + F+G IR++ ++C L G N P + C
Sbjct: 473 HCRNFSWFLTNIYPEMFVPDLKPTFYGAIRNLGINQC----LDVGENNHGGKPLIMYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 78/144 (54%), Positives = 101/144 (70%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPATSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +L+EI+LVDDAST ++LK L++YV KL V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLREIILVDDASTDDYLKDQLEQYVKKLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|301608339|ref|XP_002933739.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Xenopus (Silurana) tropicalis]
Length = 622
Score = 345 bits (885), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 239/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E+LK LD+YV L
Sbjct: 174 LPTTSVIIVFHNEAWSTLLRTVYSVLYTSPAILLKEIILVDDASEDEYLKEKLDDYVKAL 233
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ ++ R R GL ARLLGA A GE+L FLDAHCEC GWLE L++R+AED T VV
Sbjct: 234 QI-VKIARQKERKGLTTARLLGASIATGEVLTFLDAHCECFHGWLEPLLSRIAEDHTAVV 292
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I II +F + + + + G F+W L F W +++ +RKD T P KTP
Sbjct: 293 SPDIPIIDLNSFEFHKPVQYGKTHNRGNFDWSLTFGWEAIPAAEKE-RRKDETYPIKTPT 351
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG+YDEEM++WGGENLEMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 352 FAGGLFSISKAYFEHIGSYDEEMEIWGGENLEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 411
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQ-RDKQ--AVRSRLELRKQL 489
P++FP G ++V++ NL R+A VWMD++ Y++ N +A K R+K + RL+L+ L
Sbjct: 412 PHTFPKG-TQVIFRNLVRLAEVWMDDYKLLYYQRNEQAAKMVREKSFGDISKRLKLKADL 470
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P F+P D ++G++++ + C L G N P + C
Sbjct: 471 QCKNFTWYLENIYPEMFVPDRDPTYYGKVKNEGSQNC----LDAGEKNHGGKPLIMNLCN 526
Query: 549 HL 550
+
Sbjct: 527 GM 528
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 75/139 (53%), Positives = 93/139 (66%), Gaps = 3/139 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRT 109
+ + FN SDRI ++R L PD R P C + F LP +S++IVFHNEAWS LLRT
Sbjct: 135 FDKHHFNAFASDRISLHRALGPDTRPPVCIEQKFKRCPSLPTTSVIIVFHNEAWSTLLRT 194
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
V+SV+ SP +LKEI+LVDDAS E+LK LD+YV L + ++ R R GL ARLL
Sbjct: 195 VYSVLYTSPAILLKEIILVDDASEDEYLKEKLDDYVKALQI-VKIARQKERKGLTTARLL 253
Query: 170 GARQAEGEILVFLDAHCEC 188
GA A GE+L FLDAHCEC
Sbjct: 254 GASIATGEVLTFLDAHCEC 272
>gi|22760242|dbj|BAC11118.1| unnamed protein product [Homo sapiens]
Length = 622
Score = 345 bits (885), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMTQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|296211689|ref|XP_002752525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Callithrix jacchus]
Length = 622
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 177/362 (48%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASMAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPIQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G + V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTN-VIARNQVRLAEVWMDSFKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 77/145 (53%), Positives = 100/145 (68%), Gaps = 3/145 (2%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAW 103
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAW
Sbjct: 131 QEKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAW 190
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GL
Sbjct: 191 STLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGL 249
Query: 164 IKARLLGARQAEGEILVFLDAHCEC 188
I ARLLGA A+ E+L FLDAHCEC
Sbjct: 250 ITARLLGASMAQAEVLTFLDAHCEC 274
>gi|397479051|ref|XP_003810846.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Pan paniscus]
gi|397479053|ref|XP_003810847.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pan paniscus]
Length = 622
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|332206188|ref|XP_003252173.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Nomascus leucogenys]
Length = 622
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|115298684|ref|NP_009141.2| polypeptide N-acetylgalactosaminyltransferase 6 [Homo sapiens]
gi|51316028|sp|Q8NCL4.2|GALT6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|37572269|gb|AAH35822.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|119578594|gb|EAW58190.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
gi|123980642|gb|ABM82150.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
gi|123995463|gb|ABM85333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6)
[synthetic construct]
Length = 622
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|426372562|ref|XP_004053192.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Gorilla
gorilla gorilla]
Length = 622
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFQRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|297691860|ref|XP_002823292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Pongo abelii]
gi|395744294|ref|XP_002823293.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
3 [Pongo abelii]
Length = 622
Score = 345 bits (884), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|73996388|ref|XP_850161.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Canis lupus familiaris]
Length = 622
Score = 344 bits (883), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 179/362 (49%), Positives = 240/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L++YV KL
Sbjct: 176 LPTTSVVIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEQLEQYVKKL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ ++L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAQVLTFLDAHCECFHGWLEPLLARIAEDETVVV 294
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V+ +H G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAIPAHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP GVS V+ N R+A VWMD + E +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGVS-VIARNQVRLAEVWMDNYKEIFYRRNMQAAKMAQEKSFGDISERLKLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L +++P F+P + +G IR++ ++C L G N P + C
Sbjct: 473 HCHNFSWFLHNIYPEMFVPDLKPTLYGAIRNLGINQC----LDVGENNHGGKPLIMYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 81/153 (52%), Positives = 103/153 (67%), Gaps = 3/153 (1%)
Query: 38 DGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIV 95
D H + ++ Y+ + FN SDRI + R L PD R P+C + F LP +S+V
Sbjct: 123 DKWTHQETQEKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPTTSVV 182
Query: 96 IVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVI 155
IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L++YV KL V RV+
Sbjct: 183 IVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEQLEQYVKKLQV-VRVV 241
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
R R GLI ARLLGA A+ ++L FLDAHCEC
Sbjct: 242 RQEERKGLITARLLGASVAQAQVLTFLDAHCEC 274
>gi|156392174|ref|XP_001635924.1| predicted protein [Nematostella vectensis]
gi|156223022|gb|EDO43861.1| predicted protein [Nematostella vectensis]
Length = 415
Score = 344 bits (883), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 230/337 (68%), Gaps = 10/337 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPKSSI+I FHNEAWS LLRTVHSVI+R+P +L+EILL+DDAS R+ LK L+EYVAKL
Sbjct: 57 LPKSSIIICFHNEAWSTLLRTVHSVINRTPPRLLEEILLIDDASNRDELKEKLEEYVAKL 116
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V R+IR R GLI+ARL GA A+G IL FLDAHCEC+ GWLE L A++AE+ + VV
Sbjct: 117 KV-VRIIRLSKRQGLIRARLKGAAAAKGSILTFLDAHCECSKGWLEPLAAKIAENSSNVV 175
Query: 317 CPVIDIISDVTFAYVRSFE-LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVID ISD TF Y E H G F W L F W + + +RKD + +TP MAG
Sbjct: 176 MPVIDEISDTTFYYHAVPEPFHRGVFRWRLEFGWKPVPQYE-MERRKDEADGIRTPVMAG 234
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+ID+ YF IG YD M +WGGENLE+SFR+W CGG+IE+ PCS V H+FR PYS
Sbjct: 235 GLFSIDKNYFEKIGTYDTGMDIWGGENLEISFRIWMCGGAIEMLPCSRVGHVFRPRFPYS 294
Query: 436 F---PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
F PG ++V+ NL RVA VWMDE+ + ++ + K++ V RL LR++LKC
Sbjct: 295 FPARPGHNTDVVSNNLMRVADVWMDEYKKHFYNIRFDL-KRKQHDDVSQRLALREKLKCK 353
Query: 493 SFKWYLTHVWPHHFLPMDDKFF--GRIRHVQTHKCVE 527
+FKWYL +V+P +P D F G++R+ + C++
Sbjct: 354 NFKWYLDNVYPELEVP-DTNFAASGQVRNPSSDMCLD 389
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/141 (58%), Positives = 104/141 (73%), Gaps = 2/141 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ Y+++ FNL+ SD + + R LPD R CK K + LPKSSI+I FHNEAWS LLR
Sbjct: 18 EEGYELHSFNLVASDMMSLYRRLPDYRNDACKAKKY-PLHLPKSSIIICFHNEAWSTLLR 76
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+R+P +L+EILL+DDAS R+ LK L+EYVAKL V R+IR R GLI+ARL
Sbjct: 77 TVHSVINRTPPRLLEEILLIDDASNRDELKEKLEEYVAKLKV-VRIIRLSKRQGLIRARL 135
Query: 169 LGARQAEGEILVFLDAHCECT 189
GA A+G IL FLDAHCEC+
Sbjct: 136 KGAAAAKGSILTFLDAHCECS 156
>gi|291230378|ref|XP_002735140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 621
Score = 344 bits (883), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 164/362 (45%), Positives = 235/362 (64%), Gaps = 12/362 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S+++V+HNEAWS L+RTVHSVI+RSPR +L+EILL+DDASTRE+L LD+Y+ KL
Sbjct: 158 LPKTSVILVYHNEAWSVLMRTVHSVINRSPRHLLEEILLIDDASTREYLGRPLDDYITKL 217
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP RV + R GLI ARL GA A+ +L FLD+HCEC+ GWLE L+ R+A +R+ VV
Sbjct: 218 PVPVRVHHAKERRGLIGARLKGAELAKAPVLTFLDSHCECSKGWLEPLLDRIAANRSTVV 277
Query: 317 CPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVI+ I D +FA+V + E+ H G F+W + F WY S+ D +EP ++P MAG
Sbjct: 278 CPVINQIDDRSFAFVNATEVSHIGGFDWNIIFNWYNIPQSEKDRIGGDKSEPVRSPTMAG 337
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+ID++YF +G+YD E + WGGEN+E+S ++W CGG +E PCSHV H+FRK +P+
Sbjct: 338 GLFSIDKSYFEELGSYDPEFEFWGGENIELSLKIWMCGGILEFVPCSHVGHVFRKHNPHK 397
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
+ V+ N R+A VW+DE+ ++ PE K D + R++LRK L+C SF+
Sbjct: 398 YKNTTYNVVGRNNRRLAEVWLDEYKYLFYANQPETMKI-DPGDISQRVQLRKNLQCKSFR 456
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKG----------SMNQASGPASLL 545
W+L +++P G++++V + C++ A G + N S L
Sbjct: 457 WFLQNIYPDSHYNFAFVGVGQLKNVASGACLDFGKAAGHGGKEFKGKDATNVTSNTVELW 516
Query: 546 PC 547
PC
Sbjct: 517 PC 518
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 78/148 (52%), Positives = 109/148 (73%), Gaps = 1/148 (0%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+++++N FNL++S+RI +R+L D R C K ++ LPK+S+++V+HNEAWS L+RT
Sbjct: 120 KMFKVNYFNLMISNRISNDRSLADYRPQGCFAKKYSRN-LPKTSVILVYHNEAWSVLMRT 178
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSVI+RSPR +L+EILL+DDASTRE+L LD+Y+ KL VP RV + R GLI ARL
Sbjct: 179 VHSVINRSPRHLLEEILLIDDASTREYLGRPLDDYITKLPVPVRVHHAKERRGLIGARLK 238
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFL 197
GA A+ +L FLD+HCEC+ + E L
Sbjct: 239 GAELAKAPVLTFLDSHCECSKGWLEPLL 266
>gi|5834600|emb|CAA69876.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
gi|300470331|dbj|BAJ10977.1| UDP-N-acetyl-alpha-D-galactosamine: polypeptide
N-acetylgalactosaminyltransferase 6 [Homo sapiens]
Length = 622
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSIPKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|390333619|ref|XP_785951.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 756
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 168/332 (50%), Positives = 221/332 (66%), Gaps = 4/332 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWSALLRTVHSVI+R+PR L EI+LVDDAS L LD YVAKL
Sbjct: 307 LPNTSVIIVFHNEAWSALLRTVHSVINRTPRQYLSEIILVDDASIHAHLGHQLDSYVAKL 366
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP V R R GLI+AR+ GA A+G++L FLD+HCE + GWLE L+AR+AEDR+ VV
Sbjct: 367 PVPVHVERMGVRSGLIRARMRGALVAQGQVLTFLDSHCEASHGWLEPLLARIAEDRSNVV 426
Query: 317 CPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVID+I+ AY + G F+W L FRW + D + + D T P +P MAG
Sbjct: 427 TPVIDVINAQNLAYEADNQTPAIGVFDWSLTFRWQSIQRRDLPLLKHDPTHPIPSPTMAG 486
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR+YF G YD ++WG ENLE+SF+ W CGG IEI PCSHV H+FRK +PYS
Sbjct: 487 GLFAIDRSYFIETGMYDSGFEIWGAENLEISFKTWMCGGRIEILPCSHVGHIFRKHAPYS 546
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
+++ + N R+A VW+D + EF++ +P A K + R+ELR +L C SF+
Sbjct: 547 --NTLTDFISYNNKRLAEVWLDGYKEFFYFMSPSALKV-NAGNYTDRVELRDRLGCRSFQ 603
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
WYL +V+P P +K +G +RH T+ C++
Sbjct: 604 WYLENVFPEGGWPGRNKIYGEVRHTATNWCLD 635
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 85/154 (55%), Positives = 108/154 (70%), Gaps = 1/154 (0%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
S+A L+ IN FNL+ SD I +NR+LPDVR +C K ++ LP +S++IVFHNEAWSA
Sbjct: 265 SEADDLFIINSFNLMASDMIGINRSLPDVRPKQCLYKQYSSA-LPNTSVIIVFHNEAWSA 323
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLRTVHSVI+R+PR L EI+LVDDAS L LD YVAKL VP V R R GLI+
Sbjct: 324 LLRTVHSVINRTPRQYLSEIILVDDASIHAHLGHQLDSYVAKLPVPVHVERMGVRSGLIR 383
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
AR+ GA A+G++L FLD+HCE + + E L +
Sbjct: 384 ARMRGALVAQGQVLTFLDSHCEASHGWLEPLLAR 417
>gi|402593617|gb|EJW87544.1| glycosyltransferase [Wuchereria bancrofti]
Length = 520
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 180/402 (44%), Positives = 251/402 (62%), Gaps = 25/402 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + + LP +S+VI+F +EAWS L+RTVHSVI+R+P +L+EI+LVDD S R+
Sbjct: 60 HKQCRKISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDE 119
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ + R++R+P R GLI+A+LLGA++A G++LVFLD+HCE GWLE L
Sbjct: 120 LKGKLEEYIKRFGDKVRLVRAPERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPL 179
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGS--SDAIIKR 361
+AR+ + R+ V+CP+I+ IS T Y H G F W LHFRW SDA
Sbjct: 180 LARIKDKRSAVLCPIINHISPETLTYSANDRPAHVGGFWWSLHFRWDPMPKEYSDA---- 235
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
D TEP ++P MAGGL A+DR YFF +G YD EM +WGGENLEMSFRVW CGGS+E PC
Sbjct: 236 -DPTEPIRSPTMAGGLLAVDRLYFFEVGGYDPEMDIWGGENLEMSFRVWMCGGSVEFIPC 294
Query: 422 SHVAHLFRKASPYSF--PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
SHV H+FR PY+ PG +V N R+A VWMD++ +FY+ + K++D +
Sbjct: 295 SHVGHIFRAGHPYNMIGPGNNKDVHGTNSKRLAEVWMDDYKKFYYIHRLDL-KEKDVGDL 353
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF--FGRIRHVQTHKCVEKPLAKGSMNQ 537
R LR++LKC SFKWYL +V + F+ +D+ FG +R+ + C++ ++ +
Sbjct: 354 SERKALRQKLKCKSFKWYLENVAKNKFV-LDENVAAFGSLRNPSSELCLD------TLQR 406
Query: 538 ASG---PASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDV 576
G P S+ PC + Q+F L D I E C +
Sbjct: 407 DEGEAIPLSVFPCQNGKSEAQIF--SLTNDGILRRELTCAKI 446
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 66/131 (50%), Positives = 101/131 (77%), Gaps = 1/131 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N++ SD I ++R+LPD R +C+ ++++ LP +S+VI+F +EAWS L+RTVHSVI+R
Sbjct: 41 MNVVASDLISLDRSLPDRRHKQCRKISYSDD-LPVASVVIIFTDEAWSPLMRTVHSVINR 99
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P +L+EI+LVDD S R+ LK L+EY+ + R++R+P R GLI+A+LLGA++A G
Sbjct: 100 TPLKLLQEIILVDDFSQRDELKGKLEEYIKRFGDKVRLVRAPERQGLIRAKLLGAKEAVG 159
Query: 177 EILVFLDAHCE 187
++LVFLD+HCE
Sbjct: 160 DVLVFLDSHCE 170
>gi|410210024|gb|JAA02231.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410247040|gb|JAA11487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
gi|410351197|gb|JAA42202.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Pan
troglodytes]
Length = 622
Score = 344 bits (882), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLMPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|270011456|gb|EFA07904.1| hypothetical protein TcasGA2_TC005479 [Tribolium castaneum]
Length = 621
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 185/428 (43%), Positives = 260/428 (60%), Gaps = 26/428 (6%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
D C+ F +E LP++S++I FHNEAWS LLRTVHSV+ RSP ++KE++LVDD S
Sbjct: 131 DEWCKAPGRF-QEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDM 189
Query: 243 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
+ LK L +Y A ++IR+ R GLI+ARLLGA AEGE+L +LD+HCECT GWLE
Sbjct: 190 DHLKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLE 248
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYV--RSFELHWGAFNWELHFRWYTYGSSDAIIK 360
L+ R+A D T VVCPVID+I D T Y S ++ G F+W L F W+ + +
Sbjct: 249 PLLDRIARDPTTVVCPVIDVIDDTTLEYHFHDSGGVNVGGFDWNLQFNWHAVPEHEKK-R 307
Query: 361 RKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
K+ EP +P MAGGLF+ID+ +F +G YD +WGGENLE+SF+ W CGG++EI P
Sbjct: 308 HKNPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVP 367
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
CSHV H+FRK SPY + GV+ VL N R+A VW+DE+A++Y++ ++ D +
Sbjct: 368 CSHVGHIFRKRSPYKWRSGVN-VLRRNSVRLAEVWLDEYAKYYYQRI--GNEKGDFGDIT 424
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKGSMNQA 538
SR LR++L C SFKWYL +++P F+P + G IR+ + C++ P + +++
Sbjct: 425 SRKALREKLGCKSFKWYLDNIYPELFIPGEAVASGEIRNLGIGGKTCLDSPARRSDLHK- 483
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFN 596
P L PC H Q+ V+ D +C+D P D+ + + C G N
Sbjct: 484 --PVGLYPC-HRQGGNQISVL---------DRELCIDSPCKPEDLHNPIGLWPCHKQGGN 531
Query: 597 RQRWTYDK 604
Q W Y K
Sbjct: 532 -QFWMYSK 538
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 125/209 (59%), Gaps = 28/209 (13%)
Query: 41 LHSNLS-DAQQL----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIV 95
L SNLS D ++L +Q N FN VSD I V+R+LPD R CK +E LP++S++
Sbjct: 91 LPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRDEWCKAPGRFQEALPQTSVI 150
Query: 96 IVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVI 155
I FHNEAWS LLRTVHSV+ RSP ++KE++LVDD S + LK L +Y A ++I
Sbjct: 151 ICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDHLKQQLVDYFAS-EPKVKII 209
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALL 215
R+ R GLI+ARLLGA AEGE+L +LD+HCECT + E L +
Sbjct: 210 RAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPLLDR---------------- 253
Query: 216 RTVHSVISRSPRSMLKEILLVDDASTREF 244
I+R P +++ ++ V D +T E+
Sbjct: 254 ------IARDPTTVVCPVIDVIDDTTLEY 276
>gi|285026454|ref|NP_001165534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Rattus norvegicus]
Length = 622
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 182/362 (50%), Positives = 238/362 (65%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAST E LK L+ YV +L
Sbjct: 176 LPTTSVVIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKEKLERYVQQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QI-VRVVRQQERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTAVV 294
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + +R + H G F+W L F W + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFQFSKPMRRGKAHSRGNFDWSLTFGWEMLPEHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD++ + +++ N +A K + V RL LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRNLQAAKMAKENNFGDVSERLRLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P ++ F G I+++ T +C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLNPTFSGAIKNLGTSQC----LDVGENNRGGKPLIMYVCH 528
Query: 549 HL 550
+L
Sbjct: 529 NL 530
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 97/139 (69%), Gaps = 3/139 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRT 109
Y+ + FN SDRI + R+L PD R P+C + F LP +S+VIVFHNEAWS LLRT
Sbjct: 137 YKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLPTTSVVIVFHNEAWSTLLRT 196
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
V+SV+ SP +LKEI+LVDDAST E LK L+ YV +L + RV+R R GLI ARLL
Sbjct: 197 VYSVLHTSPAILLKEIILVDDASTDEHLKEKLERYVQQLQI-VRVVRQQERKGLITARLL 255
Query: 170 GARQAEGEILVFLDAHCEC 188
GA A+ E+L FLDAHCEC
Sbjct: 256 GASVAQAEVLTFLDAHCEC 274
>gi|324510655|gb|ADY44456.1| N-acetylgalactosaminyltransferase 9 [Ascaris suum]
Length = 577
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 183/426 (42%), Positives = 256/426 (60%), Gaps = 15/426 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H EC V ++ LP +S+VI+F +EAW+ LLRTVHSV++RSP +L E++L+DD S RE
Sbjct: 119 HAECRSVHYDDDLPSASVVIIFTDEAWTPLLRTVHSVVNRSPLHLLHEVILLDDFSQREE 178
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK LDEY+ + ++IR R GLI+A+L GA +A GE++VFLD+HCE GWLE L
Sbjct: 179 LKGKLDEYIKRFGGIVKLIRKKERHGLIRAKLAGAHEATGEVIVFLDSHCEANEGWLEPL 238
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKD 363
+AR+ E RT V+CP+ID IS T Y ++ G F W LHFRW + G ++ +RK
Sbjct: 239 LARIKEKRTAVLCPIIDYISAETMQYSGDANVNAVGGFWWSLHFRWDSIGKAER-DRRKS 297
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
EP ++P MAGGL A +R YF +G YD M +WGGENLE+SFRVW CGGSIE PCSH
Sbjct: 298 AIEPVRSPTMAGGLLAANREYFLEVGGYDPGMDIWGGENLEISFRVWMCGGSIEFIPCSH 357
Query: 424 VAHLFRKASPYSF--PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
V H+FR PY+ PGG +V N R+A VWMD++ Y+ P+ K +D +
Sbjct: 358 VGHIFRAGHPYNMTGPGGNLDVHGTNSKRLAEVWMDDYKRLYYLHRPDL-KTKDVGDLSE 416
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASG 540
R LRK+LKC SFKWYL +V PH F+P + +G +R+ + C++ L + + +
Sbjct: 417 RKALRKKLKCKSFKWYLDNVIPHKFIPDEGVVGYGALRNPNSGLCLDT-LQRDEKSTIT- 474
Query: 541 PASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGFNRQ 598
+ C Q+F + L E C V D P+ +R+ +C N
Sbjct: 475 -LGIFACQTGGSSAQVFSLTKSGQL--RREITCASVLRSRED-GPKGTIRMPSCDD-NND 529
Query: 599 RWTYDK 604
+W+++K
Sbjct: 530 KWSHEK 535
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 102/143 (71%), Gaps = 1/143 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N++ SD++ ++R++PD R +C++ ++++ LP +S+VI+F +EAW+ LLRTVHSV++R
Sbjct: 100 MNVVASDKLSMDRSIPDTRHAECRSVHYDDD-LPSASVVIIFTDEAWTPLLRTVHSVVNR 158
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L E++L+DD S RE LK LDEY+ + ++IR R GLI+A+L GA +A G
Sbjct: 159 SPLHLLHEVILLDDFSQREELKGKLDEYIKRFGGIVKLIRKKERHGLIRAKLAGAHEATG 218
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
E++VFLD+HCE + E L +
Sbjct: 219 EVIVFLDSHCEANEGWLEPLLAR 241
>gi|403296667|ref|XP_003939220.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Saimiri boliviensis boliviensis]
gi|403296669|ref|XP_003939221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Saimiri boliviensis boliviensis]
Length = 622
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 177/362 (48%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G + V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTN-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLKLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/145 (52%), Positives = 100/145 (68%), Gaps = 3/145 (2%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAW 103
+ ++ ++ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAW
Sbjct: 131 QEKEEGFKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAW 190
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GL
Sbjct: 191 STLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGL 249
Query: 164 IKARLLGARQAEGEILVFLDAHCEC 188
I ARLLGA A+ E+L FLDAHCEC
Sbjct: 250 ITARLLGASVAQAEVLTFLDAHCEC 274
>gi|26324460|dbj|BAC25984.1| unnamed protein product [Mus musculus]
Length = 622
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 239/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +L EI+L+DDAST E LK L++YV +L
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLNEIILMDDASTDEHLKERLEQYVQQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QI-VRVVRQRERGGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTAVV 294
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + R + G F+W L F W + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPEHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG + I PCS V H+FR S
Sbjct: 354 FAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLGIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEK---QRDKQAVRSRLELRKQL 489
P++FP G S V+ N R+A VWMD++ + +++ N +A K + + + RL LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLRLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+CH+F WYL +V+P F+P ++ F+G I+++ T++C L G N+ P + C
Sbjct: 473 RCHNFSWYLHNVYPEMFVPDLNPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYVCH 528
Query: 549 HL 550
+L
Sbjct: 529 NL 530
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +L EI+L+DDAST E LK L++YV +L + RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLNEIILMDDASTDEHLKERLEQYVQQLQI-VRVVRQRERGGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|89365963|gb|AAI14506.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Homo
sapiens]
Length = 622
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 177/362 (48%), Positives = 240/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + R +LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERPQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|410964449|ref|XP_003988767.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Felis
catus]
Length = 622
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 238/362 (65%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAST E+LK LD+YV KL
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEYLKEQLDQYVKKL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QI-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDETVVV 294
Query: 317 CPVIDIISDVTFAYVRSF---ELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + +H G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPVPRGRVHSRGNFDWSLTFGWEALPAHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G+S V+ N R+A VWMD + E +++ N +A K +++ + RL+L+++L
Sbjct: 414 PHTFPKGIS-VIARNQVRLAEVWMDSYKEIFYRRNLQAAKMAQEKSFGDISERLQLKERL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+L +++P F+P + F+G IR++ +C L G N P + C
Sbjct: 473 HCRNFSWFLHNIYPEMFVPDLKPTFYGAIRNLGVDQC----LDVGENNHGGKPLIMYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 80/144 (55%), Positives = 101/144 (70%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +LKEI+LVDDAST E+LK LD+YV KL + RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLKEIILVDDASTDEYLKEQLDQYVKKLQI-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|449683613|ref|XP_002154358.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Hydra magnipapillata]
Length = 641
Score = 342 bits (876), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 179/414 (43%), Positives = 256/414 (61%), Gaps = 20/414 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA+S LLRTVHSV++R+P +L +I+LVDD S E LK LD++VA+L
Sbjct: 185 LPTASVIICFHNEAYSVLLRTVHSVLNRTPPDLLTDIILVDDKSEYENLKRPLDDHVAQL 244
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S ++IR+ R GLI++R+ GA + G++L+FLD+HCE T GW E L+AR+AE + VV
Sbjct: 245 SKKIKIIRNAKRSGLIRSRINGADLSRGDVLIFLDSHCETTPGWAEPLLARIAEKSSNVV 304
Query: 317 CPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
P+I++I+ T Y + G F+W+L ++W + + RK + +TP MAG
Sbjct: 305 VPIIEVINADTLQYAAAANPDQRGGFSWDLFYKWKPIPLDEQHL-RKSPIDVIRTPTMAG 363
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK-ASPY 434
GLFAIDR YF+ +G YDEEM +WGGENLEMSFR+W CGG I+I PCS V H+FRK SPY
Sbjct: 364 GLFAIDRKYFYDMGTYDEEMDIWGGENLEMSFRIWMCGGRIDIIPCSRVGHIFRKFTSPY 423
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FP GV + L NL R+A VW+DE+ E Y++ P++ K +D + RL LR +L C SF
Sbjct: 424 KFPDGVEKTLSKNLNRLAEVWLDEYKELYYQKRPQS-KGKDYGDISQRLALRNKLNCKSF 482
Query: 495 KWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASG----PASLLPCTH 549
KWY+ +++P LP + G I++ + C++ SM G + PC H
Sbjct: 483 KWYIENIYPDVQLPDLYPPARGEIKNPASSYCLD------SMGDMKGNNVKKLGIFPC-H 535
Query: 550 LPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTY 602
Q FV +++ DE CLDV + + + I+ C F Q+W +
Sbjct: 536 GQGGNQNFVFSRKGEIV-FDEEYCLDVSSSKPGV--LIDIMKCHNFGGNQQWIH 586
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/164 (45%), Positives = 114/164 (69%), Gaps = 7/164 (4%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKC--KTKVFNEEFLPKSSIVIVFHNEAWSA 105
A++++ + FN ++SD+I ++RT+ DVR C K K + + LP +S++I FHNEA+S
Sbjct: 143 AEKIFANHSFNSVLSDKISLDRTMRDVRGDLCIEKHKTYPRK-LPTASVIICFHNEAYSV 201
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLRTVHSV++R+P +L +I+LVDD S E LK LD++VA+LS ++IR+ R GLI+
Sbjct: 202 LLRTVHSVLNRTPPDLLTDIILVDDKSEYENLKRPLDDHVAQLSKKIKIIRNAKRSGLIR 261
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLP----KSSIVIV 205
+R+ GA + G++L+FLD+HCE T + E L KSS V+V
Sbjct: 262 SRINGADLSRGDVLIFLDSHCETTPGWAEPLLARIAEKSSNVVV 305
>gi|149714568|ref|XP_001504374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Equus
caballus]
Length = 622
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 238/362 (65%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +L+EI+LVDDAST E+LK L++YV +L
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLREIILVDDASTDEYLKEQLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QV-VRVVRQKERTGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDETAVV 294
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V+ +H G F+W L F W + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLSFGWEALPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G+S V+ N R+A VWMD + E +++ N +A K +++ + RL+LR++L
Sbjct: 414 PHTFPKGIS-VIARNQVRLAEVWMDGYKEIFYRRNMQAAKMAQEKSFGDISERLQLRERL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L +++P F+P + F+G I+++ C L G N P + C
Sbjct: 473 HCHNFSWFLQNIYPEMFVPDLKPTFYGAIKNLGIDHC----LDVGENNHGGKPLIMYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 101/144 (70%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +L+EI+LVDDAST E+LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLREIILVDDASTDEYLKEQLEQYVKQLQV-VRVVRQKERTGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|291389167|ref|XP_002711235.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6
[Oryctolagus cuniculus]
Length = 622
Score = 341 bits (875), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 242/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +L+EI+LVDDAST E+LK L++YV +L
Sbjct: 176 LPSTSVIIVFHNEAWSTLLRTVYSVLHTAPAILLREIILVDDASTEEYLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFTGWLEPLLARIAEDETVVV 294
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V+ +H G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPVQRGRVHSRGNFDWSLTFGWEAVPAHENR-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G + V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTN-VIARNQVRLAEVWMDNYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L +V+P F+P ++ F+G I+++ +C++ G N+ P + C
Sbjct: 473 HCHNFSWFLHNVYPEMFVPDLNPTFYGAIKNLGLGQCLD----VGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/150 (52%), Positives = 102/150 (68%), Gaps = 3/150 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRT 109
Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS LLRT
Sbjct: 137 YKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPSTSVIIVFHNEAWSTLLRT 196
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
V+SV+ +P +L+EI+LVDDAST E+LK L++YV +L V RV+R R GLI ARLL
Sbjct: 197 VYSVLHTAPAILLREIILVDDASTEEYLKEKLEQYVKQLQV-VRVVRQEERKGLITARLL 255
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA A+ E+L FLDAHCEC + E L +
Sbjct: 256 GASVAQAEVLTFLDAHCECFTGWLEPLLAR 285
>gi|327290100|ref|XP_003229762.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Anolis carolinensis]
Length = 634
Score = 341 bits (874), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 235/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L+ LD+YV +
Sbjct: 185 LPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDDYLQDKLDDYVKQF 244
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A GE L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 245 HI-VKVVRQKERKGLITARLLGASIATGETLTFLDAHCECFYGWLEPLLARIAENNTYVV 303
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + ++ KRKD T P KTP
Sbjct: 304 SPDISSIDLNTFEFSKPSPYGQSHNRGNFDWSLSFGWESLPEHESK-KRKDETYPIKTPT 362
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF++IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 363 FAGGLFSISKDYFYNIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIPCSVVGHVFRSKS 422
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ +++ N EA K +Q + R EL+++L
Sbjct: 423 PHSFPKG-TQVITRNQVRLAEVWMDEYKNIFYRRNTEAAKIVKQQTFGDISKRHELKQRL 481
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C FKWYL++V+P ++P ++ G +++V C L G N P + C
Sbjct: 482 QCKDFKWYLSNVYPEAYVPDLNPPLSGFLKNVGRRAC----LDVGENNHGGKPLIMYTCH 537
Query: 549 HL 550
L
Sbjct: 538 GL 539
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 93/134 (69%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 151 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVM 210
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L+ LD+YV + + +V+R R GLI ARLLGA A
Sbjct: 211 YTSPAILLKEIILVDDASVDDYLQDKLDDYVKQFHI-VKVVRQKERKGLITARLLGASIA 269
Query: 175 EGEILVFLDAHCEC 188
GE L FLDAHCEC
Sbjct: 270 TGETLTFLDAHCEC 283
>gi|351697576|gb|EHB00495.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Heterocephalus
glaber]
Length = 622
Score = 340 bits (873), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 243/362 (67%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAST E+LK L+ YV +L
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEYLKEKLERYVEQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+ VV
Sbjct: 236 QI-VKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFYGWLEPLLARIAEDQVAVV 294
Query: 317 CPVIDIISDVTFAYVRSF---ELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I+ TF + + +H G F+W L F W T + + +R+D T P K+P
Sbjct: 295 SPDIVTINLDTFEFSKPIPGGRVHSRGNFDWSLTFGWETLPAQEKQ-RREDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EIAPCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGRLEIAPCSVVGHVFRSRS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD++ + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VISRNQVRLAEVWMDDYKKIFYRRNLQAAKIAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+L +++P F+P ++ F+G I+++ ++C L G N+ P + C
Sbjct: 473 HCRNFSWFLHNIYPEMFVPDLNPTFYGAIKNLGINQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/155 (50%), Positives = 104/155 (67%), Gaps = 3/155 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVNQKFRRCPSLPATSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +LKEI+LVDDAST E+LK L+ YV +L + +V+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLKEIILVDDASTDEYLKEKLERYVEQLQI-VKVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
ARLLGA A+ E+L FLDAHCEC + E L +
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCECFYGWLEPLLAR 285
>gi|194384516|dbj|BAG59418.1| unnamed protein product [Homo sapiens]
Length = 603
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 177/362 (48%), Positives = 240/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 157 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 216
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC G LE L+AR+AED+T VV
Sbjct: 217 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGRLEPLLARIAEDKTVVV 275
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 276 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 334
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 335 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 394
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 395 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 453
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 454 HCHNFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 509
Query: 549 HL 550
L
Sbjct: 510 GL 511
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 113 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 172
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 173 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 231
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 232 TARLLGASVAQAEVLTFLDAHCEC 255
>gi|118093614|ref|XP_422023.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Gallus
gallus]
Length = 632
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 242
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A GE L FLDAHCEC GWLE L+AR+AE+ VV
Sbjct: 243 QI-VKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLLARIAENSVAVV 301
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P +TP
Sbjct: 302 SPDIASIDLNTFEFSKPSPYGHNHNRGNFDWSLSFGWESLPKYENK-RRKDETYPIRTPT 360
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG+YD+EM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 361 FAGGLFSISKEYFEHIGSYDDEMEIWGGENIEMSFRVWQCGGLLEIMPCSVVGHVFRSKS 420
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + RL+LR++L
Sbjct: 421 PHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLDLRQRL 479
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +V+P ++P ++ F G +++V H C L G N P + C
Sbjct: 480 QCKNFTWYLNNVYPEVYVPDLNPLFSGYLKNVGNHMC----LDVGENNHGGKPLIMYSCH 535
Query: 549 HL 550
L
Sbjct: 536 GL 537
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 81/154 (52%), Positives = 101/154 (65%), Gaps = 6/154 (3%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 149 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVM 208
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + + +V+R R GLI ARLLGA A
Sbjct: 209 YTSPAILLKEIILVDDASVDEYLHDKLDEYVKQFQI-VKVVRQKERKGLITARLLGASVA 267
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK---SSIVIV 205
GE L FLDAHCEC + E L + +S+ +V
Sbjct: 268 TGETLTFLDAHCECFYGWLEPLLARIAENSVAVV 301
>gi|326436254|gb|EGD81824.1| hypothetical protein PTSG_02538 [Salpingoeca sp. ATCC 50818]
Length = 604
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 172/364 (47%), Positives = 230/364 (63%), Gaps = 10/364 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEA + LLRTV S++ RSP S++ EILL+DDAST E LK+ LDE +A +
Sbjct: 153 LPDTTVIIPFHNEARTTLLRTVWSILDRSPPSLINEILLIDDASTMEHLKAPLDEELATI 212
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
TRV+R R GLI+A++ GA QA+G+++ FLD+HCEC +GWLE L+ R+ DRT VV
Sbjct: 213 P-KTRVLRLSERSGLIRAKVFGAEQAKGKVVTFLDSHCECNVGWLEPLLERIYLDRTTVV 271
Query: 317 CPVIDIISDVTFAYVRSFE-LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVID I TFAY S + G F W L F W + KRKD P +P MAG
Sbjct: 272 TPVIDNIDKKTFAYTGSPTVITRGIFTWSLTFSWLDLPWFEQK-KRKDPIAPLPSPTMAG 330
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++DR YFF IG+YD M VWGGENLE+SFR+WQCGG++E PCS V H++R PY
Sbjct: 331 GLFSMDREYFFEIGSYDMGMDVWGGENLEISFRIWQCGGTLEFIPCSRVGHVYRDFHPYK 390
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FP G + + NL RVA VWMDE+ E Y+ P + + RLELRK+L C FK
Sbjct: 391 FPSGAVQTINKNLNRVAEVWMDEYKELYYGVRPH-HRAIGTGDISDRLELRKKLNCKPFK 449
Query: 496 WYLTHVWPHHFLPMDDKFFGR--IRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVL 553
WYL +V+P +P+ + G+ +++ T+ C++ S + A L PC +
Sbjct: 450 WYLDNVFPDMMVPLPENLLGKGAVKNAATNMCLDSL----SSREVDMKAGLYPCANGKSE 505
Query: 554 TQMF 557
QMF
Sbjct: 506 NQMF 509
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 103/148 (69%), Gaps = 1/148 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN +SDRI ++R + D R CK + + + LP ++++I FHNEA + LLRTV
Sbjct: 116 WKRNNFNQYISDRISLHRPIKDTRHAMCKDRTYPLDKLPDTTVIIPFHNEARTTLLRTVW 175
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S++ RSP S++ EILL+DDAST E LK+ LDE +A + TRV+R R GLI+A++ GA
Sbjct: 176 SILDRSPPSLINEILLIDDASTMEHLKAPLDEELATIP-KTRVLRLSERSGLIRAKVFGA 234
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
QA+G+++ FLD+HCEC + + E L +
Sbjct: 235 EQAKGKVVTFLDSHCECNVGWLEPLLER 262
>gi|326922813|ref|XP_003207639.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Meleagris
gallopavo]
Length = 632
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 233/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEY+ +
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDEYLHDKLDEYMKQF 242
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A GE L FLDAHCEC GWLE L+AR+AE+ VV
Sbjct: 243 QI-VKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLLARIAENSVAVV 301
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P +TP
Sbjct: 302 SPDIASIDLNTFEFSKPSPYGHNHNRGNFDWSLSFGWESLPKYENK-RRKDETYPIRTPT 360
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG+YD+EM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 361 FAGGLFSISKKYFEHIGSYDDEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 420
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + RL LR++L
Sbjct: 421 PHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLNLRQRL 479
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +V+P ++P ++ F G ++++ H C L G N P + C
Sbjct: 480 QCKNFTWYLNNVYPEVYVPDLNPLFSGYLKNIGNHMC----LDVGENNHGGKPLIMYSCH 535
Query: 549 HL 550
L
Sbjct: 536 GL 537
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 80/154 (51%), Positives = 101/154 (65%), Gaps = 6/154 (3%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 149 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVM 208
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEY+ + + +V+R R GLI ARLLGA A
Sbjct: 209 YTSPAILLKEIILVDDASVDEYLHDKLDEYMKQFQI-VKVVRQKERKGLITARLLGASVA 267
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK---SSIVIV 205
GE L FLDAHCEC + E L + +S+ +V
Sbjct: 268 TGETLTFLDAHCECFYGWLEPLLARIAENSVAVV 301
>gi|449275388|gb|EMC84260.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Columba livia]
Length = 632
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 233/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 183 LPTTSIIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 242
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A GE L FLDAHCEC GWLE L+AR+AE+ VV
Sbjct: 243 QI-VKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLLARIAENPVAVV 301
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P +TP
Sbjct: 302 SPDIASIDLNTFEFTKPSPYGHGHNRGNFDWSLSFGWESLPKHENK-RRKDETYPIRTPT 360
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 361 FAGGLFSISKDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 420
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + RL+LR++L
Sbjct: 421 PHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRLDLRQRL 479
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL++V+P ++P ++ F G +++ C L G N P + C
Sbjct: 480 QCKNFTWYLSNVYPEAYVPDLNPLFSGYLKNTGNRMC----LDVGENNHGGKPLIMYSCH 535
Query: 549 HL 550
L
Sbjct: 536 GL 537
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +SI+IVFHNEAWS LLRTVHSV+
Sbjct: 149 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSIIIVFHNEAWSTLLRTVHSVM 208
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + + +V+R R GLI ARLLGA A
Sbjct: 209 YTSPAILLKEIILVDDASVDEYLHDKLDEYVKQFQI-VKVVRQKERKGLITARLLGASVA 267
Query: 175 EGEILVFLDAHCEC 188
GE L FLDAHCEC
Sbjct: 268 TGETLTFLDAHCEC 281
>gi|440905500|gb|ELR55875.1| Polypeptide N-acetylgalactosaminyltransferase 13 [Bos grunniens
mutus]
Length = 412
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 165/334 (49%), Positives = 213/334 (63%), Gaps = 53/334 (15%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 18 TRVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 76
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 77 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 136
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 137 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 196
Query: 369 ----------------------------------------------------KTPAMAGG 376
+TP MAGG
Sbjct: 197 RLLPEKWLSLKVIEHTSPAQCLASAISLLDKEKASTSGESGGTGFKQSQLIERTPTMAGG 256
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PY+F
Sbjct: 257 LFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTF 316
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA 470
PGG V+ N R+A VWMDE+ +F++ +PE
Sbjct: 317 PGGTGHVINKNNRRLAEVWMDEFKDFFYIISPET 350
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 73/129 (56%), Positives = 100/129 (77%), Gaps = 1/129 (0%)
Query: 71 LPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDD 130
L +++K +CKT+V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDD
Sbjct: 8 LKELKKRRCKTRVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDD 66
Query: 131 ASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 190
AS R+FLK +L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTL
Sbjct: 67 ASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTL 126
Query: 191 VFNEEFLPK 199
+ E L +
Sbjct: 127 GWLEPLLAR 135
>gi|196000745|ref|XP_002110240.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
gi|190586191|gb|EDV26244.1| hypothetical protein TRIADDRAFT_22839 [Trichoplax adhaerens]
Length = 481
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 182/387 (47%), Positives = 245/387 (63%), Gaps = 22/387 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST---REFLKSSLDEYV 253
LP +S+VIVFHNEAWS LLRTVHSV+ RS +++EI+LVDD S E L++ L++Y+
Sbjct: 74 LPPASLVIVFHNEAWSTLLRTVHSVLDRSDPRLMREIILVDDCSEIKGHEELQAPLEKYI 133
Query: 254 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
KL + +++R+ R GLI+ARL G ++ ++VFLDAHCE GWLE L+AR+ E+R+
Sbjct: 134 QKLKI-VKLVRNKKRQGLIRARLRGYKEVTSPVIVFLDAHCEVVDGWLEPLLARIHENRS 192
Query: 314 RVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VVCP ID+IS F Y + + G FNW LHFRW T + + +RK +P ++P M
Sbjct: 193 NVVCPEIDVISFENFGYSYASGIR-GVFNWNLHFRWRTLPAVEQQ-RRKSVIDPIRSPTM 250
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLFAI + YF IG YD+EM +WGGENLEMSFR+WQCGG++EI PCSHV H+FRK+ P
Sbjct: 251 AGGLFAIHKKYFEDIGLYDDEMDIWGGENLEMSFRIWQCGGNLEIIPCSHVGHVFRKSQP 310
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y+FP G E L NL RVA VWMD + + ++ P +Q + R+ELRK+LKC S
Sbjct: 311 YTFPKGAGETLNKNLQRVAEVWMDNYKDIFYNRFPNL-RQHSYGDISKRIELRKKLKCKS 369
Query: 494 FKWYLTHVWPHHFLPMDDKFF---GRIRHVQTHKCVEKPLAKGSM-NQASGPASLLPCTH 549
F WYL +V+ P D F G +R+ T C++ SM N+ + PC H
Sbjct: 370 FDWYLKNVFTDVQYP--DMIFLAKGELRNPSTGYCLD------SMGNKEYADIGIYPC-H 420
Query: 550 LPVLTQMFVMKLPTDLIATDESVCLDV 576
Q+ + +L DE VCLD
Sbjct: 421 GQGGNQLLTYTIRKEL-EMDE-VCLDA 445
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 101/144 (70%), Gaps = 5/144 (3%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
++ +L+ NRFN SDRI ++RTLPD R C+ ++F LP +S+VIVFHNEAWS L
Sbjct: 33 ESDRLFSRNRFNQWASDRISLHRTLPDQRPAACRKQLFPTN-LPPASLVIVFHNEAWSTL 91
Query: 107 LRTVHSVISRSPRSMLKEILLVDDAST---REFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
LRTVHSV+ RS +++EI+LVDD S E L++ L++Y+ KL + +++R+ R GL
Sbjct: 92 LRTVHSVLDRSDPRLMREIILVDDCSEIKGHEELQAPLEKYIQKLKI-VKLVRNKKRQGL 150
Query: 164 IKARLLGARQAEGEILVFLDAHCE 187
I+ARL G ++ ++VFLDAHCE
Sbjct: 151 IRARLRGYKEVTSPVIVFLDAHCE 174
>gi|91089275|ref|XP_970398.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
Length = 586
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 187/426 (43%), Positives = 259/426 (60%), Gaps = 23/426 (5%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
D C+ F +E LP++S++I FHNEAWS LLRTVHSV+ RSP ++KE++LVDD S
Sbjct: 131 DEWCKAPGRF-QEALPQTSVIICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDM 189
Query: 243 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
+ LK L +Y A ++IR+ R GLI+ARLLGA AEGE+L +LD+HCECT GWLE
Sbjct: 190 DHLKQQLVDYFAS-EPKVKIIRAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLE 248
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYV--RSFELHWGAFNWELHFRWYTYGSSDAIIK 360
L+ R+A D T VVCPVID+I D T Y S ++ G F+W L F W+ + +
Sbjct: 249 PLLDRIARDPTTVVCPVIDVIDDTTLEYHFHDSGGVNVGGFDWNLQFNWHAVPEHEKK-R 307
Query: 361 RKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
K+ EP +P MAGGLF+ID+ +F +G YD +WGGENLE+SF+ W CGG++EI P
Sbjct: 308 HKNPAEPVYSPTMAGGLFSIDKKFFERLGTYDNGFDIWGGENLELSFKTWMCGGTLEIVP 367
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
CSHV H+FRK SPY + GV+ VL N R+A VW+DE+A++Y++ ++ D +
Sbjct: 368 CSHVGHIFRKRSPYKWRSGVN-VLRRNSVRLAEVWLDEYAKYYYQRI--GNEKGDFGDIT 424
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKGSMNQA 538
SR LR++L C SFKWYL +++P F+P + G IR+ + C++ P + +++
Sbjct: 425 SRKALREKLGCKSFKWYLDNIYPELFIPGEAVASGEIRNLGIGGKTCLDSPARRSDLHK- 483
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NR 597
P L PC H Q F M + I DE+ CLD E V + C G
Sbjct: 484 --PVGLYPC-HRQGGNQ-FWMYSKSGEIRRDEA-CLDYSGQE------VILYPCHGSKGN 532
Query: 598 QRWTYD 603
Q W Y+
Sbjct: 533 QFWDYN 538
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 91/209 (43%), Positives = 125/209 (59%), Gaps = 28/209 (13%)
Query: 41 LHSNLS-DAQQL----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIV 95
L SNLS D ++L +Q N FN VSD I V+R+LPD R CK +E LP++S++
Sbjct: 91 LPSNLSADVKKLVDAGWQKNAFNQYVSDMISVHRSLPDPRDEWCKAPGRFQEALPQTSVI 150
Query: 96 IVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVI 155
I FHNEAWS LLRTVHSV+ RSP ++KE++LVDD S + LK L +Y A ++I
Sbjct: 151 ICFHNEAWSVLLRTVHSVLDRSPSHLIKEVILVDDFSDMDHLKQQLVDYFAS-EPKVKII 209
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALL 215
R+ R GLI+ARLLGA AEGE+L +LD+HCECT + E L +
Sbjct: 210 RAKKREGLIRARLLGAAHAEGEVLTYLDSHCECTTGWLEPLLDR---------------- 253
Query: 216 RTVHSVISRSPRSMLKEILLVDDASTREF 244
I+R P +++ ++ V D +T E+
Sbjct: 254 ------IARDPTTVVCPVIDVIDDTTLEY 276
>gi|47226381|emb|CAG09349.1| unnamed protein product [Tetraodon nigroviridis]
Length = 631
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 238/362 (65%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E LK LDEY+ +L
Sbjct: 159 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDASEDEALKDGLDEYLKRL 218
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ RV+R R GLI ARLLGA A G+ L FLDAHCEC GWLE L+AR+A++RT VV
Sbjct: 219 SI-VRVVRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLLARIAKNRTAVV 277
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF +++ + G F+W L F W + + +RKD T P KTP
Sbjct: 278 SPDITTIDLNTFEFMKPSPYGQNHNRGNFDWSLAFGWESLPDHEKK-RRKDETYPIKTPT 336
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF+ IG+YD+ M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 337 FAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSFRVWQCGGQLEIIPCSIVGHVFRTKS 396
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEK-QRDKQ--AVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMD++ E +++ N +A + RDK + RL+ R +L
Sbjct: 397 PHSFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRNQQAAQLARDKAFGDISERLDFRVRL 455
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C SF WYL +++P F+P ++ FG +++V C++ G N+ + PC
Sbjct: 456 RCKSFSWYLKNIYPEAFIPDLNPLSFGSVKNVGKDSCLD----AGENNEGGKKLIMYPCH 511
Query: 549 HL 550
L
Sbjct: 512 GL 513
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 94/134 (70%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FNL SDRI ++R L D R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 125 FNLYASDRISLSRDLGADTRPPECIEQTFKRCPPLPTTSVIIVFHNEAWSTLLRTVYSVL 184
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E LK LDEY+ +LS+ RV+R R GLI ARLLGA A
Sbjct: 185 HTSPAVLLKEIILVDDASEDEALKDGLDEYLKRLSI-VRVVRQRERKGLITARLLGASVA 243
Query: 175 EGEILVFLDAHCEC 188
G+ L FLDAHCEC
Sbjct: 244 TGDTLTFLDAHCEC 257
>gi|291290949|ref|NP_001167507.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Xenopus
laevis]
gi|83405263|gb|AAI10707.1| Unknown (protein for MGC:130697) [Xenopus laevis]
Length = 622
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 173/359 (48%), Positives = 233/359 (64%), Gaps = 15/359 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E+LK LD+YV L
Sbjct: 174 LPTTSVIIVFHNEAWSTLLRTVYSVLYTSPAILLKEIILVDDASEDEYLKEKLDDYVKAL 233
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ ++ R R GLI ARLLGA A GE+L FLDAHCEC GWLE L++R+AED T VV
Sbjct: 234 QI-VKIARQKERKGLITARLLGASIATGEVLTFLDAHCECFHGWLEPLLSRIAEDYTAVV 292
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I +F + + + G F+W L F W ++ + +RK+ T P KTP
Sbjct: 293 SPDITTIDLNSFEFAKPVQYGKTHSRGNFDWSLTFGWEAIPEAEKL-RRKNETYPIKTPT 351
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG+YDE+M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 352 FAGGLFSISKAYFEHIGSYDEDMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 411
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMD++ Y++ N +A K +++ V RL+L+ L
Sbjct: 412 PHSFPKG-TQVISRNQVRLAEVWMDDYKIIYYRRNDQAAKMVKEKSFGDVSKRLKLKADL 470
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
C +F WYL +++P F+P D + G +++ KC L G N P + PC
Sbjct: 471 HCKNFTWYLENIYPELFVPDRDPTYSGAVKNEGAQKC----LDVGENNHGGKPLIMYPC 525
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 140 FNAYASDRISLHRALGPDTRPPVCIEQKFKRCPSLPTTSVIIVFHNEAWSTLLRTVYSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+LK LD+YV L + ++ R R GLI ARLLGA A
Sbjct: 200 YTSPAILLKEIILVDDASEDEYLKEKLDDYVKALQI-VKIARQKERKGLITARLLGASIA 258
Query: 175 EGEILVFLDAHCEC 188
GE+L FLDAHCEC
Sbjct: 259 TGEVLTFLDAHCEC 272
>gi|357619954|gb|EHJ72323.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 533
Score = 338 bits (868), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 178/415 (42%), Positives = 251/415 (60%), Gaps = 24/415 (5%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S+VI FHNEAWS LLRTVHSVI RSP ++KEI+LVDD S L LD+Y++
Sbjct: 48 EDLPQTSVVICFHNEAWSVLLRTVHSVIDRSPAHLIKEIILVDDFSDMPHLMQQLDDYMS 107
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
L R++R+ R GLI+ARLLGA+ +L +LD+HCECT GWLE L+ R+A ++T
Sbjct: 108 SLP-KVRIVRATQREGLIRARLLGAKYVTAPVLTYLDSHCECTEGWLEPLLDRIARNKTN 166
Query: 315 VVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
VVCPVID+I D T + Y S ++ G F+W L F W+ + + + K EP +P
Sbjct: 167 VVCPVIDVIDDNTLEYHYRDSTSVNVGGFDWNLQFNWHPVPARERA-RHKHTAEPVWSPT 225
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLFAID+ +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK S
Sbjct: 226 MAGGLFAIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRS 285
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
PY + GV+ VL N R+A VW+D+++++Y++ + D + R ELR++LKC
Sbjct: 286 PYKWRTGVN-VLKKNSVRLAEVWLDDYSKYYYQR--VGNDKGDYGDISGRKELREKLKCK 342
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCTHLP 551
SF WYL +++P F+P + G IR++ + C++ P K ++ P L PC
Sbjct: 343 SFDWYLKNIYPELFIPGESVAHGEIRNIGFERTCLDSPTRKSDHHK---PVGLYPC---- 395
Query: 552 VLTQMFVMKLPTDLIATDES-VCLDVPEYENDISPRVRILACSG-FNRQRWTYDK 604
+ + IA S +C+D D+ V C G + Q W Y K
Sbjct: 396 -------HRQGGNQIANPSSDMCVDSAAGPEDMKKPVNPWPCHGEYGNQYWMYSK 443
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/148 (50%), Positives = 96/148 (64%), Gaps = 1/148 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN SD I + RTLPD R CK E LP++S+VI FHNEAWS LLRTVH
Sbjct: 13 WKKNAFNQYASDLISIRRTLPDPRDEWCKQPGRYLEDLPQTSVVICFHNEAWSVLLRTVH 72
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SVI RSP ++KEI+LVDD S L LD+Y++ L R++R+ R GLI+ARLLGA
Sbjct: 73 SVIDRSPAHLIKEIILVDDFSDMPHLMQQLDDYMSSLP-KVRIVRATQREGLIRARLLGA 131
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 132 KYVTAPVLTYLDSHCECTEGWLEPLLDR 159
>gi|149639508|ref|XP_001513185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Ornithorhynchus anatinus]
Length = 634
Score = 338 bits (868), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 233/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS ++L LDEYV +
Sbjct: 185 LPTTSVVIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDASVDDYLHDKLDEYVKQF 244
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A GE L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 245 QI-VKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLLARIAENYTAVV 303
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P +TP
Sbjct: 304 SPDIASIDLNTFEFSKPSPYGNNHNRGNFDWSLSFGWESLPEHEKQ-RRKDETYPIRTPT 362
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 363 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 422
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N EA K ++A + RLELR +L
Sbjct: 423 PHSFPKG-TQVIARNQVRLAEVWMDEFKEIFYRRNTEAAKIVKQKAFGDLSKRLELRDRL 481
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V H C L G NQ + P + C
Sbjct: 482 QCKNFTWYLNTIYPEVYVPDLNPVLSGYIKSVGRHVC----LDVGENNQGTKPLIMYTCH 537
Query: 549 HL 550
L
Sbjct: 538 GL 539
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S+VIVFHNEAWS LLRTV+SV+
Sbjct: 151 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVVIVFHNEAWSTLLRTVYSVL 210
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L LDEYV + + +V+R R GLI ARLLGA A
Sbjct: 211 YSSPAILLKEIILVDDASVDDYLHDKLDEYVKQFQI-VKVVRQKERKGLITARLLGASVA 269
Query: 175 EGEILVFLDAHCEC 188
GE L FLDAHCEC
Sbjct: 270 TGETLTFLDAHCEC 283
>gi|410897032|ref|XP_003962003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Takifugu rubripes]
Length = 624
Score = 338 bits (868), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 238/362 (65%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E LK LDEY+ +L
Sbjct: 178 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASEDEALKDELDEYLKRL 237
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +V+R R GLI ARLLGA A G+ L FLDAHCEC GWLE L+AR+AE+ + VV
Sbjct: 238 SI-VQVVRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLLARIAENHSAVV 296
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF +V+ + G F+W L F W + + +RKD T P KTP
Sbjct: 297 SPDITTIDLNTFEFVKPSPYGQNHNRGNFDWSLAFGWESLPDHEKR-RRKDETYPIKTPT 355
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF+ IG+YD+ M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 356 FAGGLFSISKDYFYQIGSYDKHMEIWGGENIEMSFRVWQCGGQLEIIPCSIVGHVFRTKS 415
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQ-RDKQ--AVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMD++ E +++ N +A + RDK + R++LR +L
Sbjct: 416 PHSFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRNQQAAQLVRDKAFGDISQRMDLRARL 474
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
KC SF WYL +++P F+P ++ FG +++V C++ G N+ + PC
Sbjct: 475 KCKSFSWYLKNIYPEAFIPDLNPLGFGSVKNVGKDSCLD----AGENNEGGKRVIMYPCH 530
Query: 549 HL 550
L
Sbjct: 531 GL 532
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 83/164 (50%), Positives = 105/164 (64%), Gaps = 6/164 (3%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FNL SDRI ++R L D R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 144 FNLYASDRISLSRDLGADTRPPECIEQTFMRCPPLPTTSVIIVFHNEAWSTLLRTVYSVL 203
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E LK LDEY+ +LS+ +V+R R GLI ARLLGA A
Sbjct: 204 HTSPAILLKEIILVDDASEDEALKDELDEYLKRLSI-VQVVRQRERKGLITARLLGASVA 262
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTV 218
G+ L FLDAHCEC FN P + + H+ S + T+
Sbjct: 263 TGDTLTFLDAHCEC---FNGWLEPLLARIAENHSAVVSPDITTI 303
>gi|348580113|ref|XP_003475823.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Cavia porcellus]
Length = 622
Score = 338 bits (867), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 243/362 (67%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP ++LKEI+LVDDAST E+LK L+ YV +L
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPATLLKEIILVDDASTDEYLKDELERYVQQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AE++ VV
Sbjct: 236 QI-VKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAENKMAVV 294
Query: 317 CPVIDIISDVTFAYVRSF---ELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I+ TF + + +H G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPDIVTINLNTFEFSKPIPEGRIHSRGNFDWILTFGWEALPAHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD++ + +++ N +A K +++ + RL+LR++L
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRNLQAAKIAQEKSFGDISERLQLRERL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L++++P F+P + F+G I+++ ++C L G N+ P + C
Sbjct: 473 HCHNFSWFLSNIYPEMFVPDLSPTFYGAIKNLGINQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R +C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPSECIHQKFRRCPPLPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP ++LKEI+LVDDAST E+LK L+ YV +L + +V+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPATLLKEIILVDDASTDEYLKDELERYVQQLQI-VKVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|3047195|gb|AAC13673.1| GLY5c [Caenorhabditis elegans]
Length = 624
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 181/431 (41%), Positives = 264/431 (61%), Gaps = 29/431 (6%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S++I FHNEAWS LLRTVHSV+ R+P +L+E++LVDD S
Sbjct: 161 IDAECK-TEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD 218
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 219 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWM 278
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 279 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 335
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 336 -RKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 394
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 395 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 450
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM 535
+ SR +LR+ L C SFKWYL +++P F+P + G +R+ QT +C++ + +
Sbjct: 451 FGDISSRKKLREDLGCKSFKWYLDNIYPELFVPGESVAKGELRNAQTSQCLDSAVGEEVE 510
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
N+A P PC H Q +++ + I DES C+D V + C G
Sbjct: 511 NKAITP---YPC-HEQGGNQYWMLSKDGE-IRRDES-CVDY------AGSDVMVFPCHGM 558
Query: 596 -NRQRWTYDKE 605
Q W Y+ +
Sbjct: 559 KGNQEWRYNHD 569
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S++I FHNEAWS LLRTVHSV+
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+E++LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 200 ERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVA 259
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 260 TGEVLTYLDSHCECMEGWMEPLLDR 284
>gi|71993517|ref|NP_001022852.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
gi|14530627|emb|CAC42369.1| Protein GLY-5, isoform c [Caenorhabditis elegans]
Length = 624
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 181/431 (41%), Positives = 264/431 (61%), Gaps = 29/431 (6%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S++I FHNEAWS LLRTVHSV+ R+P +L+E++LVDD S
Sbjct: 161 IDAECK-TEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD 218
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 219 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWM 278
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 279 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 335
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 336 -RKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 394
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 395 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 450
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM 535
+ SR +LR+ L C SFKWYL +++P F+P + G +R+ QT +C++ + +
Sbjct: 451 FGDISSRKKLREDLGCKSFKWYLDNIYPELFVPGESVAKGELRNAQTSQCLDSAVGEEVE 510
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
N+A P PC H Q +++ + I DES C+D V + C G
Sbjct: 511 NKAITP---YPC-HEQGGNQYWMLSKDGE-IRRDES-CVDY------AGSDVMVFPCHGM 558
Query: 596 -NRQRWTYDKE 605
Q W Y+ +
Sbjct: 559 KGNQEWRYNHD 569
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/143 (48%), Positives = 97/143 (67%), Gaps = 1/143 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S++I FHNEAWS LLRTVHSV+
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+E++LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 200 ERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVA 259
Query: 175 EGEILVFLDAHCECTLVFNEEFL 197
GE+L +LD+HCEC + E L
Sbjct: 260 TGEVLTYLDSHCECMEGWMEPLL 282
>gi|350646654|emb|CCD58681.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 400
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 154/306 (50%), Positives = 210/306 (68%), Gaps = 3/306 (0%)
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 222
L+ + L+G R+ + F C + ++ K+S++IVFHNEAWSALLRTVHSV+
Sbjct: 95 LVVSNLIGTRRNLDD---FRHPSCRHQIPLDKLLPFKTSVIIVFHNEAWSALLRTVHSVL 151
Query: 223 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 282
R+P +L EI+LVDDAST+ L L YV L+ P R+ R R GLI+ARL GA+ +
Sbjct: 152 DRTPVQLLHEIILVDDASTQSHLGDQLKNYVKSLNKPVRIERMSSRSGLIRARLHGAKIS 211
Query: 283 EGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFN 342
G+ L FLDAHCE T+GWLE L+ ++E++ R+VCP+ID+IS TF Y+ + WG F+
Sbjct: 212 TGKTLTFLDAHCEVTIGWLETLLKHISENQKRIVCPIIDVISHDTFEYLLGSDRTWGTFD 271
Query: 343 WELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGEN 402
W+ +F W T + + P +TP MAGGLF I R YF+ IGAYDE+M++WGGEN
Sbjct: 272 WQFNFHWETVVDREIDRINDEHNVPLRTPTMAGGLFTITREYFYEIGAYDEDMEIWGGEN 331
Query: 403 LEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEF 462
+E+SFRVWQCGG + I PCS V H+FRK+SPY++PGGVS +L+ N R ALVW+D+++ F
Sbjct: 332 IELSFRVWQCGGELLIDPCSRVGHVFRKSSPYTWPGGVSHILHKNFVRTALVWLDQYSRF 391
Query: 463 YFKFNP 468
YF NP
Sbjct: 392 YFMLNP 397
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 76/153 (49%), Positives = 105/153 (68%), Gaps = 1/153 (0%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLP-KSSIVIVFHNEAWS 104
++++ + +N FNL+VS+ I R L D R P C+ ++ ++ LP K+S++IVFHNEAWS
Sbjct: 82 AESKLTFHLNEFNLVVSNLIGTRRNLDDFRHPSCRHQIPLDKLLPFKTSVIIVFHNEAWS 141
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
ALLRTVHSV+ R+P +L EI+LVDDAST+ L L YV L+ P R+ R R GLI
Sbjct: 142 ALLRTVHSVLDRTPVQLLHEIILVDDASTQSHLGDQLKNYVKSLNKPVRIERMSSRSGLI 201
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ARL GA+ + G+ L FLDAHCE T+ + E L
Sbjct: 202 RARLHGAKISTGKTLTFLDAHCEVTIGWLETLL 234
>gi|444515344|gb|ELV10843.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Tupaia chinensis]
Length = 614
Score = 337 bits (865), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST ++LK L++YV +L
Sbjct: 168 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEDYLKDKLEQYVKEL 227
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V +V+R R GLI ARLLGA+ A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 228 QV-VKVVRQVERKGLITARLLGAKVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 286
Query: 317 CPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V+S +H G F+W L F W T + + KD T P K+P
Sbjct: 287 SPDIVTIDLNTFEFSKPVQSGRVHSRGNFDWSLTFGWETLPPHEKQ-RHKDETYPIKSPT 345
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 346 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 405
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G++ V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+ L
Sbjct: 406 PHTFPKGIN-VIARNQVRLAEVWMDSYKQIFYRRNLQAAKMAQEKSFGDISERLKLRELL 464
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+L +V+P F+P + F+G I+++ ++C L G N P + C
Sbjct: 465 HCRNFSWFLHNVYPEMFVPDLKPTFYGAIKNLGINQC----LDVGENNHGGKPLIMYACH 520
Query: 549 HL 550
L
Sbjct: 521 GL 522
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/145 (52%), Positives = 102/145 (70%), Gaps = 3/145 (2%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAW 103
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAW
Sbjct: 123 QEKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPTTSVIIVFHNEAW 182
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTV+SV+ +P +LKEI+LVDDAST ++LK L++YV +L V +V+R R GL
Sbjct: 183 STLLRTVYSVLHTTPAILLKEIILVDDASTEDYLKDKLEQYVKELQV-VKVVRQVERKGL 241
Query: 164 IKARLLGARQAEGEILVFLDAHCEC 188
I ARLLGA+ A+ E+L FLDAHCEC
Sbjct: 242 ITARLLGAKVAQAEVLTFLDAHCEC 266
>gi|312379012|gb|EFR25425.1| hypothetical protein AND_09241 [Anopheles darlingi]
Length = 671
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 209/568 (36%), Positives = 301/568 (52%), Gaps = 56/568 (9%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
L Q+ + DR + + D+ +P K ++ P S ++ AL +
Sbjct: 101 LEQVEDVEVKFVDRNQLEDGVADLDEPPVAVKTHAKQATP-SRPARQPDDQGGLALPGVI 159
Query: 111 HSVISRSPRSMLKEILLVDDAS-------TREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
P + K ++L D S + K++ ++YVA + R + P
Sbjct: 160 APPSEGGPGELGKPVVLPKDLSPEVKKLVDEGWAKNAFNQYVADMISIRRTLPDP----- 214
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVIS 223
DA C+ + E+ LP +S++I FHNEAWS LLRTVHSV+
Sbjct: 215 ------------------RDAWCKEPGRYRED-LPPTSVIICFHNEAWSVLLRTVHSVLD 255
Query: 224 RSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQA 282
RSP ++KE++LVDD S + L+EY L+ P +++R+ R GLI+ARLLGAR A
Sbjct: 256 RSPEHLVKEVILVDDFSDMPHTQKQLEEYF--LAYPRVKIVRAAKREGLIRARLLGARHA 313
Query: 283 EGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGA 340
+L +LD+HCECT GWLE L+ R+A + T VVCPVID+I D T + Y S ++ G
Sbjct: 314 TAPVLTYLDSHCECTTGWLEPLLDRIARNSTTVVCPVIDVIDDNTMEYHYRDSGGVNVGG 373
Query: 341 FNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGG 400
F+W L F W+ + K K EP +P MAGGLFAIDR +F +G YD +WGG
Sbjct: 374 FDWNLQFNWHAVPEREKR-KHKSAAEPVWSPTMAGGLFAIDRVFFERLGTYDSGFDIWGG 432
Query: 401 ENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWA 460
ENLE+SF+ W CGGS+EI PCSHV H+FRK SPY + GV+ V+ N R+A VWMDE+A
Sbjct: 433 ENLELSFKTWMCGGSLEIIPCSHVGHIFRKRSPYKWRTGVN-VIKRNSVRLAEVWMDEYA 491
Query: 461 EFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV 520
++Y++ + D V SR +LR++L C F+WYL +++P F+P D G +R++
Sbjct: 492 QYYYQRI--GNDKGDFGDVSSRKKLREELHCKPFRWYLDNIYPELFVPGDAVASGEVRNM 549
Query: 521 --QTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPE 578
C++ P K ++ +A G L PC + + M T I DE+ CLD
Sbjct: 550 GYGNRTCLDAPAGKRNLRKAVG---LYPCHNQG--GNQYWMLSKTGEIRRDEA-CLD--- 600
Query: 579 YENDISPRVRILACSGF-NRQRWTYDKE 605
Y D V + C G Q W Y +
Sbjct: 601 YAGD---DVVLYPCHGSRGNQYWNYSDD 625
>gi|348521382|ref|XP_003448205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oreochromis niloticus]
Length = 620
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV SV+ SP +LKEI+LVDDAST E LKS L+EY+ +L
Sbjct: 172 LPTTSVIIVFHNEAWSTLLRTVFSVLHTSPAILLKEIILVDDASTAEHLKSRLEEYIRQL 231
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R P R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+ E+ T VV
Sbjct: 232 KI-VRVVRQPERKGLITARLLGASIAQAEVLTFLDAHCECFHGWLEPLLARIVEEPTAVV 290
Query: 317 CPVIDIISDVTFAYVRSFELHW----GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I +F + + + G F+W L F W DA RKD T P KTP
Sbjct: 291 SPEISSIDLNSFQFHKPVATNRAYNRGNFDWSLTFGWEAI-PEDAKRLRKDETYPVKTPT 349
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLFAI + YF HIG YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 350 FAGGLFAISKKYFEHIGTYDDQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 409
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQ---AVRSRLELRKQL 489
P++FP G +EV+ N R+A VWMD++ + Y++ N A + + +RL LR++L
Sbjct: 410 PHTFPKG-TEVITRNQVRLAEVWMDDYKKIYYRRNKNAAIMASEHRFGDISARLNLRERL 468
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F WYL V+P F+P ++ + G I+++ ++ C L G NQ P + C
Sbjct: 469 HCKNFSWYLNTVYPEIFIPDLNPEKSGSIKNLGSNMC----LDAGENNQGGKPLIMYHCH 524
Query: 549 HL 550
++
Sbjct: 525 NM 526
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 96/134 (71%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R+L D R P+C + F LP +S++IVFHNEAWS LLRTV SV+
Sbjct: 138 FNQFASDRISLSRSLGDDTRPPECVERKFRRCPPLPTTSVIIVFHNEAWSTLLRTVFSVL 197
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAST E LKS L+EY+ +L + RV+R P R GLI ARLLGA A
Sbjct: 198 HTSPAILLKEIILVDDASTAEHLKSRLEEYIRQLKI-VRVVRQPERKGLITARLLGASIA 256
Query: 175 EGEILVFLDAHCEC 188
+ E+L FLDAHCEC
Sbjct: 257 QAEVLTFLDAHCEC 270
>gi|156373014|ref|XP_001629329.1| predicted protein [Nematostella vectensis]
gi|156216327|gb|EDO37266.1| predicted protein [Nematostella vectensis]
Length = 499
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 177/414 (42%), Positives = 255/414 (61%), Gaps = 17/414 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTVHSV++ SP ++ +I+LVDD S + LK L ++++ L
Sbjct: 61 LPTTSVIICFHNEALSVLLRTVHSVLNESPPRLIADIILVDDYSEYDDLKQPLIDHISML 120
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ ++IR P R GL+ ARL GA +A GE+L FLD+HCE T GWLE L+ R+AEDR VV
Sbjct: 121 N-KVKLIRMPSRQGLVPARLRGAEEARGEVLTFLDSHCEATPGWLEPLLVRIAEDRRNVV 179
Query: 317 CPVIDIISDVTFAYVRSFELH-WGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVI++I+ F Y S +H G F W+L F W ++ +RKD T+ ++P MAG
Sbjct: 180 CPVIEVINADDFRYQASDVIHERGGFTWDLFFTWKAIPEAEKK-RRKDETDYIRSPTMAG 238
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK-ASPY 434
GLFAI + YF+ +G+YD +M++WGGENLEMSFR+W CGG +EI PCS V H+FRK SPY
Sbjct: 239 GLFAIHKKYFYDLGSYDSKMEIWGGENLEMSFRIWMCGGQLEIVPCSRVGHVFRKYTSPY 298
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FP G + L N R+A VWMDE+ + Y++ E E+ D + R+ LRK+L C SF
Sbjct: 299 KFPKGTTTTLARNFNRLAEVWMDEYKDHYYRKKTEEERNVDIGDISDRVALRKRLGCKSF 358
Query: 495 KWYLTHVWP---HHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
KWYL +++P + P + +IR+ ++ C++ L + ++ + L C H
Sbjct: 359 KWYLDNIYPDMTNKLPPKSYLYSHQIRNKESSLCLDT-LGEKNIKR----VGLYTC-HGM 412
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDK 604
Q F + +++ D+ CLD P D V ++ C G Q W ++K
Sbjct: 413 GGNQFFTLTKSNEILFNDDK-CLDSP--NGDPGSYVEMITCHGLKGNQEWKHNK 463
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 71/151 (47%), Positives = 100/151 (66%), Gaps = 2/151 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTK-VFNEEFLPKSSIVIVFHNEAWSAL 106
A + + FN+++SDRI ++R L DVR PKCK K LP +S++I FHNEA S L
Sbjct: 19 AAHAFDNHSFNVMLSDRISLDRRLKDVRGPKCKRKHKLYPRALPTTSVIICFHNEALSVL 78
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LRTVHSV++ SP ++ +I+LVDD S + LK L ++++ L+ ++IR P R GL+ A
Sbjct: 79 LRTVHSVLNESPPRLIADIILVDDYSEYDDLKQPLIDHISMLN-KVKLIRMPSRQGLVPA 137
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA +A GE+L FLD+HCE T + E L
Sbjct: 138 RLRGAEEARGEVLTFLDSHCEATPGWLEPLL 168
>gi|224054950|ref|XP_002197786.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Taeniopygia guttata]
Length = 631
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 182 LPTTSIIIVFHNEAWSTLLRTVHSVMYTSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 241
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A GE L FLDAHCEC GWLE L+AR+AE+ VV
Sbjct: 242 QI-VKVVRQKERKGLITARLLGASVATGETLTFLDAHCECFYGWLEPLLARIAENPVAVV 300
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P +TP
Sbjct: 301 SPDIASIDLNTFEFSKPSPYGHSHNRGNFDWSLSFGWESLPKHENK-RRKDETYPIRTPT 359
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 360 FAGGLFSISKDYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 419
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + R++LR++L
Sbjct: 420 PHTFPKG-TQVITRNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDISKRIDLRQRL 478
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL++V+P ++P ++ F G ++++ C L G N P + C
Sbjct: 479 QCKNFTWYLSNVYPEAYVPDLNPLFSGYLKNIGNRMC----LDVGENNHGGKPLIMYSCH 534
Query: 549 HL 550
L
Sbjct: 535 GL 536
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/134 (58%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +SI+IVFHNEAWS LLRTVHSV+
Sbjct: 148 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSIIIVFHNEAWSTLLRTVHSVM 207
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + + +V+R R GLI ARLLGA A
Sbjct: 208 YTSPAILLKEIILVDDASVDEYLHDKLDEYVKQFQI-VKVVRQKERKGLITARLLGASVA 266
Query: 175 EGEILVFLDAHCEC 188
GE L FLDAHCEC
Sbjct: 267 TGETLTFLDAHCEC 280
>gi|390341984|ref|XP_003725567.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Strongylocentrotus purpuratus]
Length = 654
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 185/435 (42%), Positives = 254/435 (58%), Gaps = 34/435 (7%)
Query: 191 VFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD 250
V++EE LP SIVI F+NEAWS LLRTV+SV+ R+PR ++ E++LVDD S LK LD
Sbjct: 183 VYSEE-LPTVSIVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVDDFSELTHLKKELD 241
Query: 251 EYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
+Y++K + VI + R GLI+AR +GAR A G++L+FLD+HCE WLE L+ R+
Sbjct: 242 QYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEVNEQWLEPLLERIK 301
Query: 310 EDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK 369
D VVCP+IDII+ TFAY S L G FNW +HF+W T S ++ ++D+ +P +
Sbjct: 302 ADSHTVVCPIIDIINHDTFAYTAS-PLVKGGFNWGMHFKWDTI-RSRQLVGKEDYVKPIE 359
Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
+P MAGGLFA++R YF +G YDE M +WGGENLE+SFR+WQCGG +EI PCS V H+FR
Sbjct: 360 SPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGGKLEIVPCSRVGHVFR 419
Query: 430 KASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQL 489
K PY P + N RVA VWMDE+ E +++ P+A K D + SR+ LR++L
Sbjct: 420 KRRPYGSP-NRQDTTTKNAVRVAEVWMDEYKEHFYQVQPKA-KNIDYGDISSRVALREEL 477
Query: 490 KCHSFKWYLTHVWPHHFLPMDDKFFGRIR---------HVQTHKCVEKPLAKGSMNQASG 540
KC SFKWYL V+P P D K GR K + L KG + +
Sbjct: 478 KCKSFKWYLDTVYPEMRTPNDTK--GRTGGVVEGGGGMQRMQRKAPQTSLRKGKLKHSLT 535
Query: 541 PASLLP--------------CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR 586
L+P C L M + + + +++C+D+ E + P+
Sbjct: 536 GLCLVPKAELVKGTELILGDCDRLGDTNTMTWYQTSVEELKLGDAICMDMSESNSASLPQ 595
Query: 587 VRILACSGF-NRQRW 600
+R C G QRW
Sbjct: 596 LR--KCDGMGGSQRW 608
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 102/147 (69%), Gaps = 6/147 (4%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN L+S RI +R + D R P CK +V++EE LP SIVI F+NEAWS LLRTV+
Sbjct: 152 YRQHAFNELISQRIGFHRNVTDTRNPLCKYQVYSEE-LPTVSIVICFYNEAWSTLLRTVY 210
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+PR ++ E++LVDD S LK LD+Y++K + VI + R GLI+AR +G
Sbjct: 211 SVLDRTPRRLIHELILVDDFSELTHLKKELDQYMSKNFNGLVHVIHNGQREGLIRARTIG 270
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
AR A G++L+FLD+HCE NE++L
Sbjct: 271 ARYATGDVLMFLDSHCE----VNEQWL 293
>gi|291225677|ref|XP_002732827.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Saccoglossus kowalevskii]
Length = 633
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 180/425 (42%), Positives = 252/425 (59%), Gaps = 29/425 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C LP +S+VI F NEAWS LLRTV+SVI RSP ++L EI+LVDD S+ +LK
Sbjct: 167 CAYQVYSNNLPSTSVVICFFNEAWSTLLRTVYSVIDRSPANLLHEIILVDDYSSSTYLKD 226
Query: 248 SLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD+++ L ++I + R GLI+AR++GA A G++++FLD+HCE + WLE L+
Sbjct: 227 YLDDFIKTNLFQIVKIIHNKKREGLIRARMIGAAAATGDVVMFLDSHCEVSTQWLEPLLE 286
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ D VVCP+IDII+ TF Y +S L G FNW LHF+W T SS ++D+ +
Sbjct: 287 RIKFDPHTVVCPIIDIINADTFEYQQS-PLVRGGFNWGLHFKWDTIPSS-QFKGKEDYIK 344
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P MAGGLFA+DR YF +G YD+ M +WGGENLE+SFR+WQCGG++EI PCS V H
Sbjct: 345 PVRSPTMAGGLFAMDRKYFHELGEYDDGMDIWGGENLEISFRIWQCGGTLEIIPCSRVGH 404
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + N RVA VWMDE+ E YF+ + + +D + SRL LR
Sbjct: 405 VFRKRRPYGSPNG-EDTMSKNSLRVAHVWMDEYKEHYFELKKD-NRNKDYGDISSRLALR 462
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFF-----------------GRIRHVQTHKCVEKP 529
++L+C SFKWYL +V+P LP + G+I H+ T C+
Sbjct: 463 ERLQCQSFKWYLENVYPEIRLPNQKVSYPVDVERRQPVKAEIIKRGQIVHLLTGLCL--- 519
Query: 530 LAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRI 589
++ Q L C+ QM + + +S+CLD PE ++ P R+
Sbjct: 520 TSENDFTQKGTLVVLHDCSDKD--KQMIWSQSTSHEFLLKDSLCLDTPETDSKAFP--RL 575
Query: 590 LACSG 594
+ C G
Sbjct: 576 MKCHG 580
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/149 (46%), Positives = 101/149 (67%), Gaps = 2/149 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN L+SDRI +R LPD R C +V++ LP +S+VI F NEAWS LLRTV+
Sbjct: 140 YQQHAFNQLISDRIGFHRGLPDTRNGLCAYQVYSNN-LPSTSVVICFFNEAWSTLLRTVY 198
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLG 170
SVI RSP ++L EI+LVDD S+ +LK LD+++ L ++I + R GLI+AR++G
Sbjct: 199 SVIDRSPANLLHEIILVDDYSSSTYLKDYLDDFIKTNLFQIVKIIHNKKREGLIRARMIG 258
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPK 199
A A G++++FLD+HCE + + E L +
Sbjct: 259 AAAATGDVVMFLDSHCEVSTQWLEPLLER 287
>gi|221042448|dbj|BAH12901.1| unnamed protein product [Homo sapiens]
Length = 527
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 194/438 (44%), Positives = 262/438 (59%), Gaps = 31/438 (7%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 60 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 180 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGRAEGATA 237
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 238 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 297
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 298 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 355
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 356 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 413
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + P R
Sbjct: 414 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPP--R 468
Query: 589 ILACSG-FNRQRWTYDKE 605
++ C G Q+WT+ K
Sbjct: 469 LMKCHGSGGSQQWTFGKN 486
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/164 (50%), Positives = 111/164 (67%), Gaps = 7/164 (4%)
Query: 39 GGLHSNLSDAQQL----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSI 94
GG+ N D Q+L YQ + FN+L+SDR+ +R +PD R CK K + + LP +S+
Sbjct: 17 GGMIFNERD-QELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPD-LPAASV 74
Query: 95 VIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTR 153
VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK LDEYV K L +
Sbjct: 75 VICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIK 134
Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
VIR+ R GLI+ R++GA A GE+LVFLD+HCE +++ + L
Sbjct: 135 VIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLL 178
>gi|391342054|ref|XP_003745339.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 641
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 180/412 (43%), Positives = 252/412 (61%), Gaps = 24/412 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I FHNEAWS L+RTVHSVI RSP+++LKEI+LVDD S + LK L++Y KL
Sbjct: 199 LPQTSVIICFHNEAWSVLMRTVHSVIDRSPKNLLKEIILVDDFSDMKHLKEQLEDYTRKL 258
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +++R+ R GLI+ARLLGA+ A +L +LD+HCEC+ GWLE L+ R+AE T VV
Sbjct: 259 GI-VKIVRASKREGLIRARLLGAKFATAPVLTYLDSHCECSTGWLEPLLDRIAEADTNVV 317
Query: 317 CPVIDIISDVTFAYVR---SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
CPVID+ISD TF Y + ++ G F+W L F W++ D +++ ++ +P M
Sbjct: 318 CPVIDVISDSTFEYPHRRAGYTVNVGGFDWNLQFSWHSLPQRDKDARKQSWSA-VPSPTM 376
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+I +AYF +G YD +WG ENLE+SF+VW CGG +EI PCSHV H+FRK SP
Sbjct: 377 AGGLFSISKAYFEKLGLYDSGFDIWGAENLELSFKVWMCGGRLEIVPCSHVGHVFRKRSP 436
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELRKQLKCH 492
Y + GV+ VL N R+A VWMDE+A++YF + P+ D + R+ELR+ L C
Sbjct: 437 YKWLKGVN-VLKKNSVRLAKVWMDEYAQYYFDRIGPDLGDYGD---ISERVELRRSLNCK 492
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
SF WY+ +++P F+P D G +R + E+ S S+ PC H
Sbjct: 493 SFDWYVKNIYPDLFIPGDAAASGEVR----NSGFERKWCLDSAATVHATVSVYPC-HGQG 547
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
Q ++ T I DE +CLD + V + +C G Q W YD
Sbjct: 548 GNQYWLFS-KTGEIRRDE-LCLDYSGGD------VVLYSCHGSKGNQYWRYD 591
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 75/148 (50%), Positives = 107/148 (72%), Gaps = 2/148 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
+Q N FN SD I ++R+LPD+R P CKT+ F + LP++S++I FHNEAWS L+RTVH
Sbjct: 163 WQNNAFNQYASDMISLHRSLPDMRDPGCKTQKFRRD-LPQTSVIICFHNEAWSVLMRTVH 221
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SVI RSP+++LKEI+LVDD S + LK L++Y KL + +++R+ R GLI+ARLLGA
Sbjct: 222 SVIDRSPKNLLKEIILVDDFSDMKHLKEQLEDYTRKLGI-VKIVRASKREGLIRARLLGA 280
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ A +L +LD+HCEC+ + E L +
Sbjct: 281 KFATAPVLTYLDSHCECSTGWLEPLLDR 308
>gi|149031398|gb|EDL86388.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_c [Rattus norvegicus]
Length = 560
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 189/428 (44%), Positives = 258/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S + LK LDEY+ +
Sbjct: 102 LPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRY 161
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED V
Sbjct: 162 LPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTV 221
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W SD + T P ++P MAG
Sbjct: 222 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSD-LGGADSATAPIRSPTMAG 279
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 280 GLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 339
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L C SFK
Sbjct: 340 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSFGNISERVELRKKLGCQSFK 397
Query: 496 WYLTHVWPHH-------------FLPMDDK-----FFGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +V+P F+ K GR+ H+QT+KC+ +A+G +Q
Sbjct: 398 WYLDNVYPEMQVSGPKARLQQPVFINRGPKRPRVLLRGRLYHLQTNKCL---VAQGRSSQ 454
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + TQ+++ +LI + +CLD+ E + PR+ SG
Sbjct: 455 KGGLVLLKACDYGDP-TQVWIYNEEHELILNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 511
Query: 598 QRWTYDKE 605
Q+WT+ K
Sbjct: 512 QQWTFGKN 519
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R +C+ K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 66 YQKHAFNMLISNRLGYHRDVPDTRNAECRGKSYPTD-LPTASVVICFYNEAFSALLRTVH 124
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 125 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIG 184
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 185 AAHATGEVLVFLDSHCEVNVMWLQPLL 211
>gi|397513815|ref|XP_003827203.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Pan paniscus]
Length = 532
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 192/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 48 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 107
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SVI+R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 108 LRTIRSVINRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 161
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 162 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 221
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 222 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 280
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 281 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 340
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 341 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 399
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 400 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 452
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 453 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 483
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 54 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 112
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SVI+R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 113 SVINRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 166
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 167 ADIAQGTTLTFLDSHCE 183
>gi|193784963|dbj|BAG54116.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 194/437 (44%), Positives = 261/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGRAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVN 68
A D ID R GH L S + G+ N D Q+L YQ + FN+L+SDR+ +
Sbjct: 73 ANKIDDVIDSRVEDPEEGH-LKLSSELGMIFNERD-QELRDLGYQKHAFNMLISDRLGYH 130
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R +PD R CK K + + LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LV
Sbjct: 131 RDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILV 189
Query: 129 DDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S + LK LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE
Sbjct: 190 DDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCE 249
Query: 188 CTLVFNEEFL 197
+++ + L
Sbjct: 250 VNVMWLQPLL 259
>gi|332870119|ref|XP_003318977.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Pan
troglodytes]
Length = 527
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 195/437 (44%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 60 CKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 180 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 237
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 238 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 297
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 298 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 355
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 356 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 413
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 414 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 470
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 471 KCHGSG-GSQQWTFGKN 486
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 33 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPD-LPAASIVICFYNEAFSALLRTVH 91
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 92 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 151
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 152 AAHATGEVLVFLDSHCEVNVMWLQPLL 178
>gi|242008519|ref|XP_002425051.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212508700|gb|EEB12313.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 657
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 181/415 (43%), Positives = 255/415 (61%), Gaps = 27/415 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VI FHNEAWS LLRTVHSV+ RSP ++LKEI+LVDD S LK L++Y++
Sbjct: 180 LPQTSVVICFHNEAWSVLLRTVHSVLDRSPPNLLKEIILVDDFSDMIHLKKQLEDYMSHY 239
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR+ R GLI+ARLLGA +A + FLD+HCECT+GWLE L+ R+A+D T VV
Sbjct: 240 P-KVKIIRASKREGLIRARLLGATRATAPVTTFLDSHCECTVGWLEPLLDRIAKDPTTVV 298
Query: 317 CPVIDIISDVTFAYV--RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
CPVID+I D T Y S ++ G F+W L F W+ + + K+ EP +P MA
Sbjct: 299 CPVIDVIDDTTLEYNFRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNTAEPVWSPTMA 357
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAID+ +F IG YD +WGGENLE+SF+ W CGG++EI PCSHV H+FR+ SPY
Sbjct: 358 GGLFAIDKNFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRRRSPY 417
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ GV+ VL N R+A VW+D++A++Y++ + + D V +R ELRK+L C SF
Sbjct: 418 KWRSGVN-VLKRNSVRLAEVWLDDYAKYYYQR--IGDDKGDFGDVSARKELRKRLNCKSF 474
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
KWYL +++P F+P + G +R+ + C++ P K +++A G L PC
Sbjct: 475 KWYLDNIYPELFIPGEAVAGGEVRNKGLGGKTCLDSPARKADLHKAVG---LFPC----- 526
Query: 553 LTQMFVMKLPTDLIATDES-VCLDVPEYENDISPRVRILAC--SGFNRQRWTYDK 604
+ + ++ + S CLD P D+ V + C G N Q W K
Sbjct: 527 ------HRQGGNQVSNNWSGQCLDSPCKSEDMHKPVGLWPCHKQGGN-QYWMLSK 574
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 74/145 (51%), Positives = 97/145 (66%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+R LPD R CK + LP++S+VI FHNEAWS LLRTVHSV+
Sbjct: 146 NAFNQYVSDLISVHRKLPDPRDKWCKEPGRFLQDLPQTSVVICFHNEAWSVLLRTVHSVL 205
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++LKEI+LVDD S LK L++Y++ ++IR+ R GLI+ARLLGA +A
Sbjct: 206 DRSPPNLLKEIILVDDFSDMIHLKKQLEDYMSHYP-KVKIIRASKREGLIRARLLGATRA 264
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ FLD+HCECT+ + E L +
Sbjct: 265 TAPVTTFLDSHCECTVGWLEPLLDR 289
>gi|153792095|ref|NP_071370.2| polypeptide N-acetylgalactosaminyltransferase 11 [Homo sapiens]
gi|51316030|sp|Q8NCW6.2|GLT11_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|5630076|gb|AAD45821.1|AC006017_1 N-acetylgalactosaminyltransferase; similar to Q10473 (PID:g1709559)
[Homo sapiens]
gi|51105934|gb|EAL24518.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Homo
sapiens]
gi|119574361|gb|EAW53976.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Homo sapiens]
gi|189442406|gb|AAI67834.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[synthetic construct]
gi|345500003|emb|CAC79625.3| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 608
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 194/437 (44%), Positives = 261/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGRAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVN 68
A D ID R GH L S + G+ N D Q+L YQ + FN+L+SDR+ +
Sbjct: 73 ANKIDDVIDSRVEDPEEGH-LKFSSELGMIFNERD-QELRDLGYQKHAFNMLISDRLGYH 130
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R +PD R CK K + + LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LV
Sbjct: 131 RDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILV 189
Query: 129 DDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S + LK LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE
Sbjct: 190 DDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCE 249
Query: 188 CTLVFNEEFL 197
+++ + L
Sbjct: 250 VNVMWLQPLL 259
>gi|354478256|ref|XP_003501331.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Cricetulus griseus]
gi|344235668|gb|EGV91771.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Cricetulus
griseus]
Length = 608
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 187/433 (43%), Positives = 258/433 (59%), Gaps = 39/433 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S + LK LDEY+ +
Sbjct: 150 LPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED V
Sbjct: 210 LPAKVKVIRNRKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW-----YTYGSSDAIIKRKDFTEPFKT 370
VCPVIDIIS T AY S + G FNW LHF+W G +D T P ++
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSELGGADGA------TAPIRS 322
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK
Sbjct: 323 PTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRK 382
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L
Sbjct: 383 RRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSFGNISERVELRKKLG 440
Query: 491 CHSFKWYLTHVWPHHFLPMDDK------------------FFGRIRHVQTHKCVEKPLAK 532
C SFKWYL +++P +P + GR+ H+QT+KC+ +A+
Sbjct: 441 CQSFKWYLDNIYPEMQIPGPNAKPQQPIFINRGPKRPRVLLRGRLYHLQTNKCL---VAQ 497
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC 592
G +Q G L C + TQ+++ +LI + +CLD+ E + PR+
Sbjct: 498 GRSSQKGGLVLLKACDYGDP-TQIWIYNEEHELILNN-LLCLDMSETRSSDPPRLMKCHG 555
Query: 593 SGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 556 SG-GSQQWTFGKN 567
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R KC+ K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAKCRGKSYPAD-LPTASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 173 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNRKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|109068965|ref|XP_001105286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
6 [Macaca mulatta]
gi|355561195|gb|EHH17881.1| hypothetical protein EGK_14364 [Macaca mulatta]
Length = 608
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 261/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGEAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +++P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNIYPEMQISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYRRNVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNMMWLQPLL 259
>gi|397513813|ref|XP_003827202.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Pan paniscus]
Length = 552
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 192/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SVI+R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVINRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SVI+R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVINRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|328713087|ref|XP_001951943.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Acyrthosiphon pisum]
Length = 674
Score = 335 bits (860), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 163/355 (45%), Positives = 232/355 (65%), Gaps = 12/355 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+++ FHNEAWS LLRTVHS++ RSP +++EI+LVDD S LK+ L+EY
Sbjct: 192 LPQTSVIVCFHNEAWSVLLRTVHSILDRSPEHLIREIILVDDFSDMPHLKTQLEEYSENY 251
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R+ R GLI+ARL+GAR A +L +LD+HCECT GWLE L+ R+A + + VV
Sbjct: 252 P-KIKIVRAKKREGLIRARLMGARYASAPVLTYLDSHCECTEGWLEPLLDRIAREASTVV 310
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
CPVID+I D T F Y + ++ G F+W L F W+ + + K+ EP +P MA
Sbjct: 311 CPVIDVIDDSTLEFHYRDAGGVNVGGFDWNLQFNWHVVPDKEKK-RHKNAAEPVWSPTMA 369
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAID+ +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SPY
Sbjct: 370 GGLFAIDKKFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPY 429
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ GV+ VL N R+A VWMD++A++Y++ D + SR +LR++LKC SF
Sbjct: 430 KWRTGVN-VLKKNSIRLAEVWMDDYAKYYYER--IGNDLGDYGDITSRKDLRRKLKCKSF 486
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPC 547
KWYL +++P F+P D G +R++ C++ P K +N+ PA L PC
Sbjct: 487 KWYLENIYPELFIPGDAVASGEVRNLGYGNKTCLDSPARKTDLNK---PAGLYPC 538
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/193 (39%), Positives = 111/193 (57%), Gaps = 23/193 (11%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN SD I ++RTLPD R CK + LP++S+++ FHNEAWS LLRTVH
Sbjct: 155 WKNNAFNQYASDLISLHRTLPDPRDEWCKKPGRYLDNLPQTSVIVCFHNEAWSVLLRTVH 214
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S++ RSP +++EI+LVDD S LK+ L+EY +++R+ R GLI+ARL+GA
Sbjct: 215 SILDRSPEHLIREIILVDDFSDMPHLKTQLEEYSENYP-KIKIVRAKKREGLIRARLMGA 273
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLK 231
R A +L +LD+HCECT E W L + I+R +++
Sbjct: 274 RYASAPVLTYLDSHCECT-------------------EGW---LEPLLDRIAREASTVVC 311
Query: 232 EILLVDDASTREF 244
++ V D ST EF
Sbjct: 312 PVIDVIDDSTLEF 324
>gi|355748155|gb|EHH52652.1| hypothetical protein EGM_13122 [Macaca fascicularis]
Length = 608
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 261/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGEAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +++P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNIYPEMQISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNMMWLQPLL 259
>gi|148671130|gb|EDL03077.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_a [Mus
musculus]
Length = 529
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 189/440 (42%), Positives = 261/440 (59%), Gaps = 29/440 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ EC LP +SIVI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S +
Sbjct: 59 NAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDD 118
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK LDEY+ + L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+
Sbjct: 119 LKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 178
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+A + ED VVCPVIDIIS T AY S + G FNW LHF+W S+ +
Sbjct: 179 LLAIILEDPHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSE-LGGPDG 236
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P ++P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS
Sbjct: 237 ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILPCSR 296
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+
Sbjct: 297 VGHIFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KNKSFGNISERV 354
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKF------------------FGRIRHVQTHKC 525
ELRK+L C SFKWYL +++P +P + GR+ H+QT+KC
Sbjct: 355 ELRKKLGCQSFKWYLDNIYPEMQIPGPNAKPQQPVLINRGPKRPRVLQRGRLYHLQTNKC 414
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP 585
+ +A+G +Q G L C + TQ+++ +LI + +CLD+ E + P
Sbjct: 415 L---VAQGRSSQKGGLVLLKTCDYGDP-TQVWIYNEDHELILNN-LLCLDMSETRSSDPP 469
Query: 586 RVRILACSGFNRQRWTYDKE 605
R+ SG Q+WT+ K
Sbjct: 470 RLMKCHGSG-GSQQWTFGKN 488
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R +C+ K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 35 YQKHAFNMLISNRLGYHRDVPDTRNAECRRKSYPTD-LPTASIVICFYNEAFSALLRTVH 93
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 94 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIG 153
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 154 AAHATGEVLVFLDSHCEVNVMWLQPLL 180
>gi|380786043|gb|AFE64897.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|383411811|gb|AFH29119.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
gi|384942402|gb|AFI34806.1| polypeptide N-acetylgalactosaminyltransferase 11 [Macaca mulatta]
Length = 608
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 261/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGEAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +++P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNIYPEMQISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYRRDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNMMWLQPLL 259
>gi|397513817|ref|XP_003827204.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Pan paniscus]
Length = 517
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 192/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 33 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 92
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SVI+R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 93 LRTIRSVINRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 146
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 147 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 206
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 207 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 325
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 326 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 384
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 385 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 437
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 438 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 468
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 39 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 97
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SVI+R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 98 SVINRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 151
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 152 ADIAQGTTLTFLDSHCE 168
>gi|426335179|ref|XP_004029110.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Gorilla gorilla gorilla]
Length = 532
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 48 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 107
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 108 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 161
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 162 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 221
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 222 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 280
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 281 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 340
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 341 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 399
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 400 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQ-- 452
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 453 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 483
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 54 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 112
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 113 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 166
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 167 ADIAQGTTLTFLDSHCE 183
>gi|301614636|ref|XP_002936794.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Xenopus (Silurana) tropicalis]
Length = 625
Score = 335 bits (859), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 176/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E+LK LDEYV +L
Sbjct: 176 LPTTSIIIVFHNEAWSTLLRTVYSVMHTSPAILLKEIILVDDASVDEYLKDELDEYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A G+ L FLDAHCEC GWLE L+A +AE+ T VV
Sbjct: 236 QI-VKVVRQKERKGLITARLLGASVATGDTLTFLDAHCECYYGWLEPLLASIAENYTSVV 294
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + G F+W L F W + SS+ +RKD T P KTP
Sbjct: 295 SPDITGIDLNTFQFSNPSPYGNNHNRGNFDWTLSFGWESLPSSEKT-RRKDETYPIKTPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG+YDE+M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKAYFEHIGSYDEQMEIWGGENIEMSFRVWQCGGQLEILPCSVVGHVFRSKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMD+ E +++ N EA K ++ + RL+LR +L
Sbjct: 414 PHTFPKG-TQVIVRNQVRLAEVWMDDLKEIFYRRNREAANIVKSKEYGDLSKRLDLRHRL 472
Query: 490 KCHSFKWYLTHVWPHHFLPMDDKFF-GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P G +++V C L G N P + C
Sbjct: 473 QCKNFTWYLNNIYPEMYVPERHPLIHGDLKNVGRDLC----LDVGGENHGDKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 79/143 (55%), Positives = 97/143 (67%), Gaps = 3/143 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN +DRI ++R L PD R P+C + F LP +SI+IVFHNEAWS LLRTV+SV+
Sbjct: 142 FNAFATDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSIIIVFHNEAWSTLLRTVYSVM 201
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+LK LDEYV +L + +V+R R GLI ARLLGA A
Sbjct: 202 HTSPAILLKEIILVDDASVDEYLKDELDEYVKQLQI-VKVVRQKERKGLITARLLGASVA 260
Query: 175 EGEILVFLDAHCECTLVFNEEFL 197
G+ L FLDAHCEC + E L
Sbjct: 261 TGDTLTFLDAHCECYYGWLEPLL 283
>gi|404434384|ref|NP_001258248.1| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|404501473|ref|NP_955425.2| polypeptide N-acetylgalactosaminyltransferase 11 [Rattus
norvegicus]
gi|149031397|gb|EDL86387.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11),
isoform CRA_b [Rattus norvegicus]
Length = 609
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 189/428 (44%), Positives = 258/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S + LK LDEY+ +
Sbjct: 151 LPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRY 210
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED V
Sbjct: 211 LPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTV 270
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W SD + T P ++P MAG
Sbjct: 271 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSD-LGGADSATAPIRSPTMAG 328
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 329 GLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 388
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L C SFK
Sbjct: 389 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSFGNISERVELRKKLGCQSFK 446
Query: 496 WYLTHVWPHH-------------FLPMDDK-----FFGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +V+P F+ K GR+ H+QT+KC+ +A+G +Q
Sbjct: 447 WYLDNVYPEMQVSGPKARLQQPVFINRGPKRPRVLLRGRLYHLQTNKCL---VAQGRSSQ 503
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + TQ+++ +LI + +CLD+ E + PR+ SG
Sbjct: 504 KGGLVLLKACDYGDP-TQVWIYNEEHELILNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 560
Query: 598 QRWTYDKE 605
Q+WT+ K
Sbjct: 561 QQWTFGKN 568
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R +C+ K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 115 YQKHAFNMLISNRLGYHRDVPDTRNAECRGKSYPTD-LPTASVVICFYNEAFSALLRTVH 173
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 174 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIG 233
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 234 AAHATGEVLVFLDSHCEVNVMWLQPLL 260
>gi|397469939|ref|XP_003806595.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Pan paniscus]
gi|397469941|ref|XP_003806596.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Pan paniscus]
Length = 608
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 195/437 (44%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 261 TIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPD-LPAASIVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|10437774|dbj|BAB15105.1| unnamed protein product [Homo sapiens]
Length = 608
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 194/437 (44%), Positives = 261/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGRAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLV 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVN 68
A D ID R GH L S + G+ N D Q+L YQ + FN+L+SDR+ +
Sbjct: 73 ANKIDDVIDSRVEDPEEGH-LKFSSELGMIFNERD-QELRDLGYQKHAFNMLISDRLGYH 130
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R +PD R CK K + + LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LV
Sbjct: 131 RDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILV 189
Query: 129 DDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S + LK LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE
Sbjct: 190 DDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCE 249
Query: 188 CTLVFNEEFL 197
+++ + L
Sbjct: 250 VNVMWLQPLL 259
>gi|114616856|ref|XP_001143140.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Pan troglodytes]
gi|114616860|ref|XP_001143304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
4 [Pan troglodytes]
gi|410221964|gb|JAA08201.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410256658|gb|JAA16296.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301646|gb|JAA29423.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410301648|gb|JAA29424.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
gi|410348810|gb|JAA41009.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11) [Pan
troglodytes]
Length = 608
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 195/437 (44%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 90/190 (47%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVN 68
A D ID R GH L S + G+ N D Q+L YQ + FN+L+S+R+ +
Sbjct: 73 ANKIDDVIDSRVEDPEEGH-LKFSSELGMIFNERD-QELRDLGYQKHAFNMLISNRLGYH 130
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R +PD R CK K + + LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LV
Sbjct: 131 RDVPDTRNAACKEKFYPPD-LPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILV 189
Query: 129 DDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S + LK LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE
Sbjct: 190 DDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCE 249
Query: 188 CTLVFNEEFL 197
+++ + L
Sbjct: 250 VNVMWLQPLL 259
>gi|1575723|gb|AAB09579.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase-T3 [Mus
musculus]
Length = 633
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++K+L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTIYPEAYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQQERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|426335177|ref|XP_004029109.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Gorilla gorilla gorilla]
Length = 552
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|162951828|ref|NP_056551.2| polypeptide N-acetylgalactosaminyltransferase 3 [Mus musculus]
gi|341941092|sp|P70419.3|GALT3_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|74183238|dbj|BAE22551.1| unnamed protein product [Mus musculus]
gi|148695061|gb|EDL27008.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++K+L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTIYPEAYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQQERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|47085989|ref|NP_998361.1| polypeptide N-acetylgalactosaminyltransferase 6 [Danio rerio]
gi|45501175|gb|AAH67340.1| Zgc:77836 [Danio rerio]
Length = 619
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP + LKEI++VDDAST E L L+EYV L
Sbjct: 171 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAAFLKEIIMVDDASTAEHLHGKLEEYVKAL 230
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R P R GLI ARLLGA +AEGEIL FLDAHCEC GWLE L+AR+ E+ T VV
Sbjct: 231 KI-VKVVRQPERKGLITARLLGASKAEGEILTFLDAHCECFHGWLEPLLARIVEEPTAVV 289
Query: 317 CPVIDIISDVTFAY---VRSFELH-WGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + V + H G F+W L F W + KRKD T P KTP
Sbjct: 290 SPEITTIDLNTFQFHKPVATARAHNRGNFDWSLTFGWEGIPDYENA-KRKDETYPVKTPT 348
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF IG YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 349 FAGGLFSISKAYFEKIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 408
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G +EV+ N R+A VWMD++ +++ + A K ++ + RL+LR+ L
Sbjct: 409 PHTFPKG-TEVITRNQVRLAEVWMDDYKLIFYRRSQSAAKMAKEKGFGDISDRLKLREDL 467
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL++V+P F+P + FG +++ +C L G N P + C
Sbjct: 468 QCKNFSWYLSNVYPEAFVPDLSPVKFGALKNRGAQQC----LDVGESNNGGKPVIMYTCH 523
Query: 549 HL 550
++
Sbjct: 524 NM 525
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/134 (58%), Positives = 96/134 (71%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++RTL D R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 137 FNQFASDRISLHRTLGDDTRPPECVDRKFRRCPALPTTSVIIVFHNEAWSTLLRTVYSVL 196
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP + LKEI++VDDAST E L L+EYV L + +V+R P R GLI ARLLGA +A
Sbjct: 197 HTSPAAFLKEIIMVDDASTAEHLHGKLEEYVKALKI-VKVVRQPERKGLITARLLGASKA 255
Query: 175 EGEILVFLDAHCEC 188
EGEIL FLDAHCEC
Sbjct: 256 EGEILTFLDAHCEC 269
>gi|51315700|sp|Q6P6V1.1|GLT11_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|38303875|gb|AAH62004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Rattus norvegicus]
Length = 608
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 189/428 (44%), Positives = 258/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S + LK LDEY+ +
Sbjct: 150 LPTASVVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED V
Sbjct: 210 LPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAIILEDPHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W SD + T P ++P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSD-LGGADSATAPIRSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSFGNISERVELRKKLGCQSFK 445
Query: 496 WYLTHVWPHH-------------FLPMDDK-----FFGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +V+P F+ K GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNVYPEMQVSGPKARLQQPVFINRGPKRPRVLLRGRLYHLQTNKCL---VAQGRSSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + TQ+++ +LI + +CLD+ E + PR+ SG
Sbjct: 503 KGGLVLLKACDYGDP-TQVWIYNEEHELILNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+WT+ K
Sbjct: 560 QQWTFGKN 567
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R +C+ K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAECRGKSYPTD-LPTASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 173 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|27696612|gb|AAH43331.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Mus musculus]
Length = 633
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGNNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKKYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++K+L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTIYPEAYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQQERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|75832150|ref|NP_001015032.2| polypeptide N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
gi|74353669|gb|AAI01887.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Rattus
norvegicus]
gi|149022135|gb|EDL79029.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 [Rattus norvegicus]
Length = 633
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I R YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++K+L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGDKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQQERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|21450297|ref|NP_659157.1| polypeptide N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|51316059|sp|Q921L8.1|GLT11_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 11;
AltName: Full=Polypeptide GalNAc transferase 11;
Short=GalNAc-T11; Short=pp-GaNTase 11; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 11;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11
gi|15030306|gb|AAH11428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|18204499|gb|AAH21504.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mus musculus]
gi|21529335|emb|CAC79626.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Mus
musculus]
gi|21707973|gb|AAH34185.1| Galnt11 protein [Mus musculus]
gi|23274082|gb|AAH36143.1| Galnt11 protein [Mus musculus]
gi|23274085|gb|AAH36145.1| Galnt11 protein [Mus musculus]
gi|33321872|gb|AAQ06668.1| UDP-GalNAc:polypeptide N-Acetylgalactosaminyltransferase T11 [Mus
musculus]
gi|74149639|dbj|BAE36442.1| unnamed protein product [Mus musculus]
gi|148671131|gb|EDL03078.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11, isoform CRA_b [Mus
musculus]
Length = 608
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 189/440 (42%), Positives = 261/440 (59%), Gaps = 29/440 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ EC LP +SIVI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S +
Sbjct: 138 NAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDD 197
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK LDEY+ + L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+
Sbjct: 198 LKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+A + ED VVCPVIDIIS T AY S + G FNW LHF+W S+ +
Sbjct: 258 LLAIILEDPHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSE-LGGPDG 315
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P ++P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS
Sbjct: 316 ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILPCSR 375
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+
Sbjct: 376 VGHIFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KNKSFGNISERV 433
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKF------------------FGRIRHVQTHKC 525
ELRK+L C SFKWYL +++P +P + GR+ H+QT+KC
Sbjct: 434 ELRKKLGCQSFKWYLDNIYPEMQIPGPNAKPQQPVLINRGPKRPRVLQRGRLYHLQTNKC 493
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP 585
+ +A+G +Q G L C + TQ+++ +LI + +CLD+ E + P
Sbjct: 494 L---VAQGRSSQKGGLVLLKTCDYGDP-TQVWIYNEDHELILNN-LLCLDMSETRSSDPP 548
Query: 586 RVRILACSGFNRQRWTYDKE 605
R+ SG Q+WT+ K
Sbjct: 549 RLMKCHGSG-GSQQWTFGKN 567
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R +C+ K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAECRRKSYPTD-LPTASIVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 173 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|149634819|ref|XP_001513114.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Ornithorhynchus anatinus]
Length = 608
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 193/439 (43%), Positives = 260/439 (59%), Gaps = 31/439 (7%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC LP +S+VI F+NEA+SALLRT+HSV+ R+P +L EI+LVDD S + LK
Sbjct: 140 ECKEKSYPPHLPAASVVICFYNEAFSALLRTIHSVLDRTPAHLLHEIILVDDNSELDDLK 199
Query: 247 SSLDEYVAKLSVP--TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
S LDEY+ +L +P +VIR+ R GLI+ R++GA QA GE+LVFLD+HCE WL+ L
Sbjct: 200 SGLDEYI-RLHLPRNIQVIRNEKREGLIRGRMIGAAQATGEVLVFLDSHCEVNAMWLQPL 258
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ + EDR VVCPVIDII T AY S + G FNW LHF+W S+ +
Sbjct: 259 LVPIREDRRTVVCPVIDIIGADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPGRA 316
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V
Sbjct: 317 TAPIKSPTMAGGLFAMNREYFRELGQYDSGMDIWGGENLEISFRIWMCGGQLFIIPCSRV 376
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY PGG + + +L R+A VWMDE+ E YF PE + R + R+
Sbjct: 377 GHIFRKRRPYGSPGGQDTMAHNSL-RLAHVWMDEYKEQYFALRPEL-RLRSYGNISERVT 434
Query: 485 LRKQLKCHSFKWYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCV 526
LRK+L C SFKWYL V+P + P F GR+ H+QT+KC+
Sbjct: 435 LRKKLGCKSFKWYLDTVYPEMQISGPNARPQPPAFVNRGPKRPRILQRGRLYHLQTNKCL 494
Query: 527 EKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR 586
A+G +Q G L C + L Q+++ +LI + +CLD+ E + PR
Sbjct: 495 A---AQGHPSQKGGRVVLKECDY-GDLNQVWIYNEEHELILNN-LLCLDMSETRSSDPPR 549
Query: 587 VRILACSGFNRQRWTYDKE 605
+ SG Q+WT+ ++
Sbjct: 550 LMKCHGSG-GSQQWTFGRK 567
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 79/186 (42%), Positives = 118/186 (63%), Gaps = 19/186 (10%)
Query: 29 VGHYLGDSVDGGLHSNLSDAQQL---------------YQINRFNLLVSDRIPVNRTLPD 73
GH G++V+ + L + ++ YQ + FN+L+S+R+ +R +PD
Sbjct: 76 AGHGFGNAVEDPVRGGLEFSPEMGMIFNEHDQEVRDLGYQKHAFNVLISNRLGSHRDVPD 135
Query: 74 VRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 133
R +CK K + LP +S+VI F+NEA+SALLRT+HSV+ R+P +L EI+LVDD S
Sbjct: 136 TRDAECKEKSYPPH-LPAASVVICFYNEAFSALLRTIHSVLDRTPAHLLHEIILVDDNSE 194
Query: 134 REFLKSSLDEYVAKLSVP--TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLV 191
+ LKS LDEY+ +L +P +VIR+ R GLI+ R++GA QA GE+LVFLD+HCE +
Sbjct: 195 LDDLKSGLDEYI-RLHLPRNIQVIRNEKREGLIRGRMIGAAQATGEVLVFLDSHCEVNAM 253
Query: 192 FNEEFL 197
+ + L
Sbjct: 254 WLQPLL 259
>gi|332227141|ref|XP_003262749.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Nomascus leucogenys]
Length = 532
Score = 334 bits (857), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 48 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 107
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 108 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLVKLPKVKCLRNNERQGLVR 161
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 162 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 221
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 222 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 280
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 281 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 340
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 341 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 399
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 400 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 452
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 453 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 483
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 54 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 112
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 113 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLVKLPKVKCLRNNERQGLVRSRIRG 166
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 167 ADIAQGTTLTFLDSHCE 183
>gi|402865473|ref|XP_003896947.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Papio
anubis]
Length = 608
Score = 334 bits (857), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNMMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +++P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNIYPEMQISGPHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNMMWLQPLL 259
>gi|390347269|ref|XP_781402.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 749
Score = 334 bits (857), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 178/423 (42%), Positives = 255/423 (60%), Gaps = 22/423 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP++S++I F E+WS LLRTVHSV++RSP ++ E+LLVDD S R++
Sbjct: 297 HNLCKTKEYSDDLPRTSVIICFTEESWSTLLRTVHSVLNRSPPELIAEVLLVDDFSQRDY 356
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK LDEY+ KL +V+R P R GLI+ARL+GA A+G +L FLD+H EC +GWLE L
Sbjct: 357 LKEPLDEYMKKLP-KVKVVRLPKREGLIRARLIGAEMAQGPVLTFLDSHVECNVGWLEPL 415
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ R+ +D T VVCP ID I +F Y S GAFNWE+ F W +A +R D
Sbjct: 416 LQRIHDDPTNVVCPAIDAIDATSFEYAGSGATIIGAFNWEMKFTWNGIPEYEAR-RRDDE 474
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+ P ++PAMAGGLF+ID+ +F+ IG YD +WG ENLE+SF++W CGGS+EI PCS V
Sbjct: 475 SWPIRSPAMAGGLFSIDKDFFYRIGTYDPGFDIWGAENLELSFKIWMCGGSLEIIPCSRV 534
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDE-WAEFYFKFNPEAEKQRDKQAVRSRL 483
AH+FRK PY FP G + N R+ VW+DE + + ++ P+ Q + V R+
Sbjct: 535 AHIFRKQQPYKFPDGNVKTFMRNTMRLVAVWVDEPYRDIFYSLKPQLMGQ-EYGDVSDRI 593
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGR--IRHVQTHKCVEKPLAKGSMNQASGP 541
+LR++LKCH F+WYL +V+P +P D K R +R+ T C++ + KG +
Sbjct: 594 KLREELKCHDFQWYLDNVYPALKVP-DTKVRARGDVRNAATSMCLDS-MGKGVL------ 645
Query: 542 ASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
+ PC H Q F + L ++ CL P+ + ++R+L C+ + R T
Sbjct: 646 -GMFPC-HGEGNNQAFTLTWDDQLKHKNK--CL--PKIK--TPGQLRMLGCNMASTARIT 697
Query: 602 YDK 604
+ K
Sbjct: 698 HIK 700
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/143 (51%), Positives = 103/143 (72%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN VSD I V R++PDVR CKTK ++++ LP++S++I F E+WS LLRTVHSV++R
Sbjct: 278 FNEYVSDMISVERSVPDVRHNLCKTKEYSDD-LPRTSVIICFTEESWSTLLRTVHSVLNR 336
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP ++ E+LLVDD S R++LK LDEY+ KL +V+R P R GLI+ARL+GA A+G
Sbjct: 337 SPPELIAEVLLVDDFSQRDYLKEPLDEYMKKLP-KVKVVRLPKREGLIRARLIGAEMAQG 395
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
+L FLD+H EC + + E L +
Sbjct: 396 PVLTFLDSHVECNVGWLEPLLQR 418
>gi|1617312|emb|CAA63371.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
(GalNAc-T3) [Homo sapiens]
Length = 633
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 RCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|397507787|ref|XP_003824367.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
paniscus]
Length = 633
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQSLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|426335181|ref|XP_004029111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
3 [Gorilla gorilla gorilla]
Length = 517
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 33 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 92
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 93 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 146
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 147 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 206
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 207 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 325
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 326 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 384
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 385 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQILQ-- 437
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 438 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 468
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 39 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 97
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 98 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 151
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 152 ADIAQGTTLTFLDSHCE 168
>gi|296204662|ref|XP_002749425.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Callithrix jacchus]
Length = 633
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAVLLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGASVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDMNTFEFNKPSPYGSHHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKTFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGHPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDR+ ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRVSLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAVLLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGASVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|359465585|ref|NP_001240756.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 3 [Homo
sapiens]
gi|119620894|gb|EAX00489.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_d [Homo sapiens]
gi|193783719|dbj|BAG53701.1| unnamed protein product [Homo sapiens]
Length = 532
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 190/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 48 RVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 107
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 108 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 161
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 162 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 221
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 222 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 280
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 281 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 340
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +++P +P +
Sbjct: 341 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSI 399
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 400 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 452
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 453 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 483
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 54 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 112
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 113 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 166
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 167 ADIAQGTTLTFLDSHCE 183
>gi|153266878|ref|NP_004473.2| polypeptide N-acetylgalactosaminyltransferase 3 [Homo sapiens]
gi|209572629|sp|Q14435.2|GALT3_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 3;
AltName: Full=Polypeptide GalNAc transferase 3;
Short=GalNAc-T3; Short=pp-GaNTase 3; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 3;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 3
gi|62822129|gb|AAY14678.1| unknown [Homo sapiens]
gi|109731077|gb|AAI13568.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|109731742|gb|AAI13566.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Homo
sapiens]
gi|119631729|gb|EAX11324.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3), isoform
CRA_b [Homo sapiens]
gi|313883200|gb|ADR83086.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3)
[synthetic construct]
Length = 633
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|403258871|ref|XP_003921965.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Saimiri
boliviensis boliviensis]
Length = 633
Score = 334 bits (857), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAVLLKEIILVDDASVDDYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGASVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W T + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSHHNRGNFDWSLSFGWETLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAVLLKEIILVDDASVDDYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGASVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|297668747|ref|XP_002812581.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Pongo abelii]
gi|297668749|ref|XP_002812582.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Pongo abelii]
gi|297668751|ref|XP_002812583.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
3 [Pongo abelii]
Length = 633
Score = 334 bits (857), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|402888519|ref|XP_003907606.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Papio
anubis]
Length = 633
Score = 334 bits (856), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|114581503|ref|XP_515871.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Pan
troglodytes]
gi|410331347|gb|JAA34620.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Pan
troglodytes]
Length = 633
Score = 334 bits (856), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|426358557|ref|XP_004046575.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Gorilla gorilla gorilla]
Length = 527
Score = 334 bits (856), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 60 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 180 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 237
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 238 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 297
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 298 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 355
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT++C+
Sbjct: 356 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNRCL-- 413
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 414 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 470
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 471 KCHGSG-GSQQWTFGKN 486
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 33 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVH 91
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 92 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 151
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 152 AAHATGEVLVFLDSHCEVNVMWLQPLL 178
>gi|109099998|ref|XP_001096023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Macaca mulatta]
gi|297264195|ref|XP_002798936.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
2 [Macaca mulatta]
gi|355564937|gb|EHH21426.1| hypothetical protein EGK_04492 [Macaca mulatta]
gi|355750584|gb|EHH54911.1| hypothetical protein EGM_04018 [Macaca fascicularis]
Length = 633
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|297265738|ref|XP_001104879.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
2 [Macaca mulatta]
Length = 532
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 48 RVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 107
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 108 LRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVR 161
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 162 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 221
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 222 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 280
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 281 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 340
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 341 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 399
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 400 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 452
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 453 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 483
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 54 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 112
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 113 SVLNRTPMHLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRG 166
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 167 ADIAQGTTLTFLDSHCE 183
>gi|198426119|ref|XP_002128247.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
6 [Ciona intestinalis]
Length = 627
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 165/332 (49%), Positives = 225/332 (67%), Gaps = 8/332 (2%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + + LP +S++I+FHNEAW ALLRTVHSV+ SP+ +LKEI+LVDDAST L
Sbjct: 172 ECKVRKWRKPLPDTSVIIIFHNEAWCALLRTVHSVLENSPKILLKEIILVDDASTLSNLG 231
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L +YVAKL + ++IR P R GLI+ARL GA++A+G +L FLD+HCEC WLE ++
Sbjct: 232 KELTDYVAKLQI-VKIIRLPSRAGLIRARLAGAQEAQGSVLTFLDSHCECAPHWLEPMLE 290
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+AED TRVVCPVI++I TFA + + G +W L F W + + D
Sbjct: 291 RIAEDNTRVVCPVIEVIDADTFAMSLTTARSVQTGILSWSLGFNWAPRKINPGQPIKND- 349
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
E + MAGGLFA+ R YF+H+G+YD +M VWGGEN+EMS R+W CGGS+EI PCSHV
Sbjct: 350 -EALTSATMAGGLFAMSRKYFYHLGSYDNDMLVWGGENIEMSLRIWMCGGSLEIHPCSHV 408
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK +PYS PGG S+V+ N RVA VW+DE+ E Y+K P A + + + +R++
Sbjct: 409 GHVFRKRAPYSHPGG-SDVITHNNKRVAEVWLDEYKEQYYKRVPRA-RAVEAGDLTARIK 466
Query: 485 LRKQLKCHSFKWYLTHVWPHHF-LPMDDKFFG 515
LR LKC +F+WY+T+++P + P +D G
Sbjct: 467 LRHDLKCRNFQWYITNIYPALYATPKEDILKG 498
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/143 (50%), Positives = 101/143 (70%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN V +I ++R++ D R +CK + + + LP +S++I+FHNEAW ALLRTVHSV+
Sbjct: 151 FNHYVCQKISLHRSVGDKRDQECKVRKWRKP-LPDTSVIIIFHNEAWCALLRTVHSVLEN 209
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP+ +LKEI+LVDDAST L L +YVAKL + ++IR P R GLI+ARL GA++A+G
Sbjct: 210 SPKILLKEIILVDDASTLSNLGKELTDYVAKLQI-VKIIRLPSRAGLIRARLAGAQEAQG 268
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
+L FLD+HCEC + E L +
Sbjct: 269 SVLTFLDSHCECAPHWLEPMLER 291
>gi|197099330|ref|NP_001124852.1| polypeptide N-acetylgalactosaminyltransferase 14 [Pongo abelii]
gi|55726129|emb|CAH89838.1| hypothetical protein [Pongo abelii]
Length = 552
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGHANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQSNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|332234083|ref|XP_003266237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Nomascus leucogenys]
Length = 633
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|332227139|ref|XP_003262748.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Nomascus leucogenys]
Length = 552
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLVKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLVKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|449270901|gb|EMC81545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Columba livia]
Length = 608
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 188/427 (44%), Positives = 256/427 (59%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S++I F+NEA SALLRTVHSV+ R+P +L EI+LVDD S LK LDEYV +
Sbjct: 150 LPSASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSELADLKKDLDEYVKTQ 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L T+++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+ + EDR V
Sbjct: 210 LPKTTKLVRNEKREGLIRGRMIGASHATGQVLVFLDSHCEVNEMWLQPLLTPIREDRRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLTYSSSPVVR-GGFNWGLHFKWDLVPLSE-LEGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ E YF PE + R+ + R+ELRK+L C SFK
Sbjct: 388 SPGGQDTMAHNSL-RLAHVWMDEYKEQYFALRPEL-RMRNYGNITDRVELRKRLNCKSFK 445
Query: 496 WYLTHVWPHH-------------FLPMDDKF-----FGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P F+ K GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGPNAKAPQPVFINRAQKRPKIIQRGRLYHLQTNKCL---VAQGHPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G + C + Q+++ +LI + +CLDV E + PR+ SG
Sbjct: 503 KGGLVVVRECDYNDP-NQVWIYNEDHELILNN-LLCLDVSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDK 604
Q+WT+ K
Sbjct: 560 QQWTFGK 566
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R KC+ K + + LP +S++I F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHREVPDTRDVKCREKSYPSD-LPSASVIICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S LK LDEYV +L T+++R+ R GLI+ R++G
Sbjct: 173 SVLDRTPAHLLHEIILVDDNSELADLKKDLDEYVKTQLPKTTKLVRNEKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A G++LVFLD+HCE ++ + L
Sbjct: 233 ASHATGQVLVFLDSHCEVNEMWLQPLL 259
>gi|334330196|ref|XP_003341314.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Monodelphis
domestica]
Length = 631
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 179/418 (42%), Positives = 255/418 (61%), Gaps = 23/418 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I+FHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L LDEY+ +
Sbjct: 182 LPTTSVIIIFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASEDDYLHDKLDEYIKQF 241
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +V+R R GLI ARLLGA A E L FLD+HCEC GWLE L++R+AE+ T VV
Sbjct: 242 QI-VKVVRQKERQGLINARLLGASVATAETLTFLDSHCECFYGWLEPLLSRIAENYTAVV 300
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P +TP
Sbjct: 301 SPDIASIDLTTFEFSKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIRTPT 359
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 360 FAGGLFSISKKYFEYIGTYDEEMKIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 419
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + RL++R +L
Sbjct: 420 PHTFPKG-TQVIARNQVRLAEVWMDEFKEIFYRRNTEAAKIVKQKAYGDISKRLDIRHRL 478
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+++ H C L G NQ P + C
Sbjct: 479 QCKNFTWYLNNIYPEIYVPDLNPVISGYIQNIGRHLC----LDVGENNQGGKPLIMYTC- 533
Query: 549 HLPVLTQMFVMKLPTDLI-ATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
H Q F ++ + + +CL + V++ ACS ++ +T D++
Sbjct: 534 HFLGGNQYFEXSEQHEIRHSIQKELCLHA------LQGPVQMKACSYKGQKTFTVDEQ 585
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN VSDRI ++R L PD R P+C + F LP +S++I+FHNEAWS LLRTVHSV+
Sbjct: 148 FNAFVSDRISLHRDLGPDTRPPECIEQKFKRCPSLPTTSVIIIFHNEAWSTLLRTVHSVL 207
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L LDEY+ + + +V+R R GLI ARLLGA A
Sbjct: 208 YSSPAILLKEIILVDDASEDDYLHDKLDEYIKQFQI-VKVVRQKERQGLINARLLGASVA 266
Query: 175 EGEILVFLDAHCEC 188
E L FLD+HCEC
Sbjct: 267 TAETLTFLDSHCEC 280
>gi|432865221|ref|XP_004070476.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Oryzias latipes]
Length = 621
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 232/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP S+LKEI+LVDDAS E LKS L+++V L
Sbjct: 173 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPASLLKEIILVDDASVAEHLKSQLEDFVKHL 232
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R P R GLI ARLLGA A E+L FLDAHCEC GWLE L+AR+ E+ T VV
Sbjct: 233 KI-VRVVRQPERKGLITARLLGASIATAEVLTFLDAHCECFHGWLEPLLARIVEEPTAVV 291
Query: 317 CPVIDIISDVTFAYVRSFELHW----GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I F + + + G F+W L F W +A RKD T P KTP
Sbjct: 292 SPEITTIDLNNFNFNKPIATNRAYNRGNFDWSLTFGWEAI-PEEARRLRKDETYPVKTPT 350
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 351 FAGGLFSISKKYFEHIGTYDDKMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 410
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA---EKQRDKQAVRSRLELRKQL 489
P++FP G +EV+ N R+A VWMD++ + Y++ N A +++ + RL+LR+ L
Sbjct: 411 PHTFPKG-TEVITRNQVRLAEVWMDDYKKIYYRRNKNAAIMAQEKKYGDISDRLKLREDL 469
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F WYL ++P F+P + + FG I+++ + C L G NQ P + C
Sbjct: 470 HCKNFSWYLNTIYPEIFVPDLTPEKFGAIKNLGSDTC----LDVGENNQGGKPVIMYMCH 525
Query: 549 HL 550
++
Sbjct: 526 NM 527
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/134 (57%), Positives = 95/134 (70%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R+L D R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 139 FNQFASDRISLSRSLGEDTRPPECVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVL 198
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP S+LKEI+LVDDAS E LKS L+++V L + RV+R P R GLI ARLLGA A
Sbjct: 199 HTSPASLLKEIILVDDASVAEHLKSQLEDFVKHLKI-VRVVRQPERKGLITARLLGASIA 257
Query: 175 EGEILVFLDAHCEC 188
E+L FLDAHCEC
Sbjct: 258 TAEVLTFLDAHCEC 271
>gi|443703000|gb|ELU00789.1| hypothetical protein CAPTEDRAFT_190622 [Capitella teleta]
Length = 507
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 185/432 (42%), Positives = 255/432 (59%), Gaps = 27/432 (6%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
+C+ V + +P +S+VI+FHNEAWS LLRTVHSV++RSP L E++L+DD S R L
Sbjct: 45 QCSRVSYPKVMPNASVVIIFHNEAWSPLLRTVHSVVNRSPPEYLHEVILLDDFSDRAGLG 104
Query: 247 SSLDEYVAKLSVP---TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LD Y+ K + P +V+R+P R GLI+AR+LGA+ A GE+LVFLD+HCEC + WLE
Sbjct: 105 EKLDGYI-KDTWPDGIVKVVRAPERQGLIRARVLGAKAATGEVLVFLDSHCECNVQWLEP 163
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
LVAR+ E R+ ++CP+ID+I +Y G F W LHF W + +RK
Sbjct: 164 LVARIKESRSALLCPMIDVIDAKAMSYNGIGAGSVGGFWWSLHFSWRPLPQRERK-RRKS 222
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
E ++P MAGGLFA DR YFF IG YD M VWGGENLE+SFRVW CGG++E PCS
Sbjct: 223 SVETIRSPTMAGGLFAADRKYFFEIGGYDPGMDVWGGENLEISFRVWMCGGTLEFVPCSR 282
Query: 424 VAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFK-------FNPEAEKQRD 475
V H+FR + PY+FPG ++ +G N R+A VWMD + ++ NP+ D
Sbjct: 283 VGHIFRSSHPYTFPG--NKDTHGLNSKRLAEVWMDGYKRLFYHHRRDLLVINPQFNA--D 338
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGS 534
RL+LR+ LKC SFKWYL +V+P F+P ++ +G +R+ ++ C++
Sbjct: 339 AGDFSDRLQLRRDLKCKSFKWYLENVYPEKFIPDENVIAYGMVRNPSSNLCLDTLSKDEK 398
Query: 535 MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG 594
M G L C Q+F + +L E C+D E SP V+++ C G
Sbjct: 399 MVFNLG---LYGCQGGVSSNQLFSLSQSNEL--RREESCMDSVGGEG--SP-VKLMPCHG 450
Query: 595 F-NRQRWTYDKE 605
Q WTY+ E
Sbjct: 451 SRGHQEWTYNLE 462
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 110/156 (70%), Gaps = 5/156 (3%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A +L++ FN++ SD I +NR++ D R P+C ++V + +P +S+VI+FHNEAWS L
Sbjct: 14 EADKLFKKEAFNIVASDMIALNRSVSDNRDPQC-SRVSYPKVMPNASVVIIFHNEAWSPL 72
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP---TRVIRSPGRVGL 163
LRTVHSV++RSP L E++L+DD S R L LD Y+ K + P +V+R+P R GL
Sbjct: 73 LRTVHSVVNRSPPEYLHEVILLDDFSDRAGLGEKLDGYI-KDTWPDGIVKVVRAPERQGL 131
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+AR+LGA+ A GE+LVFLD+HCEC + + E + +
Sbjct: 132 IRARVLGAKAATGEVLVFLDSHCECNVQWLEPLVAR 167
>gi|348574564|ref|XP_003473060.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Cavia porcellus]
Length = 552
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 193/452 (42%), Positives = 261/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHPRCTLLGYHTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPDDCKQLVRLPKVKCLRNGERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRMRGAEIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHFRW S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFRWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL LR+ L+CHSFKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIESRLNLRRNLQCHSFKWYLENVYPELSVPQDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ L PC L +Q + +I
Sbjct: 420 QKGNIR--QRQKCLE---SQKHNNQEIPNLRLSPCVKLKGEEAKSQGWAFTYTQQIIQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G RQ+W+
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDERQQWS 503
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 88/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R P+C ++ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHPRCTLLGYHTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPDDCKQLVRLPKVKCLRNGERQGLVRSRMRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 AEIAQGTTLTFLDSHCE 203
>gi|60498976|ref|NP_078848.2| polypeptide N-acetylgalactosaminyltransferase 14 isoform 1 [Homo
sapiens]
gi|51316071|sp|Q96FL9.1|GLT14_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
gi|14714999|gb|AAH10659.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Homo
sapiens]
gi|21749654|dbj|BAC03634.1| unnamed protein product [Homo sapiens]
gi|28268674|dbj|BAC56889.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Homo sapiens]
gi|37182635|gb|AAQ89118.1| RRLT2434 [Homo sapiens]
gi|119620891|gb|EAX00486.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_a [Homo sapiens]
gi|325463357|gb|ADZ15449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[synthetic construct]
gi|345500006|emb|CAA70505.4| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 14 [Homo
sapiens]
Length = 552
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 190/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +++P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|441661684|ref|XP_004091530.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Nomascus leucogenys]
Length = 535
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 51 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 110
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 111 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLVKLPKVKCLRNNERQGLVR 164
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 165 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 224
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 225 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 283
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 284 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 343
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 344 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 402
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 403 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 455
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 456 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 486
Score = 105 bits (261), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 57 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 115
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 116 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLVKLPKVKCLRNNERQGLVRSRIRG 169
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 170 ADIAQGTTLTFLDSHCE 186
>gi|312087698|ref|XP_003145574.1| glycosyl transferase [Loa loa]
gi|307759263|gb|EFO18497.1| glycosyl transferase [Loa loa]
Length = 520
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 174/383 (45%), Positives = 249/383 (65%), Gaps = 19/383 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + + LP +S+VI+F +EAWS L+RTVHSVI+R+P +L+EI+LVDD S R+
Sbjct: 60 HEQCRKINYPDNLPVASVVIIFTDEAWSPLMRTVHSVINRTPFKLLQEIILVDDFSQRDD 119
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ + R+IR+ R GLI+A+LLGA++A G++L+FLD+HCE + GWLE L
Sbjct: 120 LKGRLEEYIKRFGNKVRLIRARERQGLIRAKLLGAKEAIGDVLIFLDSHCEVSEGWLEPL 179
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKD 363
+AR+ E+R+ V+CP+ID IS T AY S L + G F W LHFRW I D
Sbjct: 180 LARIKENRSVVLCPIIDHISAETLAYSGSDRLANVGGFWWSLHFRWDPLPEEYYGI---D 236
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P ++P MAGGLFA+DR YFF +G YD +M +WGGENLE+SFRVW CGG IE PCSH
Sbjct: 237 PTKPIRSPTMAGGLFAVDRLYFFEVGGYDPKMDIWGGENLEISFRVWMCGGGIEFIPCSH 296
Query: 424 VAHLFRKASPYSFPG-GVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
V H+FR PY+ G G +E ++G N R+A VWMD++ FY+ + K+++ +
Sbjct: 297 VGHIFRAGHPYNMTGPGNNEDVHGTNSKRLAEVWMDDYKRFYYIHRSDL-KEKNVGDLSE 355
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF--FGRIRHVQTHKCVEKPLAKGSMNQAS 539
R LRK+LKC SFKWYL +V + F+ +D+ FG +R+ + C++ ++ Q
Sbjct: 356 RRALRKKLKCKSFKWYLENVAKNKFI-LDENVAAFGALRNPSSGFCLD------TLQQDE 408
Query: 540 GPA---SLLPCTHLPVLTQMFVM 559
A ++ PC + Q+F +
Sbjct: 409 KEAVSLAVFPCQNGKSEAQIFSL 431
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 69/143 (48%), Positives = 104/143 (72%), Gaps = 1/143 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
NL+ SD I ++R+LPD R +C+ K+ + LP +S+VI+F +EAWS L+RTVHSVI+R
Sbjct: 41 MNLVASDMISLDRSLPDHRHEQCR-KINYPDNLPVASVVIIFTDEAWSPLMRTVHSVINR 99
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P +L+EI+LVDD S R+ LK L+EY+ + R+IR+ R GLI+A+LLGA++A G
Sbjct: 100 TPFKLLQEIILVDDFSQRDDLKGRLEEYIKRFGNKVRLIRARERQGLIRAKLLGAKEAIG 159
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
++L+FLD+HCE + + E L +
Sbjct: 160 DVLIFLDSHCEVSEGWLEPLLAR 182
>gi|221042368|dbj|BAH12861.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 190/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 33 RVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 92
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 93 LRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 146
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 147 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 206
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 207 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 325
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +++P +P +
Sbjct: 326 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSI 384
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 385 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQKILQ-- 437
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 438 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 468
Score = 105 bits (263), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 39 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 97
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 98 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 151
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 152 ADIAQGTTLTFLDSHCE 168
>gi|109102562|ref|XP_001105195.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
5 [Macaca mulatta]
Length = 552
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|350593559|ref|XP_003133495.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Sus
scrofa]
Length = 633
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++++L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKRRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEAYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|195380503|ref|XP_002049010.1| GJ21354 [Drosophila virilis]
gi|194143807|gb|EDW60203.1| GJ21354 [Drosophila virilis]
Length = 693
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 178/413 (43%), Positives = 249/413 (60%), Gaps = 25/413 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+ ++I FHNEAWS LLRTVHSV+ RSP ++ +I+LVDD S LK L++Y A
Sbjct: 218 LPKTDVIICFHNEAWSVLLRTVHSVLDRSPPELIGQIILVDDYSDMPHLKKQLEDYFASY 277
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +++R P R GLI+ARLLGA+ A+ ++ +LD+HCEC GWLE L+ R+A + T VV
Sbjct: 278 PM-VQIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDRIARNSTTVV 336
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
CPVID+I D T F Y S ++ G F+W L F W+ + + + EP +P MA
Sbjct: 337 CPVIDVIDDTTLEFHYRDSSGVNVGGFDWNLQFSWHAVPEREKR-RHNNTAEPVYSPTMA 395
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+IDR +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SPY
Sbjct: 396 GGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPY 455
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ GV+ VL N R+A VWMD+++++Y + + D V R +LR+ L+C SF
Sbjct: 456 KWRTGVN-VLKKNSVRLAEVWMDDYSKYYLQR--IGMDKGDYGDVSERKKLREDLQCKSF 512
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
KWYL +++P F+P D G IR++ C++ P K +M + P L PC H
Sbjct: 513 KWYLDNIYPELFIPGDAVANGEIRNLGYGGRTCLDSPTGKRNMKK---PVGLYPC-HKQG 568
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG-FNRQRWTYDK 604
Q + + TD+ C+D P+ D SP V + C G Q W K
Sbjct: 569 GNQ--IKSINTDM-------CVDAPK-TGDESP-VGVYPCHGQGGHQYWMLSK 610
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 94/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+R+LPD R CK LPK+ ++I FHNEAWS LLRTVHSV+
Sbjct: 184 NAFNQYVSDLISVHRSLPDPRDAWCKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVL 243
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A + +++R P R GLI+ARLLGA+ A
Sbjct: 244 DRSPPELIGQIILVDDYSDMPHLKKQLEDYFASYPM-VQIVRGPQREGLIRARLLGAKYA 302
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ ++ +LD+HCEC + E L +
Sbjct: 303 KSPVITYLDSHCECAEGWLEPLLDR 327
>gi|109096689|ref|XP_001083664.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
mulatta]
Length = 641
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V +V+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CHSF WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHSFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V +V+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VKVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|297265736|ref|XP_002799240.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Macaca
mulatta]
Length = 517
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 264/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 33 RVGDDPYKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 92
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 93 LRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVR 146
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 147 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 206
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 207 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 265
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 266 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 325
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 326 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 384
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 385 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 437
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 438 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 468
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 39 YKLYAFNQRESERISSNRAIPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 97
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 98 SVLNRTPMHLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRG 151
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 152 ADIAQGTTLTFLDSHCE 168
>gi|426220977|ref|XP_004004688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Ovis
aries]
Length = 633
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 230/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W T + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWETLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQKERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|395519661|ref|XP_003763961.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Sarcophilus harrisii]
Length = 631
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I+FHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 182 LPTTSVIIIFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASEDEYLHDKLDEYVKQF 241
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +++R R GLI ARLLGA A E L FLD+HCEC GWLE L+AR+AE+ T VV
Sbjct: 242 QI-VKIVRQKERQGLINARLLGASVATAETLTFLDSHCECFYGWLEPLLARIAENYTAVV 300
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + + G F+W L F W + + +RKD T P +TP
Sbjct: 301 SPDIASIDLNTFEFNKPSPYGYNHNRGNFDWSLSFGWESLPEHERQ-RRKDETYPIRTPT 359
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 360 FAGGLFSISKEYFEYIGTYDEEMKIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 419
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + RLE++ +L
Sbjct: 420 PHSFPKG-TQVIARNQVRLAEVWMDEFKEIFYRRNTEAAKIVKQKTFGDISKRLEIKHRL 478
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +V+P ++P ++ G I++ H C L G N P + C
Sbjct: 479 QCKNFTWYLNNVYPEIYVPDLNPVISGYIQNKGRHLC----LDVGENNLGGKPLIMYTCH 534
Query: 549 HL 550
L
Sbjct: 535 GL 536
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 79/152 (51%), Positives = 99/152 (65%), Gaps = 8/152 (5%)
Query: 44 NLSDAQQLYQINR-----FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVI 96
NLS +Q + N FN VSDRI ++R L PD R P+C + F LP +S++I
Sbjct: 130 NLSPEEQKEKENGEAKHCFNAFVSDRISLHRDLGPDTRPPECIEQKFKRCPSLPTTSVII 189
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
+FHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV + + +++R
Sbjct: 190 IFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASEDEYLHDKLDEYVKQFQI-VKIVR 248
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
R GLI ARLLGA A E L FLD+HCEC
Sbjct: 249 QKERQGLINARLLGASVATAETLTFLDSHCEC 280
>gi|426358553|ref|XP_004046573.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Gorilla gorilla gorilla]
gi|426358555|ref|XP_004046574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Gorilla gorilla gorilla]
Length = 608
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 193/437 (44%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT++C+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNRCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 89/190 (46%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVN 68
A D ID R GH L S + G+ N D Q+L YQ + FN+L+S+R+ +
Sbjct: 73 ANKIDDVIDSRVEDPEEGH-LKFSSELGMIFNERD-QELRDLGYQKHAFNMLISNRLGYH 130
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R +PD R CK K + + LP +S+VI F+NEA+SALLRTVHSVI R+P +L EI+LV
Sbjct: 131 RDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILV 189
Query: 129 DDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S + LK LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE
Sbjct: 190 DDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCE 249
Query: 188 CTLVFNEEFL 197
+++ + L
Sbjct: 250 VNVMWLQPLL 259
>gi|26352932|dbj|BAC40096.1| unnamed protein product [Mus musculus]
Length = 608
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 189/440 (42%), Positives = 261/440 (59%), Gaps = 29/440 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ EC LP +SIVI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD+S +
Sbjct: 138 NAECRRKSYPTDLPTASIVICFYNEAFSALLRTVHSVVDRTPAHLLHEIILVDDSSDFDD 197
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK LDEY+ + L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+
Sbjct: 198 LKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQP 257
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+A + ED VVCPVIDIIS T AY S + G FNW LHF+W S+ +
Sbjct: 258 LLAIILEDPHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPVSE-LGGPDG 315
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P ++P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS
Sbjct: 316 ATAPIRSPTMAGGLFAMNRQYFNDLGQYDSGMDIWGGENLEISFRIWMCGGKLFILPCSR 375
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+
Sbjct: 376 VGHIFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KNKSFGNISERV 433
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKF------------------FGRIRHVQTHKC 525
ELRK+L C SFKWYL +++P +P + GR+ H+QT+KC
Sbjct: 434 ELRKKLGCQSFKWYLDNIYPEMQIPGPNAKPQQPVLINRGPKRPRVLQRGRLYHLQTNKC 493
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP 585
+ +A+G +Q G L C + TQ+++ +LI + +CLD+ E + P
Sbjct: 494 L---VAQGRSSQKGGLVLLKTCDYGDP-TQVWIYNEDHELILNN-LLCLDMSETRSLDPP 548
Query: 586 RVRILACSGFNRQRWTYDKE 605
R+ SG Q+WT+ K
Sbjct: 549 RLMKCHGSG-GSQQWTFGKN 567
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 105/147 (71%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R +C+ K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAECRRKSYPTD-LPTASIVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDEY+ + L +VIR+ R GLI+ R++G
Sbjct: 173 SVVDRTPAHLLHEIILVDDSSDFDDLKGELDEYIQRYLPAKVKVIRNMKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|355564239|gb|EHH20739.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
gi|355762987|gb|EHH62101.1| Polypeptide N-acetylgalactosaminyltransferase 6 [Macaca
fascicularis]
gi|380809242|gb|AFE76496.1| polypeptide N-acetylgalactosaminyltransferase 6 [Macaca mulatta]
Length = 622
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V +V+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CHSF WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHSFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V +V+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VKVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|402886019|ref|XP_003906439.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Papio anubis]
gi|402886021|ref|XP_003906440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Papio anubis]
Length = 622
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 178/362 (49%), Positives = 241/362 (66%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V +V+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VKVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + + +++ N +A K +++ + RL+LR+QL
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CHSF WYL +V+P F+P + F+G I+++ T++C L G N+ P + C
Sbjct: 473 HCHSFSWYLHNVYPEMFVPDLTPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 132 bits (333), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V +V+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VKVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|195171653|ref|XP_002026618.1| GL11821 [Drosophila persimilis]
gi|194111544|gb|EDW33587.1| GL11821 [Drosophila persimilis]
Length = 658
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 181/412 (43%), Positives = 249/412 (60%), Gaps = 24/412 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP + ++I FHNEAW+ LLRTVHSV+ RSP ++ I+LVDD S LK+ L++Y A
Sbjct: 216 LPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGRIILVDDYSDMPHLKTQLEDYFAAY 275
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR R GLI+ARLLGA+ A+ +L +LD+HCECT GWLE L+ R+A + T VV
Sbjct: 276 P-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVV 334
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT-EPFKTPAM 373
CPVID+ISD T + Y S ++ G F+W L F W+ + KR + T EP +P M
Sbjct: 335 CPVIDVISDDTLEYHYRDSSGVNVGGFDWNLQFSWHAVPEREK--KRHNSTAEPVYSPTM 392
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDR YF +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 393 AGGLFSIDREYFNRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 452
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y + GV+ VL N R+A VWMDE++++Y ++ + D V R +LR+ L+C S
Sbjct: 453 YKWRSGVN-VLRKNSVRLAEVWMDEYSQYY--YHRIGNDKGDWGDVSDRKKLREDLQCKS 509
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
FKWYL +++P F+P D G IR++ C++ P K +A G L PC H
Sbjct: 510 FKWYLDNIYPELFIPGDAVAHGEIRNLGYGGRTCLDSPTGKKHQKKAVG---LYPC-HRQ 565
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNRQRWTY 602
Q +++ ++ D CLD E V + +C G Q WTY
Sbjct: 566 GGNQYWMLSKVGEIRRDD--YCLDYAGKE------VILYSCHGGKGNQFWTY 609
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 92/145 (63%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK LP + ++I FHNEAW+ LLRTVHSV+
Sbjct: 182 NAFNQYVSDLISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVL 241
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ I+LVDD S LK+ L++Y A ++IR R GLI+ARLLGA+ A
Sbjct: 242 DRSPEHLIGRIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHA 300
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 301 KAPVLTYLDSHCECTEGWLEPLLDR 325
>gi|198461537|ref|XP_002139017.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
gi|198137372|gb|EDY69575.1| GA25136 [Drosophila pseudoobscura pseudoobscura]
Length = 658
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 181/412 (43%), Positives = 249/412 (60%), Gaps = 24/412 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP + ++I FHNEAW+ LLRTVHSV+ RSP ++ I+LVDD S LK+ L++Y A
Sbjct: 216 LPATDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGRIILVDDYSDMPHLKTQLEDYFAAY 275
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR R GLI+ARLLGA+ A+ +L +LD+HCECT GWLE L+ R+A + T VV
Sbjct: 276 P-KVQIIRGKKREGLIRARLLGAQHAKAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVV 334
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT-EPFKTPAM 373
CPVID+ISD T + Y S ++ G F+W L F W+ + KR + T EP +P M
Sbjct: 335 CPVIDVISDDTLEYHYRDSSGVNVGGFDWNLQFSWHAVPEREK--KRHNSTAEPVYSPTM 392
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDR YF +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 393 AGGLFSIDREYFNRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 452
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y + GV+ VL N R+A VWMDE++++Y ++ + D V R +LR+ L+C S
Sbjct: 453 YKWRSGVN-VLRKNSVRLAEVWMDEYSQYY--YHRIGNDKGDWGDVSDRKKLREDLQCKS 509
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
FKWYL +++P F+P D G IR++ C++ P K +A G L PC H
Sbjct: 510 FKWYLDNIYPELFIPGDAVAHGEIRNLGYGGRTCLDSPTGKKHQKKAVG---LYPC-HRQ 565
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNRQRWTY 602
Q +++ ++ D CLD E V + +C G Q WTY
Sbjct: 566 GGNQYWMLSKVGEIRRDD--YCLDYAGKE------VILYSCHGGKGNQFWTY 609
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 92/145 (63%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK LP + ++I FHNEAW+ LLRTVHSV+
Sbjct: 182 NAFNQYVSDLISVHRTLPDPRDAWCKDSAHYLSNLPATDVIICFHNEAWTVLLRTVHSVL 241
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ I+LVDD S LK+ L++Y A ++IR R GLI+ARLLGA+ A
Sbjct: 242 DRSPEHLIGRIILVDDYSDMPHLKTQLEDYFAAYP-KVQIIRGKKREGLIRARLLGAQHA 300
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 301 KAPVLTYLDSHCECTEGWLEPLLDR 325
>gi|189066640|dbj|BAG36187.1| unnamed protein product [Homo sapiens]
Length = 633
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L LDEYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEEQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A K ++A + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSAGQPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L LDEYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|348585909|ref|XP_003478713.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Cavia porcellus]
Length = 633
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 230/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+A++ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIADNYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGTNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N EA KQ+ + R +RK+L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTEAAKIVKQKTFGDLSKRFAIRKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL V+P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTVYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQKERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|296224175|ref|XP_002757934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Callithrix jacchus]
Length = 552
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 263/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCQQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLALCKNGDDRQQWT 503
Score = 105 bits (261), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIREIILVDDFS------NDPDDCQQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|291391661|ref|XP_002712292.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oryctolagus cuniculus]
Length = 633
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 230/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRT+HSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTIHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDMNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKNRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL V+P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTVYPEVYVPELNPVISGYIKTVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRT+HSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTIHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|296210174|ref|XP_002751861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Callithrix jacchus]
Length = 607
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 191/437 (43%), Positives = 260/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRTVHSVI R+P +L E++LVDD S + LK
Sbjct: 140 CKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEVILVDDDSDFDDLKG 199
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L ++IR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 200 ELDEYVQKYLPGKIKIIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 259
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ ED+ VVCPVIDIIS T AY S + G FNW LHFRW S+ + + T
Sbjct: 260 AIREDQHTVVCPVIDIISADTLAYSSS-PIVRGGFNWGLHFRWDLVPLSE-LGGAEGATT 317
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 318 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 377
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 378 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERIELR 435
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +++P H P F GR+ H+QT+KC+
Sbjct: 436 KKLGCKSFKWYLDNIYPEMQTSGPHAKPQQPIFVNKGPKRPKVLQRGRLYHLQTNKCL-- 493
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 494 -VAQGRPSQKGGLVVLKACDYTDP-DQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 550
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 551 KCHGSG-GSQQWTFGKN 566
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 113 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKFYPPD-LPAASVVICFYNEAFSALLRTVH 171
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L E++LVDD S + LK LDEYV K L ++IR+ R GLI+ R++G
Sbjct: 172 SVIDRTPAHLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKIIRNTKREGLIRGRMIG 231
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 232 AAHATGEVLVFLDSHCEVNVMWLQPLL 258
>gi|148664577|gb|EDK96993.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
gi|148664578|gb|EDK96994.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 1, isoform CRA_a [Mus
musculus]
Length = 400
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 165/392 (42%), Positives = 230/392 (58%), Gaps = 36/392 (9%)
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
R FLK L+ YV KL VP VIR R GLI+ARL GA + G+++ FLDAHCECT GWL
Sbjct: 1 RNFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWL 60
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
E L+AR+ DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++
Sbjct: 61 EPLLARIKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 120
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
D T P +TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI C
Sbjct: 121 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 180
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
SHV H+FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + S
Sbjct: 181 SHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISS 239
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------- 534
RL LR++L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 240 RLGLRRKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFN 299
Query: 535 --------------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIAT 568
+++ +GP ++L C HL + L
Sbjct: 300 CHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHV 359
Query: 569 DESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+ + CLD E+ P +R C+G Q+W
Sbjct: 360 NSNQCLDKATEEDSQVPSIR--DCTGSRSQQW 389
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/66 (53%), Positives = 43/66 (65%)
Query: 134 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFN 193
R FLK L+ YV KL VP VIR R GLI+ARL GA + G+++ FLDAHCECT +
Sbjct: 1 RNFLKRPLESYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSRGQVITFLDAHCECTAGWL 60
Query: 194 EEFLPK 199
E L +
Sbjct: 61 EPLLAR 66
>gi|395838351|ref|XP_003792079.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Otolemur garnettii]
Length = 608
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 190/428 (44%), Positives = 259/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSVI R+P +L E++LVDD S + LK LDEYV K
Sbjct: 150 LPVASVVICFYNEAFSALLRTVHSVIDRTPVHLLHEVILVDDDSDFDDLKGELDEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA QA GE+LVFLD+HCE + WL+ L+A + ED+ V
Sbjct: 210 LPGKIKVIRNTKREGLIRGRMIGAAQATGEVLVFLDSHCEVNVMWLQPLLAAIREDQQTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGEEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRQYFHDLGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELRKKLGCKSFK 445
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P H P F GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGPHAKPQQPIFINRGLSRPKVLQRGRLYHLQTNKCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 503 KGGLVVLKACDYGDP-NQIWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+WT+ K
Sbjct: 560 QQWTFGKN 567
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/147 (50%), Positives = 104/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK + + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNVLISNRLGYHRDVPDTRNAACKEQSYPTD-LPVASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L E++LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPVHLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A QA GE+LVFLD+HCE +++ + L
Sbjct: 233 AAQATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|327263882|ref|XP_003216746.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Anolis carolinensis]
Length = 536
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 171/348 (49%), Positives = 233/348 (66%), Gaps = 17/348 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAST E+LK LD YV +L
Sbjct: 175 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDASTDEYLKDELDSYVKQL 234
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RVIR R GLI ARLLGA A G++L FLDAHCEC GWLE L++R+AE+ T VV
Sbjct: 235 QI-VRVIRQIERKGLITARLLGASVATGDVLTFLDAHCECFHGWLEPLLSRIAEEPTAVV 293
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + +RKD T P KTP
Sbjct: 294 SPDITTIDLNTFEFSKPIQYGKQHSRGNFDWSLTFGWEAIPQHEKE-RRKDETYPIKTPT 352
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLFAI +AYF H+G+YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 353 FAGGLFAISKAYFEHVGSYDDQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRSKS 412
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMD++ E +++ N +A +++ + RL+L+++L
Sbjct: 413 PHTFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRNQQASQMAREKTYGDLSDRLDLKERL 471
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGR---IRHVQTHKCVEKPLAKG 533
C +F WYL ++P F+P + F+G + +V + KC+ AKG
Sbjct: 472 HCKNFTWYLQTIYPEIFIPDLTPAFYGADHLLINVASKKCLS---AKG 516
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 92/197 (46%), Positives = 116/197 (58%), Gaps = 25/197 (12%)
Query: 52 YQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRT 109
Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS LLRT
Sbjct: 136 YEKHCFNAYASDRISLQRALGPDSRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRT 195
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
V+SV+ SP +LKEI+LVDDAST E+LK LD YV +L + RVIR R GLI ARLL
Sbjct: 196 VYSVLYSSPAILLKEIILVDDASTDEYLKDELDSYVKQLQI-VRVIRQIERKGLITARLL 254
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSM 229
GA A G++L FLDAHCEC FH W L + S I+ P ++
Sbjct: 255 GASVATGDVLTFLDAHCEC-----------------FH--GW---LEPLLSRIAEEPTAV 292
Query: 230 LKEILLVDDASTREFLK 246
+ + D +T EF K
Sbjct: 293 VSPDITTIDLNTFEFSK 309
>gi|344288741|ref|XP_003416105.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Loxodonta africana]
Length = 552
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 192/452 (42%), Positives = 261/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C L+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCNLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S S D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------SDPDDCKLLIKLPKVKCVRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIQGAGIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDS 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
EM +WGGEN EMSFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 EMDIWGGENFEMSFRVWMCGGSLEIIPCSRVGHVFRKKHPYIFPDGNTNTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L+C SF+WYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIENRLSLRKNLQCESFQWYLKNVYPELSIPKDSLI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E K NQ L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLETQKRK---NQEIPNLKLSPCIKIKGEEAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V + +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITFFPG-APVVLVLCKNGDDRQQWT 503
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCNLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S S D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------SDPDDCKLLIKLPKVKCVRNNERQGLVRSRIQG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 AGIAQGTTLTFLDSHCE 203
>gi|351702714|gb|EHB05633.1| Polypeptide N-acetylgalactosaminyltransferase 14 [Heterocephalus
glaber]
Length = 553
Score = 332 bits (852), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 192/452 (42%), Positives = 258/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C L+ LP +SI+I FHNEA S L
Sbjct: 69 RVGDDPYKLYAFNQRESERISSHRAVPDTRHPRCMLLVYHTALPPTSIIITFHNEARSTL 128
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 129 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPDDCKQLVRLPKVKCLRNSERQGLVR 182
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WLE L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 183 SRMRGADIAQGATLTFLDSHCEVNRDWLEPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 242
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHFRW S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 243 ASELRGGFDWSLHFRWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 301
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 302 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 361
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL LR+ L+C SFKWYL +V+P +P D
Sbjct: 362 VWMDEYKQYYYAARPFA-LERPFGNIESRLNLRRNLQCQSFKWYLENVYPELSVPQDSSI 420
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ L PC L Q + +I
Sbjct: 421 QKGNIR--QRQKCLE---SQKQNNQEIPNLRLSPCVKLKGEEAKAQGWAFTYTQQII--Q 473
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G RQ+W+
Sbjct: 474 EELCLSVVTLFPG-APVVLVLCKNGDERQQWS 504
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 91/147 (61%), Gaps = 8/147 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI +R +PD R P+C V++ LP +SI+I FHNEA S LLRT+
Sbjct: 75 YKLYAFNQRESERISSHRAVPDTRHPRCMLLVYHTA-LPPTSIIITFHNEARSTLLRTIR 133
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 134 SVLNRTPMHLIQEIILVDDFS------NDPDDCKQLVRLPKVKCLRNSERQGLVRSRMRG 187
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A+G L FLD+HCE + E L
Sbjct: 188 ADIAQGATLTFLDSHCEVNRDWLEPLL 214
>gi|332243650|ref|XP_003270991.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Nomascus leucogenys]
Length = 527
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 192/437 (43%), Positives = 259/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRT HSVI R+P +L EI+LVDD S + LK
Sbjct: 60 CQEKFYPPDLPSASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKG 119
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 120 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 179
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ ED+ VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 180 AIREDQHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 237
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 238 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 297
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 298 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 355
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 356 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 413
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 414 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 470
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 471 KCHGSG-GSQQWTFGKN 486
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R C+ K + + LP +S+VI F+NEA+SALLRT H
Sbjct: 33 YQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPD-LPSASVVICFYNEAFSALLRTAH 91
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 92 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 151
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 152 AAHATGEVLVFLDSHCEVNVMWLQPLL 178
>gi|62148926|dbj|BAD93347.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-3 [Rattus
norvegicus]
Length = 633
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 173/362 (47%), Positives = 230/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP + KEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILPKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQQERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I R YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISRDYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRNKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++K+L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTIYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGDKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + FN LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFNGCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP + KEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILPKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQQERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|397513819|ref|XP_003827205.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Pan paniscus]
Length = 557
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 189/439 (43%), Positives = 260/439 (59%), Gaps = 23/439 (5%)
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
LL Q + VFL C TL+ LP +SI+I FHNEA S LLRT+ SVI+R+P
Sbjct: 88 LLQEAQFHLQTQVFLQVRC--TLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVINRTPT 145
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 286
+++EI+LVDD S + D+ + +P + +R+ R GL+++R+ GA A+G
Sbjct: 146 HLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTT 199
Query: 287 LVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELH 346
L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S G F+W LH
Sbjct: 200 LTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLH 259
Query: 347 FRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMS 406
F+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+S
Sbjct: 260 FQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEIS 318
Query: 407 FRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF 466
FRVW CGGS+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+
Sbjct: 319 FRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAA 378
Query: 467 NPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKC 525
P A +R V SRL+LRK L+C SFKWYL +V+P +P + G IR Q KC
Sbjct: 379 RPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR--QRQKC 435
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYEND 582
+E ++ NQ + L PC + +Q++ ++ E +CL V
Sbjct: 436 LE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQIL--QEELCLSVITLFPG 490
Query: 583 ISPRVRILACSGFNRQRWT 601
+P V +L +G +RQ+WT
Sbjct: 491 -APVVLVLCKNGDDRQQWT 508
Score = 87.0 bits (214), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 47/111 (42%), Positives = 72/111 (64%), Gaps = 8/111 (7%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C V+ + LP +SI+I FHNEA S LLRT+ SVI+R+P +++EI+LVDD S
Sbjct: 105 RCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIRSVINRTPTHLIREIILVDDFS----- 158
Query: 138 KSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ D+ + +P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 159 -NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 208
>gi|403307061|ref|XP_003944030.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Saimiri boliviensis boliviensis]
Length = 552
Score = 332 bits (850), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 191/452 (42%), Positives = 262/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTELPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCQQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFHWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +V+P +P +
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLALCKNGDDRQQWT 503
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ E LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCTE-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIREIILVDDFS------NDPDDCQQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|332243648|ref|XP_003270990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Nomascus leucogenys]
Length = 608
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 192/437 (43%), Positives = 259/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+VI F+NEA+SALLRT HSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CQEKFYPPDLPSASVVICFYNEAFSALLRTAHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ ED+ VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 261 AIREDQHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R C+ K + + LP +S+VI F+NEA+SALLRT H
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACQEKFYPPD-LPSASVVICFYNEAFSALLRTAH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|410899503|ref|XP_003963236.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6-like
[Takifugu rubripes]
Length = 618
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS LK L+E+V +
Sbjct: 170 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAVLLKEIILVDDASVAGHLKEQLEEFVLQF 229
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R P R GLI ARLLGA +A+GE+L FLDAHCEC GWLE L+AR+ E+ T VV
Sbjct: 230 KI-VRVLRQPERKGLITARLLGASEAQGEVLTFLDAHCECFHGWLEPLLARIVEEPTAVV 288
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I +F + + S + G F+W L F W + + RKD T P KTP
Sbjct: 289 SPEITTIDLESFQFNKPAPSSHAFNRGNFDWSLTFGWEQIPEAARKL-RKDETCPVKTPT 347
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF HIG YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 348 FAGGLFSILKTYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 407
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA---EKQRDKQAVRSRLELRKQL 489
P++FP G +EV+ N R+A VWMD++ + +++ N A K+ + + RL LR++L
Sbjct: 408 PHTFPKG-TEVITRNQVRLAEVWMDDYKKIFYRRNKNAAKMAKENNYGDISERLNLRERL 466
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F WYL V+P F+P + FG I++ + C L G N P + C
Sbjct: 467 HCKNFSWYLNTVYPEAFVPDLTPDRFGAIKNQGSKTC----LDVGENNLGGKPVMMYTCH 522
Query: 549 HL 550
++
Sbjct: 523 NM 524
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 94/134 (70%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R+L D R +C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 136 FNQFASDRISLSRSLGEDTRPRECVERKFPRCPALPTTSVIIVFHNEAWSTLLRTVYSVL 195
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS LK L+E+V + + RV+R P R GLI ARLLGA +A
Sbjct: 196 HTSPAVLLKEIILVDDASVAGHLKEQLEEFVLQFKI-VRVLRQPERKGLITARLLGASEA 254
Query: 175 EGEILVFLDAHCEC 188
+GE+L FLDAHCEC
Sbjct: 255 QGEVLTFLDAHCEC 268
>gi|426335183|ref|XP_004029112.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
4 [Gorilla gorilla gorilla]
Length = 557
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 188/439 (42%), Positives = 260/439 (59%), Gaps = 23/439 (5%)
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
LL Q + VFL C TL+ LP +SI+I FHNEA S LLRT+ SV++R+P
Sbjct: 88 LLQEAQFHLQTQVFLQVRC--TLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPT 145
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 286
+++EI+LVDD S + D+ + +P + +R+ R GL+++R+ GA A+G
Sbjct: 146 HLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTT 199
Query: 287 LVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELH 346
L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S G F+W LH
Sbjct: 200 LTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLH 259
Query: 347 FRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMS 406
F+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+S
Sbjct: 260 FQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEIS 318
Query: 407 FRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF 466
FRVW CGGS+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+
Sbjct: 319 FRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAA 378
Query: 467 NPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKC 525
P A +R V SRL+LRK L+C SFKWYL +V+P +P + G IR Q KC
Sbjct: 379 RPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR--QRQKC 435
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYEND 582
+E ++ NQ + L PC + +Q++ ++ E +CL V
Sbjct: 436 LE---SQRQNNQETPNLKLSPCAKVKGEDAKSQIWAFTYTQQIL--QEELCLSVITLFPG 490
Query: 583 ISPRVRILACSGFNRQRWT 601
+P V +L +G +RQ+WT
Sbjct: 491 -APVVLVLCKNGDDRQQWT 508
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 72/111 (64%), Gaps = 8/111 (7%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C V+ + LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S
Sbjct: 105 RCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFS----- 158
Query: 138 KSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ D+ + +P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 159 -NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 208
>gi|195584006|ref|XP_002081807.1| GD25523 [Drosophila simulans]
gi|194193816|gb|EDX07392.1| GD25523 [Drosophila simulans]
Length = 650
Score = 331 bits (849), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 196/494 (39%), Positives = 275/494 (55%), Gaps = 53/494 (10%)
Query: 116 RSPRSMLKEIL-LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R P+ M +E+ VDD T K++ ++YV+ L R + P
Sbjct: 154 RLPKEMSEEMKKAVDDGWT----KNAFNQYVSDLISVHRTLPDP---------------- 193
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEIL 234
DA C+ + + LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +I+
Sbjct: 194 -------RDAWCKDEARYLTD-LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKII 245
Query: 235 LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 294
LVDD S LK L++Y A ++IR R GLI+AR+LGA A+ +L +LD+HC
Sbjct: 246 LVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHC 304
Query: 295 ECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTY 352
ECT GWLE L+ R+A + T VVCPVID+ISD T + Y S ++ G F+W L F W+
Sbjct: 305 ECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPV 364
Query: 353 GSSDAIIKRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQ 411
+ KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+ W
Sbjct: 365 PERER--KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWM 422
Query: 412 CGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE 471
CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VWMDE++++Y ++
Sbjct: 423 CGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYY--YHRIGN 479
Query: 472 KQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKP 529
+ D V R +LR LKC SFKWYL +++P F+P D G IR++ C++ P
Sbjct: 480 DKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLDAP 539
Query: 530 LAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRI 589
K +A G PC H Q +++ ++ D CLD + V +
Sbjct: 540 AGKKHQKKAVG---TYPC-HRQGGNQYWMLSKAGEIRRDDS--CLDYAGKD------VTL 587
Query: 590 LAC-SGFNRQRWTY 602
C G Q WTY
Sbjct: 588 FGCHGGKGNQFWTY 601
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 174 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVL 233
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 234 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 292
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 293 KSPVLTYLDSHCECTEGWLEPLLDR 317
>gi|355565588|gb|EHH22017.1| hypothetical protein EGK_05198 [Macaca mulatta]
Length = 557
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 188/439 (42%), Positives = 260/439 (59%), Gaps = 23/439 (5%)
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
LL Q + VFL C TL+ LP +SI+I FHNEA S LLRT+ SV++R+P
Sbjct: 88 LLQEAQFHLQTQVFLQVRC--TLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPM 145
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 286
+++EI+LVDD S + D+ + +P + +R+ R GL+++R+ GA A+G
Sbjct: 146 HLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTT 199
Query: 287 LVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELH 346
L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S G F+W LH
Sbjct: 200 LTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLH 259
Query: 347 FRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMS 406
F+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+S
Sbjct: 260 FQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEIS 318
Query: 407 FRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF 466
FRVW CGGS+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+
Sbjct: 319 FRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAA 378
Query: 467 NPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKC 525
P A +R V SRL+LRK L+C SFKWYL +V+P +P + G IR Q KC
Sbjct: 379 RPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR--QRQKC 435
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYEND 582
+E ++ NQ + L PC + +Q++ ++ E +CL V
Sbjct: 436 LE---SQRQNNQETANLKLSPCAKVKGEDAKSQVWAFTYTQQIL--QEELCLSVITLFPG 490
Query: 583 ISPRVRILACSGFNRQRWT 601
+P V +L +G +RQ+WT
Sbjct: 491 -APVVLVLCKNGDDRQQWT 508
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 72/111 (64%), Gaps = 8/111 (7%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C V+ + LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S
Sbjct: 105 RCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFS----- 158
Query: 138 KSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ D+ + +P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 159 -NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 208
>gi|345484986|ref|XP_003425168.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Nasonia vitripennis]
Length = 610
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 180/419 (42%), Positives = 253/419 (60%), Gaps = 34/419 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 170 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYSDMPHLKRQLEDYM--M 227
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P +++R+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A ++T V
Sbjct: 228 NYPKVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGWLEPLLDRIARNQTTV 287
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 288 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEP 340
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLFAIDR +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 341 VWSPTMAGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 400
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK SPY + GV+ VL N R++ VW+DE+A++Y++ + + V R LRK
Sbjct: 401 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQR--IGHDKGNYGDVSDRKALRK 457
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
L C SFKWYL +++P F+P + G IRH+ + C++ P ++QA G C
Sbjct: 458 NLGCKSFKWYLDNIYPELFIPGEAVASGEIRHLASRLCIDSPGNPEDLHQAVG---FYEC 514
Query: 548 THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
H Q +++ T I DES CLD + V + C G Q+WTY+ +
Sbjct: 515 -HNQGGNQYWMLS-KTGEIRRDES-CLDYSGTD------VILYPCHGSKGNQQWTYNTQ 564
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 98/144 (68%), Gaps = 3/144 (2%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+R+LPD R P CK ++ LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 136 NAFNQYASDLISVHRSLPDPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVL 195
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S LK L++Y+ ++ P +++R+ R GLI+ARLLGA
Sbjct: 196 DRSPDHLIQEIILVDDYSDMPHLKRQLEDYM--MNYPKVKILRASKREGLIRARLLGAAM 253
Query: 174 AEGEILVFLDAHCECTLVFNEEFL 197
A+ +L +LD+HCECT + E L
Sbjct: 254 AKAPVLTYLDSHCECTEGWLEPLL 277
>gi|195124241|ref|XP_002006602.1| GI18492 [Drosophila mojavensis]
gi|193911670|gb|EDW10537.1| GI18492 [Drosophila mojavensis]
Length = 670
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 185/427 (43%), Positives = 257/427 (60%), Gaps = 27/427 (6%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
DA C+ + ++ LPK+ ++I FHNEAWS L+RTVHSV+ RSP ++ EI+LVDD S
Sbjct: 215 DAWCKDSALYLSN-LPKTDVIICFHNEAWSVLIRTVHSVLDRSPPELIGEIILVDDFSDM 273
Query: 243 EFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
LK L++Y A S P +++R P R GLI+ARLLGA A+ ++ +LD+HCEC GWL
Sbjct: 274 PHLKKQLEDYFA--SYPKVKIVRGPQREGLIRARLLGAEYAKSPVITYLDSHCECAEGWL 331
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+A + T VVCPVID+I D T F Y S ++ G F+W L F W+ +
Sbjct: 332 EPLLDRIARNSTTVVCPVIDVIDDTTLEFHYRDSSGVNVGGFDWNLQFSWHAVPEREK-- 389
Query: 360 KRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEI 418
KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+ W CGG++EI
Sbjct: 390 KRHNSTSEPVYSPTMAGGLFSIDRKFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEI 449
Query: 419 APCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCSHV H+FRK SPY + GV+ VL N R+A VWMD++A++Y++ + D
Sbjct: 450 VPCSHVGHIFRKRSPYKWRTGVN-VLKKNSVRLAEVWMDDYAKYYYQRI--GMDKGDFGD 506
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMN 536
V R +LR+ L+C SFKWYL +V+P F+P D G +R++ C++ P K +
Sbjct: 507 VSERKKLREDLQCKSFKWYLDNVYPELFIPGDAVANGEMRNLGYGGRTCLDSPSGKRYLK 566
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF- 595
+ P L PC H Q ++ T I D++ CLD + V + C G
Sbjct: 567 K---PVGLYPC-HRQGGNQYWMFS-KTGEIRRDQA-CLDYAGKD------VILFGCHGSK 614
Query: 596 NRQRWTY 602
Q WTY
Sbjct: 615 GNQFWTY 621
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 93/146 (63%), Gaps = 3/146 (2%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V R+LPD R CK LPK+ ++I FHNEAWS L+RTVHSV+
Sbjct: 194 NAFNQYVSDLISVRRSLPDPRDAWCKDSALYLSNLPKTDVIICFHNEAWSVLIRTVHSVL 253
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP ++ EI+LVDD S LK L++Y A S P +++R P R GLI+ARLLGA
Sbjct: 254 DRSPPELIGEIILVDDFSDMPHLKKQLEDYFA--SYPKVKIVRGPQREGLIRARLLGAEY 311
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPK 199
A+ ++ +LD+HCEC + E L +
Sbjct: 312 AKSPVITYLDSHCECAEGWLEPLLDR 337
>gi|432096766|gb|ELK27344.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Myotis
davidii]
Length = 507
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 189/451 (41%), Positives = 263/451 (58%), Gaps = 24/451 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 23 RVGDDPYKLHAFNQRESERISSNRAIPDTRHLRCTLLMYCRDLPPTSIIITFHNEARSTL 82
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 274
LRT+ SV++R+P +++KEI+LVDD S +E + V + +R+ R GL+++
Sbjct: 83 LRTIRSVLNRTPMNLIKEIILVDDFSND---PGDCEELIKLPKV--KCLRNDQREGLVRS 137
Query: 275 RLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSF 334
R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF+Y+ S
Sbjct: 138 RIRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFSYIESA 197
Query: 335 ELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEE 394
G F+W LHF+W S + +R D +EP +TP +AGGLF +D+++F +G YD +
Sbjct: 198 TELRGGFDWSLHFQWEQL-SPEQKAQRLDPSEPIRTPIIAGGLFVMDKSWFNFLGKYDMD 256
Query: 395 MQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALV 454
M +WGGEN EMSFRVW CGGS+EI PCS V H+FRK PY FP G + N R A V
Sbjct: 257 MDIWGGENFEMSFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEV 316
Query: 455 WMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF- 513
WMDE+ ++++ P A +R + SRL+LR++L+C SFKWYL +V+P +P D
Sbjct: 317 WMDEYKQYFYAARPFA-LERPFGDIESRLDLRRKLRCQSFKWYLENVYPELRVPKDSSIQ 375
Query: 514 FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDE 570
G IR Q KC+E K NQ L PC + +Q++ +I E
Sbjct: 376 KGPIR--QRQKCLESQRQK---NQEVSNLKLRPCVKIKGEDAKSQIWAFTYTQQIIQ--E 428
Query: 571 SVCLDVPEYENDISPRVRILACSGFNRQRWT 601
+CL V + +P V +L +G ++Q+WT
Sbjct: 429 ELCLSVITFFPG-APVVLVLCKNGDDKQQWT 458
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/136 (41%), Positives = 87/136 (63%), Gaps = 6/136 (4%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+++ FN S+RI NR +PD R +C ++ + LP +SI+I FHNEA S LLRT+
Sbjct: 29 YKLHAFNQRESERISSNRAIPDTRHLRCTLLMYCRD-LPPTSIIITFHNEARSTLLRTIR 87
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++R+P +++KEI+LVDD S +E + V + +R+ R GL+++R+ GA
Sbjct: 88 SVLNRTPMNLIKEIILVDDFSND---PGDCEELIKLPKV--KCLRNDQREGLVRSRIRGA 142
Query: 172 RQAEGEILVFLDAHCE 187
A+G L FLD+HCE
Sbjct: 143 DVAQGTTLTFLDSHCE 158
>gi|62630154|gb|AAX88899.1| unknown [Homo sapiens]
Length = 452
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 181/419 (43%), Positives = 253/419 (60%), Gaps = 21/419 (5%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
CTL+ LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S +
Sbjct: 1 CTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFS------N 54
Query: 248 SLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
D+ + +P + +R+ R GL+++R+ GA A+G L FLD+HCE WL+ L+
Sbjct: 55 DPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLH 114
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
RV ED TRVVCPVIDII+ TF Y+ S G F+W LHF+W S + +R D TE
Sbjct: 115 RVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQL-SPEQKARRLDPTE 173
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+SFRVW CGGS+EI PCS V H
Sbjct: 174 PIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGH 233
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY FP G + N R A VWMDE+ ++Y+ P A +R V SRL+LR
Sbjct: 234 VFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAARPFA-LERPFGNVESRLDLR 292
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
K L+C SFKWYL +++P +P + G IR Q KC+E ++ NQ + L
Sbjct: 293 KNLRCQSFKWYLENIYPELSIPKESSIQKGNIR--QRQKCLE---SQRQNNQETPNLKLS 347
Query: 546 PCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
PC + +Q++ ++ E +CL V +P V +L +G +RQ+WT
Sbjct: 348 PCAKVKGEDAKSQVWAFTYTQQILQ--EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 403
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 67/100 (67%), Gaps = 7/100 (7%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S + D+ +
Sbjct: 10 LPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFS------NDPDDCKQLI 63
Query: 149 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 64 KLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 103
>gi|24654219|ref|NP_725602.1| CG30463, isoform A [Drosophila melanogaster]
gi|161077158|ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster]
gi|51316018|sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9
gi|21627105|gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster]
gi|157400367|gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster]
Length = 650
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 198/496 (39%), Positives = 274/496 (55%), Gaps = 57/496 (11%)
Query: 116 RSPRSMLKEIL-LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R P+ M E+ VDD T K++ ++YV+ L R + P
Sbjct: 154 RLPKEMSDEMKKAVDDGWT----KNAFNQYVSDLISVHRTLPDP---------------- 193
Query: 175 EGEILVFLDAHC--ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
DA C E + N LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +
Sbjct: 194 -------RDAWCKDEARYLTN---LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGK 243
Query: 233 ILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A+ +L +LD+
Sbjct: 244 IILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDS 302
Query: 293 HCECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWY 350
HCECT GWLE L+ R+A + T VVCPVID+ISD T + Y S ++ G F+W L F W+
Sbjct: 303 HCECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWH 362
Query: 351 TYGSSDAIIKRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRV 409
+ KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+
Sbjct: 363 PVPERER--KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKT 420
Query: 410 WQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE 469
W CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VWMDE++++Y ++
Sbjct: 421 WMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYY--YHRI 477
Query: 470 AEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVE 527
+ D V R +LR LKC SFKWYL +++P F+P D G IR++ C++
Sbjct: 478 GNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLD 537
Query: 528 KPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRV 587
P K +A G PC H Q +++ ++ D CLD + V
Sbjct: 538 APAGKKHQKKAVG---TYPC-HRQGGNQYWMLSKAGEIRRDDS--CLDYAGKD------V 585
Query: 588 RILAC-SGFNRQRWTY 602
+ C G Q WTY
Sbjct: 586 TLFGCHGGKGNQFWTY 601
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 174 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVL 233
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 234 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 292
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 293 KSPVLTYLDSHCECTEGWLEPLLDR 317
>gi|74004468|ref|XP_535940.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 isoform
1 [Canis lupus familiaris]
Length = 632
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 242
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 243 SI-VKIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 301
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 302 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQ-RRKDETYPIKTPT 360
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 361 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 420
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 421 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 479
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ + C L G NQ P L C
Sbjct: 480 QCKNFTWYLNTIYPEAYVPDLNPVISGYIKSIGQPLC----LDVGENNQGGKPLILYTCH 535
Query: 549 HL 550
L
Sbjct: 536 GL 537
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 149 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 208
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 209 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQKERKGLITARLLGATVA 267
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 268 TAETLTFLDAHCEC 281
>gi|71896101|ref|NP_001026749.1| polypeptide N-acetylgalactosaminyltransferase 6 [Gallus gallus]
gi|60098353|emb|CAH65007.1| hypothetical protein RCJMB04_1b1 [Gallus gallus]
Length = 621
Score = 331 bits (848), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 174/359 (48%), Positives = 236/359 (65%), Gaps = 15/359 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVFHNEAWS LLRTV+SV+ SP ++L+EI+LVDDAST E+LK LD YV +L
Sbjct: 175 LPTTSVVIVFHNEAWSTLLRTVYSVLHASPAALLREIILVDDASTDEYLKDELDRYVKQL 234
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R R GLI ARLLGA A GE+L FLDAHCEC GWLE L++R+AE+ T VV
Sbjct: 235 QI-VRVVRQAERKGLITARLLGASVASGEVLTFLDAHCECFHGWLEPLLSRIAEEPTAVV 293
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + +RKD T P K+P
Sbjct: 294 SPDITTIDLNTFEFSKPVQYGKQHSRGNFDWSLTFGWEVVPPRERQ-RRKDETVPIKSPT 352
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLFAI R+YF HIG+YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 353 FAGGLFAISRSYFEHIGSYDDQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRSKS 412
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA---EKQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMD++ E +++ N +A +++ + R LR++L
Sbjct: 413 PHTFPKG-TQVISRNQVRLAEVWMDDYKEIFYRRNQQAAQMAREKTYGDITERRRLRERL 471
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
C +F WYL +V+P F+P ++ +G I++ T+ C L G N P + PC
Sbjct: 472 HCKNFTWYLQNVYPEMFVPDLNPTSYGAIKNEGTNSC----LDVGENNHGGKPLIMYPC 526
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 92/202 (45%), Positives = 120/202 (59%), Gaps = 25/202 (12%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S+VIVFHNEAWS
Sbjct: 131 EKERGYEKHCFNAFASDRISLQRALGPDSRPPECIDQKFKRCPPLPTTSVVIVFHNEAWS 190
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP ++L+EI+LVDDAST E+LK LD YV +L + RV+R R GLI
Sbjct: 191 TLLRTVYSVLHASPAALLREIILVDDASTDEYLKDELDRYVKQLQI-VRVVRQAERKGLI 249
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 224
ARLLGA A GE+L FLDAHCEC FH W L + S I+
Sbjct: 250 TARLLGASVASGEVLTFLDAHCEC-----------------FH--GW---LEPLLSRIAE 287
Query: 225 SPRSMLKEILLVDDASTREFLK 246
P +++ + D +T EF K
Sbjct: 288 EPTAVVSPDITTIDLNTFEFSK 309
>gi|405975887|gb|EKC40420.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 653
Score = 330 bits (847), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 175/359 (48%), Positives = 229/359 (63%), Gaps = 15/359 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +SI+I FHNEAWS LLRTVHS+++RSP ++ EILLVDD S EFLK+ LDEY+A
Sbjct: 237 ENLPSTSIIICFHNEAWSVLLRTVHSILNRSPLHLIHEILLVDDFSDLEFLKTELDEYLA 296
Query: 255 KLSV-PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
+ RV+R+P R GLI+ARLLG +A GEIL FLD+H EC GWLE L+ARVAE+ T
Sbjct: 297 ENGKGKVRVVRTPKREGLIRARLLGYGEATGEILTFLDSHIECFPGWLEPLLARVAENHT 356
Query: 314 RVVCPVIDIISDVTFAYVRSFELHWGAFN-WELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
RVV PVID+ISD + A + + G F + F W T ++ + +EP+KTP
Sbjct: 357 RVVAPVIDMISDRSLACGGNEIGNLGTFEIANMGFNWLTLNKTEKA--KHGQSEPWKTPT 414
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
+AGGLF+I+RAYF +G YD M +WGGENLE+SFRVW CGGS+EI PCSHVAHLFR S
Sbjct: 415 IAGGLFSINRAYFTKMGTYDHGMDIWGGENLEISFRVWMCGGSLEIHPCSHVAHLFRSMS 474
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELRKQLKC 491
PY + ++L N R A VWMDE+ Y+ + N + D V R +LR +L C
Sbjct: 475 PYKWGKSFRDILRKNAVRTAEVWMDEYKHIYYERLNYDLGDYGD---VSERKDLRNRLGC 531
Query: 492 HSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPA-SLLPCTH 549
SF WYL + P LP + G +R+++ C++ +M +G +PC H
Sbjct: 532 KSFGWYLKTMLPDMKLPETALYSGEVRNMEKGMCLD------TMGTTAGNKFQAIPCHH 584
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 80/140 (57%), Positives = 102/140 (72%), Gaps = 2/140 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+L++ N FN SD I V+R LPD+R P CK + + E LP +SI+I FHNEAWS LLRT
Sbjct: 201 ELFKRNSFNQYASDMISVHRRLPDIRNPSCKQETYPEN-LPSTSIIICFHNEAWSVLLRT 259
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSV-PTRVIRSPGRVGLIKARL 168
VHS+++RSP ++ EILLVDD S EFLK+ LDEY+A+ RV+R+P R GLI+ARL
Sbjct: 260 VHSILNRSPLHLIHEILLVDDFSDLEFLKTELDEYLAENGKGKVRVVRTPKREGLIRARL 319
Query: 169 LGARQAEGEILVFLDAHCEC 188
LG +A GEIL FLD+H EC
Sbjct: 320 LGYGEATGEILTFLDSHIEC 339
>gi|395844920|ref|XP_003795196.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Otolemur garnettii]
Length = 633
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 169/362 (46%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L++Y+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEKYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGGNHNRGNFDWSLSFGWESLPDQEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG+YD+EM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKKYFEYIGSYDDEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ + C L G NQ P + C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSIGKPLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L++Y+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEKYIKQFSI-VKIVRQKERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|109102570|ref|XP_001104659.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 isoform
1 [Macaca mulatta]
Length = 557
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 188/439 (42%), Positives = 260/439 (59%), Gaps = 23/439 (5%)
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
LL Q + VFL C TL+ LP +SI+I FHNEA S LLRT+ SV++R+P
Sbjct: 88 LLQEAQFHLQTQVFLQVRC--TLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPM 145
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 286
+++EI+LVDD S + D+ + +P + +R+ R GL+++R+ GA A+G
Sbjct: 146 HLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTT 199
Query: 287 LVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELH 346
L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S G F+W LH
Sbjct: 200 LTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLH 259
Query: 347 FRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMS 406
F+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+S
Sbjct: 260 FQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEIS 318
Query: 407 FRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF 466
FRVW CGGS+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+
Sbjct: 319 FRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAA 378
Query: 467 NPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKC 525
P A +R V SRL+LRK L+C SFKWYL +V+P +P + G IR Q KC
Sbjct: 379 RPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR--QRQKC 435
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYEND 582
+E ++ NQ + L PC + +Q++ ++ E +CL V
Sbjct: 436 LE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQIL--QEELCLSVITLFPG 490
Query: 583 ISPRVRILACSGFNRQRWT 601
+P V +L +G +RQ+WT
Sbjct: 491 -APVVLVLCKNGDDRQQWT 508
Score = 85.5 bits (210), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 72/111 (64%), Gaps = 8/111 (7%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C V+ + LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S
Sbjct: 105 RCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFS----- 158
Query: 138 KSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ D+ + +P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 159 -NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 208
>gi|296490594|tpg|DAA32707.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 3 (GalNAc-T3) [Bos
taurus]
gi|440907905|gb|ELR57989.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Bos grunniens
mutus]
Length = 633
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 230/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W T + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWETLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGRPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVYSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQKERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|301783121|ref|XP_002926975.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like
[Ailuropoda melanoleuca]
gi|281344477|gb|EFB20061.1| hypothetical protein PANDA_016676 [Ailuropoda melanoleuca]
Length = 632
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 183 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 242
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 243 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 301
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 302 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQ-RRKDETYPIKTPT 360
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 361 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 420
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 421 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 479
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 480 QCKNFTWYLNTIYPEAYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 535
Query: 549 HL 550
L
Sbjct: 536 GL 537
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 149 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 208
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 209 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQRERKGLITARLLGATVA 267
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 268 TAETLTFLDAHCEC 281
>gi|355751232|gb|EHH55487.1| hypothetical protein EGM_04701, partial [Macaca fascicularis]
Length = 516
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 188/439 (42%), Positives = 260/439 (59%), Gaps = 23/439 (5%)
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
LL Q + VFL C TL+ LP +SI+I FHNEA S LLRT+ SV++R+P
Sbjct: 47 LLQEAQFHLQTQVFLQVRC--TLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPM 104
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 286
+++EI+LVDD S + D+ + +P + +R+ R GL+++R+ GA A+G
Sbjct: 105 HLIREIILVDDFS------NDPDDCKQLIRLPKVKCLRNNERQGLVRSRIRGADIAQGTT 158
Query: 287 LVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELH 346
L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S G F+W LH
Sbjct: 159 LTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLH 218
Query: 347 FRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMS 406
F+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+S
Sbjct: 219 FQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEIS 277
Query: 407 FRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF 466
FRVW CGGS+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+
Sbjct: 278 FRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAA 337
Query: 467 NPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKC 525
P A +R V SRL+LRK L+C SFKWYL +V+P +P + G IR Q KC
Sbjct: 338 RPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENVYPELSIPKESSIQKGNIR--QRQKC 394
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYEND 582
+E ++ NQ + L PC + +Q++ ++ E +CL V
Sbjct: 395 LE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQIL--QEELCLSVITLFPG 449
Query: 583 ISPRVRILACSGFNRQRWT 601
+P V +L +G +RQ+WT
Sbjct: 450 -APVVLVLCKNGDDRQQWT 467
Score = 85.5 bits (210), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 44/100 (44%), Positives = 67/100 (67%), Gaps = 7/100 (7%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S + D+ +
Sbjct: 74 LPPTSIIITFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCKQLI 127
Query: 149 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 128 RLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 167
>gi|195057673|ref|XP_001995302.1| GH22705 [Drosophila grimshawi]
gi|193899508|gb|EDV98374.1| GH22705 [Drosophila grimshawi]
Length = 693
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 178/414 (42%), Positives = 251/414 (60%), Gaps = 26/414 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+ ++I FHNEAWS LLRTVHSV+ RSP ++ EI+LVDD S LK L++Y A
Sbjct: 217 LPKTDVIICFHNEAWSVLLRTVHSVLDRSPSELIGEIILVDDYSDMTHLKKKLEDYFADY 276
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +++R P R GLI+ARLLGA+ A+ ++ +LD+HCEC GWLE L+ R+A + T VV
Sbjct: 277 PM-VKIVRGPQREGLIRARLLGAKYAKSPVITYLDSHCECAEGWLEPLLDRIARNSTTVV 335
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT-EPFKTPAM 373
CPVID+I D T F Y S ++ G F+W L F W++ + KR + T EP +P M
Sbjct: 336 CPVIDVIDDATLEFHYRDSSGVNVGGFDWNLQFSWHSVPEREK--KRHNSTSEPVYSPTM 393
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDR +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 394 AGGLFSIDREFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 453
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y + GV+ VL N R+A VWMD+++++Y++ + D V R +LR+ L+C S
Sbjct: 454 YKWRTGVN-VLKKNSVRLAEVWMDDYSKYYYQ--RIGMDKGDFGDVSDRKKLREDLQCKS 510
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
F+WYL ++P F+P + G IR++ C++ P K ++ +A G L PC H
Sbjct: 511 FQWYLDTIYPELFIPGNAVANGEIRNLGYGGRTCLDSPSGKRNLKKAVG---LYPC-HKQ 566
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG-FNRQRWTYDK 604
Q+ + + ++CLD D SP V + C G Q W K
Sbjct: 567 GGNQIRNI---------NTNMCLDAMLKNEDESP-VGVYECHGQGGHQYWMLSK 610
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 94/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+R+LPD R CK LPK+ ++I FHNEAWS LLRTVHSV+
Sbjct: 183 NAFNQYVSDLISVHRSLPDPRDAWCKDSARYLSNLPKTDVIICFHNEAWSVLLRTVHSVL 242
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ EI+LVDD S LK L++Y A + +++R P R GLI+ARLLGA+ A
Sbjct: 243 DRSPSELIGEIILVDDYSDMTHLKKKLEDYFADYPM-VKIVRGPQREGLIRARLLGAKYA 301
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ ++ +LD+HCEC + E L +
Sbjct: 302 KSPVITYLDSHCECAEGWLEPLLDR 326
>gi|359465583|ref|NP_001240755.1| polypeptide N-acetylgalactosaminyltransferase 14 isoform 2 [Homo
sapiens]
gi|10434341|dbj|BAB14227.1| unnamed protein product [Homo sapiens]
gi|119620892|gb|EAX00487.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_b [Homo sapiens]
Length = 557
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 187/439 (42%), Positives = 260/439 (59%), Gaps = 23/439 (5%)
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
LL Q + VFL C TL+ LP +SI+I FHNEA S LLRT+ SV++R+P
Sbjct: 88 LLQEAQFHLQTQVFLQVRC--TLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPT 145
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 286
+++EI+LVDD S + D+ + +P + +R+ R GL+++R+ GA A+G
Sbjct: 146 HLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTT 199
Query: 287 LVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELH 346
L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S G F+W LH
Sbjct: 200 LTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTYIESASELRGGFDWSLH 259
Query: 347 FRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMS 406
F+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD +M +WGGEN E+S
Sbjct: 260 FQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEIS 318
Query: 407 FRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF 466
FRVW CGGS+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+
Sbjct: 319 FRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAEVWMDEYKQYYYAA 378
Query: 467 NPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKC 525
P A +R V SRL+LRK L+C SFKWYL +++P +P + G IR Q KC
Sbjct: 379 RPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKESSIQKGNIR--QRQKC 435
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYEND 582
+E ++ NQ + L PC + +Q++ ++ E +CL V
Sbjct: 436 LE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQIL--QEELCLSVITLFPG 490
Query: 583 ISPRVRILACSGFNRQRWT 601
+P V +L +G +RQ+WT
Sbjct: 491 -APVVLVLCKNGDDRQQWT 508
Score = 85.9 bits (211), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/111 (41%), Positives = 72/111 (64%), Gaps = 8/111 (7%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C V+ + LP +SI+I FHNEA S LLRT+ SV++R+P +++EI+LVDD S
Sbjct: 105 RCTLLVYCTD-LPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFS----- 158
Query: 138 KSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ D+ + +P + +R+ R GL+++R+ GA A+G L FLD+HCE
Sbjct: 159 -NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCE 208
>gi|300797404|ref|NP_001179787.1| polypeptide N-acetylgalactosaminyltransferase 3 [Bos taurus]
Length = 633
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 230/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W T + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWETLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKDYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGRPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVYSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQKERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|118085566|ref|XP_418541.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Gallus
gallus]
Length = 608
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 188/427 (44%), Positives = 256/427 (59%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S++I F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK L EYV +
Sbjct: 150 LPFASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSELDDLKKDLVEYVKTR 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L T+++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+ + EDR V
Sbjct: 210 LPKTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCEVNEMWLQPLLTPIKEDRRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLTYSSSPVVR-GGFNWGLHFKWDLVPLSE-LEGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ E YF PE + R+ + R+ELRK+L C SFK
Sbjct: 388 SPGGQDTMAHNSL-RLAHVWMDEYKEQYFALRPEL-RTRNYGNITDRVELRKRLNCKSFK 445
Query: 496 WYLTHVWPHH-------------FLPMDDKF-----FGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P F+ K GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQVSGPNAKAPQPVFINRAQKRPKIIQRGRLYHLQTNKCL---VAQGHPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G + C + Q++V +LI + +CLDV E + PR+ SG
Sbjct: 503 KGGLVVVRECDYNDQ-NQVWVYNEDHELILNN-LLCLDVSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDK 604
Q+WT+ K
Sbjct: 560 QQWTFGK 566
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 111/171 (64%), Gaps = 13/171 (7%)
Query: 29 VGHYLGDSVDG--------GLHSNLSDAQQL---YQINRFNLLVSDRIPVNRTLPDVRKP 77
+G+++ D V G G+ N D + YQ + FN+L+S+R+ +R +PD R
Sbjct: 80 IGNHVQDPVKGERDFSPEMGMIFNEEDQEVRDLGYQKHAFNMLISNRLGYHREVPDTRDA 139
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
KC+ K + + LP +S++I F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + L
Sbjct: 140 KCREKSYPSD-LPFASVIICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSELDDL 198
Query: 138 KSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
K L EYV +L T+++R+ R GLI+ R++GA A G++LVFLD+HCE
Sbjct: 199 KKDLVEYVKTRLPKTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCE 249
>gi|417403505|gb|JAA48553.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 633
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKE++LVDDAS E+L L+EYV +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVYSVLYSSPAVLLKEVILVDDASVDEYLHDKLEEYVKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRKRKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + +RKD T P KTP
Sbjct: 303 SPDIASIDMNTFEFNKPSPYGINHNRGNFDWSLSFGWEALPDHERQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEAYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVYSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKE++LVDDAS E+L L+EYV + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAVLLKEVILVDDASVDEYLHDKLEEYVKQFSI-VKIVRQRKRKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|195488108|ref|XP_002092174.1| GE14045 [Drosophila yakuba]
gi|194178275|gb|EDW91886.1| GE14045 [Drosophila yakuba]
Length = 684
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 198/498 (39%), Positives = 275/498 (55%), Gaps = 59/498 (11%)
Query: 116 RSPRSMLKEIL-LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R P+ M +E+ VDD T K++ ++YV+ L R + P
Sbjct: 154 RLPKEMSEEMKKAVDDGWT----KNAFNQYVSDLISVHRTLPDP---------------- 193
Query: 175 EGEILVFLDAHC--ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
DA C E + N LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +
Sbjct: 194 -------RDAWCKDEARYLTN---LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGK 243
Query: 233 ILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A+ +L +LD+
Sbjct: 244 IILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDS 302
Query: 293 HCECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWY 350
HCECT GWLE L+ R+A + + VVCPVID+I+D T + Y S ++ G F+W L F W+
Sbjct: 303 HCECTEGWLEPLLDRIARNSSTVVCPVIDVINDETLEYHYRDSGGVNVGGFDWNLQFSWH 362
Query: 351 TYGSSDAIIKRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRV 409
+ KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+
Sbjct: 363 PVPERER--KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKT 420
Query: 410 WQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE 469
W CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VWMDE++++Y ++
Sbjct: 421 WMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYY--YHRI 477
Query: 470 AEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVE 527
+ D V R +LR LKC SFKWYL +++P F+P D G IR++ C++
Sbjct: 478 GNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLD 537
Query: 528 KPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRV 587
P K +A G PC H Q+ M+ +CLD E + +P V
Sbjct: 538 APAGKKHQKKAVG---TYPC-HRQGGNQIANMQ---------HGMCLDAKEKSEEETP-V 583
Query: 588 RILACSG-FNRQRWTYDK 604
I C G Q W K
Sbjct: 584 SIYECHGQGGNQYWMLSK 601
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 174 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVL 233
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 234 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 292
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 293 KSPVLTYLDSHCECTEGWLEPLLDR 317
>gi|410968769|ref|XP_003990872.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3 [Felis catus]
Length = 633
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL ++P ++P ++ G I+ + C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTIYPEAYVPDLNPVISGYIKSIGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|391346483|ref|XP_003747502.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Metaseiulus occidentalis]
Length = 514
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 183/426 (42%), Positives = 256/426 (60%), Gaps = 23/426 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H EC LP +SI++ FHNEAWS L+RTVHS+++RSP ++ +I+LVDD S +
Sbjct: 59 HIECRDQVYSSKLPSTSIIVCFHNEAWSVLIRTVHSILNRSPAHLIHDIILVDDFSDLQL 118
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+ Y++ R++R+ R GLI+ARLLGA + +L FLD+H ECT GWLE L
Sbjct: 119 LKDPLERYLSAFP-KVRIVRAEKREGLIRARLLGASHSTAPVLTFLDSHVECTQGWLEPL 177
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
+ R+A + T VV PVIDII+D T Y S +++ G F+W L F W++ + I+K
Sbjct: 178 LDRIAVNSTNVVSPVIDIIADDTLEYNAKESADVNVGGFDWSLQFSWHSI--PERILKSG 235
Query: 363 -DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
+P +TP MAGGLF+IDR +F +G YD +WGGENLE+SF+ W CGG +EI PC
Sbjct: 236 YKRWQPVETPTMAGGLFSIDRKFFERLGMYDPGFDIWGGENLELSFKTWMCGGRLEIIPC 295
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
SHV H+FRK SPY + GV+ VL N R+A VWMDE+A +YF+ D +
Sbjct: 296 SHVGHIFRKRSPYKWRSGVN-VLRRNSIRLAKVWMDEYANYYFE--RLGNDLGDYGDISD 352
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK-CVEKPLAKGSMNQASG 540
R+ LR +LKCHSFKWY+ V+P F+P D G +R++ + C++ P K S+++A G
Sbjct: 353 RIALRDKLKCHSFKWYIDEVYPELFVPGDAIGSGEMRNLGSGGMCLDSPAGKSSLHKAVG 412
Query: 541 PASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQR 599
L PC H Q ++ + I DE+ CLD + V + C G Q
Sbjct: 413 ---LYPC-HGQGGNQYWLYSKNGE-IRRDEA-CLDYAGTD------VILYPCHGSKGNQY 460
Query: 600 WTYDKE 605
W YD++
Sbjct: 461 WIYDQQ 466
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 68/154 (44%), Positives = 106/154 (68%), Gaps = 2/154 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++ N FN VSD I VNR+LPD+R +C+ +V++ + LP +SI++ FHNEAWS L+R
Sbjct: 32 REGWKQNAFNSYVSDLISVNRSLPDMRHIECRDQVYSSK-LPSTSIIVCFHNEAWSVLIR 90
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHS+++RSP ++ +I+LVDD S + LK L+ Y++ R++R+ R GLI+ARL
Sbjct: 91 TVHSILNRSPAHLIHDIILVDDFSDLQLLKDPLERYLSAFP-KVRIVRAEKREGLIRARL 149
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSSI 202
LGA + +L FLD+H ECT + E L + ++
Sbjct: 150 LGASHSTAPVLTFLDSHVECTQGWLEPLLDRIAV 183
>gi|444724231|gb|ELW64842.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Tupaia chinensis]
Length = 654
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 189/425 (44%), Positives = 255/425 (60%), Gaps = 29/425 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSVI R+P +L E++LVDD S + LK LDEYV K
Sbjct: 150 LPVASVVICFYNEAFSALLRTVHSVIDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 210 LPGKIKVIRNKKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPAVR-GGFNWGLHFKWDLVPLSE-LAGAGGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRQYFSELGQYDSGMDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K R + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTRSYGNISERVELRKRLGCKSFK 445
Query: 496 WYLTHVWPHHFLPMDDK------------------FFGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +V+P +P + GR+ H+QT +C+ +A+G +Q
Sbjct: 446 WYLDNVYPEMQIPGPNARPQQPVFVHRGPKRPRVLLRGRLYHLQTSRCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q++V +L+ + +CLD+ E + PR+ SG
Sbjct: 503 KGGLVVLKACDYGDP-NQVWVYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTY 602
Q+WT+
Sbjct: 560 QQWTF 564
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/147 (50%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNVLISNRLGYHRDVPDTRSAACKGKSYPAD-LPVASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L E++LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVIDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNKKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVLWLQPLL 259
>gi|297682043|ref|XP_002818744.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11,
partial [Pongo abelii]
Length = 587
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 192/437 (43%), Positives = 259/437 (59%), Gaps = 29/437 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C F LP +S+V+ F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK
Sbjct: 141 CKEKFYPPDLPAASVVVCFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKG 200
Query: 248 SLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A
Sbjct: 201 ELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLA 260
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ EDR VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T
Sbjct: 261 AIREDRHTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LRGAEGATA 318
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H
Sbjct: 319 PIKSPTMAGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGH 378
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P G + + +L R+A VW+DE+ E YF + K + + R+ELR
Sbjct: 379 IFRKRRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRSDL-KTKSYGNISERVELR 436
Query: 487 KQLKCHSFKWYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEK 528
K+L C SFKWYL +V+P H P F GR+ H+QT+KC+
Sbjct: 437 KKLGCKSFKWYLDNVYPEMQISGSHAKPQQPIFVNRGPKRPKVLQRGRLCHLQTNKCL-- 494
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
+A+G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 495 -VAQGRPSQKGGLVVLKACDYSDP-NQIWIYNEEHELV-LNSLLCLDMSETRSSDPPRLM 551
Query: 589 ILACSGFNRQRWTYDKE 605
SG Q+WT+ K
Sbjct: 552 KCHGSG-GSQQWTFGKN 567
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 88/190 (46%), Positives = 120/190 (63%), Gaps = 8/190 (4%)
Query: 13 ARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVN 68
A D ID R GH L S + G+ N D Q+L YQ + FN+L+S+R+ +
Sbjct: 73 ANKIDDVIDSRVEDPEEGH-LKFSSELGMIFNERD-QELRDLGYQKHAFNMLISNRLGYH 130
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R +PD R CK K + + LP +S+V+ F+NEA+SALLRTVHSVI R+P +L EI+LV
Sbjct: 131 RDVPDTRNAACKEKFYPPD-LPAASVVVCFYNEAFSALLRTVHSVIDRTPAHLLHEIILV 189
Query: 129 DDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S + LK LDEYV K L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE
Sbjct: 190 DDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCE 249
Query: 188 CTLVFNEEFL 197
+++ + L
Sbjct: 250 VNVMWLQPLL 259
>gi|194222233|ref|XP_001490001.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13 [Equus
caballus]
Length = 539
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 176/447 (39%), Positives = 259/447 (57%), Gaps = 58/447 (12%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTV+SVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVYSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ YV L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENYVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP- 284
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+G + A+ ++ ++ + +WQCGGS+EI CSHV H+F
Sbjct: 285 -VSCFSGNMTALPTGLLYNSCSFSQ---------------IWQCGGSLEIVTCSHVGHVF 328
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RKA+PY+FPGG V+ N R+A VWMDE+ +F++ +P K D V R LR+
Sbjct: 329 RKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKSLREN 387
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS-------------- 534
LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 388 LKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGN 447
Query: 535 -------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVC 573
+++ +GP +L C H+ L + +L + +++ C
Sbjct: 448 QVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--C 505
Query: 574 LDVPEYENDISPRVRILACSGFNRQRW 600
LD P ++ + P ++ CSG Q+W
Sbjct: 506 LDEPSEDDKMVPTMQ--DCSGSRSQQW 530
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SVI+RSP +L E++LVDDAS R+FLK +L+ YV L VP ++IR R GLI+ARL
Sbjct: 134 TVYSVINRSPHYLLSEVILVDDASERDFLKLTLENYVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|194882445|ref|XP_001975321.1| GG22251 [Drosophila erecta]
gi|190658508|gb|EDV55721.1| GG22251 [Drosophila erecta]
Length = 721
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 177/403 (43%), Positives = 244/403 (60%), Gaps = 25/403 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +I+LVDD S LK L++Y A
Sbjct: 207 LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAY 266
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR R GLI+AR+LGA A+ +L +LD+HCECT GWLE L+ R+A + T VV
Sbjct: 267 P-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHCECTEGWLEPLLDRIARNSTTVV 325
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT-EPFKTPAM 373
CPVID+ISD T + Y S ++ G F+W L F W+ + KR + T EP +P M
Sbjct: 326 CPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPVPERER--KRHNSTAEPVYSPTM 383
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDR +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 384 AGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 443
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y + GV+ VL N R+A VWMDE++++Y ++ + D V R +LR LKC S
Sbjct: 444 YKWRSGVN-VLKKNSVRLAEVWMDEYSQYY--YHRIGNDKGDWGDVSDRRKLRTDLKCKS 500
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
FKWYL +++P F+P D G IR++ C++ P K +A G PC H
Sbjct: 501 FKWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLDAPAGKKHQKKAVG---TYPC-HRQ 556
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG 594
Q + +P + +CLD E + +P V + C G
Sbjct: 557 GGNQ--IANVP-------KGMCLDAKEKSEEETP-VSVYECHG 589
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 173 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVL 232
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 233 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 291
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 292 KSPVLTYLDSHCECTEGWLEPLLDR 316
>gi|440895697|gb|ELR47827.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Bos grunniens
mutus]
Length = 606
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 187/428 (43%), Positives = 259/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEY+ K
Sbjct: 148 LPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKY 207
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+P R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 208 LPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRQTV 267
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 268 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 325
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 326 GLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 385
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + R+ + R+ELRK+L C SFK
Sbjct: 386 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRNYGNISERVELRKKLDCKSFK 443
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G ++
Sbjct: 444 WYLDNIYPEMQISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCL---VAQGRPSE 500
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 501 KGGLVVLKACDYSDP-NQVWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 557
Query: 598 QRWTYDKE 605
Q+WT+ +
Sbjct: 558 QQWTFGRN 565
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/147 (50%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 112 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPAD-LPVASVVICFYNEALSALLRTVH 170
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEY+ K L +VIR+P R GLI+ R++G
Sbjct: 171 SVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIG 230
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 231 AAHATGEVLVFLDSHCEVNVLWLQPLL 257
>gi|195335001|ref|XP_002034165.1| GM20039 [Drosophila sechellia]
gi|194126135|gb|EDW48178.1| GM20039 [Drosophila sechellia]
Length = 650
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 195/494 (39%), Positives = 275/494 (55%), Gaps = 53/494 (10%)
Query: 116 RSPRSMLKEIL-LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R P+ M +E+ VDD T K++ ++YV+ L R + P
Sbjct: 154 RLPKEMSEEMKKAVDDGWT----KNAFNQYVSDLISVHRTLPDP---------------- 193
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEIL 234
DA C+ + + LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +I+
Sbjct: 194 -------RDAWCKDEARYLTD-LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKII 245
Query: 235 LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 294
LVDD S LK L++Y A ++IR R GLI+AR+LGA A+ +L +LD+HC
Sbjct: 246 LVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDSHC 304
Query: 295 ECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTY 352
ECT GWLE L+ R+A + T VVCPVID+ISD T + Y S ++ G F+W L F W+
Sbjct: 305 ECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWHPV 364
Query: 353 GSSDAIIKRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQ 411
+ KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+ W
Sbjct: 365 PERER--KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKTWM 422
Query: 412 CGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE 471
CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VWMDE++++Y ++
Sbjct: 423 CGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYY--YHRIGN 479
Query: 472 KQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKP 529
+ + V R +LR LKC SFKWYL +++P F+P D G IR++ C++ P
Sbjct: 480 DKGNWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLDAP 539
Query: 530 LAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRI 589
K +A G PC H Q +++ ++ D CLD + V +
Sbjct: 540 AGKKHQKKAVG---TYPC-HRQGGNQYWMLSKAGEIRRDDS--CLDYAGKD------VTL 587
Query: 590 LAC-SGFNRQRWTY 602
C G Q WTY
Sbjct: 588 FGCHGGKGNQFWTY 601
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 174 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTDLPKTDVIICFHNEAWTVLLRTVHSVL 233
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 234 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 292
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 293 KSPVLTYLDSHCECTEGWLEPLLDR 317
>gi|358412070|ref|XP_870404.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
3 [Bos taurus]
gi|359064998|ref|XP_002687097.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Bos
taurus]
Length = 606
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 188/428 (43%), Positives = 259/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEY+ K
Sbjct: 148 LPVASIVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKY 207
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+P R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 208 LPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTV 267
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 268 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 325
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 326 GLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 385
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + R+ + R+ELRK+L C SFK
Sbjct: 386 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRNYGNISERVELRKKLDCKSFK 443
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G ++
Sbjct: 444 WYLDNIYPEMQISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCL---VAQGRPSE 500
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 501 KGGLVVLKACDYSDP-NQVWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 557
Query: 598 QRWTYDKE 605
Q+WT+ +
Sbjct: 558 QQWTFGRN 565
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +SIVI F+NEA SALLRTVH
Sbjct: 112 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPAD-LPVASIVICFYNEALSALLRTVH 170
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEY+ K L +VIR+P R GLI+ R++G
Sbjct: 171 SVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIG 230
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 231 AAHATGEVLVFLDSHCEVNVLWLQPLL 257
>gi|443704818|gb|ELU01679.1| hypothetical protein CAPTEDRAFT_140956 [Capitella teleta]
Length = 550
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 177/430 (41%), Positives = 254/430 (59%), Gaps = 27/430 (6%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + + +PK+S++++FHNEAWS LLRTV+S++ RSP L+E++LVDD S +E L
Sbjct: 88 ECRAILHSSKMPKASVIVIFHNEAWSVLLRTVYSILERSPPRFLEEVILVDDYSDQEHLH 147
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LDE+VA R++RS R GLI+ARL+GA A+G++LVFLD+HCECT GWLE ++
Sbjct: 148 DQLDEFVATQQ-KVRLVRSEKREGLIRARLIGAEAAKGQVLVFLDSHCECTPGWLEPMLD 206
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY-----VRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
R+ +D + VV P+ID+I D T Y R F + G F+W + F W+ + + +R
Sbjct: 207 RIGQDWSHVVTPIIDVIDDKTLMYNFNPLSRGFSV--GGFDWAMGFTWHALPNHEKE-RR 263
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
K ++P ++P MAGGLFAIDR YF+HIG+YD M++WGGENLEMSFR+W CGG++E PC
Sbjct: 264 KKISDPARSPTMAGGLFAIDREYFYHIGSYDPGMEIWGGENLEMSFRIWMCGGTLETLPC 323
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
SHV H+FRK +P + + N R A VWMDE+ Y ++ D V
Sbjct: 324 SHVGHIFRKRNP-NHSAKHGNFVQRNSVRTAEVWMDEYK--YLYYDRIGNHIGDFGDVSD 380
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIR---HVQ--THKCVEKPLAKGSMN 536
R LR++LKC SFKWYL ++P F+P D + G +R H + C++ +
Sbjct: 381 RRALREELKCKSFKWYLDTIYPTLFVPSDAEASGEVRCKAHFPKVSQVCLDSADIDPETS 440
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGF 595
PC H QM+++ ++ + CLD ND ++RI C S
Sbjct: 441 ANGKEVQTWPC-HGQGGNQMWMLSQNGEI--RKDKGCLDY----ND--GKLRIYPCHSSK 491
Query: 596 NRQRWTYDKE 605
Q W Y+ +
Sbjct: 492 GPQDWKYNDD 501
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 74/148 (50%), Positives = 105/148 (70%), Gaps = 2/148 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN SDRI V+RTLPD R +C+ + + +PK+S++++FHNEAWS LLRTV+
Sbjct: 62 FKRNAFNQYASDRISVHRTLPDYRDVECRA-ILHSSKMPKASVIVIFHNEAWSVLLRTVY 120
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S++ RSP L+E++LVDD S +E L LDE+VA R++RS R GLI+ARL+GA
Sbjct: 121 SILERSPPRFLEEVILVDDYSDQEHLHDQLDEFVA-TQQKVRLVRSEKREGLIRARLIGA 179
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
A+G++LVFLD+HCECT + E L +
Sbjct: 180 EAAKGQVLVFLDSHCECTPGWLEPMLDR 207
>gi|268572569|ref|XP_002641355.1| C. briggsae CBR-GLY-9 protein [Caenorhabditis briggsae]
Length = 579
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 183/418 (43%), Positives = 254/418 (60%), Gaps = 24/418 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S++I+F +EAW+ LLRTVHSVI+RSP +L+EI+L+DD S R+ L+ LDE++ +
Sbjct: 133 LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEIILLDDNSKRQELQEPLDEHIKRF 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR R GLI+A+L GAR+A G+I+VFLD+HCE GWLE +V R++++RT +V
Sbjct: 193 GGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTAIV 252
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTEPFKTPAMAG 375
CP+ID ISD T AY + L G F+W LHF W G D +KR+ T+ ++P MAG
Sbjct: 253 CPMIDSISDSTLAYHGDWSLSVGGFSWALHFTW--EGLPDEELKRRTKVTDYIRSPTMAG 310
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GL A +R YFF +G YDEEM +WGGENLE+SFR W CGGSIE PCSHV H+FR PY+
Sbjct: 311 GLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPCSHVGHIFRAGHPYN 370
Query: 436 FPG--GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
G +V N R+A VWMD++ Y+ + E + +D + +R ELRK+L C S
Sbjct: 371 MTGRNNNKDVHGTNSKRLAEVWMDDYKRLYY-MHREDLRTKDVGDLTARHELRKRLNCKS 429
Query: 494 FKWYLTHVWPHHFLPMDDKF-FGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
FKW+L ++ F+ +D +G + V T C + M+Q G + C
Sbjct: 430 FKWFLDNIAKGKFIMDEDVLAYGALHTVVSGTRMCTDTLQRDEKMSQLLG---VFHCQGK 486
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG---FNRQRWTYDKE 605
Q+ M L + E+ C E N VR+ ACS FN +RW Y+K+
Sbjct: 487 GSSPQL--MSLSKEGYLRRENTCA-AEENGN-----VRMKACSKRAQFN-ERWAYEKQ 535
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 65/133 (48%), Positives = 95/133 (71%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ SD+I ++R +PD R CK ++ LPK+S++I+F +EAW+ LLRTVHSVI+R
Sbjct: 101 MNVHASDKISLDRDVPDPRIQACKDIKYDYATLPKTSVIIIFTDEAWTPLLRTVHSVINR 160
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+EI+L+DD S R+ L+ LDE++ + R+IR R GLI+A+L GAR+A G
Sbjct: 161 SPPELLQEIILLDDNSKRQELQEPLDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVG 220
Query: 177 EILVFLDAHCECT 189
+I+VFLD+HCE
Sbjct: 221 DIIVFLDSHCEAN 233
>gi|410955524|ref|XP_003984401.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Felis
catus]
Length = 552
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 190/452 (42%), Positives = 261/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
R+G +L Q E E + A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RIGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCADLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P ++++EI+LVDD S + D+ + +P + IR+ R GL++
Sbjct: 128 LRTIRSVLNRTPMNLIQEIILVDDFS------NDPDDCSQLIKLPKVKCIRNTERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDIIS F Y+ S
Sbjct: 182 SRIRGASVAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF +D+++F ++G YD
Sbjct: 242 AAELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFEYLGKYDT 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R V SRLELRK L C SFKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNVESRLELRKNLHCQSFKWYLENVYPELRVPNDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ N L PC + +Q++ ++
Sbjct: 420 QKGTIR--QRQKCLE---SQRQRNTEIYNLRLSPCVKIKGEDAKSQIWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVVTIFPG-APVVLVLCKNGDDRQQWT 503
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 88/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHLRCTLLVYCAD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P ++++EI+LVDD S + D+ + +P + IR+ R GL+++R+ G
Sbjct: 133 SVLNRTPMNLIQEIILVDDFS------NDPDDCSQLIKLPKVKCIRNTERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ASVAQGTTLTFLDSHCE 203
>gi|194756744|ref|XP_001960635.1| GF13455 [Drosophila ananassae]
gi|190621933|gb|EDV37457.1| GF13455 [Drosophila ananassae]
Length = 688
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 181/415 (43%), Positives = 245/415 (59%), Gaps = 26/415 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP + ++I FHNEAW+ LLRTVHSV+ RSP ++ +I+LVDD S LK L++Y A
Sbjct: 210 LPTTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGKIILVDDYSDMPHLKKQLEDYFAAY 269
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR R GLI+AR+LGA A+ +L +LD+HCECT GWLE L+ R+A + T VV
Sbjct: 270 P-KVQIIRGQKREGLIRARILGANHAKSAVLTYLDSHCECTEGWLEPLLDRIARNSTTVV 328
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
CPVID+ISD T + Y S ++ G F+W L F W++ + + + EP +P MA
Sbjct: 329 CPVIDVISDDTLEYHYRDSSGVNVGGFDWNLQFSWHSVPERERK-RHNNSAEPVYSPTMA 387
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAIDR +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SPY
Sbjct: 388 GGLFAIDREFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPY 447
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ GV+ VL N R+A VWMD++A++Y ++ + D V R +LR LKC SF
Sbjct: 448 KWRSGVN-VLRKNSVRLAEVWMDDYAQYY--YHRIGNDKGDWGDVTDRKKLRADLKCKSF 504
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
KWYL +++P F+P D G IR++ C++ P K +A G PC H
Sbjct: 505 KWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLDAPSGKKHQKKAVG---TYPC-HRQG 560
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEY--ENDISPRVRILACSG-FNRQRWTYDK 604
Q + LPT +CLD E E D + V I C G Q W K
Sbjct: 561 GNQ--IANLPT-------GMCLDAKELSTEGDDT-SVSIYECHGQGGNQYWMLSK 605
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 91/145 (62%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD + V+R+LPD R CK LP + ++I FHNEAW+ LLRTVHSV+
Sbjct: 176 NAFNQYVSDLVSVHRSLPDPRDAWCKDSTQYLTNLPTTDVIICFHNEAWTVLLRTVHSVL 235
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 236 DRSPEHLIGKIILVDDYSDMPHLKKQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 294
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 295 KSAVLTYLDSHCECTEGWLEPLLDR 319
>gi|426228257|ref|XP_004008230.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Ovis
aries]
Length = 606
Score = 328 bits (841), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 187/427 (43%), Positives = 259/427 (60%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEY+ K
Sbjct: 148 LPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKY 207
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+P R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 208 LPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRAV 267
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 268 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 325
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 326 GLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 385
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + R+ + R+ELRK+L C SFK
Sbjct: 386 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRNYGNISERVELRKKLDCKSFK 443
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G ++
Sbjct: 444 WYLDNIYPEMQISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCL---VAQGRPSE 500
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 501 KGGLVVLKACDYSDP-NQVWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 557
Query: 598 QRWTYDK 604
Q+WT+ +
Sbjct: 558 QQWTFGR 564
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/147 (50%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 112 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPVD-LPVASVVICFYNEALSALLRTVH 170
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEY+ K L +VIR+P R GLI+ R++G
Sbjct: 171 SVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIG 230
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 231 AAHATGEVLVFLDSHCEVNVLWLQPLL 257
>gi|71993511|ref|NP_001022850.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
gi|51316068|sp|Q95ZJ1.2|GALT5_CAEEL RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
Short=pp-GaNTase 5; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 5; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|5824785|emb|CAB54435.1| Protein GLY-5, isoform a [Caenorhabditis elegans]
Length = 626
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 179/433 (41%), Positives = 262/433 (60%), Gaps = 31/433 (7%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S++I FHNEAWS LLRTVHSV+ R+P +L+E++LVDD S
Sbjct: 161 IDAECK-TEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD 218
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 219 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWM 278
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 279 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 335
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 336 -RKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 394
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 395 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 450
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKG 533
+ SR +LR+ L C SFKWYL +++P F+P + G +R+ VQ +C++ + +
Sbjct: 451 FGDISSRKKLREDLGCKSFKWYLDNIYPELFVPGESVAKGEVRNSAVQPARCLDCMVGRH 510
Query: 534 SMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS 593
N+ P C H Q +++ + I DES C+D V + C
Sbjct: 511 EKNR---PVGTYQC-HGQGGNQYWMLSKDGE-IRRDES-CVDY------AGSDVMVFPCH 558
Query: 594 GF-NRQRWTYDKE 605
G Q W Y+ +
Sbjct: 559 GMKGNQEWRYNHD 571
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S++I FHNEAWS LLRTVHSV+
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+E++LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 200 ERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVA 259
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 260 TGEVLTYLDSHCECMEGWMEPLLDR 284
>gi|170582702|ref|XP_001896248.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158596593|gb|EDP34915.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 520
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 174/416 (41%), Positives = 253/416 (60%), Gaps = 26/416 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + + LP +S+VI+F +EAWS L+RTVHSVI+R+P +L+EI+LVDD S R+
Sbjct: 60 HKQCHKISYSDDLPVASVVIIFTDEAWSPLMRTVHSVINRTPLKLLQEIILVDDFSQRDE 119
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ + R++R+ R GLI+A+LLGA++A G++LVFLD+HCE GWLE L
Sbjct: 120 LKEKLEEYIKRFGNKVRLVRALERQGLIRAKLLGAKEAVGDVLVFLDSHCEVGEGWLEPL 179
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKR-- 361
+AR+ + R+ V+CP+I+ IS T Y + G F+W LHF W D + K
Sbjct: 180 LARIKDKRSAVLCPIINHISAETLTYSANDRPTNVGGFSWSLHFLW------DPMPKEYF 233
Query: 362 -KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
D TEP ++P MAGGL A+DR+YFF +G YD +M +WGGENLEMSFRVW CGGSIE P
Sbjct: 234 DADPTEPIRSPTMAGGLLAVDRSYFFEVGGYDPKMDIWGGENLEMSFRVWMCGGSIEFIP 293
Query: 421 CSHVAHLFRKASPYSF--PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
CSHV H+FR PY+ PG +V N R+A VWMD++ +FY+ + K +D
Sbjct: 294 CSHVGHIFRDGHPYNMIGPGDNKDVHGTNSKRLAEVWMDDYKKFYYIHRLDL-KGKDVGD 352
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF--FGRIRHVQTHKCVEK-------- 528
+ R LR++L+C SFKWYL +V + F+ +D+ FG +R+ + C++
Sbjct: 353 LSERRALRQKLRCKSFKWYLQNVAKNKFV-LDENVAAFGALRNPSSGLCLDTLQRNEDEV 411
Query: 529 -PLAKGSMNQASGPASLLPCTHLPVL-TQMFVMKLPTDLIATDESVCLDVPEYEND 582
PL S + T+ +L ++ K+ D + T+++ V EN+
Sbjct: 412 IPLCVFSCQNGKSQTQIFSLTNDGILRRELTCAKIDQDTVGTNKTTVQLVHCGENE 467
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 66/131 (50%), Positives = 99/131 (75%), Gaps = 1/131 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
NL+ SD I ++R+LPD R +C ++++ LP +S+VI+F +EAWS L+RTVHSVI+R
Sbjct: 41 MNLVASDLISLDRSLPDHRHKQCHKISYSDD-LPVASVVIIFTDEAWSPLMRTVHSVINR 99
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P +L+EI+LVDD S R+ LK L+EY+ + R++R+ R GLI+A+LLGA++A G
Sbjct: 100 TPLKLLQEIILVDDFSQRDELKEKLEEYIKRFGNKVRLVRALERQGLIRAKLLGAKEAVG 159
Query: 177 EILVFLDAHCE 187
++LVFLD+HCE
Sbjct: 160 DVLVFLDSHCE 170
>gi|149730677|ref|XP_001496099.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3 [Equus
caballus]
Length = 633
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 171/362 (47%), Positives = 228/362 (62%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHGKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGAAVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDMNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHERQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R ++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFAIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSFGQSLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHGKLEEYIKQFSI-VKIVRQRERKGLITARLLGAAVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|189217666|ref|NP_001121278.1| uncharacterized protein LOC100158361 [Xenopus laevis]
gi|115528277|gb|AAI24896.1| LOC100158361 protein [Xenopus laevis]
Length = 600
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 187/427 (43%), Positives = 258/427 (60%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK LD Y+ +
Sbjct: 142 LPLASIVICFYNEASSALLRTVHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDYYMQEN 201
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
LS +++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+A + E+ V
Sbjct: 202 LSKKVKLVRNKRREGLIRGRMVGASHATGDVLVFLDSHCEVNEMWLQPLLAPIRENPKTV 261
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y +S + G FNW LHF+W S+ + + FT PF++P MAG
Sbjct: 262 VCPVIDIISADTLIYSQSPVVR-GGFNWGLHFKWDPVPLSE-LGGPEGFTAPFRSPTMAG 319
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGGS+ I PCS V H+FRK PY
Sbjct: 320 GLFAMDREYFNTLGQYDSGMDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYG 379
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ + YF PE + RD +R RL LRK+L C SFK
Sbjct: 380 SPGGHDTMAHNSL-RLAHVWMDEYKDQYFALRPEL-RNRDFGDIRDRLTLRKRLNCKSFK 437
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ ++QT+KC+ +A+G +Q
Sbjct: 438 WYLDNIYPEMQVSGPNAKPQPPVFINKGQKRPKILQRGRLINMQTNKCL---VAQGHPSQ 494
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G + C Q++ +LI ++ +CLD+ E + PR+ SG
Sbjct: 495 KGGLVVVKDCD-FNDSEQVWSYNEEHELILSN-LLCLDMSETRSSDPPRLMKCHGSG-GS 551
Query: 598 QRWTYDK 604
Q+W + K
Sbjct: 552 QQWVFGK 558
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 101/147 (68%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R LPD R KC K + + LP +SIVI F+NEA SALLRTVH
Sbjct: 106 YQKHAFNLLISNRLGYHRDLPDTRDSKCSKKTYPAD-LPLASIVICFYNEASSALLRTVH 164
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LD Y+ + LS +++R+ R GLI+ R++G
Sbjct: 165 SVLDRTPAQLLHEIILVDDNSELDDLKKDLDYYMQENLSKKVKLVRNKRREGLIRGRMVG 224
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A G++LVFLD+HCE ++ + L
Sbjct: 225 ASHATGDVLVFLDSHCEVNEMWLQPLL 251
>gi|3047193|gb|AAC13672.1| GLY5b [Caenorhabditis elegans]
Length = 626
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 179/433 (41%), Positives = 262/433 (60%), Gaps = 31/433 (7%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S++I FHNEAWS LLRTVHSV+ R+P +L+E++LVDD S
Sbjct: 161 IDAECK-TEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD 218
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 219 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWM 278
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 279 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 335
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 336 -RKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 394
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 395 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 450
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKG 533
+ SR +LR+ L C SFKWYL +++P F+P + G +R+ VQ +C++ + +
Sbjct: 451 FGDISSRKKLREDLGCKSFKWYLDNIYPELFVPGESVAKGEVRNSAVQPARCLDCMVGRH 510
Query: 534 SMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS 593
N+ P C H Q +++ + I DES C+D V + C
Sbjct: 511 EKNR---PVGTYQC-HGQGGNQYWMLSKDGE-IRRDES-CVDY------AGSDVMVFPCH 558
Query: 594 GF-NRQRWTYDKE 605
G Q W Y+ +
Sbjct: 559 GMKGNQEWRYNHD 571
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S++I FHNEAWS LLRTVHSV+
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+E++LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 200 ERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVA 259
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 260 TGEVLTYLDSHCECMEGWMEPLLDR 284
>gi|326911650|ref|XP_003202170.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Meleagris gallopavo]
Length = 579
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 180/402 (44%), Positives = 250/402 (62%), Gaps = 15/402 (3%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T +N LP +S+VI F+NEAWS LLRTVHSV+ SP +LKEI+LVDD S + +LK+
Sbjct: 127 TKSYNYRKLPTTSVVIAFYNEAWSTLLRTVHSVLETSPSVLLKEIILVDDLSDKVYLKTD 186
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L++Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+
Sbjct: 187 LEKYISSLKR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLERI 245
Query: 309 AEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
AE+ T V+CPVID I TF Y ++S E G F+W L F+W++ + + +RK T+P
Sbjct: 246 AENETVVICPVIDTIDWNTFEYYMQSAEPMIGGFDWRLTFQWHSVPKHERL-RRKSETDP 304
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
++P MAGGLFA+ + YF ++G YD M VWGGENLE+SFRVWQCGG +EI PCSHV H+
Sbjct: 305 IRSPTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCSHVGHV 364
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
F K +PY+ P + N AR A VWMDE+ E ++ NP A K+ + + R LR+
Sbjct: 365 FPKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKE-NYGDISERKLLRE 418
Query: 488 QLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLL 545
+LKC SF WYL +V+ +P D + G +R V + +C++ L + N SL
Sbjct: 419 RLKCKSFNWYLRNVFSELHVPEDRPGWHGAVRSVGISSECLDYVLPE--HNPTGAHLSLF 476
Query: 546 PCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDISPR 586
C H Q F + + +C +VPE E+ I R
Sbjct: 477 GC-HGQGGNQFFEYTSNKEFRFNSVTELCAEVPEQEDFIGMR 517
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 72/153 (47%), Positives = 105/153 (68%), Gaps = 3/153 (1%)
Query: 49 QQLYQINRF--NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
Q+ I+++ N+ +SD+I ++R + D R CKTK +N LP +S+VI F+NEAWS L
Sbjct: 93 QEAELIDKYAINIYLSDKISLHRHIEDNRMSGCKTKSYNYRKLPTTSVVIAFYNEAWSTL 152
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LRTVHSV+ SP +LKEI+LVDD S + +LK+ L++Y++ L R+IR+ R GL++A
Sbjct: 153 LRTVHSVLETSPSVLLKEIILVDDLSDKVYLKTDLEKYISSLKR-VRLIRTNKREGLVRA 211
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RL+GA A G++L FLD HCEC + E L +
Sbjct: 212 RLIGATFATGDVLTFLDCHCECVSGWLEPLLER 244
>gi|268576200|ref|XP_002643080.1| C. briggsae CBR-GLY-5 protein [Caenorhabditis briggsae]
Length = 630
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 179/433 (41%), Positives = 262/433 (60%), Gaps = 31/433 (7%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S+++ FHNEAWS LLRTVHSV+ R+P +L+EI+LVDD S
Sbjct: 165 IDAECK-TEKYNEN-LPRTSVIVCFHNEAWSVLLRTVHSVLERTPEHLLEEIVLVDDFSD 222
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 223 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGWI 282
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 283 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 339
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 340 -RKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 398
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 399 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERINNQLGDFGD 457
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKG 533
V +R +LR L C SFKWYL +++P F+P + G +R+ VQ +C++ + +
Sbjct: 458 ---VSARKKLRSDLGCKSFKWYLDNIFPELFVPGESVAKGEVRNSAVQPARCLDCMVGRH 514
Query: 534 SMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS 593
N+ P C H Q +++ + I DES C+D + V + C
Sbjct: 515 EKNR---PVGTYQC-HGQGGNQYWMLSKDGE-IRRDES-CVDYAGSD------VMVFPCH 562
Query: 594 GF-NRQRWTYDKE 605
G Q W Y+ +
Sbjct: 563 GMKGNQEWRYNHD 575
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S+++ FHNEAWS LLRTVHSV+
Sbjct: 145 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIVCFHNEAWSVLLRTVHSVL 203
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+EI+LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 204 ERTPEHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIA 263
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 264 TGEVLTYLDSHCECMEGWIEPLLDR 288
>gi|432882423|ref|XP_004074023.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryzias latipes]
Length = 584
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 159/329 (48%), Positives = 227/329 (68%), Gaps = 10/329 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ +P +LKEI+L+DD S R +LKS L E
Sbjct: 135 FDYRHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILIDDYSDRGYLKSQLAE 194
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GW+E L+ R+AE+
Sbjct: 195 YISNLQR-VRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLERIAEN 253
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ +VCPVID I +F Y+++ E G F+W L F+W++ S+ +RK T+PF++
Sbjct: 254 ASTIVCPVIDTIDWNSFEFYMQTGEPMIGGFDWRLTFQWHSVPESERK-RRKSRTDPFRS 312
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGGS+EI PCSHV H+F K
Sbjct: 313 PTMAGGLFAVSKVYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGHVFPK 372
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N R A VWMD + ++ NP A+K+ + + RL+LR++LK
Sbjct: 373 KAPYARPNFLQ-----NTVRAAEVWMDSYKHHFYNRNPPAKKE-NYGDITERLQLRERLK 426
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIR 518
C+SF WYL +++P +P D + + G IR
Sbjct: 427 CNSFDWYLKNIYPELHVPEDREGWHGAIR 455
Score = 141 bits (356), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 68/142 (47%), Positives = 98/142 (69%), Gaps = 1/142 (0%)
Query: 58 NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 117
N+ VSD+I ++R + D R +C+ K F+ LP +S++I F+NEAWS LLRT+HSV+ +
Sbjct: 109 NIFVSDKISLHRHIQDNRMEECRNKKFDYRHLPTTSVIIAFYNEAWSTLLRTIHSVLETT 168
Query: 118 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 177
P +LKEI+L+DD S R +LKS L EY++ L R+IR+ R GL++ARL+GA A G+
Sbjct: 169 PAILLKEIILIDDYSDRGYLKSQLAEYISNLQR-VRLIRTNKREGLVRARLIGATYATGD 227
Query: 178 ILVFLDAHCECTLVFNEEFLPK 199
+L FLD HCEC + E L +
Sbjct: 228 VLTFLDCHCECVPGWIEPLLER 249
>gi|431894865|gb|ELK04658.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Pteropus alecto]
Length = 633
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 170/362 (46%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDMNTFEFNKPSPYGNNHNRGNFDWSLSFGWESLPDHERQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YD+EM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGTYDDEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P+SFP G ++V+ N R+A VWMD++ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHSFPKG-TQVIARNQVRLAEVWMDDYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ C L G NQ P L C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSFGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHDKLEEYIKQFSI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|324507488|gb|ADY43175.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Ascaris suum]
Length = 632
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 178/428 (41%), Positives = 251/428 (58%), Gaps = 23/428 (5%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+D C+ ++ LP +S++I FHNEAWS LLRTVHSVI R+P +L E++LVDD S
Sbjct: 168 IDEECKTEKYLDD--LPSTSVIICFHNEAWSVLLRTVHSVIERTPEHLLTEVILVDDFSD 225
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ LK L+EY++ L R++R R GLI+ARL GA ++G ++ FLD+HCEC GW+
Sbjct: 226 MDHLKKPLEEYMSALK-KVRIVRMDKREGLIRARLKGAAVSKGAVVTFLDSHCECMEGWI 284
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ + + VVCPVID+I D TF Y +++ + G F+W L F W+ D
Sbjct: 285 EPLLDRIKRNSSTVVCPVIDVIDDETFEYHYSKAYFTNVGGFDWSLQFNWHAIPERDRK- 343
Query: 360 KRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
RK +P ++P MAGGLF+IDRAYF +G YD +WGGENLE+SF++W CGG++EI
Sbjct: 344 NRKRHIDPVRSPTMAGGLFSIDRAYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIV 403
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQA 478
PCSHV H+FRK SPY + GV+ VL N R+A VW+DE+ +Y+ + N + D
Sbjct: 404 PCSHVGHVFRKRSPYKWRTGVN-VLKKNSVRLAEVWLDEYKVYYYERINNQTGDYGD--- 459
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM--N 536
V R LR++LKC SFKWYL +++P F+P D G +R+ + P S+
Sbjct: 460 VSDRKALRERLKCKSFKWYLDNIYPELFVPGDSVAKGEVRNYGYKEGGGAPQCLDSVVGE 519
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF- 595
+ PC H Q +++ + I DES C+D V I C G
Sbjct: 520 DVHKDVTPYPC-HGQGGNQYWMLSKDGE-IRRDES-CIDY------AGANVMIFPCHGMK 570
Query: 596 NRQRWTYD 603
Q W Y+
Sbjct: 571 GNQEWRYN 578
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/148 (46%), Positives = 100/148 (67%), Gaps = 2/148 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN VSD I ++R+LP +CKT+ + ++ LP +S++I FHNEAWS LLRTVH
Sbjct: 145 FRNNAFNQYVSDMISIHRSLPSTIDEECKTEKYLDD-LPSTSVIICFHNEAWSVLLRTVH 203
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SVI R+P +L E++LVDD S + LK L+EY++ L R++R R GLI+ARL GA
Sbjct: 204 SVIERTPEHLLTEVILVDDFSDMDHLKKPLEEYMSALK-KVRIVRMDKREGLIRARLKGA 262
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
++G ++ FLD+HCEC + E L +
Sbjct: 263 AVSKGAVVTFLDSHCECMEGWIEPLLDR 290
>gi|148230993|ref|NP_001087490.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus laevis]
gi|51261644|gb|AAH80006.1| MGC81846 protein [Xenopus laevis]
Length = 603
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 185/427 (43%), Positives = 257/427 (60%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LD Y+ +
Sbjct: 145 LPHASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDNYMQEN 204
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
LS +++R+ R GLI+ R++GA +A G++LVFLD+HCE WL+ L+A + E+ V
Sbjct: 205 LSEKVKLVRNKQREGLIRGRMVGASRATGDVLVFLDSHCEVNEMWLQPLLAPIRENPKTV 264
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + + +T PF++P MAG
Sbjct: 265 VCPVIDIISSDTLIYSSSPVVR-GGFNWGLHFKWDPVPLSE-LGGPEGYTAPFRSPTMAG 322
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF +DR YF +G YD M +WGGENLE+SFR+W CGGS+ I PCS V H+FRK PY
Sbjct: 323 GLFVMDREYFNTLGHYDSGMDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYG 382
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + Y +L R+A VWMDE+ + YF PE + +D + RL LRK+LKC SFK
Sbjct: 383 SPGGHDTMAYNSL-RLAHVWMDEYKDQYFALRPEL-RNKDYGDISERLALRKRLKCKSFK 440
Query: 496 WYLTHVWPHHFLPMDDK------FF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P LP + F GR+ ++ T+KC+ +A+G +Q
Sbjct: 441 WYLDNIYPEMQLPGPNAKPQPPVFINKGQKRPKILQRGRLINMPTNKCL---VAQGHPSQ 497
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G C Q++ +LI ++ +CLD+ E + PR+ SG
Sbjct: 498 KGGLVVTKECD-FNDSEQVWSYNEEHELILSN-LLCLDMSEIRSSDPPRLMKCHGSG-GS 554
Query: 598 QRWTYDK 604
Q+W + K
Sbjct: 555 QQWVFGK 561
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 71/147 (48%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R KC K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 109 YQKHAFNLLISNRLGYHRDVPDTRDSKCSKKTYPAD-LPHASIVICFYNEAFSALLRTVH 167
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LD Y+ + LS +++R+ R GLI+ R++G
Sbjct: 168 SVLDRTPAQLLHEIILVDDNSELDDLKKDLDNYMQENLSEKVKLVRNKQREGLIRGRMVG 227
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A +A G++LVFLD+HCE ++ + L
Sbjct: 228 ASRATGDVLVFLDSHCEVNEMWLQPLL 254
>gi|291230380|ref|XP_002735141.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Saccoglossus kowalevskii]
Length = 510
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 181/428 (42%), Positives = 244/428 (57%), Gaps = 41/428 (9%)
Query: 193 NEEF---LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
N E+ L +S+VIVFHNEAW+ LLRTVHSVI+RSPR +L EI+LVDD S R
Sbjct: 55 NREYPGVLQTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLTEIILVDDYSNR------- 107
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
VP V R GL +ARL+GA A GE++ FLD+HCECT GWLE L+AR+A
Sbjct: 108 --------VPVMVHHCQQREGLTRARLIGAAMATGEVVTFLDSHCECTRGWLEPLLARIA 159
Query: 310 EDRTRVVCPVIDIISDVTFAYVR-SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
ED+T VVCPVI+IISD TF ++ S G F+W L F W+ + + D T P
Sbjct: 160 EDKTNVVCPVINIISDTTFEFINGSDATQVGGFDWRLIFNWHVVPHRELQRIKFDRTSPV 219
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLF+I + +F +G YD VWG ENLE+SF+ W CGG++E PCSHV H+F
Sbjct: 220 RSPTMAGGLFSIHKEFFTRLGTYDPGFDVWGAENLELSFKTWMCGGTLEFVPCSHVGHVF 279
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RK SP+ FP V+ N R+A VW+DE+ Y+ +PE K D + RL LR++
Sbjct: 280 RKRSPHRFPPTTHNVMQRNNRRLAEVWLDEYKYLYYNAHPEILKT-DPGDISERLALRER 338
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFG--RIRHVQTHKCVE--------KPLAKGSMNQA 538
L+C SFKWYL +V+P + P+ F+G I+H+ + C++ K K +
Sbjct: 339 LQCKSFKWYLENVYPENVFPI--HFYGVVTIKHIISGNCLDYGNLKMRGKQPTKAGKTDS 396
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG-FNR 597
L C PV T ++ L E CLD + I+ + + C G
Sbjct: 397 GQKVELWKCHGGPVQTFIYTKAKEIRL----EKECLDY----SAITGSLTLYPCHGQGGN 448
Query: 598 QRWTYDKE 605
Q W Y+K+
Sbjct: 449 QVWGYNKK 456
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/172 (44%), Positives = 106/172 (61%), Gaps = 21/172 (12%)
Query: 33 LGDSVDGGLHSNLSDAQ-----QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEE 87
+G + + G ++D+Q QL+ +N+FN++ SD I +NR+LPD+R C+ + +
Sbjct: 2 IGAAGEMGKPVFIADSQKEKMNQLFPLNQFNVMASDMIALNRSLPDIRPRGCQNREY-PG 60
Query: 88 FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK 147
L +S+VIVFHNEAW+ LLRTVHSVI+RSPR +L EI+LVDD S R
Sbjct: 61 VLQTTSVVIVFHNEAWTTLLRTVHSVINRSPRHLLTEIILVDDYSNR------------- 107
Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
VP V R GL +ARL+GA A GE++ FLD+HCECT + E L +
Sbjct: 108 --VPVMVHHCQQREGLTRARLIGAAMATGEVVTFLDSHCECTRGWLEPLLAR 157
>gi|341878756|gb|EGT34691.1| CBN-GLY-9 protein [Caenorhabditis brenneri]
Length = 579
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 178/425 (41%), Positives = 256/425 (60%), Gaps = 24/425 (5%)
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
+ ++ LPK+S++I+F +EAW+ LLRTVHSVI+RSP +L+E++L+DD S R+ L+ L
Sbjct: 126 IKYDYSSLPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPL 185
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
DE++ + ++IR R GLI+A+L GAR+A G+I+VFLD+HCE GWLE +V R++
Sbjct: 186 DEHIKRFGGKVKLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRIS 245
Query: 310 EDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK 369
++RT +VCP+ID ISD T AY + L G F+W LHF W D +RK T+ +
Sbjct: 246 DERTAIVCPMIDSISDSTLAYHGDWSLSVGGFSWALHFTWEGI-PEDEQKRRKKPTDYIR 304
Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
+P MAGGL A +R YFF +G YDEEM +WGGENLE+SFR W CGGSIE PCSHV H+FR
Sbjct: 305 SPTMAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRNWMCGGSIEFIPCSHVGHIFR 364
Query: 430 KASPYSFPG--GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
PY+ G +V N R+A VWMD++ Y+ + E + +D + SR ELRK
Sbjct: 365 AGHPYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYY-MHREDLRTKDVGDLTSRHELRK 423
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKF--FGRIRHV--QTHKCVEKPLAKGSMNQASGPAS 543
+L C SFKW+L ++ F+ MD+ +G + V T C + M+Q G
Sbjct: 424 RLNCKSFKWFLDNIAKGKFI-MDEDVVAYGALHTVVSGTRMCTDTLQRDEKMSQLLG--- 479
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG---FNRQRW 600
+ C Q+ + +L E+ C ++ + VR+ CS FN +RW
Sbjct: 480 VFHCQGKGSSPQLMSLSREGNL--RRENTCA------SEENGNVRMKTCSKKAQFN-ERW 530
Query: 601 TYDKE 605
Y+K+
Sbjct: 531 AYEKQ 535
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 95/133 (71%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ SD+I ++R +PD R CK ++ LPK+S++I+F +EAW+ LLRTVHSVI+R
Sbjct: 101 MNVHASDKISLDRDVPDPRIQACKDIKYDYSSLPKTSVIIIFTDEAWTPLLRTVHSVINR 160
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+E++L+DD S R+ L+ LDE++ + ++IR R GLI+A+L GAR+A G
Sbjct: 161 SPPELLQEVILLDDNSKRQELQEPLDEHIKRFGGKVKLIRKHVRHGLIRAKLAGAREAVG 220
Query: 177 EILVFLDAHCECT 189
+I+VFLD+HCE
Sbjct: 221 DIIVFLDSHCEAN 233
>gi|326434666|gb|EGD80236.1| polypeptide N-acetylgalactosaminyltransferase 13 [Salpingoeca sp.
ATCC 50818]
Length = 641
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 181/425 (42%), Positives = 251/425 (59%), Gaps = 23/425 (5%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC L ++ +PK+S++I + NEAWS LLRTV SV++RSP +++EI+L+DDAS E+L
Sbjct: 190 ECLPLKWDTSKMPKTSVIICYVNEAWSTLLRTVWSVLNRSPPELIEEIILLDDASDAEWL 249
Query: 246 KSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LD YV + R++RSP R+GLI+ARLLGA+ A+G ++ FLD+HCE GWLE +
Sbjct: 250 GEKLDTYVREHFPSHVRIVRSPDRLGLIRARLLGAKHAKGPVMTFLDSHCEANQGWLEPI 309
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVR--SFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
+ +A +RT VV PVID I T Y + S G F+W L F W S + +
Sbjct: 310 LDIIATNRTTVVTPVIDTIDHRTMEYAKWTSNIPSVGTFDWTLDFNW----KSGVLRPGQ 365
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+P +P MAGGLFAIDR YF+ IG+YDE+M WGGEN+EMSFR+WQCGG + APCS
Sbjct: 366 KLTDPIDSPTMAGGLFAIDRDYFYEIGSYDEDMDGWGGENVEMSFRIWQCGGRLVTAPCS 425
Query: 423 HVAHLFRKASPYSFPG-GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
HV H+FR PY PG G+ N R+A VWMD++ +F++ P+ E D +
Sbjct: 426 HVGHIFRDTHPYKVPGKGIHHTFMKNSMRLAEVWMDDYKQFFYDTKPKRENI-DIGDLTK 484
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMNQAS 539
R LR++LKC FKWYL HV P F+P + G +R C++K ++A
Sbjct: 485 RKALRERLKCKPFKWYLKHVLPDLFVPDSEHVLHKGALR-AGNGLCLDK-----MGHRAG 538
Query: 540 GPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNRQ 598
G A + C H Q ++ + D I T +S+CLDV Y + + + C Q
Sbjct: 539 GQAGVFSC-HGEGGNQGWMYTV-NDEIRTADSLCLDV--YSSKFPAPIHLQRCHQKQGNQ 594
Query: 599 RWTYD 603
W Y+
Sbjct: 595 AWKYE 599
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 96/142 (67%), Gaps = 1/142 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN S +P++R +PD R +C ++ +PK+S++I + NEAWS LLRTV SV++R
Sbjct: 169 FNEYKSSLLPLDRAIPDTRIKECLPLKWDTSKMPKTSVIICYVNEAWSTLLRTVWSVLNR 228
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAE 175
SP +++EI+L+DDAS E+L LD YV + R++RSP R+GLI+ARLLGA+ A+
Sbjct: 229 SPPELIEEIILLDDASDAEWLGEKLDTYVREHFPSHVRIVRSPDRLGLIRARLLGAKHAK 288
Query: 176 GEILVFLDAHCECTLVFNEEFL 197
G ++ FLD+HCE + E L
Sbjct: 289 GPVMTFLDSHCEANQGWLEPIL 310
>gi|432098371|gb|ELK28171.1| Polypeptide N-acetylgalactosaminyltransferase 3 [Myotis davidii]
Length = 633
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 170/362 (46%), Positives = 228/362 (62%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ +P +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSTPAILLKEIILVDDASVAEYLHDKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 PI-VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + +RKD T P KTP
Sbjct: 303 SPDIASIDMNTFEFNKPSPYGSNHNRGNFDWSLSFGWEALPDHERQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL +++P ++P ++ G I+ C L G NQ S P L C
Sbjct: 481 QCKNFTWYLNNIYPEVYVPDLNPVISGYIKSFGQPLC----LDVGENNQGSKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 72/134 (53%), Positives = 91/134 (67%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
+P +LKEI+LVDDAS E+L L+EY+ + + +++R R GLI ARLLGA A
Sbjct: 210 YSTPAILLKEIILVDDASVAEYLHDKLEEYIKQFPI-VKIVRQRERKGLITARLLGATVA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|224044641|ref|XP_002188932.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Taeniopygia guttata]
Length = 608
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 187/427 (43%), Positives = 254/427 (59%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S LK L EYV +
Sbjct: 150 LPSASVVICFYNEALSALLRTVHSVLDRTPAHLLHEIILVDDNSELADLKKDLSEYVKTQ 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L T+++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+A + ED V
Sbjct: 210 LPRTTKLVRNEKREGLIRGRMIGASHATGKVLVFLDSHCEVNEMWLQPLLAPIREDPRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W ++ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLTYSSSPVVR-GGFNWGLHFKWDLVPLAE-LEGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGRLLIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ E YF PE + R + R+ELRK+L C SFK
Sbjct: 388 SPGGQDTMAHNSL-RLAHVWMDEYKEQYFALRPEL-RTRSYGNITDRVELRKRLNCKSFK 445
Query: 496 WYLTHVWPHH-------------FLPMDDKF-----FGRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P F+ K GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGPNAKAPQPVFINRAQKRPKIIQRGRLYHLQTNKCL---VAQGHPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G + C + Q+++ +LI + +CLDV E + PR+ SG
Sbjct: 503 KGGLVVVRECDYNDQ-NQVWIYNEDHELILNN-LLCLDVSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDK 604
Q+WT+ K
Sbjct: 560 QQWTFGK 566
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 79/199 (39%), Positives = 118/199 (59%), Gaps = 24/199 (12%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R KC+ K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHREVPDTRDAKCREKSYPAD-LPSASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S LK L EYV +L T+++R+ R GLI+ R++G
Sbjct: 173 SVLDRTPAHLLHEIILVDDNSELADLKKDLSEYVKTQLPRTTKLVRNEKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSML 230
A A G++LVFLD+HCE NE W L+ + + I PR+++
Sbjct: 233 ASHATGKVLVFLDSHCEV-------------------NEMW---LQPLLAPIREDPRTVV 270
Query: 231 KEILLVDDASTREFLKSSL 249
++ + A T + S +
Sbjct: 271 CPVIDIISADTLTYSSSPV 289
>gi|161077154|ref|NP_725603.2| CG30463, isoform B [Drosophila melanogaster]
gi|161077156|ref|NP_001097341.1| CG30463, isoform C [Drosophila melanogaster]
gi|157400365|gb|AAF57964.3| CG30463, isoform B [Drosophila melanogaster]
gi|157400366|gb|ABV53822.1| CG30463, isoform C [Drosophila melanogaster]
Length = 647
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 196/494 (39%), Positives = 271/494 (54%), Gaps = 56/494 (11%)
Query: 116 RSPRSMLKEIL-LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R P+ M E+ VDD T K++ ++YV+ L R + P
Sbjct: 154 RLPKEMSDEMKKAVDDGWT----KNAFNQYVSDLISVHRTLPDP---------------- 193
Query: 175 EGEILVFLDAHC--ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
DA C E + N LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +
Sbjct: 194 -------RDAWCKDEARYLTN---LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGK 243
Query: 233 ILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A+ +L +LD+
Sbjct: 244 IILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDS 302
Query: 293 HCECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWY 350
HCECT GWLE L+ R+A + T VVCPVID+ISD T + Y S ++ G F+W L F W+
Sbjct: 303 HCECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWH 362
Query: 351 TYGSSDAIIKRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRV 409
+ KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+
Sbjct: 363 PVPERER--KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKT 420
Query: 410 WQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE 469
W CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VWMDE++++Y ++
Sbjct: 421 WMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLKKNSVRLAEVWMDEYSQYY--YHRI 477
Query: 470 AEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKP 529
+ D V R +LR LKC SFKWYL +++P F+P D G I +V C
Sbjct: 478 GNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIANVPNGMC---- 533
Query: 530 LAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRI 589
L ++ P S+ C H Q +++ ++ D CLD + V +
Sbjct: 534 LDAKEKSEEETPVSIYEC-HGQGGNQYWMLSKAGEIRRDDS--CLDYAGKD------VTL 584
Query: 590 LAC-SGFNRQRWTY 602
C G Q WTY
Sbjct: 585 FGCHGGKGNQFWTY 598
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 174 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVL 233
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 234 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 292
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 293 KSPVLTYLDSHCECTEGWLEPLLDR 317
>gi|427794265|gb|JAA62584.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Rhipicephalus pulchellus]
Length = 591
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 211/313 (67%), Gaps = 7/313 (2%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F E+ LP +S+V+ F+NEAWSAL+RTVHS++ R+P ++L E++LVDD ST L L
Sbjct: 120 FQEQSLPTASVVVCFYNEAWSALVRTVHSILERTPAALLHELILVDDNSTLPELGLQLSR 179
Query: 252 YVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE 310
YVA +L R+IR+P R GLI+AR+ GA A G++LVFLD+HCE +GWLE ++AR+
Sbjct: 180 YVASELPSHVRLIRTPAREGLIRARMYGAHNASGQVLVFLDSHCEVNVGWLEPMLARIGA 239
Query: 311 DRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+RT V CPVIDII+ TF Y S + G FNW LHF+W S + + +P +
Sbjct: 240 NRTTVTCPVIDIINADTFEYSASPIVR-GGFNWGLHFKWE---SPPRLRGPQQAIDPIPS 295
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+DR YF +G YD+ M +WGGENLE+SFR+W CGG +EI PCS V H+FR+
Sbjct: 296 PTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISFRIWMCGGRLEILPCSRVGHVFRR 355
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY P G + L N RVA VWMDE+ +Y + +A Q V +R ELRK+LK
Sbjct: 356 RRPYGSPSG-EDTLTKNSLRVAHVWMDEYKTYYLQTRRDARNQWYGD-VSARKELRKRLK 413
Query: 491 CHSFKWYLTHVWP 503
CHSF WY+ HV+P
Sbjct: 414 CHSFDWYMHHVYP 426
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 72/149 (48%), Positives = 104/149 (69%), Gaps = 1/149 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN+L+S+R+ R+LPD R P C+ + F E+ LP +S+V+ F+NEAWSAL+RTVH
Sbjct: 88 YHQHAFNVLISNRLGKVRSLPDTRNPLCRQQEFQEQSLPTASVVVCFYNEAWSALVRTVH 147
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA-KLSVPTRVIRSPGRVGLIKARLLG 170
S++ R+P ++L E++LVDD ST L L YVA +L R+IR+P R GLI+AR+ G
Sbjct: 148 SILERTPAALLHELILVDDNSTLPELGLQLSRYVASELPSHVRLIRTPAREGLIRARMYG 207
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPK 199
A A G++LVFLD+HCE + + E L +
Sbjct: 208 AHNASGQVLVFLDSHCEVNVGWLEPMLAR 236
>gi|390350617|ref|XP_784979.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Strongylocentrotus purpuratus]
Length = 647
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 160/346 (46%), Positives = 224/346 (64%), Gaps = 8/346 (2%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
+C + E LP +S++I+FHNEA+SALLRTVHSVI+RSPR +LKEI+LVDDAST+E LK
Sbjct: 281 QCKSLVYPEVLPTTSVIIIFHNEAFSALLRTVHSVINRSPRHLLKEIILVDDASTQEHLK 340
Query: 247 SSLDEYVAK---LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LD+Y+++ S R+ R P R GLI+AR+ GA A G+IL FLD+HCE +GWLE
Sbjct: 341 VKLDDYISRHFHSSARVRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEP 400
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRK 362
L+A + +DR VV P ID+I D AY S +L G+F W + FRW + D ++
Sbjct: 401 LLAVIDKDRRNVVTPTIDVIDDNDLAYKGSDQLPQVGSFGWTMAFRWTAIQTMDLEEAKR 460
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+ T P ++P MAGGLF+ID+ YF +G YD Q+WG EN+E+SF+ W CGGS+ CS
Sbjct: 461 NPTLPIRSPTMAGGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACS 520
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSR 482
HV H+FRK +PYS G+ + N R+ VW+ + FY+K +P+ + D ++ +
Sbjct: 521 HVGHIFRKFAPYS---GMGSYFHRNNKRLIEVWLGDARAFYYKLHPDVLRI-DAGDIQDQ 576
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEK 528
+ LRK+L C SF WYL +V+P P FG IR+ + C++K
Sbjct: 577 INLRKKLDCKSFDWYLDNVFPESPWPRKGSIFGFIRNPDFNICLDK 622
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 87/153 (56%), Positives = 111/153 (72%), Gaps = 4/153 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
A LY N FNLL SD I NR+LPDVR +CK+ V+ E LP +S++I+FHNEA+SALL
Sbjct: 251 ADALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVY-PEVLPTTSVIIIFHNEAFSALL 309
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK---LSVPTRVIRSPGRVGLI 164
RTVHSVI+RSPR +LKEI+LVDDAST+E LK LD+Y+++ S R+ R P R GLI
Sbjct: 310 RTVHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISRHFHSSARVRIERLPTRSGLI 369
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+AR+ GA A G+IL FLD+HCE + + E L
Sbjct: 370 RARIHGALNAIGDILTFLDSHCEVNVGWLEPLL 402
>gi|348519859|ref|XP_003447447.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Oreochromis niloticus]
Length = 624
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 174/362 (48%), Positives = 237/362 (65%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS + LK LD+Y+ +L
Sbjct: 178 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASVDDELKDKLDDYLKQL 237
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++ RV+R R GLI ARLLGA A G+ L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 238 NI-VRVMRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLLARIAENYTAVV 296
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF +++ + G F+W L F W + + +RKD T P KTP
Sbjct: 297 SPDITTIDLNTFEFMKPSPYGQNHNRGNFDWSLSFGWESLPDHEKR-RRKDETYPIKTPT 355
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF+ IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 356 FAGGLFSISKEYFYRIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIPCSIVGHVFRTKS 415
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G ++V+ N R+A VWMD++ E +++ N +A + A + R+ELR++L
Sbjct: 416 PHTFPKG-TQVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKDGAFGDISKRVELREKL 474
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C SF WYL +V+P F+P ++ FG +++V C L G N+ + PC
Sbjct: 475 QCKSFSWYLQNVYPEVFMPDLNPLRFGSVKNVGKDSC----LDAGENNEGGKQLIMYPCH 530
Query: 549 HL 550
L
Sbjct: 531 GL 532
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 93/134 (69%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FNL SD I ++R L D R P+C + F LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 144 FNLYASDHISLSRDLGADTRPPECIEQTFKRCPPLPTTSVIIVFHNEAWSTLLRTVYSVL 203
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS + LK LD+Y+ +L++ RV+R R GLI ARLLGA A
Sbjct: 204 HTSPAILLKEIILVDDASVDDELKDKLDDYLKQLNI-VRVMRQRERKGLITARLLGASVA 262
Query: 175 EGEILVFLDAHCEC 188
G+ L FLDAHCEC
Sbjct: 263 TGDTLTFLDAHCEC 276
>gi|157113705|ref|XP_001652065.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108877647|gb|EAT41872.1| AAEL006558-PA [Aedes aegypti]
Length = 368
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 157/323 (48%), Positives = 209/323 (64%), Gaps = 14/323 (4%)
Query: 284 GEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNW 343
G+++ FLDAHCECT GWLE L+AR+ DR VVCP+ID+ISD TF YV + + WG FNW
Sbjct: 2 GQVITFLDAHCECTEGWLEPLLARIVLDRKTVVCPIIDVISDETFEYVTASDQTWGGFNW 61
Query: 344 ELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENL 403
+L+FRWY + + + D T P +TP MAGGLF+IDR YF+ IG+YDE M +WGGENL
Sbjct: 62 KLNFRWYRVPAREMQRRNHDRTAPLRTPTMAGGLFSIDRDYFYEIGSYDEGMDIWGGENL 121
Query: 404 EMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFY 463
EMSFR+WQCGG +EIAPCSHV H+FR SPY+FPGGV+ ++ N ARVA VW+DEW EFY
Sbjct: 122 EMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTFPGGVANIVLKNAARVAEVWLDEWKEFY 181
Query: 464 FKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTH 523
++ +P A K V R ELR++LKC SF+WYL +++P +P+D F G IR+V+T
Sbjct: 182 YQMSPGARKA-SAGDVSERKELRERLKCKSFRWYLENIYPESQMPLDYYFLGEIRNVETG 240
Query: 524 KCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
C++ +M + S H Q+F +++ D CLD N +
Sbjct: 241 NCLD------TMGRKSNEKIGSSYCHGLGGNQVFAYTKRHQVMSDDN--CLDA---SNAL 289
Query: 584 SPRVRILACSGF-NRQRWTYDKE 605
P V ++ C G Q W YD+E
Sbjct: 290 GP-VNLVRCHGMGGNQEWVYDEE 311
>gi|311275138|ref|XP_003134591.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Sus
scrofa]
Length = 608
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 188/428 (43%), Positives = 257/428 (60%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S++I F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEY+ K
Sbjct: 150 LPVASVIICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L+ +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 210 LTGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHFRW S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSASPVVR-GGFNWGLHFRWDLVPLSE-LEGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + R + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRSYGNISERVELRKKLDCKSFK 445
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ A+G +Q
Sbjct: 446 WYLDNIYPEMQVSGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCLA---AQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +LI + +CLD+ E + PR+ SG
Sbjct: 503 KGGLVVLKACDYGDP-DQIWIYNEEHELILNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+WT+ +
Sbjct: 560 QQWTFGRN 567
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S++I F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYRTD-LPVASVIICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEY+ K L+ +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLTGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVLWLQPLL 259
>gi|73979014|ref|XP_539924.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Canis
lupus familiaris]
Length = 608
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 187/427 (43%), Positives = 258/427 (60%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK L+EYV K
Sbjct: 150 LPAASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED+ V
Sbjct: 210 LPGKIKVIRNIKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIQEDQQTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + + + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTKSYGNISERVELRKKLGCKSFK 445
Query: 496 WYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGPNAKPQQPIFINRGPKRPKILQRGRLYHLQTNKCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + TQ+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 503 KGGLVVLKACDYSDP-TQIWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDK 604
Q+WT+ K
Sbjct: 560 QQWTFGK 566
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R C+ K F + LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACRDKSFPAD-LPAASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK L+EYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNIKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|21464370|gb|AAM51988.1| RE10344p [Drosophila melanogaster]
Length = 650
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 197/496 (39%), Positives = 272/496 (54%), Gaps = 57/496 (11%)
Query: 116 RSPRSMLKEIL-LVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R P+ M E+ VDD T K++ ++YV+ L R + P
Sbjct: 154 RLPKEMSDEMKKAVDDGWT----KNAFNQYVSDLISVHRTLPDP---------------- 193
Query: 175 EGEILVFLDAHC--ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
DA C E + N LPK+ ++I FHNEAW+ LLRTVHSV+ RSP ++ +
Sbjct: 194 -------RDAWCKDEARYLTN---LPKTDVIICFHNEAWTVLLRTVHSVLDRSPEHLIGK 243
Query: 233 ILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDA 292
I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A+ +L +LD+
Sbjct: 244 IILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHAKSPVLTYLDS 302
Query: 293 HCECTLGWLENLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWY 350
HCECT GWLE L+ R+A + T VVCPVID+ISD T + Y S ++ G F+W L F W+
Sbjct: 303 HCECTEGWLEPLLDRIARNSTTVVCPVIDVISDETLEYHYRDSGGVNVGGFDWNLQFSWH 362
Query: 351 TYGSSDAIIKRKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRV 409
+ KR + T EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+
Sbjct: 363 PVPERER--KRHNSTAEPVYSPTMAGGLFSIDREFFDRLGTYDSGFDIWGGENLELSFKT 420
Query: 410 WQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE 469
W CGG++EI PCSHV H+FRK SPY + GV+ V N R+A VWMDE+++ Y ++
Sbjct: 421 WMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VPKKNSVRLAEVWMDEYSQCY--YHRI 477
Query: 470 AEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVE 527
+ D V R +LR LKC SFKWYL +++P F+P D G IR++ C++
Sbjct: 478 GNDKGDWGDVSDRRKLRNDLKCKSFKWYLDNIYPELFIPGDSVAHGEIRNLGYGGRTCLD 537
Query: 528 KPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRV 587
P K +A G PC H Q +++ ++ D CLD + V
Sbjct: 538 APAGKKHQKKAVG---TYPC-HRQGGNQYWMLSKAGEIRRDDS--CLDYAGKD------V 585
Query: 588 RILAC-SGFNRQRWTY 602
+ C G Q WTY
Sbjct: 586 TLFGCHGGKGNQFWTY 601
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 93/145 (64%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R CK + LPK+ ++I FHNEAW+ LLRTVHSV+
Sbjct: 174 NAFNQYVSDLISVHRTLPDPRDAWCKDEARYLTNLPKTDVIICFHNEAWTVLLRTVHSVL 233
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +I+LVDD S LK L++Y A ++IR R GLI+AR+LGA A
Sbjct: 234 DRSPEHLIGKIILVDDYSDMPHLKRQLEDYFAAYP-KVQIIRGQKREGLIRARILGANHA 292
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 293 KSPVLTYLDSHCECTEGWLEPLLDR 317
>gi|189236651|ref|XP_969621.2| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270005204|gb|EFA01652.1| hypothetical protein TcasGA2_TC007223 [Tribolium castaneum]
Length = 564
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 178/405 (43%), Positives = 242/405 (59%), Gaps = 19/405 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S E +AK+
Sbjct: 130 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSD----NPEDGEELAKI 185
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+HCEC + WLE L+ RVAED TRVV
Sbjct: 186 Q-KVRVLRNDKREGLMRSRVRGADAATASVLTFLDSHCECNVNWLEPLLERVAEDPTRVV 244
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W G ++ +++D T+ +TP +AGG
Sbjct: 245 CPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLGYAERESRQRDPTQAIRTPMIAGG 304
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 305 LFVINKAYFEKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 364
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ FY+ P A K + RLELR+ L+C FKW
Sbjct: 365 PGGSGNVFARNTRRAAEVWMDDYKHFYYAAVPLA-KNIPFGDISERLELRRNLQCKPFKW 423
Query: 497 YLTHVWPHHFLPM-DDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
YL HV+P +P G +R Q C++ G + G +L C H +
Sbjct: 424 YLQHVYPELAIPQATSAHVGELR--QGMYCLD---TMGHL--IDGTVALYQCHHTGGNQE 476
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
L + + +CL + +Y + +RI C G + Q+W
Sbjct: 477 ---WGLTSGGLIKHHDLCLTLDDYMKGVQVVMRI--CDGSDSQKW 516
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/157 (42%), Positives = 95/157 (60%), Gaps = 6/157 (3%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
L + Y NRFN SD +P NR +PD R C+ K++ + LP +S++I FHNEA
Sbjct: 85 GGLRQGEDPYIRNRFNQEASDNLPSNREIPDTRNAMCRRKLWRTD-LPPTSVIITFHNEA 143
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
S LLRTV SV++RSP ++KEI+LVDD S E +AK+ RV+R+ R G
Sbjct: 144 RSTLLRTVVSVLNRSPEHLIKEIILVDDFSD----NPEDGEELAKIQ-KVRVLRNDKREG 198
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L+++R+ GA A +L FLD+HCEC + + E L +
Sbjct: 199 LMRSRVRGADAATASVLTFLDSHCECNVNWLEPLLER 235
>gi|345782166|ref|XP_540140.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Canis
lupus familiaris]
Length = 552
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 186/452 (41%), Positives = 260/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CT++ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSSRAVPDTRHLRCTMLVYCADLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P ++++EI+LVDD S + D+ + + +P + IR+ R GL++
Sbjct: 128 LRTIRSVLNRTPMNLIQEIILVDDFS------NDPDDCLQLIKLPKVKCIRNSERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDIIS F Y+ S
Sbjct: 182 SRIRGANVAKGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIISLDNFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D EP +TP +AGGLF +D+++F ++G YD
Sbjct: 242 AAELRGGFDWSLHFQWEQL-SPEQKARRLDPAEPIRTPIIAGGLFVMDKSWFNYLGKYDT 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL+LRK L+C SFKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIESRLDLRKNLQCQSFKWYLENVYPELRIPNDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E K N L PC +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLESQRQK---NTEIYDLRLSPCVKTKGKDAKSQIWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V ++ +G ++Q+WT
Sbjct: 473 EELCLSVVTVFPG-APVVLVVCKNGDDKQQWT 503
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 89/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI +R +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSSRAVPDTRHLRCTMLVYCAD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P ++++EI+LVDD S + D+ + + +P + IR+ R GL+++R+ G
Sbjct: 133 SVLNRTPMNLIQEIILVDDFS------NDPDDCLQLIKLPKVKCIRNSERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ANVAKGTTLTFLDSHCE 203
>gi|426223372|ref|XP_004005849.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Ovis
aries]
Length = 552
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 186/452 (41%), Positives = 261/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEIL----VFLDAHC-ECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + V D CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTIRSILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + + +R D TEP +TP +AGGLF +D+++F+++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-TPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDT 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL LRK L+C SFKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYASRPFA-LERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSI 419
Query: 514 F-GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E A+ +Q L PC +Q++ ++
Sbjct: 420 HKGSIR--QRQKCLE---AQKQKDQEISSLKLSPCVKTEGKDAKSQIWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDKRQQWT 503
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/137 (40%), Positives = 88/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCAD-LPPTSIIIAFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|440907821|gb|ELR57918.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Bos
grunniens mutus]
Length = 509
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 186/452 (41%), Positives = 261/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEIL----VFLDAHC-ECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + V D CTL+ LP +SI+I FHNEA S L
Sbjct: 25 RVGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTL 84
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 85 LRTIRSILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 138
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII TF Y+ S
Sbjct: 139 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNYIES 198
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + + +R D TEP +TP +AGGLF +D+++F+++G YD
Sbjct: 199 ASELRGGFDWSLHFQWEQL-TPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDT 257
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 258 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYIFPDGNANTYIKNTKRTAE 317
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL LRK L+C SFKWYL +V+P +P D
Sbjct: 318 VWMDEYKQYYYASRPFA-LERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSI 376
Query: 514 F-GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E A+ +Q L PC +Q++ ++
Sbjct: 377 HKGSIR--QRQKCLE---AQKQKDQEISNLKLSPCVKTEGKDAKSQIWAFTYTQQILQ-- 429
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G RQ+WT
Sbjct: 430 EELCLSVITLFPG-APVVLVLCKNGDKRQQWT 460
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 55/137 (40%), Positives = 88/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 31 YKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCAD-LPPTSIIIAFHNEARSTLLRTIR 89
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 90 SILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRIRG 143
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 144 ADIAQGTTLTFLDSHCE 160
>gi|291397404|ref|XP_002715111.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Oryctolagus cuniculus]
Length = 608
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 190/428 (44%), Positives = 256/428 (59%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEYV K
Sbjct: 150 LPVASVVICFYNEAFSALLRTVHSVLDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 210 LPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSEQG-GAEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRLYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELRKKLGCQSFK 445
Query: 496 WYLTHVWPHHFLPMDDK------FF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P +P + FF GR+ H QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQVPGPNAKAQQPVFFNRGPKRPKVLRRGRLYHFQTNKCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q++ +L+ + +CLDV E + PR+ SG
Sbjct: 503 KGGLVVLKACDYGDP-DQVWFYNEEHELVLHN-LLCLDVSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+W + K
Sbjct: 560 QQWAFGKN 567
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNLLISNRLGYHRDVPDTRNAACKDKSYPAD-LPVASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVLWLQPLL 259
>gi|443720284|gb|ELU10082.1| hypothetical protein CAPTEDRAFT_93071, partial [Capitella teleta]
Length = 518
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 176/414 (42%), Positives = 252/414 (60%), Gaps = 26/414 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+++FHNEAWS LLRTV S + RSP ++KEI+LVDD S E L++ L E+ A
Sbjct: 63 LPDTSIIVIFHNEAWSVLLRTVFSCLDRSPGHLVKEIILVDDFSDFEHLQAPLQEF-ADS 121
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R++R+ R GLI+ARLLGA A+G +L FLD+HCECT+GWLE L+ R++++++ VV
Sbjct: 122 QEKVRLVRAKKREGLIRARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRISQNKSNVV 181
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
PVID+I+D T + Y + G F+W L F W+ + +RK +P ++P MA
Sbjct: 182 TPVIDVINDDTIQYQYSSAKSTSVGGFDWNLQFNWHGIPDHEKK-RRKSDVDPVRSPTMA 240
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+I R YF ++G YD M +WGGENLE+SFR+W CGGS++IAPCSHV H+FRK SPY
Sbjct: 241 GGLFSISREYFEYLGTYDPGMDIWGGENLELSFRIWMCGGSLDIAPCSHVGHIFRKRSPY 300
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
S+ GV+ V+ N R+A VW+DE++++Y+ +FN + D V +R LR++L C S
Sbjct: 301 SWKTGVN-VVKKNSIRLAEVWLDEFSKYYYERFNYDLGDYGD---VSARKALRERLHCKS 356
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK----CVEKPLAKGSMNQASGPASLLPCTH 549
FKWYL +++P F+P + G + V + C++ K + N+A L PC +
Sbjct: 357 FKWYLDNIYPDLFIPGESLASGEVNGVFNSQSQPACLDSAADKKAYNKA---IKLWPCHN 413
Query: 550 LPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTY 602
+ + M + I DE C D V I C Q W Y
Sbjct: 414 MG--GNQYWMLSKSGEIRRDEG-CFDYA------GQFVMIYPCHAMKGNQEWIY 458
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 76/157 (48%), Positives = 107/157 (68%), Gaps = 2/157 (1%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++ YQ N FN SD++ ++RTLPDVR +C+ + + E LP +SI+++FHNEAWS
Sbjct: 20 LKKYEKGYQRNAFNQYASDQMSLHRTLPDVRDKECRDRNYATE-LPDTSIIVIFHNEAWS 78
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV S + RSP ++KEI+LVDD S E L++ L E+ A R++R+ R GLI
Sbjct: 79 VLLRTVFSCLDRSPGHLVKEIILVDDFSDFEHLQAPLQEF-ADSQEKVRLVRAKKREGLI 137
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARLLGA A+G +L FLD+HCECT+ + E L + S
Sbjct: 138 RARLLGASVAQGNVLTFLDSHCECTMGWLEPLLDRIS 174
>gi|126341064|ref|XP_001364304.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11
[Monodelphis domestica]
Length = 609
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 190/439 (43%), Positives = 253/439 (57%), Gaps = 29/439 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ EC LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S +
Sbjct: 139 NAECKEKSYPSDLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDD 198
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK LD+YV K L +V+R+ R GLI+ R++GA A GE+LVFLD+HCE WL+
Sbjct: 199 LKGELDKYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQP 258
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ + EDR VVCPVIDIIS T Y S + G FNW LHF+W S+ + +
Sbjct: 259 LLVPIQEDRRTVVCPVIDIISADTLMYSSS-PIVRGGFNWGLHFKWDLVPFSE-LEGPEG 316
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
P K+P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS
Sbjct: 317 AIAPIKSPTMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSR 376
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY P G + Y +L R+A VW+DE+ E YF PE K + + R+
Sbjct: 377 VGHIFRKRRPYGSPEGQDTMTYNSL-RLAHVWLDEYKEQYFSLRPEL-KLKSYGNISERI 434
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKC 525
LRK+L C SFKWYL +++P L P F GR+ H+QT+KC
Sbjct: 435 ALRKKLGCKSFKWYLDNIYPEMQLSGPNAKPQQPVFINRGPKRPKILQRGRLYHLQTNKC 494
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP 585
+ A+G +Q G L C + Q+++ +LI + +CLD+ E + P
Sbjct: 495 LA---AQGHPSQKGGLVVLRVCDYSDP-NQVWIYNEEHELILNN-LLCLDMSETRSSDPP 549
Query: 586 RVRILACSGFNRQRWTYDK 604
R+ SG Q+W K
Sbjct: 550 RLMKCHGSG-GSQQWILGK 567
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R +CK K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 115 YQKHAFNLLISNRLGYHRDVPDTRNAECKEKSYPSD-LPAASIVICFYNEAFSALLRTVH 173
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LD+YV K L +V+R+ R GLI+ R++G
Sbjct: 174 SVIDRTPAHLLHEIILVDDNSEFDDLKGELDKYVQKYLPGKIQVVRNEKREGLIRGRMIG 233
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE ++ + L
Sbjct: 234 AAHATGEVLVFLDSHCEVNKMWLQPLL 260
>gi|449276238|gb|EMC84873.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Columba livia]
Length = 522
Score = 325 bits (833), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 174/399 (43%), Positives = 250/399 (62%), Gaps = 15/399 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
+N LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S + +LK+ L++
Sbjct: 73 YNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPSVLLKEIILVDDLSDKVYLKTDLEK 132
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+AE+
Sbjct: 133 YISSLKR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVSGWLEPLLERIAEN 191
Query: 312 RTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y +++ E G F+W L F+W++ + + +RK T+P ++
Sbjct: 192 ETVIVCPVIDTIDWKTFEYYMQTAEPMIGGFDWRLTFQWHSVPKHERL-RRKSETDPIRS 250
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 251 PTMAGGLFAVSKKYFEYLGTYDTGMDVWGGENLELSFRVWQCGGMLEIHPCSHVGHVFPK 310
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ + + R LR++LK
Sbjct: 311 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPSARKE-NYGDLSERKILRERLK 364
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +++ +P D + G IR +C++ L + N +G L
Sbjct: 365 CKSFNWYLKNIFAELHVPEDRPGWHGAIRSAGIASECLDYALPE---NHPTGAHLSLFGC 421
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDISPR 586
H Q F ++ + +C +VPE+E+ I+ R
Sbjct: 422 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEHEDFITMR 460
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 67/142 (47%), Positives = 98/142 (69%), Gaps = 1/142 (0%)
Query: 58 NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 117
N+ +SD+I ++R + D R CK K +N LP +S++I F+NEAWS LLRT+HSV+ S
Sbjct: 47 NIYLSDKISLHRHIEDNRLSGCKAKSYNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETS 106
Query: 118 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 177
P +LKEI+LVDD S + +LK+ L++Y++ L R+IR+ R GL++ARL+GA A G+
Sbjct: 107 PSVLLKEIILVDDLSDKVYLKTDLEKYISSLKR-VRLIRTNKREGLVRARLIGATFATGD 165
Query: 178 ILVFLDAHCECTLVFNEEFLPK 199
+L FLD HCEC + E L +
Sbjct: 166 VLTFLDCHCECVSGWLEPLLER 187
>gi|300794826|ref|NP_001179661.1| polypeptide N-acetylgalactosaminyltransferase 14 [Bos taurus]
gi|296482443|tpg|DAA24558.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14) [Bos
taurus]
Length = 552
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 186/452 (41%), Positives = 261/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEIL----VFLDAHC-ECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + V D CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCADLPPTSIIIAFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTIRSILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII TF Y+ S
Sbjct: 182 SRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + + +R D TEP +TP +AGGLF +D+++F+++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-TPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFYYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYIFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL LRK L+C SFKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYASRPFA-LERPFGNIESRLNLRKNLQCQSFKWYLENVYPELRVPKDSSI 419
Query: 514 F-GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E A+ +Q L PC +Q++ ++
Sbjct: 420 HKGSIR--QRQKCLE---AQKQKDQEISNLKLSPCVKTEGKDAKSQIWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDKRQQWT 503
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 55/137 (40%), Positives = 88/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERIASNRVVPDTRLFRCTLLVYCAD-LPPTSIIIAFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|296488074|tpg|DAA30187.1| TPA: polypeptide N-acetylgalactosaminyltransferase 11-like [Bos
taurus]
Length = 605
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 187/428 (43%), Positives = 259/428 (60%), Gaps = 30/428 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEY+ K
Sbjct: 148 LPVASIVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKY 207
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+P R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 208 LPGKIKVIRNPKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDRRTV 267
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 268 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 325
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 326 GLFAMNRNYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 385
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ + YF P+ + R+ + R+ELRK+L C SFK
Sbjct: 386 SPEGQDTMTHNSL-RLAHVWLDEYKQ-YFSLRPDL-RTRNYGNISERVELRKKLDCKSFK 442
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G ++
Sbjct: 443 WYLDNIYPEMQISGPNVKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCL---VAQGRPSE 499
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 500 KGGLVVLKACDYSDP-NQVWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 556
Query: 598 QRWTYDKE 605
Q+WT+ +
Sbjct: 557 QQWTFGRN 564
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 75/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +SIVI F+NEA SALLRTVH
Sbjct: 112 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPAD-LPVASIVICFYNEALSALLRTVH 170
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEY+ K L +VIR+P R GLI+ R++G
Sbjct: 171 SVLDRTPARLLHEIILVDDDSDFDDLKGELDEYIQKYLPGKIKVIRNPKREGLIRGRMIG 230
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 231 AAHATGEVLVFLDSHCEVNVLWLQPLL 257
>gi|195425498|ref|XP_002061038.1| GK10725 [Drosophila willistoni]
gi|194157123|gb|EDW72024.1| GK10725 [Drosophila willistoni]
Length = 644
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 176/426 (41%), Positives = 256/426 (60%), Gaps = 25/426 (5%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
DA C+ T + + LPK+ ++I FHNEAWS LLRTVHSV+ RSP ++ +++LVDD S
Sbjct: 189 DAWCKDTARYLTD-LPKTDVIICFHNEAWSVLLRTVHSVLDRSPEHLIGKVILVDDYSDM 247
Query: 243 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
LK L++Y +++R R GLI+AR+LGA+ A+ +L +LD+HCECT GWLE
Sbjct: 248 PHLKKQLEDYFTAYP-KVQIVRGAKREGLIRARILGAQYAKSPVLTYLDSHCECTEGWLE 306
Query: 303 NLVARVAEDRTRVVCPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIK 360
L+ R+A + T VVCPVID+I+D T + Y S ++ G F+W L F W+ + K
Sbjct: 307 PLLDRIARNSTTVVCPVIDVINDDTLEYHYRDSTGVNVGGFDWNLQFSWHAVPEREK--K 364
Query: 361 RKDFT-EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
R + + EP +P MAGGLF+IDR +F +G YD +WGGENLE+SF+ W CGG++EI
Sbjct: 365 RHNSSAEPVYSPTMAGGLFSIDRDFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIV 424
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
PCSHV H+FRK SPY + GV+ VL N R+A VWMD++A++Y ++ + D V
Sbjct: 425 PCSHVGHIFRKRSPYKWRSGVN-VLRKNSVRLAEVWMDDYAQYY--YHRIGNDKGDWGDV 481
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQ 537
R +LR+ L+C SF+WYL +++P F+P D G I+++ C++ P K + +
Sbjct: 482 SDRKKLREDLQCKSFRWYLDNIYPELFIPGDAVAHGEIKNLGYGGRTCMDAPAGKKHLKK 541
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-N 596
+ G PC H Q +++ ++ D CLD + V + AC G
Sbjct: 542 SVG---TYPC-HRQGGNQYWMLSKAGEIRRDDS--CLDYAGKD------VTLYACHGSKG 589
Query: 597 RQRWTY 602
Q WTY
Sbjct: 590 NQFWTY 595
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 67/145 (46%), Positives = 91/145 (62%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLPD R CK LPK+ ++I FHNEAWS LLRTVHSV+
Sbjct: 168 NAFNQYASDLISVHRTLPDPRDAWCKDTARYLTDLPKTDVIICFHNEAWSVLLRTVHSVL 227
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++ +++LVDD S LK L++Y +++R R GLI+AR+LGA+ A
Sbjct: 228 DRSPEHLIGKVILVDDYSDMPHLKKQLEDYFTAYP-KVQIVRGAKREGLIRARILGAQYA 286
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+ +L +LD+HCECT + E L +
Sbjct: 287 KSPVLTYLDSHCECTEGWLEPLLDR 311
>gi|291243600|ref|XP_002741689.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Saccoglossus kowalevskii]
Length = 524
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 216/325 (66%), Gaps = 6/325 (1%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + LP +SI+I F E+WS L+R+VHSVI+RSP ++KEI+LVDD S+RE+LK
Sbjct: 66 ECQDKLYSDSLPSTSIIICFTEESWSTLVRSVHSVINRSPPQLIKEIILVDDFSSREYLK 125
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
+ LD+Y+ + +++R R GLI+ RL G A+GE+L FLD+H EC +GWLE ++
Sbjct: 126 APLDKYMKRFP-QVKILRLENREGLIRGRLRGTEIAQGEVLTFLDSHIECGVGWLEPMLQ 184
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+ EDR VV P+ID I F+Y S L G F+WE+ F+W + + +RKD T
Sbjct: 185 RIKEDRRNVVAPMIDGIDATKFSYAAS-NLIRGGFSWEMQFKWKPIPDYE-MKRRKDETW 242
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P MAGGLFAID++YF IG YD +++WG ENLE+SF++W CGG++E+ PCSHV H
Sbjct: 243 PIRSPTMAGGLFAIDKSYFLEIGTYDPGLEIWGAENLELSFKIWMCGGNLEMIPCSHVGH 302
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FR + PY FP G + N RVA VWMDE+ + ++ P+ K D V R ELR
Sbjct: 303 VFRASQPYKFPEGNIKTFMRNNMRVAEVWMDEYKDIFYALKPQL-KGEDYGDVTERKELR 361
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDD 511
+L+CH FKWYL +++P LP+ D
Sbjct: 362 DRLQCHDFKWYLQNIYPE--LPIPD 384
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/143 (47%), Positives = 100/143 (69%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN VSD I V R +PDVR +C+ K++++ LP +SI+I F E+WS L+R+VHSVI+R
Sbjct: 45 FNEFVSDMISVERAIPDVRPEECQDKLYSDS-LPSTSIIICFTEESWSTLVRSVHSVINR 103
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP ++KEI+LVDD S+RE+LK+ LD+Y+ + +++R R GLI+ RL G A+G
Sbjct: 104 SPPQLIKEIILVDDFSSREYLKAPLDKYMKRFP-QVKILRLENREGLIRGRLRGTEIAQG 162
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
E+L FLD+H EC + + E L +
Sbjct: 163 EVLTFLDSHIECGVGWLEPMLQR 185
>gi|241998138|ref|XP_002433712.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215495471|gb|EEC05112.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 653
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 194/501 (38%), Positives = 273/501 (54%), Gaps = 51/501 (10%)
Query: 33 LGDSVDGGLHSN------LSDAQQL---------------YQINRFNLLVSDRIPVNRTL 71
LG S DGG ++DA +L Y + FNLL+S+R+ R+L
Sbjct: 28 LGGSADGGPEKAADGSVVVADASRLGVIRSPKDQQVKVAGYHQHAFNLLISNRLGFYRSL 87
Query: 72 PDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDA 131
PD R P ++ L + ++V W L +V P ++V DA
Sbjct: 88 PDTRNP-----LYRPFLLGIGATLLV-----WCLLGGSVDG----GPEKAADGSVVVADA 133
Query: 132 STREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLV 191
S ++S D+ V L+ + LG ++ + L C
Sbjct: 134 SRLGVIRSPKDQQVKVAGYHQHAFN------LLISNRLGFYRSLPDTRNPL-----CRSE 182
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
+ LP +S+V+ F+NEAWS LLRTVH+V+ R+PR +L E++LVDD ST+ L L E
Sbjct: 183 EHGAELPTASVVVCFYNEAWSTLLRTVHTVLGRTPRHLLHEVILVDDNSTQVDLGPQLAE 242
Query: 252 YVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE 310
YV+ +L R+IR+ R GLI+AR+ GAR A GE+LVFLD+HCE +GWLE L+ R+
Sbjct: 243 YVSSQLPSHVRLIRTRDREGLIRARMFGARNASGEVLVFLDSHCEVNVGWLEPLLERIRA 302
Query: 311 DRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+R V CP+IDII+ TF Y S + G FNW LHF+W + + A K + P +
Sbjct: 303 NRATVTCPIIDIINADTFEYTAS-PIVRGGFNWGLHFKWESPPAGLAR-KGRGAIAPIPS 360
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+DR +F +G YD+ M +WGGENLE+SFR+W CGG +EI PCS V H+FR+
Sbjct: 361 PTMAGGLFAMDRKFFHRLGEYDDGMDIWGGENLEISFRIWMCGGQLEIIPCSRVGHVFRR 420
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY P G + L N RVA VWMD++ ++YF+ + + + SR+ LRK+L
Sbjct: 421 RRPYGSPNG-EDTLTKNSLRVAHVWMDDYKKYYFQTRSDVVG-KPYGDITSRVALRKRLG 478
Query: 491 CHSFKWYLTHVWPHHFLPMDD 511
C SF WY+ V+P P D
Sbjct: 479 CRSFDWYMKTVYPELQPPSKD 499
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/198 (43%), Positives = 121/198 (61%), Gaps = 23/198 (11%)
Query: 24 GVHVVVGHYLGDSVDGGLHSN------LSDAQQL---------------YQINRFNLLVS 62
G ++V LG SVDGG ++DA +L Y + FNLL+S
Sbjct: 103 GATLLVWCLLGGSVDGGPEKAADGSVVVADASRLGVIRSPKDQQVKVAGYHQHAFNLLIS 162
Query: 63 DRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSML 122
+R+ R+LPD R P C+++ E LP +S+V+ F+NEAWS LLRTVH+V+ R+PR +L
Sbjct: 163 NRLGFYRSLPDTRNPLCRSEEHGAE-LPTASVVVCFYNEAWSTLLRTVHTVLGRTPRHLL 221
Query: 123 KEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVF 181
E++LVDD ST+ L L EYV ++L R+IR+ R GLI+AR+ GAR A GE+LVF
Sbjct: 222 HEVILVDDNSTQVDLGPQLAEYVSSQLPSHVRLIRTRDREGLIRARMFGARNASGEVLVF 281
Query: 182 LDAHCECTLVFNEEFLPK 199
LD+HCE + + E L +
Sbjct: 282 LDSHCEVNVGWLEPLLER 299
>gi|327274386|ref|XP_003221958.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Anolis carolinensis]
Length = 608
Score = 324 bits (831), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 184/428 (42%), Positives = 252/428 (58%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SI+I F+NEA+SALLRTVHSV+ R+P +L EI+LVDD S LK LD Y+ K
Sbjct: 150 LPSASIIICFYNEAFSALLRTVHSVLDRTPSHLLHEIILVDDNSELVDLKEDLDVYLRKN 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+ + E R V
Sbjct: 210 LPNNVKLVRNGKREGLIRGRMIGASHATGKVLVFLDSHCEVNELWLQPLLTPIRESRKTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLTYSSSPVVR-GGFNWGLHFKWDLVPLSE-LEGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMDREYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLLIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ + YF PE + R+ + R+ELRK+L C SFK
Sbjct: 388 SPGGQDTMAHNSL-RLAHVWMDEYKDQYFALRPEL-RMRNYGNITDRVELRKKLNCKSFK 445
Query: 496 WYLTHVWPHHFLPMDDK------FF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + + FF GR+RH+Q+ KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGSNAKVQPPLFFNKGQKRPKTLQRGRLRHLQSDKCL---VAQGHPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G + C + Q+++ +LI + +CLDV E PR+ SG
Sbjct: 503 KGGLVVVRECDY-SDQNQVWLYNEDHELILNN-LLCLDVSETRTSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+W K
Sbjct: 560 QQWVLGKN 567
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 101/147 (68%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R KCK K + + LP +SI+I F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRDAKCKGKKYPLD-LPSASIIICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S LK LD Y+ K L +++R+ R GLI+ R++G
Sbjct: 173 SVLDRTPSHLLHEIILVDDNSELVDLKEDLDVYLRKNLPNNVKLVRNGKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A G++LVFLD+HCE ++ + L
Sbjct: 233 ASHATGKVLVFLDSHCEVNELWLQPLL 259
>gi|157114750|ref|XP_001652403.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108883556|gb|EAT47781.1| AAEL001121-PA [Aedes aegypti]
Length = 647
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 173/411 (42%), Positives = 246/411 (59%), Gaps = 22/411 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEAWS LLRTVHSV+ RSP ++KE++LVDD S + L++Y
Sbjct: 205 LPATSVIICFHNEAWSVLLRTVHSVLDRSPEHLVKEVILVDDFSDMPHTQKQLEDYFEAY 264
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR+P R GLI+ARLLGAR A +L +LD+HCECT GWLE L+ R+A + T VV
Sbjct: 265 PR-VKIIRAPKREGLIRARLLGARYATAPVLTYLDSHCECTTGWLEPLLDRIARNSTTVV 323
Query: 317 CPVIDIISDVT--FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
CPVID+I D T + Y S ++ G F+W L F W+ + + K EP +P MA
Sbjct: 324 CPVIDVIDDNTMEYHYRDSGGVNVGGFDWNLQFNWHAVPDREKK-RHKSTAEPVFSPTMA 382
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+ +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SPY
Sbjct: 383 GGLFSIDKEFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPY 442
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ GV+ V+ N R+A VW+DE+A++Y++ + D V R +LR+ L C F
Sbjct: 443 KWRTGVN-VIKRNSVRLAEVWLDEYAKYYYQR--IGNDKGDYGDVSERKQLRENLGCKPF 499
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
+WYL +++P F+P + G +R++ C++ P K ++ + P L PC H
Sbjct: 500 RWYLDNIFPELFIPGEAVASGEVRNMGYGNRTCLDAPGGKKNLRK---PVGLYPC-HNQG 555
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTY 602
Q +++ T I DE+ CLD + V + C G Q W Y
Sbjct: 556 GNQYWMLS-KTGEIRRDEA-CLDYAGQD------VILYPCHGSKGNQYWNY 598
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 92/145 (63%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN +D I + R+LPD R P CK LP +S++I FHNEAWS LLRTVHSV+
Sbjct: 171 NAFNQYAADLISIRRSLPDPRDPWCKEPGRYGTDLPATSVIICFHNEAWSVLLRTVHSVL 230
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
RSP ++KE++LVDD S + L++Y ++IR+P R GLI+ARLLGAR A
Sbjct: 231 DRSPEHLVKEVILVDDFSDMPHTQKQLEDYFEAYPR-VKIIRAPKREGLIRARLLGARYA 289
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+L +LD+HCECT + E L +
Sbjct: 290 TAPVLTYLDSHCECTTGWLEPLLDR 314
>gi|345488662|ref|XP_003425959.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Nasonia vitripennis]
Length = 572
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 180/416 (43%), Positives = 242/416 (58%), Gaps = 19/416 (4%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C L + LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S
Sbjct: 129 CRLKQWRQDLPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGD 188
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L + K+ RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ R
Sbjct: 189 ELSR-IHKV----RVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWLEPLLER 243
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VAED +RVVCPVID+IS F Y+ + G F+W L F+W S+ ++KD T+
Sbjct: 244 VAEDPSRVVCPVIDVISMDNFQYIGASADLRGGFDWSLVFKWEYLSQSERQARQKDPTQA 303
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP +AGGLF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+
Sbjct: 304 IRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHV 363
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK PYSFPGG V N R A VWMD++ +FY+ P A + ++ R+EL++
Sbjct: 364 FRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKR 422
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLP 546
+L C F WYL HV+P +P + G Q C++ SM G L P
Sbjct: 423 KLHCKPFSWYLKHVYPELIIPTSEGGPGGSLK-QGTACLD------SMGHLLDGNVGLYP 475
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
C H Q + M D + +CL +P Y S ++I C G Q+W +
Sbjct: 476 C-HDTGGNQEWGMT--NDGLIKHHDLCLTLPVYAKGTSLLMQI--CDGSENQKWRH 526
Score = 118 bits (296), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 96/161 (59%), Gaps = 10/161 (6%)
Query: 39 GGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVF 98
GGL + + Y N+FN SD +P NR +PD R C+ K + ++ LP +S++I F
Sbjct: 93 GGLRA----GEDPYARNKFNQEASDGLPSNRDIPDTRSAMCRLKQWRQD-LPPTSVIITF 147
Query: 99 HNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSP 158
HNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+ RVIR+
Sbjct: 148 HNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGDELSR-IHKV----RVIRNE 202
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R GL+++R+ GA A +L FLD+HCEC + E L +
Sbjct: 203 KREGLMRSRVRGADAATANVLTFLDSHCECNADWLEPLLER 243
>gi|345484988|ref|XP_001605337.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Nasonia vitripennis]
Length = 646
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 175/415 (42%), Positives = 250/415 (60%), Gaps = 27/415 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 169 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDYSDMPHLKRQLEDYM--M 226
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P +++R+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A ++T V
Sbjct: 227 NYPKVKILRASKREGLIRARLLGAAMAKAPVLTYLDSHCECTEGWLEPLLDRIARNQTTV 286
Query: 316 VCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VCPVID+I D T Y S ++ G F+W L F W+ + + K+ EP +P M
Sbjct: 287 VCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEPVWSPTM 345
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLFAIDR +F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 346 AGGLFAIDRLFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 405
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y + GV+ VL N R++ VW+DE+A++Y++ + + V R LRK L C S
Sbjct: 406 YKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQR--IGHDKGNYGDVSDRKALRKNLGCKS 462
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHLP 551
FKWYL +++P F+P + G +R++ + C++ P K +++ PA L PC H
Sbjct: 463 FKWYLDNIYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGLYPC-HRQ 518
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFNRQRWTYDK 604
Q+ + +C+D P D+ V C G N Q W K
Sbjct: 519 GGNQIRHLA---------SRLCIDSPGNPEDLHQAVGFYECHNQGGN-QYWMLSK 563
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 98/144 (68%), Gaps = 3/144 (2%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+R+LPD R P CK ++ LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 135 NAFNQYASDLISVHRSLPDPRDPWCKEPGRYQKDLPPTAVIICFHNEAWSVLLRTVHSVL 194
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S LK L++Y+ ++ P +++R+ R GLI+ARLLGA
Sbjct: 195 DRSPDHLIQEIILVDDYSDMPHLKRQLEDYM--MNYPKVKILRASKREGLIRARLLGAAM 252
Query: 174 AEGEILVFLDAHCECTLVFNEEFL 197
A+ +L +LD+HCECT + E L
Sbjct: 253 AKAPVLTYLDSHCECTEGWLEPLL 276
>gi|156397428|ref|XP_001637893.1| predicted protein [Nematostella vectensis]
gi|156225009|gb|EDO45830.1| predicted protein [Nematostella vectensis]
Length = 398
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 170/357 (47%), Positives = 238/357 (66%), Gaps = 17/357 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYV 253
LP +S++I+FHNEAWS LLRTVHSV++RSP +L+EI+LVDD S T L S L+ Y+
Sbjct: 46 LPTASVIIIFHNEAWSTLLRTVHSVLARSPPYLLREIVLVDDHSRLDTYGHLGSKLESYI 105
Query: 254 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
++ + ++IR+P R GLI+ARL+GA+QA+GE+LVFLD+HCE LGWLE L+AR+ E+R+
Sbjct: 106 SQFT-KVQLIRAPKREGLIRARLIGAKQAKGEVLVFLDSHCEANLGWLEPLLARIGENRS 164
Query: 314 RVVCPVIDIISDVTFAYVRSFELH-WGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
VV P I++I TF Y + G FNWEL F+W + +RK ++P ++P
Sbjct: 165 IVVTPDIEVIDLRTFGYTHEHGANNRGIFNWELTFKWRGIPEYERR-RRKSDSDPIRSPT 223
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLFAID++YF+ IG+YD EM WGGEN+E+SFR+W CGGS+EI PCS V H+FR++
Sbjct: 224 MAGGLFAIDKSYFYEIGSYDTEMSFWGGENVEISFRIWMCGGSLEIIPCSKVGHVFRESQ 283
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
PY G + N R+A VWMD++ + ++ P+ K +D V R LR++L C
Sbjct: 284 PYKIGEGAIDR---NNMRLAEVWMDDYKKIFYAMRPQL-KGKDYGDVSGRKALRERLMCK 339
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGR--IRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
SFKWYL +V +P D GR IR++ T+ C++ LAK N+A G + C
Sbjct: 340 SFKWYLDNVISELAIP-DLYPIGRGEIRNLGTNTCLDT-LAK---NEAGGEPGMYMC 391
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 71/154 (46%), Positives = 111/154 (72%), Gaps = 5/154 (3%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ +++ +FN L S ++ ++R++PD R C + + + LP +S++I+FHNEAWS LLR
Sbjct: 7 KERFRLYQFNELASSKVALDRSIPDNRPQSCLSLSYPTK-LPTASVIIIFHNEAWSTLLR 65
Query: 109 TVHSVISRSPRSMLKEILLVDDAS---TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
TVHSV++RSP +L+EI+LVDD S T L S L+ Y+++ + ++IR+P R GLI+
Sbjct: 66 TVHSVLARSPPYLLREIVLVDDHSRLDTYGHLGSKLESYISQFT-KVQLIRAPKREGLIR 124
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
ARL+GA+QA+GE+LVFLD+HCE L + E L +
Sbjct: 125 ARLIGAKQAKGEVLVFLDSHCEANLGWLEPLLAR 158
>gi|383847543|ref|XP_003699412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Megachile rotundata]
Length = 571
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 177/407 (43%), Positives = 239/407 (58%), Gaps = 19/407 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 137 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 195
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 196 ----RVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAEDPTRVV 251
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W S+ + ++KD T+ +TP +AGG
Sbjct: 252 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQSERLARQKDPTQAIRTPMIAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 312 LFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYSF 371
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +FY+ P A + ++ R+EL+++L C F W
Sbjct: 372 PGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKRKLHCKPFSW 430
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVLTQ 555
YL +V+P +P + G + P SM G L PC H Q
Sbjct: 431 YLKNVYPELVIPTSEGGPG-------GSLKQGPACLDSMGHLLDGNVGLYPC-HDTGGNQ 482
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ L D + +CL +P Y + ++I C G Q+W +
Sbjct: 483 EW--GLTKDGLIKHHDLCLTLPVYAKGTTLLMQI--CDGSENQKWRH 525
>gi|449507774|ref|XP_004186276.1| PREDICTED: LOW QUALITY PROTEIN:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 13 (GalNAc-T13),
partial [Taeniopygia guttata]
Length = 402
Score = 324 bits (830), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 169/368 (45%), Positives = 234/368 (63%), Gaps = 18/368 (4%)
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
FLK+SL+ YV KL VP R++R R GLI+ARL GA ++G+++ FLDAHCECTLGWLE
Sbjct: 1 FLKASLENYVKKLGVPIRILRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEP 60
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L++R+ EDR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D
Sbjct: 61 LLSRIKEDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMERRKGD 120
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP MAGGLF+IDR+YF IG YD M +WGGENLEMSFR+WQCGGS+EI CSH
Sbjct: 121 RTLPVRTPTMAGGLFSIDRSYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSH 180
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEK-QRDKQAVRSR 482
V H+FRKA+PY+FPGG V+ N R+A VWMD++ +F++ +P A + DK+ +
Sbjct: 181 VGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDDFKDFFYIISPGAPRFVWDKRILYGI 240
Query: 483 L----ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQA 538
+ L+ ++KC F WYL +V+P +P G IR+V+T++C++ K N+
Sbjct: 241 VPWCGTLKIRMKCQPFSWYLENVYPDSQIPRRYYSLGEIRNVETNQCLDNMGRK--ENEK 298
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NR 597
G C H Q+F + I TD+ +CLDV ++ V +L C
Sbjct: 299 VG---FFNC-HGMGGNQVFSYTADKE-IRTDD-LCLDVSR----LNGPVLMLKCHHLRGN 348
Query: 598 QRWTYDKE 605
Q W YD E
Sbjct: 349 QLWEYDAE 356
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/64 (54%), Positives = 47/64 (73%)
Query: 136 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEE 195
FLK+SL+ YV KL VP R++R R GLI+ARL GA ++G+++ FLDAHCECTL + E
Sbjct: 1 FLKASLENYVKKLGVPIRILRMEQRSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEP 60
Query: 196 FLPK 199
L +
Sbjct: 61 LLSR 64
>gi|417403257|gb|JAA48441.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 608
Score = 324 bits (830), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 186/430 (43%), Positives = 257/430 (59%), Gaps = 29/430 (6%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV- 253
E LP +S+VI F+NEA SALLRTVHSV+ R+P +L+E++LVDD S + LK LDE+V
Sbjct: 148 EDLPVASVVICFYNEALSALLRTVHSVLDRTPAQLLREVILVDDDSDFDDLKGQLDEFVQ 207
Query: 254 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
+L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE WL+ L+A + EDR
Sbjct: 208 TQLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNTMWLQPLLATIQEDRR 267
Query: 314 RVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VVCPVIDIIS T AY S + G FNW LHF+W S+ + T P K+P M
Sbjct: 268 TVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLIPPSE-LGGPGGATAPIKSPTM 325
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK P
Sbjct: 326 AGGLFAMNRDYFDELGRYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRP 385
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y P G + + +L R+A VW+DE+ E YF P+ + R + R+ELR++L C S
Sbjct: 386 YGSPEGRDTMAHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRSYGNISERVELRRKLGCKS 443
Query: 494 FKWYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGSM 535
FKWYL +++P + P F GR+ H+QT KC+ +A+G
Sbjct: 444 FKWYLDNIYPEMQISGPNAKPQQPLFINRGPKRPKVLQRGRLYHLQTGKCL---VAQGRP 500
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
+Q G L C + Q+++ +L+ ++ +CLD+ E + PR+ SG
Sbjct: 501 SQKGGLVVLKACDYSDP-NQVWIYNEEHELVLSN-LLCLDMSETRSSDPPRLMKCHGSG- 557
Query: 596 NRQRWTYDKE 605
Q+WT+ K
Sbjct: 558 GSQQWTFGKN 567
Score = 135 bits (339), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R CK + + E+ LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNLLISNRLGYHRDVPDTRSAACKDETYPED-LPVASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L+E++LVDD S + LK LDE+V +L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAQLLREVILVDDDSDFDDLKGQLDEFVQTQLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE ++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNTMWLQPLL 259
>gi|350582569|ref|XP_003481303.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Sus scrofa]
Length = 552
Score = 324 bits (830), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 185/452 (40%), Positives = 263/452 (58%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEIL----VFLDAHC-ECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + V D CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERVASNRVVPDTRLFRCTLLVYCADLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRTV S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTVRSILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII TF Y+ S
Sbjct: 182 SRIRGADAAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIIHLDTFDYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + + +R D TEP +TP +AGGLF +D+++F ++G YD
Sbjct: 242 ATELRGGFDWSLHFQWEQL-TPEQKARRLDPTEPIRTPIIAGGLFVMDKSWFDYLGKYDT 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL+LR+ L+C SFKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYASRPFA-LERPFGNIESRLDLRRNLQCQSFKWYLENVYPELRIPKDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ +Q L PC + +Q++ ++
Sbjct: 420 QKGNIR--QRQKCLE---SQKQKDQEISNLRLSPCVKIEGKDAKSQIWAFTYTQQILQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 473 EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 503
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/137 (40%), Positives = 88/137 (64%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+R+ NR +PD R +C V+ + LP +SI+I FHNEA S LLRTV
Sbjct: 74 YKLYAFNQRESERVASNRVVPDTRLFRCTLLVYCAD-LPPTSIIITFHNEARSTLLRTVR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
S+++R+P ++++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SILNRTPMNLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADAAQGTTLTFLDSHCE 203
>gi|405959954|gb|EKC25926.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Crassostrea gigas]
Length = 569
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 196/533 (36%), Positives = 290/533 (54%), Gaps = 50/533 (9%)
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT----RVIRSPG 159
+A++ +V S R M K L DAS E L S+L E+ P+ +++PG
Sbjct: 6 AAVIDSVRSYWREKKRVMRKSDL---DASNIEALGSALSEFDEDDWKPSARAKTDLKAPG 62
Query: 160 RVG----LIKARLLGARQAEGEI-------------LVFLDAHC------ECTLVFNEEF 196
+G K++L + E E + L EC + E
Sbjct: 63 ELGSPYIFNKSQLTSKEKLEYETGWKKNNFNEFASNRISLQRSLKDPRDKECHNLTYSEN 122
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP+ SI++ FHNEAWS L+R+V+S+++R+P S+LKE++LVDD S+ E LK LD+++ +
Sbjct: 123 LPEVSIIVTFHNEAWSVLIRSVYSILNRTPDSLLKEVILVDDFSSLEHLKEPLDQFMEQF 182
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R+ R GLI+ARL G R+A G++LVFLD+H EC GW E L+ +A + + V+
Sbjct: 183 Q-KVKIVRATERQGLIRARLRGYREAVGDVLVFLDSHIECAEGWFEPLIDPIARNWSTVM 241
Query: 317 CPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
PVID+I TF Y + + G F+W L F W+ ++ ++ P ++P MA
Sbjct: 242 TPVIDVIDKETFQYGFQAASATNVGGFDWSLMFTWHFVPETEQKRRQNKHYLPVRSPTMA 301
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAI R YF HIG YDE M +WGGENLE+SFR+W CGG++ APCSHV H+FR PY
Sbjct: 302 GGLFAISRKYFEHIGTYDEGMDIWGGENLELSFRIWMCGGTLLTAPCSHVGHVFRHTPPY 361
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFK-FNPEAEKQRDKQAVRSRLELRKQLKCHS 493
SF G V+ NL R+A VW+D++ +Y++ N D V +R LR L+CHS
Sbjct: 362 SF-GPKKNVVKNNLVRMAEVWLDDFKYYYYQHINYTLGNYGD---VSARRALRANLQCHS 417
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVL 553
F WYL +V+P +P + + G IR C+E P G +N+ P ++ C H
Sbjct: 418 FDWYLVNVYPELLIPAEALYSGEIRSKAEPLCLESPYRFGKINK---PLTVFHC-HGQKG 473
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q ++ + I D C+D D V + +C G Q+WTY ++
Sbjct: 474 NQYWLYTQKGE-IRHDLYGCMD------DAGSTVYVNSCHGLGGNQKWTYRED 519
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 110/173 (63%), Gaps = 9/173 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN S+RI + R+L D R +C ++E LP+ SI++ FHNEAWS L+R+V+
Sbjct: 87 WKKNNFNEFASNRISLQRSLKDPRDKECHNLTYSEN-LPEVSIIVTFHNEAWSVLIRSVY 145
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+++R+P S+LKE++LVDD S+ E LK LD+++ + +++R+ R GLI+ARL G
Sbjct: 146 SILNRTPDSLLKEVILVDDFSSLEHLKEPLDQFMEQFQ-KVKIVRATERQGLIRARLRGY 204
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 224
R+A G++LVFLD+H EC + E + + WS ++ V VI +
Sbjct: 205 REAVGDVLVFLDSHIECAEGWFEPLIDPIA-------RNWSTVMTPVIDVIDK 250
>gi|3047191|gb|AAC13671.1| GLY5a [Caenorhabditis elegans]
Length = 623
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 176/433 (40%), Positives = 259/433 (59%), Gaps = 34/433 (7%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S++I FHNEAWS LLRTVHSV+ R+P +L+E++LVDD S
Sbjct: 161 IDAECK-TEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD 218
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 219 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWM 278
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 279 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 335
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 336 -RKNRTRPIDPVRSPTMAGGLFSIDKEYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 394
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 395 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 450
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKG 533
+ SR +LR+ L C SFKWYL +++P F+P + G +R+ + +C++ + G
Sbjct: 451 FGDISSRKKLREDLGCKSFKWYLDNIYPELFVPGESVAKGEMRNAGGKNRQCIDYKPSGG 510
Query: 534 SMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS 593
+ C H Q +++ + I DES C+D V + C
Sbjct: 511 KT------VGMYQC-HNQGGNQYWMLSKDGE-IRRDES-CVDY------AGSDVMVFPCH 555
Query: 594 GF-NRQRWTYDKE 605
G Q W Y+ +
Sbjct: 556 GMKGNQEWRYNHD 568
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S++I FHNEAWS LLRTVHSV+
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+E++LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 200 ERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVA 259
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 260 TGEVLTYLDSHCECMEGWMEPLLDR 284
>gi|71993513|ref|NP_001022851.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
gi|14530626|emb|CAC42368.1| Protein GLY-5, isoform b [Caenorhabditis elegans]
Length = 623
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 176/433 (40%), Positives = 259/433 (59%), Gaps = 34/433 (7%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S++I FHNEAWS LLRTVHSV+ R+P +L+E++LVDD S
Sbjct: 161 IDAECK-TEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVLERTPDHLLEEVVLVDDFSD 218
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 219 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWM 278
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 279 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 335
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 336 -RKNRTRPIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 394
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 395 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 450
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKG 533
+ SR +LR+ L C SFKWYL +++P F+P + G +R+ + +C++ + G
Sbjct: 451 FGDISSRKKLREDLGCKSFKWYLDNIYPELFVPGESVAKGEMRNAGGKNRQCIDYKPSGG 510
Query: 534 SMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS 593
+ C H Q +++ + I DES C+D V + C
Sbjct: 511 KT------VGMYQC-HNQGGNQYWMLSKDGE-IRRDES-CVDY------AGSDVMVFPCH 555
Query: 594 GF-NRQRWTYDKE 605
G Q W Y+ +
Sbjct: 556 GMKGNQEWRYNHD 568
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S++I FHNEAWS LLRTVHSV+
Sbjct: 141 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIICFHNEAWSVLLRTVHSVL 199
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+E++LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 200 ERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAVA 259
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 260 TGEVLTYLDSHCECMEGWMEPLLDR 284
>gi|395828928|ref|XP_003787614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Otolemur garnettii]
Length = 678
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 175/395 (44%), Positives = 238/395 (60%), Gaps = 20/395 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E A H CTL+ LP +SI+I FHNEA S L
Sbjct: 68 RVGEDPYKLYAFNQRESERTPSNRAVPDTRHSRCTLLVYYTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ R+ ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRIRGADVAQGTTLTFLDSHCEVNRDWLQPLLHRIKEDYTRVVCPVIDIINLDTFTYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDM 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SRL+LRK L+C SFKWYL +++P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIESRLDLRKNLRCQSFKWYLENIYPELSVPKDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
G IR Q KC+E ++ N+ + L PC
Sbjct: 420 QKGNIR--QRQKCLE---SQRQNNRDAPNLKLSPC 449
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+R P NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERTPSNRAVPDTRHSRCTLLVYYTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADVAQGTTLTFLDSHCE 203
>gi|440896822|gb|ELR48646.1| Polypeptide N-acetylgalactosaminyltransferase 4, partial [Bos
grunniens mutus]
Length = 566
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 175/397 (44%), Positives = 244/397 (61%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 117 FNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 176
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
YV+ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ +D
Sbjct: 177 YVSNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIRKD 235
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T V+CPVID I TF Y+++ E G F+W L F+W++ + +RK EPF++
Sbjct: 236 ETVVICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIEPFRS 294
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 295 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 354
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++L+
Sbjct: 355 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLR 408
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ +P D + G IR + + +C++ N S SL C
Sbjct: 409 CKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIGISSECLD--YNAPDNNPTSANLSLFGC- 465
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE + +
Sbjct: 466 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPELKKHVG 502
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 105/157 (66%), Gaps = 1/157 (0%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S L ++L + N+ +SDRI ++R + D R +CK+K FN LP +S++I F+NEA
Sbjct: 76 SELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEA 135
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ YV+ L R+IR+ R G
Sbjct: 136 WSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYVSNLDR-VRLIRTNKREG 194
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L++ARL+GA A G++L FLD HCEC + E L +
Sbjct: 195 LVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 231
>gi|312083982|ref|XP_003144087.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
gi|307760750|gb|EFO19984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Loa loa]
Length = 682
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 185/415 (44%), Positives = 249/415 (60%), Gaps = 24/415 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEAWS LLRTVHSV+ R+P +LKEI+LVDD S + LK L++Y+++
Sbjct: 230 LPDTSVIICFHNEAWSVLLRTVHSVLERTPDHLLKEIILVDDFSDFDHLKKPLEDYMSQF 289
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR R+GLI+ARL GA A G++L +LD+HCEC WLE L+ R+A++ T VV
Sbjct: 290 G-KVRIIRLENRMGLIRARLKGASVATGKVLTYLDSHCECMNRWLEPLLDRIAQNSTNVV 348
Query: 317 CPVIDIISDVTFAYVRSFE--LHWGAFNWELHFRWYTYGSSD--AIIKRKDFTEPFKTPA 372
PVID I+ T Y S L G FNW L F W+ D A+ R D P +P
Sbjct: 349 TPVIDTINLETLQYHLSSHRRLSVGGFNWGLVFNWHILPDRDYQAMKSRID---PIPSPT 405
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLF+IDR YF +G YD +WG ENLE+SF++W CGG +E+ PCSHV H+FRK S
Sbjct: 406 MAGGLFSIDRGYFEKLGGYDPGFDIWGSENLEISFKIWMCGGRLEVVPCSHVGHIFRKKS 465
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
PY + G++ VL N R+A VW+D++ E Y +N K D V R +LR+ LKCH
Sbjct: 466 PYKWRKGIN-VLQRNNIRLAEVWLDDYKEIY--YNRINHKLVDFGDVSERKKLREHLKCH 522
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCTHLP 551
SFKWYL +V+P FLP + G IR++ K CV+ + + ++N + P PC HL
Sbjct: 523 SFKWYLDNVFPDLFLPSEAIASGEIRNLGNQKYCVDHDVGRNAVNDSVIP---YPC-HLQ 578
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q +++ + I DE C+D Y SP C G Q W Y+ E
Sbjct: 579 GGNQFWMLSKSGE-IRRDE-YCID---YTGRGSPVT--YECHGSKGNQLWDYNHE 626
Score = 132 bits (333), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 65/137 (47%), Positives = 95/137 (69%), Gaps = 2/137 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FN VS+ I ++R+LP+ C+ + + LP +S++I FHNEAWS LLRTVH
Sbjct: 194 FKRNSFNEYVSNMISIHRSLPNNTDELCQKASYRND-LPDTSVIICFHNEAWSVLLRTVH 252
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV+ R+P +LKEI+LVDD S + LK L++Y+++ R+IR R+GLI+ARL GA
Sbjct: 253 SVLERTPDHLLKEIILVDDFSDFDHLKKPLEDYMSQFG-KVRIIRLENRMGLIRARLKGA 311
Query: 172 RQAEGEILVFLDAHCEC 188
A G++L +LD+HCEC
Sbjct: 312 SVATGKVLTYLDSHCEC 328
>gi|34042906|gb|AAQ56699.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 601
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 182/433 (42%), Positives = 254/433 (58%), Gaps = 33/433 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ E LP +S+VI+F NE +S LLRTVHS +S LKEI+LVDD S L + LD
Sbjct: 143 FDSESLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 252 YVAKLSVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
YV + +P+ ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 203 YV-RTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRI 261
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 262 KESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRREC 319
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 320 KQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 379
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D
Sbjct: 380 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C SF+WYL +++P F+P D + +G++ V ++ C++ L N+
Sbjct: 438 VTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVHAVNSNICLDDLLQN---NE 494
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC--- 592
A L PC + +Q+F + +E C V E SP RV ++ C
Sbjct: 495 KPYNAGLYPCGKVLQKSQLF--SFTNTNVLRNELSCATVQHSE---SPPYRVVMVPCMEN 549
Query: 593 SGFNRQRWTYDKE 605
FN Q W Y+ +
Sbjct: 550 DEFNEQ-WRYEHQ 561
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
++Y+ N +S+++ NR++ D R P C + F+ E LP +S+VI+F NE +S LL
Sbjct: 107 GDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQRFDSESLPTASVVIIFFNEPYSVLL 166
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLI 164
RTVHS +S LKEI+LVDD S L + LD YV + +P+ ++R R+GLI
Sbjct: 167 RTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RTRIPSGKVTILRLKNRLGLI 225
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 226 RARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 260
>gi|443721252|gb|ELU10645.1| hypothetical protein CAPTEDRAFT_228331 [Capitella teleta]
Length = 512
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 181/417 (43%), Positives = 247/417 (59%), Gaps = 18/417 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C+ + + LPK+S++I FHNEA S LLRTV SV++RSP +++EI+LVDD S +F
Sbjct: 64 HSRCSALRWRKNLPKTSVIITFHNEARSTLLRTVVSVLNRSPEELIQEIILVDDFS--DF 121
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
+ E +AK+ +V+R+ R GLI++R+ GA A +L FLD+HCEC + WLE L
Sbjct: 122 PEDG--EELAKID-KVKVLRNDQRQGLIRSRIRGADAAVAPVLTFLDSHCECNVHWLEPL 178
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RVAED TRVV P+ID+I+ F YV + G FNW L F+W + + +R +
Sbjct: 179 LERVAEDPTRVVSPIIDVINMDNFQYVGASSNLKGGFNWNLVFKWDSLTPEEVTQRRGNP 238
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P KTP +AGGLF ID+ F IG YD M VWGGENLE+SFRVWQC GS+EI PCS V
Sbjct: 239 TAPIKTPMIAGGLFVIDKERFEEIGKYDMMMDVWGGENLEISFRVWQCHGSLEIIPCSRV 298
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY+FPGG V N R A VWMDE+ +Y+ P A K + +RSRLE
Sbjct: 299 GHVFRKQHPYTFPGGSGNVFARNTRRAAEVWMDEYKSYYYAEVPSA-KSVNFGDIRSRLE 357
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LR++LKC F W+L +V+P +P + D FG I+ Q CV+ N G
Sbjct: 358 LREKLKCRPFSWFLQNVYPSLIVPSEQDVQFGYIQ--QGSMCVDT-----LSNALGGKVG 410
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+ C H Q +V+ + D +C+ +P ++ R+ A +G Q W
Sbjct: 411 MFQC-HNTGGNQEWVLTKNQKIKHLD--LCITMPASARHMALRLETCA-NGSRNQNW 463
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 99/158 (62%), Gaps = 6/158 (3%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++ L YQ ++FN SDR+ NR +PD R +C + + LPK+S++I FHNE
Sbjct: 30 NTRLKSTDDAYQRHKFNQASSDRLRSNRDIPDSRHSRCSALRWRKN-LPKTSVIITFHNE 88
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
A S LLRTV SV++RSP +++EI+LVDD S +F + E +AK+ +V+R+ R
Sbjct: 89 ARSTLLRTVVSVLNRSPEELIQEIILVDDFS--DFPEDG--EELAKID-KVKVLRNDQRQ 143
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GLI++R+ GA A +L FLD+HCEC + + E L +
Sbjct: 144 GLIRSRIRGADAAVAPVLTFLDSHCECNVHWLEPLLER 181
>gi|338721407|ref|XP_001494570.3| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4 [Equus caballus]
Length = 703
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 176/397 (44%), Positives = 242/397 (60%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S+VI F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 254 FNYRKLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 313
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+++D
Sbjct: 314 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISKD 372
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T VVCPVID I TF Y+++ E G F+W L F+W++ + +RK +P +
Sbjct: 373 ETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPISS 431
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 432 PTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 491
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P N AR A VWMDE+ E ++ NP A K+ +L LRK+LK
Sbjct: 492 RAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRKRLK 545
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQT-HKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ + +P D + G IR + +C++ N SL C
Sbjct: 546 CKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMGIPSECLDYNAPDN--NPTGANLSLFGC- 602
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE +N +
Sbjct: 603 HGQGGNQFFEYTSKKEIRFNSVTELCAEVPEQKNHVG 639
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 73/157 (46%), Positives = 105/157 (66%), Gaps = 1/157 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S+VI F+NEAWS
Sbjct: 215 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRKLPTTSVVIAFYNEAWS 274
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 275 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 333
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 334 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIS 370
>gi|387017710|gb|AFJ50973.1| Polypeptide N-acetylgalactosaminyltransferase 11-like [Crotalus
adamanteus]
Length = 608
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 183/428 (42%), Positives = 251/428 (58%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SI+I F+NEA+SALLRT+HSV+ R+P +L EI+LVDD S LK LD Y+ K
Sbjct: 150 LPSASIIICFYNEAFSALLRTIHSVLDRTPSHLLHEIILVDDRSELADLKEDLDIYLTKD 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+ + E R V
Sbjct: 210 LPNKVKLVRNENREGLIRGRMVGASHATGKVLVFLDSHCEVNEMWLQPLLTPIQESRRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W + + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLTYSSSPVVR-GGFNWGLHFKWDLVPLLE-MEGPEQATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMDREYFNALGQYDSGMDIWGGENLEISFRIWMCGGKLVIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ E YF PE + R+ + R+ELRK+L C SFK
Sbjct: 388 SPGGQDTMAHNSL-RLAHVWMDEYKEQYFALRPEL-RTRNYGNITDRVELRKKLNCKSFK 445
Query: 496 WYLTHVWPHHFLPMDDK------FF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +V+P + + FF GR+ H+QT+KC+ +A+ + +Q
Sbjct: 446 WYLDNVYPEMQISGPNAKVQPPIFFNKGQKRPKLLQQGRLYHLQTNKCL---VAQSNPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G + C + Q+++ +LI + +CLDV E PR+ SG
Sbjct: 503 KGGLVVVKECDY-SNKNQIWMYNEDHELILNN-LLCLDVSETRTSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+W K
Sbjct: 560 QQWLLGKN 567
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/147 (46%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R KCK K++ + LP +SI+I F+NEA+SALLRT+H
Sbjct: 114 YQKHAFNVLISNRLGYHRDVPDTRDRKCKEKIYPHD-LPSASIIICFYNEAFSALLRTIH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S LK LD Y+ K L +++R+ R GLI+ R++G
Sbjct: 173 SVLDRTPSHLLHEIILVDDRSELADLKEDLDIYLTKDLPNKVKLVRNENREGLIRGRMVG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A G++LVFLD+HCE ++ + L
Sbjct: 233 ASHATGKVLVFLDSHCEVNEMWLQPLL 259
>gi|3047207|gb|AAC13679.1| GLY9 [Caenorhabditis elegans]
Length = 579
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 175/418 (41%), Positives = 251/418 (60%), Gaps = 24/418 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S++I+F +EAW+ LLRTVHSVI+RSP +L+E++L+DD S R+ L+ LDE++ +
Sbjct: 133 LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIKRF 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR R GLI+A+L GAR+A G+I+VFLD+HCE GWLE +V R++++RT +V
Sbjct: 193 GGKVRLIRKHDRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTAIV 252
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID ISD T AY + L G F+W LHF W S + +R T+ ++P MAGG
Sbjct: 253 CPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWEGL-SEEEQKRRTKPTDYIRSPTMAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
L A +R YFF +G YDEEM +WGGENLE+SFR W CGGSIE PCSHV H+FR PY+
Sbjct: 312 LLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGHPYNM 371
Query: 437 PG--GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
G +V N R+A VWMD++ Y+ + E + +D + +R ELRK+L C F
Sbjct: 372 TGRNNNKDVHGTNSKRLAEVWMDDYKRLYY-MHREDLRTKDVGDLTARHELRKRLNCKPF 430
Query: 495 KWYLTHVWPHHFLPMDDKF--FGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
KW+L ++ F+ MD+ +G + V T C + M+Q G + C
Sbjct: 431 KWFLDNIAKGKFI-MDEDVVAYGALHTVVSGTRMCTDTLQRDEKMSQLLG---VFHCQGK 486
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG---FNRQRWTYDKE 605
Q+ M L + E+ C ++ + +R+ CS FN +RW Y+ +
Sbjct: 487 GSSPQL--MSLSKEGNLRRENTCA------SEENGNIRMKTCSKKAQFN-ERWAYENK 535
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 64/133 (48%), Positives = 95/133 (71%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ SD+I ++R +PD R CK ++ LPK+S++I+F +EAW+ LLRTVHSVI+R
Sbjct: 101 MNVHASDKISLDRDVPDPRIQACKDIKYDYAALPKTSVIIIFTDEAWTPLLRTVHSVINR 160
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+E++L+DD S R+ L+ LDE++ + R+IR R GLI+A+L GAR+A G
Sbjct: 161 SPPELLQEVILLDDNSKRQELQEPLDEHIKRFGGKVRLIRKHDRHGLIRAKLAGAREAVG 220
Query: 177 EILVFLDAHCECT 189
+I+VFLD+HCE
Sbjct: 221 DIIVFLDSHCEAN 233
>gi|350584686|ref|XP_003481803.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
2 [Sus scrofa]
Length = 578
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 174/397 (43%), Positives = 243/397 (61%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S+VI F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+AED
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIAED 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 306
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 307 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 366
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P N AR A VWMDE+ E ++ NP A K+ +L LR++L
Sbjct: 367 RAPYARPN-----FLQNTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLG 420
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ + +P D + G IR + + +C++ N SL C
Sbjct: 421 CKSFDWYLKNVFSNLHVPEDRPGWHGAIRSIGISSECLD--YNSPENNPTGANLSLFGC- 477
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE +N +
Sbjct: 478 HGQGGNQFFEYTSNREIRFNSVTELCAEVPEQKNHVG 514
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK+K F+ LP +S+VI F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFDYRRLPTTSVVIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243
>gi|62751482|ref|NP_001015534.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
gi|75057892|sp|Q5EA41.1|GALT6_BOVIN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 6;
AltName: Full=Polypeptide GalNAc transferase 6;
Short=GalNAc-T6; Short=pp-GaNTase 6; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 6;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 6
gi|59857821|gb|AAX08745.1| polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L+ YV +L
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V V + R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QVVQVVRQQE-RKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDETVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE----LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPNIVTIDLNTFEFSKPVQRGRVQSRGNFDWSLTFGWEVLPAREKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G++ V+ N R+A VWMD + E +++ N +A + +++ + RL+LR++L
Sbjct: 414 PHTFPKGIN-VIARNQVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L +V+P F+P + FFG ++++ C L G N P L C
Sbjct: 473 NCHNFSWFLDNVYPEMFVPDLKPTFFGALKNLGVDHC----LDVGENNNGGKPLILYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 98/144 (68%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPATSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E+LK L+ YV +L V V + R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERYVKQLQVVQVVRQQE-RKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|148878418|gb|AAI46056.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 6 (GalNAc-T6) [Bos
taurus]
gi|296487792|tpg|DAA29905.1| TPA: polypeptide N-acetylgalactosaminyltransferase 6 [Bos taurus]
Length = 622
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L+ YV +L
Sbjct: 176 LPATSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V V + R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 236 QVVQVVRQQE-RKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDETVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE----LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P K+P
Sbjct: 295 SPNIVTIDLNTFEFSKPVQRGRIQSRGNFDWSLTFGWEVLPAREKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G++ V+ N R+A VWMD + E +++ N +A + +++ + RL+LR++L
Sbjct: 414 PHTFPKGIN-VIARNQVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
CH+F W+L +V+P F+P + FFG ++++ C L G N P L C
Sbjct: 473 NCHNFSWFLDNVYPEMFVPDLKPTFFGALKNLGVDHC----LDVGENNNGGKPLILYTCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 76/144 (52%), Positives = 98/144 (68%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPATSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E+LK L+ YV +L V V + R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERYVKQLQVVQVVRQQE-RKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|350584684|ref|XP_003481802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Sus scrofa]
gi|350596113|ref|XP_003360781.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Sus scrofa]
Length = 582
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 174/397 (43%), Positives = 244/397 (61%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S+VI F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 133 FDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 192
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+AED
Sbjct: 193 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIAED 251
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 252 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 310
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 311 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 370
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++L
Sbjct: 371 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLG 424
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ + +P D + G IR + + +C++ N SL C
Sbjct: 425 CKSFDWYLKNVFSNLHVPEDRPGWHGAIRSIGISSECLD--YNSPENNPTGANLSLFGC- 481
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE +N +
Sbjct: 482 HGQGGNQFFEYTSNREIRFNSVTELCAEVPEQKNHVG 518
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK+K F+ LP +S+VI F+NEAWS
Sbjct: 94 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFDYRRLPTTSVVIAFYNEAWS 153
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 154 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 212
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 213 RARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 247
>gi|291386971|ref|XP_002709979.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oryctolagus cuniculus]
Length = 551
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 187/452 (41%), Positives = 258/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEIL----VFLDA-HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + V D H C L+ + LP +SI+I FHNEA S L
Sbjct: 67 RVGDDPYKLHAFNQRESERIPSNRVVPDTRHNRCALLVYCKDLPPTSIIITFHNEARSTL 126
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRTV S+++R+P +++EI+LVDD S S D+ + +P + +R+ R GL++
Sbjct: 127 LRTVRSILNRTPMHLIQEIILVDDFS------SDPDDCNQLIKLPKVKCLRNNERQGLVR 180
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 181 SRIRGADIAQGATLTFLDSHCEVNKDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 240
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF W S + +R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 241 ASELRGGFDWSLHFHWEQL-SPEQKARRLDPTEPIRTPVIAGGLFVIDKAWFDYLGKYDT 299
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW C GS+EI PCS V H+FRK PY+FP G + N R A
Sbjct: 300 DMDIWGGENFEISFRVWMCRGSLEIIPCSRVGHVFRKKHPYAFPNGNTNTYIKNTKRTAE 359
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMD++ ++Y+ P A +R +RSR+ LR L+C FKWYL +V+P +P D
Sbjct: 360 VWMDDYKQYYYAARPFA-LERPFGNIRSRVMLRANLQCQDFKWYLENVYPELRIPKDSSI 418
Query: 514 F-GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q HKC+ ++ NQ S L PC Q++ +I
Sbjct: 419 LKGSIR--QRHKCLA---SQKQNNQGSPNLKLRPCVKFKGEESKAQVWAFTYTQQIIQ-- 471
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P + + +G +Q+WT
Sbjct: 472 EELCLSVVTLFPG-APVILAVCKNGDEKQQWT 502
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/137 (43%), Positives = 90/137 (65%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+++ FN S+RIP NR +PD R +C V+ ++ LP +SI+I FHNEA S LLRTV
Sbjct: 73 YKLHAFNQRESERIPSNRVVPDTRHNRCALLVYCKD-LPPTSIIITFHNEARSTLLRTVR 131
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
S+++R+P +++EI+LVDD S S D+ + +P + +R+ R GL+++R+ G
Sbjct: 132 SILNRTPMHLIQEIILVDDFS------SDPDDCNQLIKLPKVKCLRNNERQGLVRSRIRG 185
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 186 ADIAQGATLTFLDSHCE 202
>gi|157074156|ref|NP_001096791.1| polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
gi|154426082|gb|AAI51594.1| GALNT4 protein [Bos taurus]
gi|296487968|tpg|DAA30081.1| TPA: polypeptide N-acetylgalactosaminyltransferase 4 [Bos taurus]
Length = 578
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 175/397 (44%), Positives = 244/397 (61%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
YV+ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ +D
Sbjct: 189 YVSNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIRKD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T V+CPVID I TF Y+++ E G F+W L F+W++ + +RK EPF++
Sbjct: 248 ETVVICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIEPFRS 306
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 307 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 366
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++L+
Sbjct: 367 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLR 420
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ +P D + G IR + + +C++ N S SL C
Sbjct: 421 CKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIGISSECLDYNAPDN--NPTSANLSLFGC- 477
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE + +
Sbjct: 478 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPELKKHVG 514
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 105/157 (66%), Gaps = 1/157 (0%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S L ++L + N+ +SDRI ++R + D R +CK+K FN LP +S++I F+NEA
Sbjct: 88 SELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEA 147
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ YV+ L R+IR+ R G
Sbjct: 148 WSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYVSNLDR-VRLIRTNKREG 206
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L++ARL+GA A G++L FLD HCEC + E L +
Sbjct: 207 LVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 243
>gi|426224267|ref|XP_004006295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Ovis
aries]
Length = 582
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 175/397 (44%), Positives = 244/397 (61%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 133 FNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLEA 192
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
YV+ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ +D
Sbjct: 193 YVSNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIHKD 251
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T V+CPVID I TF Y+++ E G F+W L F+W++ + +RK EPF++
Sbjct: 252 ETVVICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIEPFRS 310
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 311 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 370
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++L+
Sbjct: 371 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLR 424
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ +P D + G IR + + +C++ N S SL C
Sbjct: 425 CKSFDWYLKNVFSTLHVPEDRPGWHGAIRSIGISSECLDYNAPDN--NPTSANLSLFGC- 481
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE + +
Sbjct: 482 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPELKKHVG 518
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/157 (47%), Positives = 105/157 (66%), Gaps = 1/157 (0%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S L ++L + N+ +SDRI ++R + D R +CK+K FN LP +S++I F+NEA
Sbjct: 92 SELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEA 151
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ YV+ L R+IR+ R G
Sbjct: 152 WSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLEAYVSNLDR-VRLIRTNKREG 210
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L++ARL+GA A G++L FLD HCEC + E L +
Sbjct: 211 LVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLER 247
>gi|291389706|ref|XP_002711427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Oryctolagus cuniculus]
Length = 579
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 176/403 (43%), Positives = 245/403 (60%), Gaps = 16/403 (3%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +L
Sbjct: 124 ECKSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRAYL 183
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ L+ Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+
Sbjct: 184 KTQLETYISNLDR-VRLIRTKKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLL 242
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ D T VVCPVID I TF Y+++ E G F+W L F+W++ + +RK
Sbjct: 243 ERIERDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSR 301
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV
Sbjct: 302 IDPIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 361
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P + N AR A VWMD++ E ++ NP A K+ D + R
Sbjct: 362 GHVFPKRAPYARPNFLQ-----NTARAAEVWMDDYKEHFYNRNPPARKE-DYGDISERKL 415
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPA 542
LR++LKC SF WYL +V+ +P D + G IR + +C++ N
Sbjct: 416 LRERLKCKSFDWYLKNVFSSLHVPEDRPGWHGAIRSKGISSECLD--YNSPDNNPTGANL 473
Query: 543 SLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
SL C H Q F ++ + +C +VPE +N +
Sbjct: 474 SLFGC-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEKKNHVG 515
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK+K FN LP +S++I F+NEAWS
Sbjct: 91 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKTFNYRRLPTTSVIIAFYNEAWS 150
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 151 TLLRTIHSVLETSPAVLLKEIILVDDLSDRAYLKTQLETYISNLDR-VRLIRTKKREGLV 209
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 210 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 244
>gi|410910794|ref|XP_003968875.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Takifugu rubripes]
Length = 583
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/337 (46%), Positives = 223/337 (66%), Gaps = 11/337 (3%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + FN LP +S++I F+NEAWS LLRT+HSV+ +P +LKEI+L+DD S R +L
Sbjct: 129 ECRSKTFNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILIDDFSDRAYL 188
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
KS L +Y++ L R+IR+ R GL++ARL+GA A GE+L FLD HCEC GW+E L+
Sbjct: 189 KSQLADYISNLER-VRLIRTKKREGLVRARLIGATYATGEVLTFLDCHCECVPGWIEPLL 247
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ E+ + +VCPVID I TF Y+++ E G F+W L F+W++ + +RK
Sbjct: 248 ERIGENSSTIVCPVIDTIDWNTFEFYMQTEEPMIGGFDWRLTFQWHSVPERERK-RRKSP 306
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+++ +F ++G YD M+VWGGENLE+SFRVWQCGGS+EI PCSHV
Sbjct: 307 VDPIRSPTMAGGLFAVNKNFFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHV 366
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P N R A VWMD + + ++ NP A K+ + RL
Sbjct: 367 GHVFPKKAPYARPN-----FLQNTVRAAEVWMDSYKQHFYNRNPPARKE-TYGDISGRLL 420
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDK-FFGRIRHV 520
LR +LKC SF WYL +++P +P D + G +RH+
Sbjct: 421 LRDKLKCQSFNWYLKNIYPDLHIPEDRAGWHGAVRHL 457
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 76/163 (46%), Positives = 108/163 (66%), Gaps = 6/163 (3%)
Query: 42 HSNLS---DAQQLYQINRF--NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
H NLS Q+ I R+ N+ VSD+I ++R + D R +C++K FN LP +S++I
Sbjct: 88 HLNLSPDEKKQEEDSIERYAINIFVSDKISLHRHIQDHRMKECRSKTFNYRRLPTTSVII 147
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
F+NEAWS LLRT+HSV+ +P +LKEI+L+DD S R +LKS L +Y++ L R+IR
Sbjct: 148 AFYNEAWSTLLRTIHSVLETTPAILLKEIILIDDFSDRAYLKSQLADYISNLER-VRLIR 206
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ R GL++ARL+GA A GE+L FLD HCEC + E L +
Sbjct: 207 TKKREGLVRARLIGATYATGEVLTFLDCHCECVPGWIEPLLER 249
>gi|58865788|ref|NP_001012109.1| polypeptide N-acetylgalactosaminyltransferase 14 [Rattus
norvegicus]
gi|50926091|gb|AAH79128.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14)
[Rattus norvegicus]
gi|149050682|gb|EDM02855.1| rCG61782, isoform CRA_b [Rattus norvegicus]
Length = 552
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 186/452 (41%), Positives = 258/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNSERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQLSVEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ + L PC + +Q++ +I
Sbjct: 420 QKGNIR--QRQKCLE---SQKQKNQETPHLRLSPCAKVKGDRAKSQVWAFTYTQQIIQ-- 472
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G RQ+WT
Sbjct: 473 EELCLSVVTLFPG-APVVLVLCKNGDERQQWT 503
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNSERQGLVRSRMRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|225007540|ref|NP_001070030.2| polypeptide N-acetylgalactosaminyltransferase 11 [Danio rerio]
Length = 590
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 184/436 (42%), Positives = 250/436 (57%), Gaps = 49/436 (11%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LD YV +
Sbjct: 136 LPTASIVICFFNEAFSALLRTVHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQH 195
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +V+R+ R GLI+ R++GA A GE+LVFLD+HCE WL+ L+ + E+R V
Sbjct: 196 LQKKVKVVRNEKREGLIRGRMIGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTV 255
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + D ++P MAG
Sbjct: 256 VCPVIDIISADTLVYTPS-PIVRGGFNWGLHFKWDPVPMSE--LNSPDGA--IRSPTMAG 310
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF+ +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 311 GLFAMDRNYFYELGQYDRGMDIWGGENLEISFRIWMCGGQLLIVPCSRVGHIFRKRRPYG 370
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMD++ E YF PE + RD + R+ +RK+L+CHSFK
Sbjct: 371 SPGGQDTMAHNSL-RLAHVWMDDYKEQYFALRPEL-RNRDYGDISERVSIRKRLQCHSFK 428
Query: 496 WYLTHVWPHHFL-----PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQA 538
WYL +++P + P F GR+R++ KC+ +A+G +Q
Sbjct: 429 WYLDNIYPEMQVSSPHKPQQPVFINKGLKRPKVLQRGRLRNLLADKCL---VAQGRPSQK 485
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDES--------VCLDVPEYENDISPRVRIL 590
G + C + P A DE +CLD+ E + P R++
Sbjct: 486 GGAVVVKDCDP----------QDPEQEWAYDEEHELVLAGLLCLDMSEIRSSDPP--RLM 533
Query: 591 ACSGF-NRQRWTYDKE 605
C G Q+WT K+
Sbjct: 534 KCHGSGGSQQWTLGKK 549
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 69/137 (50%), Positives = 96/137 (70%), Gaps = 2/137 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN+L+S+R+ +R +PD R KC+ + ++ LP +SIVI F NEA+SALLRTVH
Sbjct: 100 YHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVS-LPTASIVICFFNEAFSALLRTVH 158
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LD YV + L +V+R+ R GLI+ R++G
Sbjct: 159 SVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRMIG 218
Query: 171 ARQAEGEILVFLDAHCE 187
A A GE+LVFLD+HCE
Sbjct: 219 ASHATGEVLVFLDSHCE 235
>gi|395820104|ref|XP_003783415.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Otolemur garnettii]
Length = 582
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 174/397 (43%), Positives = 242/397 (60%), Gaps = 15/397 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 133 FNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 192
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 193 YISNLER-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 251
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T VVCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 252 ETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 310
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 311 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 370
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++L+
Sbjct: 371 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKETYGDISERKL-LRQRLR 424
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL V+P+ +P D + G IR + +C++ N SL C
Sbjct: 425 CKSFDWYLKTVFPNLHVPEDRPGWHGAIRSSGISSECLD--YNSPDNNPTGANLSLFGC- 481
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
H Q F ++ + +C +VPE +N +
Sbjct: 482 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKNYVG 518
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK+K FN LP +S++I F+NEAWS
Sbjct: 94 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRRLPTTSVIIAFYNEAWS 153
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 154 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLER-VRLIRTNKREGLV 212
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 213 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 247
>gi|115313271|gb|AAI24298.1| Zgc:153274 [Danio rerio]
Length = 590
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 184/436 (42%), Positives = 250/436 (57%), Gaps = 49/436 (11%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LD YV +
Sbjct: 136 LPTASIVICFFNEAFSALLRTVHSVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQH 195
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +V+R+ R GLI+ R++GA A GE+LVFLD+HCE WL+ L+ + E+R V
Sbjct: 196 LQKKVKVVRNEKREGLIRGRMIGASHATGEVLVFLDSHCEVNEAWLQPLLTPIKENRKTV 255
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + D ++P MAG
Sbjct: 256 VCPVIDIISADTLVYTPS-PIVRGGFNWGLHFKWDPVPMSE--LNSPDGA--IRSPTMAG 310
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF+ +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 311 GLFAMDRNYFYELGQYDRGMDIWGGENLEISFRIWMCGGQLLIVPCSRVGHIFRKRRPYG 370
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMD++ E YF PE + RD + R+ +RK+L+CHSFK
Sbjct: 371 SPGGQDTMAHNSL-RLAHVWMDDYKEQYFALRPEL-RNRDYGDISERVSIRKRLQCHSFK 428
Query: 496 WYLTHVWPHHFL-----PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQA 538
WYL +++P + P F GR+R++ KC+ +A+G +Q
Sbjct: 429 WYLDNIYPEMQVSSPHKPQQPVFINKGLKRPKVLQRGRLRNLLADKCL---VAQGRPSQK 485
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDES--------VCLDVPEYENDISPRVRIL 590
G + C + P A DE +CLD+ E + P R++
Sbjct: 486 GGAVVVKDCDP----------QDPEQEWAYDEEHELVLAGLLCLDMSEIRSSDPP--RLM 533
Query: 591 ACSGF-NRQRWTYDKE 605
C G Q+WT K+
Sbjct: 534 KCHGSGGSQQWTLGKK 549
Score = 135 bits (340), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 69/137 (50%), Positives = 96/137 (70%), Gaps = 2/137 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN+L+S+R+ +R +PD R KC+ + ++ LP +SIVI F NEA+SALLRTVH
Sbjct: 100 YHKHAFNVLISNRLGYHRDVPDTRTDKCRDRAYSVS-LPTASIVICFFNEAFSALLRTVH 158
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LD YV + L +V+R+ R GLI+ R++G
Sbjct: 159 SVLDRTPNYLLHEIILVDDHSELDDLKEDLDSYVQQHLQKKVKVVRNEKREGLIRGRMIG 218
Query: 171 ARQAEGEILVFLDAHCE 187
A A GE+LVFLD+HCE
Sbjct: 219 ASHATGEVLVFLDSHCE 235
>gi|19922324|ref|NP_611043.1| GalNAc-T1, isoform A [Drosophila melanogaster]
gi|24653878|ref|NP_725472.1| GalNAc-T1, isoform B [Drosophila melanogaster]
gi|51315876|sp|Q6WV20.2|GALT1_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 1;
Short=pp-GaNTase 1; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 1; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 1
gi|10121393|gb|AAG13184.1|AF218236_1 polypeptide N-acetylgalactosaminyltransferase [Drosophila
melanogaster]
gi|7303062|gb|AAF58130.1| GalNAc-T1, isoform B [Drosophila melanogaster]
gi|21064373|gb|AAM29416.1| RE14585p [Drosophila melanogaster]
gi|21645385|gb|AAM70974.1| GalNAc-T1, isoform A [Drosophila melanogaster]
gi|220947986|gb|ACL86536.1| GalNAc-T1-PA [synthetic construct]
Length = 601
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 181/433 (41%), Positives = 254/433 (58%), Gaps = 33/433 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ + LP +S+VI+F NE +S LLRTVHS +S LKEI+LVDD S L + LD
Sbjct: 143 FDSDSLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 252 YVAKLSVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
YV + +P+ ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 203 YV-RTRIPSGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRI 261
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 262 KESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRREC 319
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 320 KQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 379
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D
Sbjct: 380 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C SF+WYL +++P F+P D + +G++ V ++ C++ L N+
Sbjct: 438 VTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVHAVNSNICLDDLLQN---NE 494
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC--- 592
A L PC + +Q+F + +E C V E SP RV ++ C
Sbjct: 495 KPYNAGLYPCGKVLQKSQLF--SFTNTNVLRNELSCATVQHSE---SPPYRVVMVPCMEN 549
Query: 593 SGFNRQRWTYDKE 605
FN Q W Y+ +
Sbjct: 550 DEFNEQ-WRYEHQ 561
Score = 121 bits (304), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
++Y+ N +S+++ NR++ D R P C + F+ + LP +S+VI+F NE +S LL
Sbjct: 107 GDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQRFDSDSLPTASVVIIFFNEPYSVLL 166
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLI 164
RTVHS +S LKEI+LVDD S L + LD YV + +P+ ++R R+GLI
Sbjct: 167 RTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RTRIPSGKVTILRLKNRLGLI 225
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 226 RARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 260
>gi|348585731|ref|XP_003478624.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Cavia porcellus]
Length = 937
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 220/344 (63%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N+ LP +SI++ F +E WS LLR+VHSV++RSP+ ++KEILLVDD ST++
Sbjct: 481 AGCAEQLVHNQ--LPTTSIIMCFVDEVWSTLLRSVHSVLNRSPQHLIKEILLVDDFSTKD 538
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 539 YLKDKLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 597
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 598 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGVFVWPMNFGWRTIPPEVVAKNRIK 657
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 658 ETDVIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIVPCSR 717
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR-DKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E ++ QR D + +
Sbjct: 718 VGHIFRNDNPYSFPKDRLKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQRLDAGNLTQQ 777
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + V KC+
Sbjct: 778 RELRKKLKCKSFKWYLDNVFPDLKAPI-VRAGGVLVSVALSKCI 820
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 107/153 (69%), Gaps = 2/153 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+AQ+ ++ FN+ +SD IPV+R + D R C ++ + + LP +SI++ F +E WS L
Sbjct: 451 EAQKRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNQ-LPTTSIIMCFVDEVWSTL 509
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP+ ++KEILLVDD ST+++LK LD+Y+++ R++R R GLI+A
Sbjct: 510 LRSVHSVLNRSPQHLIKEILLVDDFSTKDYLKDKLDKYMSQFP-KVRILRLKERHGLIRA 568
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RL GA+ A G++L FLD+H EC + + E L +
Sbjct: 569 RLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 601
>gi|71994065|ref|NP_001022876.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
gi|51316113|sp|Q9U2C4.1|GALT9_CAEEL RecName: Full=Probable N-acetylgalactosaminyltransferase 9;
AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 9; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9
gi|6018409|emb|CAB57897.1| Protein GLY-9, isoform a [Caenorhabditis elegans]
Length = 579
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 175/418 (41%), Positives = 251/418 (60%), Gaps = 24/418 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S++I+F +EAW+ LLRTVHSVI+RSP +L+E++L+DD S R+ L+ LDE++ +
Sbjct: 133 LPKTSVIIIFTDEAWTPLLRTVHSVINRSPPELLQEVILLDDNSKRQELQEPLDEHIKRF 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR R GLI+A+L GAR+A G+I+VFLD+HCE GWLE +V R++++RT +V
Sbjct: 193 GGKVRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTAIV 252
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID ISD T AY + L G F+W LHF W S + +R T+ ++P MAGG
Sbjct: 253 CPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWEGL-SEEEQKRRTKPTDYIRSPTMAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
L A +R YFF +G YDEEM +WGGENLE+SFR W CGGSIE PCSHV H+FR PY+
Sbjct: 312 LLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGHPYNM 371
Query: 437 PG--GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
G +V N R+A VWMD++ Y+ + E + +D + +R ELRK+L C F
Sbjct: 372 TGRNNNKDVHGTNSKRLAEVWMDDYKRLYY-MHREDLRTKDVGDLTARHELRKRLNCKPF 430
Query: 495 KWYLTHVWPHHFLPMDDKF--FGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
KW+L ++ F+ MD+ +G + V T C + M+Q G + C
Sbjct: 431 KWFLDNIAKGKFI-MDEDVVAYGALHTVVSGTRMCTDTLQRDEKMSQLLG---VFHCQGK 486
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG---FNRQRWTYDKE 605
Q+ M L + E+ C ++ + +R+ CS FN +RW Y+ +
Sbjct: 487 GSSPQL--MSLSKEGNLRRENTCA------SEENGNIRMKTCSKKAQFN-ERWAYENK 535
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 64/133 (48%), Positives = 95/133 (71%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ SD+I ++R +PD R CK ++ LPK+S++I+F +EAW+ LLRTVHSVI+R
Sbjct: 101 MNVHASDKISLDRDVPDPRIQACKDIKYDYAALPKTSVIIIFTDEAWTPLLRTVHSVINR 160
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+E++L+DD S R+ L+ LDE++ + R+IR R GLI+A+L GAR+A G
Sbjct: 161 SPPELLQEVILLDDNSKRQELQEPLDEHIKRFGGKVRLIRKHVRHGLIRAKLAGAREAVG 220
Query: 177 EILVFLDAHCECT 189
+I+VFLD+HCE
Sbjct: 221 DIIVFLDSHCEAN 233
>gi|395539756|ref|XP_003771832.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11 [Sarcophilus
harrisii]
Length = 970
Score = 322 bits (824), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 187/428 (43%), Positives = 250/428 (58%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S + LK LD+YV K
Sbjct: 512 LPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDDLKGELDDYVQKY 571
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +V+R+ R GLI+ R++GA A GE+LVFLD+HCE WL+ L+ + ED V
Sbjct: 572 LPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSHCEVNKMWLQPLLVPIHEDHRTV 631
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W S+ + + P K+P MAG
Sbjct: 632 VCPVIDIISADTLMYSSS-PIVRGGFNWGLHFKWDLVPFSE-LGGPEGAIAPIKSPTMAG 689
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 690 GLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 749
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF PE K + + R+ELRK+L C SFK
Sbjct: 750 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPEL-KLKSYGNISERVELRKKLGCKSFK 807
Query: 496 WYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P L P F GR+ H+QT+KC+ A+G +Q
Sbjct: 808 WYLDNIYPEMQLSGPNAKPQQPVFINRGPKRPKILQRGRLYHLQTNKCLA---AQGHPSQ 864
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +LI + +CLD+ E + PR+ SG
Sbjct: 865 KGGLVVLKVCDYSDP-NQVWIYNEEHELILNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 921
Query: 598 QRWTYDKE 605
Q+W K
Sbjct: 922 QQWILGKN 929
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 172/380 (45%), Positives = 222/380 (58%), Gaps = 36/380 (9%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ EC LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+LVDD S +
Sbjct: 138 NAECKEKSYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDNSEFDD 197
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK LD+YV K L +V+R+ GLI R++GA GE+LVFLD+HCE WL+
Sbjct: 198 LKGELDDYVQKYLPGKIQVVRNEKGEGLIXGRMIGAAHGTGEVLVFLDSHCEVNKMWLQP 257
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW-----YTYGSSDAI 358
L+ + ED VVCPVIDIIS T Y S + G FNW+LHF+W G +
Sbjct: 258 LLVPIHEDHRTVVCPVIDIISADTLMYSSS-PIVCGGFNWDLHFKWDLVPFSKLGGPEGA 316
Query: 359 IKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEI 418
I P K+PAMAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I
Sbjct: 317 IA------PIKSPAMAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFI 370
Query: 419 APCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FRK PY P G + + N R+A VW+DE+ E YF PE K +
Sbjct: 371 IPCSRVGHIFRKRRPYGSPEG-QDTMTNNSLRMAHVWLDEYKEQYFSLRPEL-KLKSYGN 428
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFL------PMDDKFFGR------------IRHV 520
+ R+ELRK+L C SFKWYL +++P L P F R + H+
Sbjct: 429 ISERVELRKKLGCKSFKWYLDNIYPEMQLSGPNAKPQQPVFINRGPKRPKILQRGWLYHL 488
Query: 521 QTHKCVEKPLAKGSMNQASG 540
QT+KC+ A+G +Q G
Sbjct: 489 QTNKCLA---AQGHPSQKRG 505
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 99/147 (67%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R +CK K + LP +SIVI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNLLISNRLGYHRDVPDTRNAECKEKSY-PTGLPAASIVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SVI R+P +L EI+LVDD S + LK LD+YV K L +V+R+ GLI R++G
Sbjct: 173 SVIDRTPAHLLHEIILVDDNSEFDDLKGELDDYVQKYLPGKIQVVRNEKGEGLIXGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A GE+LVFLD+HCE ++ + L
Sbjct: 233 AAHGTGEVLVFLDSHCEVNKMWLQPLL 259
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/132 (48%), Positives = 85/132 (64%), Gaps = 1/132 (0%)
Query: 67 VNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEIL 126
N+ L P K + LP +SIVI F+NEA+SALLRTVHSVI R+P +L EI+
Sbjct: 490 TNKCLAAQGHPSQKRGLSYPTGLPAASIVICFYNEAFSALLRTVHSVIDRTPAHLLHEII 549
Query: 127 LVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAH 185
LVDD S + LK LD+YV K L +V+R+ R GLI+ R++GA A GE+LVFLD+H
Sbjct: 550 LVDDNSEFDDLKGELDDYVQKYLPGKIQVVRNEKREGLIRGRMIGAAHATGEVLVFLDSH 609
Query: 186 CECTLVFNEEFL 197
CE ++ + L
Sbjct: 610 CEVNKMWLQPLL 621
>gi|354468358|ref|XP_003496633.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Cricetulus griseus]
Length = 541
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 186/451 (41%), Positives = 258/451 (57%), Gaps = 26/451 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 57 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 116
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 117 LRTIRSVLNRTPTHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 170
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 171 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 230
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 231 ASELRGGFDWSLHFQWEQLSPEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 289
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R A
Sbjct: 290 DMDIWGGENFEISFRVWMCGGSLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 349
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKWYL +V+P +P D
Sbjct: 350 VWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSI 408
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ Q + L PCT + +Q++ +I
Sbjct: 409 QKGNIR--QRQKCLE---SQKQKKQETPHLRLSPCTKVKGEEAKSQVWAFTYTQQIIQ-- 461
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRW 600
E +CL V +P V +L +G RQ+W
Sbjct: 462 EELCLSVVTLFPG-APVVLVLCKNGDERQQW 491
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 63 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 121
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 122 SVLNRTPTHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRMRG 175
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 176 ADIAQGTTLTFLDSHCE 192
>gi|321476751|gb|EFX87711.1| hypothetical protein DAPPUDRAFT_306553 [Daphnia pulex]
Length = 626
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 181/426 (42%), Positives = 254/426 (59%), Gaps = 25/426 (5%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + + LP +S++I+F NEAWS L+RT+ SVI+RSPR LKEILL+DD S R L+
Sbjct: 166 CKGLTYDTILPSASVIIIFTNEAWSPLIRTIWSVINRSPRKFLKEILLIDDFSDRVELQG 225
Query: 248 SLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+ Y+ +L R++R R GLI+ARL GA++A GE+++FLD+HCE TLGWLE L+
Sbjct: 226 KLERYIETQLPSIVRLVRLKERQGLIRARLAGAKEATGEVIIFLDSHCEATLGWLEPLLQ 285
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYV----RSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
R+ ED+ V+ P+ID+I D T Y SF++ G+F W HF W + I +R
Sbjct: 286 RIKEDKRAVLVPIIDVIDDKTLEYYHGSPESFQI--GSFTWSGHFTWMDIPKRE-IKRRG 342
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
P +P MAGGLFAIDR YF+ +G+YDE M VWGGENLEMSFR+W CGGS+E PCS
Sbjct: 343 SRVGPTNSPTMAGGLFAIDRQYFWDLGSYDEGMDVWGGENLEMSFRIWMCGGSLETIPCS 402
Query: 423 HVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
V H+FR PY+FPG ++ +G N ARV VWMD++ E ++ + K D +
Sbjct: 403 RVGHIFRSFHPYTFPG--NKDTHGINTARVVEVWMDDYKELFYMHRGDL-KTIDIGDTSA 459
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASG 540
R +LRK LKC SFKWYL +V P F+ + +GR+ + K L ++ +
Sbjct: 460 RKKLRKDLKCKSFKWYLENVLPDKFIMTEHSLGYGRV----MNDAFGKQLCLDNLQRNED 515
Query: 541 -PASL--LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGF 595
P +L PC ++Q+F + L E C +V + + +P V++L C
Sbjct: 516 QPYNLGQYPCHAQMAMSQVFALSKLGQL--RREESCAEVQDNVSAEAP-VKMLGCQVDPP 572
Query: 596 NRQRWT 601
Q+WT
Sbjct: 573 EDQKWT 578
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 92/240 (38%), Positives = 134/240 (55%), Gaps = 35/240 (14%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
+ L +A+++ + FNL +S+RI NRTLPDVR CK ++ LP +S++I+F NEA
Sbjct: 130 AELQEAEEIMKKEAFNLFISNRISYNRTLPDVRDSMCKGLTYDT-ILPSASVIIIFTNEA 188
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRV 161
WS L+RT+ SVI+RSPR LKEILL+DD S R L+ L+ Y+ +L R++R R
Sbjct: 189 WSPLIRTIWSVINRSPRKFLKEILLIDDFSDRVELQGKLERYIETQLPSIVRLVRLKERQ 248
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 221
GLI+ARL GA++A GE+++FLD+HCE TL + E L +
Sbjct: 249 GLIRARLAGAKEATGEVIIFLDSHCEATLGWLEPLLQR---------------------- 286
Query: 222 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK----------LSVPTRVIRSPG-RVG 270
I R++L I+ V D T E+ S + + + +P R I+ G RVG
Sbjct: 287 IKEDKRAVLVPIIDVIDDKTLEYYHGSPESFQIGSFTWSGHFTWMDIPKREIKRRGSRVG 346
>gi|395834931|ref|XP_003790440.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
1 [Otolemur garnettii]
gi|395834933|ref|XP_003790441.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
2 [Otolemur garnettii]
Length = 622
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 234/362 (64%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L E +
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEKL-EQYVQQ 234
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 235 LQVVRVVRQVERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSF---ELHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + +H G F+W L F W T + + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFSKPIPRGRVHSRGNFDWSLTFGWETLPTHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQMEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G S V+ N R+A VWMD + +++ N +A K +++ + RL+LR++L
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKMIFYRRNQQAAKMAQEKSFGDISERLQLRERL 472
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+L +V+P F+P + F+G I+++ ++C L G N P + C
Sbjct: 473 HCRNFSWFLNNVYPEMFVPDLMPTFYGAIKNLGINQC----LDVGENNHGEKPLIMYSCH 528
Query: 549 HL 550
L
Sbjct: 529 GL 530
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/144 (52%), Positives = 96/144 (66%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDRKFRRCPALPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E+LK L E + RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEYLKEKL-EQYVQQLQVVRVVRQVERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|410914790|ref|XP_003970870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Takifugu rubripes]
Length = 552
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 162/357 (45%), Positives = 236/357 (66%), Gaps = 14/357 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ E++LVDD S +E
Sbjct: 79 HPNCKQKLYAEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEVILVDDFSDKEH 138
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK LDEY+ +L R++R+ R GLI+ RLLGA +A+GE++ FLD+HCE + WL L
Sbjct: 139 LKVPLDEYMVRLP-KVRILRTKKREGLIRTRLLGAARAKGEVITFLDSHCEANVNWLPPL 197
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ ++++D
Sbjct: 198 LDRIAQNRKTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYKRIPIPLE---LQKED 254
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 255 PSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDTPCSR 314
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGGVS L NL RVA VWMDE+AE+ ++ PE + + +
Sbjct: 315 VGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLAAGDMAVQK 371
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-KPLAKGS 534
+LR QL C SFKW++T V W P H+ P++ +G IR+V + C+E K A GS
Sbjct: 372 DLRSQLNCKSFKWFMTKVAWDLPKHYPPVEPPAAAWGEIRNVASGMCLETKHFASGS 428
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 75/141 (53%), Positives = 107/141 (75%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VSDRI +NR++PD+R P CK K++ E+ LP +S++I FHNE WS+LLRT
Sbjct: 53 QAYRENGFNIYVSDRISLNRSVPDIRHPNCKQKLYAEK-LPNTSVIIPFHNEGWSSLLRT 111
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ E++LVDD S +E LK LDEY+ +L R++R+ R GLI+ RLL
Sbjct: 112 VHSVLNRSPPQLIAEVILVDDFSDKEHLKVPLDEYMVRLP-KVRILRTKKREGLIRTRLL 170
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA +A+GE++ FLD+HCE +
Sbjct: 171 GAARAKGEVITFLDSHCEANV 191
>gi|47221376|emb|CAF97294.1| unnamed protein product [Tetraodon nigroviridis]
Length = 675
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 179/436 (41%), Positives = 259/436 (59%), Gaps = 30/436 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 204 HPNCKQKLYAEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDREH 263
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ +L R++R+ R GLI+ RLLGA A+GE++ FLD+HCE + WL L
Sbjct: 264 LKQPLEEYMVRLP-KVRILRTKKREGLIRTRLLGATAAKGEVITFLDSHCEANVNWLPPL 322
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ ++++D
Sbjct: 323 LDRIAQNRKTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKED 379
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 380 PSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGCMEDIPCSR 439
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGGVS L NL RVA VWMDE+AE+ ++ PE + ++
Sbjct: 440 VGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDTAAQK 496
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-KPLAKGS--- 534
+LR QL C SFKW++T V W H+ P++ +G IR+V + C+E K GS
Sbjct: 497 DLRSQLNCKSFKWFMTKVAWDLSKHYPPVEPPAAAWGEIRNVGSSMCLETKHFVSGSPVW 556
Query: 535 MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATD----ESVCLDVPEYENDISPRVRIL 590
M + H Q+F D+ D + VC D ++ SP V +
Sbjct: 557 MESCLKGRGEVGWNH----GQVFTFGWREDIRVGDPMHTKKVCFDA---VSNNSP-VTLY 608
Query: 591 ACSGF-NRQRWTYDKE 605
C G Q W Y K+
Sbjct: 609 DCHGMKGNQLWRYRKD 624
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 76/141 (53%), Positives = 106/141 (75%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VSDRI +NR++PD+R P CK K++ E+ LP +S++I FHNE WS+LLRT
Sbjct: 178 QAYRENGFNIYVSDRISLNRSIPDIRHPNCKQKLYAEK-LPNTSVIIPFHNEGWSSLLRT 236
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ EI+LVDD S RE LK L+EY+ +L R++R+ R GLI+ RLL
Sbjct: 237 VHSVLNRSPPQLIAEIILVDDFSDREHLKQPLEEYMVRLP-KVRILRTKKREGLIRTRLL 295
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A+GE++ FLD+HCE +
Sbjct: 296 GATAAKGEVITFLDSHCEANV 316
>gi|340712798|ref|XP_003394942.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus terrestris]
Length = 571
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 177/407 (43%), Positives = 240/407 (58%), Gaps = 19/407 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 137 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 195
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 196 ----RVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAEDPTRVV 251
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W ++ ++KD T+ +TP +AGG
Sbjct: 252 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQAIRTPMIAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 312 LFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYSF 371
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +FY+ P A + ++ R+EL+++L C F W
Sbjct: 372 PGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKRKLHCKPFSW 430
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVLTQ 555
YL +V+P +P + G Q C++ SM G L PC H Q
Sbjct: 431 YLKNVYPELVIPTSEGGPGGSLK-QGTACLD------SMGHLLDGNVGLYPC-HDTGGNQ 482
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ L D + +CL +P Y + ++I C G Q+W +
Sbjct: 483 EW--GLTKDGLIKHHDLCLTLPMYAKGTTLLMQI--CDGSENQKWRH 525
Score = 118 bits (296), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 28/205 (13%)
Query: 13 ARNQQDYIDRRGVHVVVGHYL------------------GDSVDGGLHSNLSDAQQLYQI 54
A N +Y+D G +V+ L G GGL + + Y
Sbjct: 48 AANSGNYVDPDGTAIVMSSELLPAPTPDPRVTWNYFDEQGYVSRGGLRA----GEDPYAR 103
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N+FN SD +P NR +PD R C+ K + + LP +S++I FHNEA S LLRTV SV+
Sbjct: 104 NKFNQEASDGLPSNRDIPDTRSAMCRMKQWRRD-LPPTSVIITFHNEARSTLLRTVVSVL 162
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
+RSP ++KEI+LVDD S L + K+ RVIR+ R GL+++R+ GA A
Sbjct: 163 NRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV----RVIRNEKREGLMRSRVRGADAA 217
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+L FLD+HCEC + E L +
Sbjct: 218 TASVLTFLDSHCECNADWLEPLLER 242
>gi|66507571|ref|XP_394527.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis mellifera]
gi|380015445|ref|XP_003691712.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Apis florea]
Length = 571
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 177/407 (43%), Positives = 240/407 (58%), Gaps = 19/407 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 137 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 195
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 196 ----RVIRNEKREGLMRSRVRGADAATATVLTFLDSHCECNADWLEPLLERVAEDPTRVV 251
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W ++ ++KD T+ +TP +AGG
Sbjct: 252 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQAIRTPMIAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 312 LFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYSF 371
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +FY+ P A + ++ R+EL+++L C F W
Sbjct: 372 PGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKRKLHCKPFSW 430
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVLTQ 555
YL +V+P +P + G Q C++ SM G L PC H Q
Sbjct: 431 YLKNVYPELVIPTSEGGPGGSLK-QGSACLD------SMGHLLDGNVGLYPC-HDTGGNQ 482
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ L D + +CL +P Y + ++I C G Q+W +
Sbjct: 483 EW--GLTKDGLIRHHGLCLTLPVYAKGTTLLMQI--CDGSENQKWRH 525
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 28/205 (13%)
Query: 13 ARNQQDYIDRRGVHVVVGHYL------------------GDSVDGGLHSNLSDAQQLYQI 54
A N +Y+D G +V+ L G GGL + + Y
Sbjct: 48 AGNSGNYVDPDGTAIVMSSELLPAPTPDPRVTWNYFDEQGYVSRGGLRA----GEDPYAR 103
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N+FN SD +P NR +PD R C+ K + + LP +S++I FHNEA S LLRTV SV+
Sbjct: 104 NKFNQEASDGLPSNRDIPDTRSAMCRMKQWRRD-LPPTSVIITFHNEARSTLLRTVVSVL 162
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
+RSP ++KEI+LVDD S L + K+ RVIR+ R GL+++R+ GA A
Sbjct: 163 NRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV----RVIRNEKREGLMRSRVRGADAA 217
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+L FLD+HCEC + E L +
Sbjct: 218 TATVLTFLDSHCECNADWLEPLLER 242
>gi|432934600|ref|XP_004081948.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 3-like [Oryzias
latipes]
Length = 600
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 170/365 (46%), Positives = 234/365 (64%), Gaps = 20/365 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD---EYV 253
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS K+S+ Y+
Sbjct: 153 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASVDG--KNSMGPFRTYL 210
Query: 254 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
KLS+ RV+R R GLI ARLLGA A G+ L FLDAHCEC GWLE L+AR+A++ T
Sbjct: 211 KKLSI-VRVVRQRERKGLITARLLGASVATGDTLTFLDAHCECFNGWLEPLLARIAQNYT 269
Query: 314 RVVCPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK 369
VV P I I TF +++ + G F+W L F W + + +RKD T P K
Sbjct: 270 AVVSPDISTIDLNTFEFMKPSPYGQNHNRGNFDWGLSFGWESLPDHEKQ-RRKDETYPIK 328
Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
TP AGGLF+I + YF+ IG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR
Sbjct: 329 TPTFAGGLFSISKEYFYQIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIIPCSVVGHVFR 388
Query: 430 KASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA---EKQRDKQAVRSRLELR 486
SP++FP G ++V+ N R+A VWMD++ E +++ N +A K+ + R +LR
Sbjct: 389 TKSPHTFPKG-TQVIARNQVRLAEVWMDDYKEIFYRRNQQAAQIAKEETFGDISKRKDLR 447
Query: 487 KQLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
++L+C +F WYL +++P F+P ++ FG +++V C L G N+ +
Sbjct: 448 ERLQCKNFSWYLKNIYPEIFMPDLNPLLFGSVKNVGKASC----LDAGENNEGGKELIMY 503
Query: 546 PCTHL 550
PC L
Sbjct: 504 PCHGL 508
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/103 (59%), Positives = 74/103 (71%), Gaps = 6/103 (5%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD---EYV 145
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS K+S+ Y+
Sbjct: 153 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASVDG--KNSMGPFRTYL 210
Query: 146 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
KLS+ RV+R R GLI ARLLGA A G+ L FLDAHCEC
Sbjct: 211 KKLSI-VRVVRQRERKGLITARLLGASVATGDTLTFLDAHCEC 252
>gi|417411769|gb|JAA52311.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 582
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 175/403 (43%), Positives = 247/403 (61%), Gaps = 16/403 (3%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +L
Sbjct: 127 ECKSKTFNYRQLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYL 186
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ L+ YV+ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+
Sbjct: 187 KTQLETYVSNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLL 245
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R++ED T ++CPVID I TF Y+++ E G F+W L F+W++ + +RK
Sbjct: 246 ERISEDETVIICPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSR 304
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV
Sbjct: 305 IDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 364
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L
Sbjct: 365 GHVFPKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL- 418
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPA 542
LR++LKC SF WYL +V+ + +P D + G IR + + +C++ N
Sbjct: 419 LRERLKCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMGISSECLD--YNSPDNNPTGANL 476
Query: 543 SLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
SL C H Q F ++ + +C +VPE + +
Sbjct: 477 SLFGC-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKKHVG 518
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/159 (45%), Positives = 106/159 (66%), Gaps = 1/159 (0%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
+ L ++L + N+ +SD+I ++R + D R +CK+K FN LP +S++I F+NEA
Sbjct: 92 AELKQQEELIERYAINIYLSDKISLHRHIEDKRMYECKSKTFNYRQLPTTSVIIAFYNEA 151
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ YV+ L R+IR+ R G
Sbjct: 152 WSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYVSNLDR-VRLIRTNKREG 210
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
L++ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 211 LVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIS 249
>gi|350409232|ref|XP_003488663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Bombus impatiens]
Length = 571
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 177/407 (43%), Positives = 240/407 (58%), Gaps = 19/407 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 137 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 195
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 196 ----RVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAEDPTRVV 251
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W ++ ++KD T+ +TP +AGG
Sbjct: 252 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQAIRTPMIAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 312 LFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYSF 371
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +FY+ P A + ++ R+EL+++L C F W
Sbjct: 372 PGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKRKLHCKPFSW 430
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVLTQ 555
YL +V+P +P + G Q C++ SM G L PC H Q
Sbjct: 431 YLKNVYPELVIPTSEGGPGGSLK-QGTACLD------SMGHLLDGNVGLYPC-HDTGGNQ 482
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ L D + +CL +P Y + ++I C G Q+W +
Sbjct: 483 EW--GLTKDGLIKHHDLCLTLPVYAKGTTLLMQI--CDGSENQKWRH 525
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 28/205 (13%)
Query: 13 ARNQQDYIDRRGVHVVVGHYL------------------GDSVDGGLHSNLSDAQQLYQI 54
A N +Y+D G +V+ L G GGL + + Y
Sbjct: 48 AANSGNYVDPDGTAIVMSSELLPAPTPDPRVTWNYFDEQGYVSRGGLRA----GEDPYAR 103
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N+FN SD +P NR +PD R C+ K + + LP +S++I FHNEA S LLRTV SV+
Sbjct: 104 NKFNQEASDGLPSNRDIPDTRSAMCRMKQWRRD-LPPTSVIITFHNEARSTLLRTVVSVL 162
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
+RSP ++KEI+LVDD S L + K+ RVIR+ R GL+++R+ GA A
Sbjct: 163 NRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV----RVIRNEKREGLMRSRVRGADAA 217
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
+L FLD+HCEC + E L +
Sbjct: 218 TASVLTFLDSHCECNADWLEPLLER 242
>gi|410916145|ref|XP_003971547.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Takifugu rubripes]
Length = 579
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 171/364 (46%), Positives = 226/364 (62%), Gaps = 13/364 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + + LP +SI+I FHNEA S LLRTV SV++R+P ++ EI+LVDD S E
Sbjct: 125 HFRCATIRYDSDLPPTSIIITFHNEARSTLLRTVRSVLNRTPVHLIHEIILVDDFSDDE- 183
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
S + + KL R +R+P R GLI++R+ GA A+ +L FLD+HCE WL L
Sbjct: 184 ---SDCQLLIKLP-KVRCVRNPQREGLIRSRVRGADSAKAAVLTFLDSHCEVNKDWLPPL 239
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ R+ +D TRVV PVIDII+ TFAYV + G F+W LHF+W S + +R D
Sbjct: 240 LQRIKQDPTRVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQL-SPEQRARRTDP 298
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P KTP +AGGLF IDR++F H+G YD M +WGGEN E+SFRVWQCGGS+EI PCS V
Sbjct: 299 AQPIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGGENFEISFRVWQCGGSLEILPCSRV 358
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMD+++ FY+ P A + + +R RLE
Sbjct: 359 GHVFRKKHPYVFPEGNANTYIKNTRRTAEVWMDDFSLFYYSARP-AARGKSYGDIRGRLE 417
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK+LKC +FKWYL +V+P +P D D G I+ Q C+E +G Q +
Sbjct: 418 LRKKLKCKTFKWYLDNVYPELKVPDDSDSKSGVIK--QRQNCLESQRVEG---QELPVLT 472
Query: 544 LLPC 547
L PC
Sbjct: 473 LAPC 476
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 66/147 (44%), Positives = 93/147 (63%), Gaps = 10/147 (6%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN S+RIP NR L D R +C T ++ + LP +SI+I FHNEA S LLRTV
Sbjct: 101 YTLYAFNQRESERIPSNRALRDTRHFRCATIRYDSD-LPPTSIIITFHNEARSTLLRTVR 159
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++R+P ++ EI+LVDD S E S + + KL R +R+P R GLI++R+ GA
Sbjct: 160 SVLNRTPVHLIHEIILVDDFSDDE----SDCQLLIKLP-KVRCVRNPQREGLIRSRVRGA 214
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLP 198
A+ +L FLD+HCE N+++LP
Sbjct: 215 DSAKAAVLTFLDSHCE----VNKDWLP 237
>gi|351695439|gb|EHA98357.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
glaber]
Length = 608
Score = 321 bits (823), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 187/433 (43%), Positives = 255/433 (58%), Gaps = 39/433 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEY+ K
Sbjct: 150 LPVASVVICFYNEAFSALLRTVHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYIQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L ++IR+P R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A V D V
Sbjct: 210 LPAKIKLIRNPRREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAVVHGDPHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW-----YTYGSSDAIIKRKDFTEPFKT 370
VCPVIDIIS T AY S + G FNW LHF+W G +D+ T P K+
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSELGGADSA------TAPIKS 322
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK
Sbjct: 323 PTMAGGLFAMNRQYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRK 382
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY P G + + +L R+A VW+DE+ E YF P+ K + + R+ELRK+L
Sbjct: 383 RRPYGSPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTKSYGNISERVELRKKLG 440
Query: 491 CHSFKWYLTHVWPHHFL----------------PMDDKFF--GRIRHVQTHKCVEKPLAK 532
C SFKWYL +++P + P + GR+ H QT+KC+ +A+
Sbjct: 441 CQSFKWYLDNIYPEMQILGPNAKAQQPVFVNRGPKRPRVLQRGRLYHFQTNKCL---VAQ 497
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC 592
G +Q G L C + Q+++ +L+ + +CLD+ E + PR+
Sbjct: 498 GRPSQKGGLVVLKACDYEDP-AQVWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHG 555
Query: 593 SGFNRQRWTYDKE 605
SG Q+WT K
Sbjct: 556 SG-GSQQWTLGKN 567
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 104/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAVCKEKSYPTD-LPVASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LDEY+ K L ++IR+P R GLI+ R++G
Sbjct: 173 SVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYIQKYLPAKIKLIRNPRREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|410909548|ref|XP_003968252.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Takifugu rubripes]
Length = 580
Score = 321 bits (822), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 188/435 (43%), Positives = 251/435 (57%), Gaps = 46/435 (10%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S+VI F NEA SALLRTVHSV+ R+ +L EI+LVDD S E LK LD YV A+
Sbjct: 123 LPPASVVICFFNEALSALLRTVHSVLDRTAPFLLHEIILVDDYSELEELKGDLDRYVQAE 182
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +V+R+ R GLI+ R++GA A G++LVFLD+HCE WLE L+A + EDR V
Sbjct: 183 LQGKVKVLRNQRREGLIRGRMIGAAHASGQVLVFLDSHCEVNQMWLEPLLASIHEDRRTV 242
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T +Y S + G FNW LHF+W S+ + K +P ++P MAG
Sbjct: 243 VCPVIDIISADTLSYSPS-PIVRGGFNWGLHFKWDPVPPSE-LKSPKGPVDPIRSPTMAG 300
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAI+R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 301 GLFAINRKYFNEMGQYDAGMDIWGGENLEISFRIWMCGGQLLIIPCSRVGHIFRKRRPYG 360
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ E Y PE ++RD + R+ LR++L+C SF+
Sbjct: 361 SPGGQDTMAHNSL-RLAHVWMDEYKEQYLSMRPEL-RERDYGDISDRVALRERLQCRSFR 418
Query: 496 WYLTHVWPHH------------FLPMDDKF-----FGRIRHVQTHKCVEKPLAKGSMNQA 538
WYL +V+P F+ D K GR+ + T++C+ +A+G +Q
Sbjct: 419 WYLDNVYPEMQTVSNGNKHPPLFINKDLKRPKVLQRGRLHNRATNRCL---VAQGRASQK 475
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDES--------VCLDVPEYENDISPRVRIL 590
G L PC + P A DE +CLDV E P R++
Sbjct: 476 GGAVVLRPCDP----------QDPEQEWAYDEEGQLVLAGLLCLDVSEVRTFDPP--RLM 523
Query: 591 ACSGF-NRQRWTYDK 604
C G Q+W+ K
Sbjct: 524 KCHGSGGSQQWSMGK 538
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 119/191 (62%), Gaps = 10/191 (5%)
Query: 12 LARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPV 67
LA+NQ + +R H S + G+ N +D QQL Y + FNLL+S R+
Sbjct: 47 LAQNQDN---QRAAHKASNKRSDLSAEMGMIFNEAD-QQLRDSGYHRHAFNLLISTRLGP 102
Query: 68 NRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILL 127
+R LPD R P+C+ +++ + LP +S+VI F NEA SALLRTVHSV+ R+ +L EI+L
Sbjct: 103 HRDLPDTRDPQCRDRIYPRD-LPPASVVICFFNEALSALLRTVHSVLDRTAPFLLHEIIL 161
Query: 128 VDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 186
VDD S E LK LD YV A+L +V+R+ R GLI+ R++GA A G++LVFLD+HC
Sbjct: 162 VDDYSELEELKGDLDRYVQAELQGKVKVLRNQRREGLIRGRMIGAAHASGQVLVFLDSHC 221
Query: 187 ECTLVFNEEFL 197
E ++ E L
Sbjct: 222 EVNQMWLEPLL 232
>gi|194210168|ref|XP_001915003.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 [Equus
caballus]
Length = 609
Score = 321 bits (822), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 186/431 (43%), Positives = 255/431 (59%), Gaps = 35/431 (8%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L E++LVDD S + LK LDEYV K
Sbjct: 151 LPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKY 210
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 211 LPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAVIQEDRRMV 270
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 271 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 328
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+ R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 329 GLFAMSRRYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 388
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + + + R+ELRK+L C SFK
Sbjct: 389 SPEGQDTMTHNSL-RLAYVWLDEYKEQYFSLRPDL-RTKSYGNISERVELRKKLGCKSFK 446
Query: 496 WYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCV---EKPLAKGS 534
WYL +++P + P F GR+ H+QT+KC+ +P KGS
Sbjct: 447 WYLDNIYPEMQISGPNAKPQQPIFINRGPKRPKVLQRGRLCHLQTNKCLVAQSRPSQKGS 506
Query: 535 MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG 594
+ +L Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 507 L-------VVLKACDYGDPNQVWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG 558
Query: 595 FNRQRWTYDKE 605
Q+WT+ K+
Sbjct: 559 -GSQQWTFGKK 568
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 115 YQKHAFNMLISNRLGYHREVPDTRNAACKDKSYPTD-LPVASVVICFYNEALSALLRTVH 173
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L E++LVDD S + LK LDEYV K L +VIR+ R GLI+ R++G
Sbjct: 174 SVLDRTPARLLHEVILVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIG 233
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 234 AAHATGEVLVFLDSHCEVNVMWLQPLL 260
>gi|158299131|ref|XP_319236.4| AGAP010078-PA [Anopheles gambiae str. PEST]
gi|157014221|gb|EAA14535.4| AGAP010078-PA [Anopheles gambiae str. PEST]
Length = 504
Score = 321 bits (822), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 178/408 (43%), Positives = 252/408 (61%), Gaps = 23/408 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++ EI+LVDD S +F + + +AK+
Sbjct: 68 LPATSVIITFHNEARSTLLRTVVSVLNRSPERLIHEIILVDDYS--DFPEDG--QELAKI 123
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL+++R+ GA A ++L FLD+HCEC + WLE L+ARVAED TRVV
Sbjct: 124 Q-KVRLIRNSKREGLVRSRVTGAAAATAKVLTFLDSHCECNVNWLEPLLARVAEDPTRVV 182
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W +++ +++D T P +TP +AGG
Sbjct: 183 CPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSNAERKARQRDPTAPIRTPMIAGG 242
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF ID+AYF +G YD +M +WGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 243 LFVIDKAYFERLGTYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 302
Query: 437 PGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG S ++ N R A VWMDE+ ++Y+ P A + RL+LRK+L+C F+
Sbjct: 303 PGGGSGNIFAKNTRRAAEVWMDEYKKYYYAAVPLATNIPFGD-IDDRLQLRKELQCKPFR 361
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
WYL HV+P +P + + G IR Q C++ S+ +G L H Q
Sbjct: 362 WYLEHVYPQLGIP-ERRNNGSIR--QGVYCLD------SLGNVAGAVVGLYSCHGNGGNQ 412
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPR---VRILACSGFNRQRW 600
+++ +L D +CL + ++ IS R V + C Q+W
Sbjct: 413 NWILNRKGELKHHD--LCLTLVKFT--ISARYNSVLMKYCDDSENQQW 456
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 65/161 (40%), Positives = 96/161 (59%), Gaps = 9/161 (5%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNE----EFLPKSSIVIVF 98
L + Y NRFN SD + NR LPD R C+ +++ LP +S++I F
Sbjct: 18 GGLQRGEDPYLRNRFNQQASDGLKSNRELPDTRNAMCRRSSWSDLSTIAHLPATSVIITF 77
Query: 99 HNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSP 158
HNEA S LLRTV SV++RSP ++ EI+LVDD S +F + + +AK+ R+IR+
Sbjct: 78 HNEARSTLLRTVVSVLNRSPERLIHEIILVDDYS--DFPEDG--QELAKIQ-KVRLIRNS 132
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R GL+++R+ GA A ++L FLD+HCEC + + E L +
Sbjct: 133 KREGLVRSRVTGAAAATAKVLTFLDSHCECNVNWLEPLLAR 173
>gi|431895736|gb|ELK05155.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Pteropus alecto]
Length = 608
Score = 321 bits (822), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 186/428 (43%), Positives = 254/428 (59%), Gaps = 29/428 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L E++LVDD S + LK LD +V K
Sbjct: 150 LPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEVILVDDDSDFDDLKGELDAFVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 210 LPGKIKVIRNRKREGLIRGRMIGASHATGEVLVFLDSHCEVNVMWLQPLLAAIQEDRRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLPEPG-GPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRDYFSELGQYDRGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + R + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRSYGNISERVELRKKLGCKSFK 445
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQVSGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTGKCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +LI ++ +CLD+ E + PR+ SG
Sbjct: 503 KGGLVVLKACDYSDP-NQIWIYNEEHELILSN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDKE 605
Q+WT+ K
Sbjct: 560 QQWTFGKN 567
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/147 (48%), Positives = 101/147 (68%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+SDR+ +R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISDRLGYHRDVPDTRNAACKDKTYPAD-LPVASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L E++LVDD S + LK LD +V K L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAQLLHEVILVDDDSDFDDLKGELDAFVQKYLPGKIKVIRNRKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 ASHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|296212534|ref|XP_002752871.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Callithrix jacchus]
Length = 578
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/396 (44%), Positives = 241/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T +VCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
LKC SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LKCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSKKEIRFNSVTELCAEVPEQKN 511
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/155 (46%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK+K FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|443683118|gb|ELT87486.1| hypothetical protein CAPTEDRAFT_155466 [Capitella teleta]
Length = 644
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 153/338 (45%), Positives = 222/338 (65%), Gaps = 4/338 (1%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
N+ L ++++I FHNEAWS LLRT+HSVI+RSP ++ EI+LVDDAST ++L L+
Sbjct: 198 INQSTLSPTTVIICFHNEAWSTLLRTLHSVINRSPSHLIMEIILVDDASTFDYLGEPLEN 257
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++++L ++R+ R GLI+ARLLG A+G++LVFLD+HCEC GWL L+ + D
Sbjct: 258 HLSQLE-NVYLLRTKIREGLIRARLLGVSYAKGDVLVFLDSHCECAEGWLPPLLLAIEAD 316
Query: 312 RTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
RT++VCP++D+I TF Y + E GAF+W L F W + + +R + + P
Sbjct: 317 RTKIVCPLVDVIEFQTFEYRAAKEELHGAFDWNLQFIWKDLPEHE-MKRRTSPADNIRAP 375
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
+ GGLFA+DR YF IG+YD M +WG ENLE+SFRVW CGGS+EI+PCS V H+FR
Sbjct: 376 TIIGGLFAVDRLYFKRIGSYDSGMDIWGSENLELSFRVWMCGGSLEISPCSRVGHVFRTR 435
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKC 491
PY FP G + N R A VW+D++ +F++ + + V R++LR++LKC
Sbjct: 436 IPYGFPNGGKRTIRNNAMRAAEVWLDDYKKFFYASQNITRRLTTVEDVVVRVDLRRKLKC 495
Query: 492 HSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQT-HKCVE 527
SF+WYL +V P LP D D++FG+I+ + + KC+E
Sbjct: 496 KSFQWYLDNVIPEAVLPEDEDEYFGQIQSLASPSKCLE 533
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 100/142 (70%), Gaps = 3/142 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
D + Q + FN+ S+ +P++R +PD R +C + N+ L ++++I FHNEAWS L
Sbjct: 163 DLESGMQRHSFNVRASELLPLDRPIPDYRPTQCPS--INQSTLSPTTVIICFHNEAWSTL 220
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LRT+HSVI+RSP ++ EI+LVDDAST ++L L+ ++++L ++R+ R GLI+A
Sbjct: 221 LRTLHSVINRSPSHLIMEIILVDDASTFDYLGEPLENHLSQLE-NVYLLRTKIREGLIRA 279
Query: 167 RLLGARQAEGEILVFLDAHCEC 188
RLLG A+G++LVFLD+HCEC
Sbjct: 280 RLLGVSYAKGDVLVFLDSHCEC 301
>gi|332839987|ref|XP_003313889.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
troglodytes]
gi|397505857|ref|XP_003823459.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pan
paniscus]
gi|410207422|gb|JAA00930.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252142|gb|JAA14038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252144|gb|JAA14039.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252146|gb|JAA14040.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252148|gb|JAA14041.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410252150|gb|JAA14042.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410289758|gb|JAA23479.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355493|gb|JAA44350.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
gi|410355495|gb|JAA44351.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Pan
troglodytes]
Length = 578
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/396 (44%), Positives = 241/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T VVCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 511
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|332020473|gb|EGI60888.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Acromyrmex
echinatior]
Length = 442
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 177/407 (43%), Positives = 240/407 (58%), Gaps = 19/407 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 8 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 66
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 67 ----RVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAEDPTRVV 122
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W ++ ++KD T+ +TP +AGG
Sbjct: 123 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQTERQARQKDPTQAIRTPMIAGG 182
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 183 LFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYSF 242
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +FY+ P A + ++ R+EL+++L C F W
Sbjct: 243 PGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKRKLHCKPFSW 301
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVLTQ 555
YL +V+P +P + G Q C++ SM G L PC H Q
Sbjct: 302 YLKNVYPELVIPTSEGGPGGSLK-QGTACLD------SMGHLLDGNVGLYPC-HDTGGNQ 353
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ L D + +CL +P Y + ++I C G Q+W +
Sbjct: 354 EW--GLTKDGLIKHHDLCLTLPVYAKGTTLLMQI--CDGSENQKWRH 396
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 70/111 (63%), Gaps = 5/111 (4%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 8 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 66
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RVIR+ R GL+++R+ GA A +L FLD+HCEC + E L +
Sbjct: 67 ----RVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLER 113
>gi|348513278|ref|XP_003444169.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4
[Oreochromis niloticus]
Length = 584
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 158/339 (46%), Positives = 229/339 (67%), Gaps = 11/339 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ +P +LKEI+LVDD S R +LKS L +
Sbjct: 135 FDYRHLPTTSVIIAFYNEAWSTLLRTIHSVLETTPAILLKEIILVDDFSDRGYLKSKLAD 194
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GW+E L+ R++E+
Sbjct: 195 YISDLQR-VRLIRTNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLERISEN 253
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ +VCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 254 ASTIVCPVIDTIDWNTFEFYMQTDEPMIGGFDWRLTFQWHSVPEMERK-RRKSRIDPIRS 312
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ +AYF ++G YD M VWGGENLE+SFRVWQCGGS+EI PCSHV H+F K
Sbjct: 313 PTMAGGLFAVSKAYFEYLGTYDMGMDVWGGENLELSFRVWQCGGSLEIHPCSHVGHVFPK 372
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N R A VWMD + + ++ NP A K++ + RL LR++LK
Sbjct: 373 KAPYARPNFLQ-----NTVRAAEVWMDSYKKHFYNRNPPARKEKYGN-ISERLLLREKLK 426
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTH-KCVE 527
C+SF+WYL +++P +P D + + G +R H +C++
Sbjct: 427 CNSFEWYLKNIYPELHVPEDREGWHGAVRSSGIHSECLD 465
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 75/165 (45%), Positives = 109/165 (66%), Gaps = 6/165 (3%)
Query: 42 HSNLS---DAQQLYQINRF--NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
H NLS Q+ + R+ N+ VSD+I ++R + D R +C++K F+ LP +S++I
Sbjct: 88 HLNLSPEEKKQEQDSVERYAINIYVSDKISLHRHIQDHRMKECRSKKFDYRHLPTTSVII 147
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
F+NEAWS LLRT+HSV+ +P +LKEI+LVDD S R +LKS L +Y++ L R+IR
Sbjct: 148 AFYNEAWSTLLRTIHSVLETTPAILLKEIILVDDFSDRGYLKSKLADYISDLQR-VRLIR 206
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ R GL++ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 207 TNKREGLVRARLIGATYATGDVLTFLDCHCECVPGWIEPLLERIS 251
>gi|403272081|ref|XP_003927917.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Saimiri
boliviensis boliviensis]
Length = 578
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/399 (44%), Positives = 242/399 (60%), Gaps = 19/399 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGS--SDAIIKRKDFTEPF 368
T +VCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKYERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
LKC SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LKCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
C H Q F ++ + +C +VPE +N +
Sbjct: 477 C-HGQGGNQFFEYTSKKEIRFNSVTELCAEVPEQKNYVG 514
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 111/173 (64%), Gaps = 5/173 (2%)
Query: 31 HYLGD-SVDGGLHSN---LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNE 86
H LG+ LH N L ++L + N+ +SDRI ++R + D R +CK+K FN
Sbjct: 72 HALGEWGKASKLHLNEGELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSKKFNY 131
Query: 87 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 146
LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++
Sbjct: 132 RTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYIS 191
Query: 147 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L R+IR+ R GL++ARL+GA A G++L FLD HCEC + E L +
Sbjct: 192 NLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|195334637|ref|XP_002033984.1| GM21620 [Drosophila sechellia]
gi|194125954|gb|EDW47997.1| GM21620 [Drosophila sechellia]
Length = 601
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 184/433 (42%), Positives = 253/433 (58%), Gaps = 33/433 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ E LP +S+VI+F NE +S LLRTVHS +S LKEI+LVDD S L + LD
Sbjct: 143 FDSESLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 252 YVAKLSVPTR---VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
YV + +P R ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 203 YV-RTRIPARKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRI 261
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 262 KESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRREC 319
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 320 RQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 379
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D
Sbjct: 380 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C SF+WYL +++P F+P D + +G++ V + C++ L N+
Sbjct: 438 VTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVHAVNANLCLDDLLQN---NE 494
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC--- 592
A L PC + +Q+F L +E C V E SP RV ++ C
Sbjct: 495 KPYNAGLYPCGKVLQKSQLFSFTNTNAL--RNELSCATVQHSE---SPPYRVVMVPCMEN 549
Query: 593 SGFNRQRWTYDKE 605
FN Q W Y+ +
Sbjct: 550 DEFNEQ-WRYEHQ 561
Score = 125 bits (314), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 97/155 (62%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
++Y+ N +S+++ NR++ D R P C + F+ E LP +S+VI+F NE +S LL
Sbjct: 107 GDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQRFDSESLPTASVVIIFFNEPYSVLL 166
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR---VIRSPGRVGLI 164
RTVHS +S LKEI+LVDD S L + LD YV + +P R ++R R+GLI
Sbjct: 167 RTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RTRIPARKVTILRLKNRLGLI 225
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 226 RARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 260
>gi|302565702|ref|NP_001181690.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
gi|380817542|gb|AFE80645.1| polypeptide N-acetylgalactosaminyltransferase 4 [Macaca mulatta]
Length = 578
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 175/396 (44%), Positives = 242/396 (61%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLES 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGS--SDAIIKRKDFTEPF 368
T +VCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N ARVA VWMDE+ E ++ NP A K+ +L +R++
Sbjct: 365 PKRAPYARPNFLQ-----NTARVAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-IRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 511
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLESYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|194220840|ref|XP_001500424.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Equus
caballus]
Length = 539
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 183/452 (40%), Positives = 258/452 (57%), Gaps = 26/452 (5%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H CT + LP +SI+I FHNEA S L
Sbjct: 55 RVGDDPYKLYAFNQRESERISSNRAVPDTRHLRCTTLVYCTDLPPTSIIITFHNEARSTL 114
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++KEI+LVDD S + D+ + +P + +R+ R GL++
Sbjct: 115 LRTIRSVLNRTPMNLIKEIILVDDFS------NDPDDCNQLIKLPKVKCLRNENRQGLVR 168
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA AEG IL F+D+HCE WL+ L+ RV ED TRVVCPVIDII+ F Y+ S
Sbjct: 169 SRIRGADFAEGAILTFMDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDNFNYIES 228
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W S + +R D EP +TP +AGGLF +++++F ++G YD
Sbjct: 229 ATELRGGFDWSLHFQWEQL-SPEQKAQRLDPAEPIRTPVIAGGLFVMNKSWFDYLGKYDM 287
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R
Sbjct: 288 DMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTVE 347
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + SR++LR L C SFKWYL +V+P +P D
Sbjct: 348 VWMDEYKQYYYAARPFA-LERPFGNIDSRVDLRSTLLCQSFKWYLENVYPELRIPKDSSI 406
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E ++ NQ L PC ++Q++ +I
Sbjct: 407 QKGNIR--QRQKCLE---SQKQDNQKISNVKLSPCVKSKGEDTMSQIWAFTYTQQIIQ-- 459
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L + ++Q+WT
Sbjct: 460 EELCLSVITVFPG-APVVLVLCKNEDDKQQWT 490
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/137 (43%), Positives = 90/137 (65%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C T V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 61 YKLYAFNQRESERISSNRAVPDTRHLRCTTLVYCTD-LPPTSIIITFHNEARSTLLRTIR 119
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++KEI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 120 SVLNRTPMNLIKEIILVDDFS------NDPDDCNQLIKLPKVKCLRNENRQGLVRSRIRG 173
Query: 171 ARQAEGEILVFLDAHCE 187
A AEG IL F+D+HCE
Sbjct: 174 ADFAEGAILTFMDSHCE 190
>gi|297692565|ref|XP_002823614.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Pongo
abelii]
Length = 578
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 175/396 (44%), Positives = 241/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T +VCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKQKRDRQISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 511
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|189053556|dbj|BAG35722.1| unnamed protein product [Homo sapiens]
Length = 578
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 175/396 (44%), Positives = 241/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T VVCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C ++PE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEIPEQKN 511
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|426373643|ref|XP_004053705.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Gorilla
gorilla gorilla]
Length = 578
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/396 (44%), Positives = 241/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T VVCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLR-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 511
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|167526997|ref|XP_001747831.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773580|gb|EDQ87218.1| predicted protein [Monosiga brevicollis MX1]
Length = 658
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 176/418 (42%), Positives = 249/418 (59%), Gaps = 30/418 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L K+SI+I F NEAWS LLRTVHSV++RSP ++ EI+L+DD+S +L L Y+ +
Sbjct: 215 LLKASIIICFVNEAWSTLLRTVHSVLNRSPADLVHEIILLDDSSDAAWLGDKLTNYI-RD 273
Query: 257 SVPTRV--IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
++P +V +R+ R GLI+ARL+GA A G++L+FLD+HCE L WLE ++A + EDR
Sbjct: 274 NLPDKVKYVRTQHRSGLIRARLVGAEHATGDVLLFLDSHCEANLNWLEPIMALITEDRRT 333
Query: 315 VVCPVIDIISDVTFAYVRSFE--LHWGAFNWELHFRWYTYGSSDAIIKRK--DFTEPFKT 370
VV PVID I T Y ++ + G F+W + F W A ++R D T+P +
Sbjct: 334 VVTPVIDSIDHHTMEYSKATQDVPAVGTFDWTMDFNW------KAGVRRAGADATDPVDS 387
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+++ YF+ +G+YDE+M WGGENLEMSFR+WQCGG + APCSHV H+FR
Sbjct: 388 PTMAGGLFAMEKNYFYELGSYDEKMDGWGGENLEMSFRIWQCGGRLVTAPCSHVGHIFRD 447
Query: 431 ASPYSFPGG-VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQL 489
+ PY+ PGG + + N RVA VWMD + +++ P + D V R ELR++L
Sbjct: 448 SHPYTVPGGSIHDTFLRNSMRVAEVWMDHYKQYFLDTRP-GQNIIDAGDVSERKELRQRL 506
Query: 490 KCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKP--LAKGSMNQASGP-ASLLP 546
+CH FKWYL V P F+P H+Q + P + M Q +G A + P
Sbjct: 507 QCHDFKWYLNTVLPDLFIP-------DANHIQHQGTLHTPDNICVDKMGQRNGGVAGVYP 559
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
C H Q ++ + T+ I T +S+CLD + + + V + C G Q W YD
Sbjct: 560 C-HGQGTNQAWMYSI-TNEIRTHDSLCLDA--WGSTLPSPVHLGRCHGMRGNQEWRYD 613
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 120/204 (58%), Gaps = 25/204 (12%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+ YQ + FN S +P++R +PDVR P CK K + L K+SI+I F NEAWS LLR
Sbjct: 175 QESYQNHFFNEYKSSLLPLDRPVPDVRPPACKAKQWPTANLLKASIIICFVNEAWSTLLR 234
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRV--IRSPGRVGLIKA 166
TVHSV++RSP ++ EI+L+DD+S +L L Y+ + ++P +V +R+ R GLI+A
Sbjct: 235 TVHSVLNRSPADLVHEIILLDDSSDAAWLGDKLTNYI-RDNLPDKVKYVRTQHRSGLIRA 293
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSP 226
RL+GA A G++L+FLD+HCE L + L + ++I+
Sbjct: 294 RLVGAEHATGDVLLFLDSHCEANLNW----------------------LEPIMALITEDR 331
Query: 227 RSMLKEILLVDDASTREFLKSSLD 250
R+++ ++ D T E+ K++ D
Sbjct: 332 RTVVTPVIDSIDHHTMEYSKATQD 355
>gi|410965222|ref|XP_003989149.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Felis
catus]
Length = 582
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 172/394 (43%), Positives = 243/394 (61%), Gaps = 15/394 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 133 FNYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 192
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ +D
Sbjct: 193 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGKD 251
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 252 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 310
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 311 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 370
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMD++ E ++ NP A K+ +L LR++LK
Sbjct: 371 RAPYARPNFLQ-----NTARAAEVWMDQYKEHFYNRNPPARKEAYGDISERKL-LRERLK 424
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ + +P D + G IR + + +C++ N SL C
Sbjct: 425 CKSFDWYLKNVFSNLHVPEDRPGWHGAIRSMGISSECLD--YNSPDSNPTGANLSLFGC- 481
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
H Q F ++ + +C +VPE +N
Sbjct: 482 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 515
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 104/158 (65%), Gaps = 1/158 (0%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NE
Sbjct: 91 QDELKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRRLPTTSVIIAFYNE 150
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
AWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R
Sbjct: 151 AWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKRE 209
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GL++ARL+GA A G++L FLD HCEC + E L +
Sbjct: 210 GLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 247
>gi|402887191|ref|XP_003906986.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 [Papio
anubis]
Length = 578
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 175/399 (43%), Positives = 242/399 (60%), Gaps = 19/399 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGS--SDAIIKRKDFTEPF 368
T +VCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSKGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
C H Q F ++ + +C +VPE +N +
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKNYVG 514
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|301759363|ref|XP_002915525.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Ailuropoda melanoleuca]
gi|281339844|gb|EFB15428.1| hypothetical protein PANDA_003531 [Ailuropoda melanoleuca]
Length = 608
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 184/427 (43%), Positives = 256/427 (59%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK L+EYV K
Sbjct: 150 LPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + +D+ V
Sbjct: 210 LPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIQQDQRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + + + R+ELR++L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTKSYGNISERVELRRKLGCKSFK 445
Query: 496 WYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTDKCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 503 KGGLVVLKACDYSDP-GQIWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDK 604
Q+WT+ K
Sbjct: 560 QQWTFGK 566
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 102/147 (69%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPVD-LPVASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK L+EYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVMWLQPLL 259
>gi|449493914|ref|XP_004175359.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12 [Taeniopygia
guttata]
Length = 594
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/325 (50%), Positives = 220/325 (67%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S+VI F+NEAWS LLRTVHSV+ SP +L+EI+LVDD S +E LK +L+ YVA L
Sbjct: 146 LPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLKETLENYVAGL 205
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A+G+IL FLD HCEC GWLE L+AR+AE+ T VV
Sbjct: 206 R-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLARIAEEETAVV 264
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ + E G F+ L F W++ + +RK T+ ++P MAG
Sbjct: 265 CPVIDVIDWNTFEYLGNAGEPQIGGFDXRLVFTWHSTPEREQK-RRKSKTDVIRSPTMAG 323
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++ + YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K +PYS
Sbjct: 324 GLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPKQAPYS 383
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMDE+ + Y+ NP A + RL LR++LKC FK
Sbjct: 384 RAKALA-----NSVRAAEVWMDEYKQLYYHRNPHARLEPYGDVTERRL-LREKLKCKDFK 437
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRH 519
W+L +V+P +P D FFG +++
Sbjct: 438 WFLENVYPELHVPEDRPGFFGMLKN 462
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 72/134 (53%), Positives = 98/134 (73%), Gaps = 1/134 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
++ N+ +SDRI ++R LP+ P C+ K ++ LPK+S+VI F+NEAWS LLRTVHSV+
Sbjct: 112 HQINIYLSDRISLHRRLPERWHPLCREKKYDYYNLPKTSVVIAFYNEAWSTLLRTVHSVL 171
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +L+EI+LVDD S +E LK +L+ YVA L R+IR+ R GL++ARLLGA A
Sbjct: 172 ETSPDILLEEIILVDDYSDKEHLKETLENYVAGLR-KVRLIRANKREGLVRARLLGASVA 230
Query: 175 EGEILVFLDAHCEC 188
+G+IL FLD HCEC
Sbjct: 231 KGDILTFLDCHCEC 244
>gi|315221121|ref|NP_001186710.1| POC1B-GALNT4 protein isoform 1 [Homo sapiens]
Length = 575
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 178/402 (44%), Positives = 243/402 (60%), Gaps = 20/402 (4%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +L
Sbjct: 120 ECKSQKFNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYL 179
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ L+ Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+
Sbjct: 180 KTQLETYISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLL 238
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRK 362
R+ D T VVCPVID I TF Y++ E G F+W L F+W++ D I R
Sbjct: 239 ERIGRDETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPKQERDRRISRI 298
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
D P ++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCS
Sbjct: 299 D---PIRSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCS 355
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSR 482
HV H+F K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +
Sbjct: 356 HVGHVFPKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERK 410
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASG 540
L LR++L+C SF WYL +V+P+ +P D + G IR + +C++ N
Sbjct: 411 L-LRERLRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGA 467
Query: 541 PASLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
SL C H Q F ++ + +C +VPE +N
Sbjct: 468 NLSLFGC-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 508
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 87 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 146
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 147 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 205
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 206 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 240
>gi|348518337|ref|XP_003446688.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Oreochromis niloticus]
Length = 598
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 174/378 (46%), Positives = 231/378 (61%), Gaps = 16/378 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT + + LP +SI+I FHNEA S LLRT+ SV++R+P ++ EI+LVDD S E
Sbjct: 144 HYRCTTLHYDSELPSTSIIITFHNEARSTLLRTIKSVLNRTPVHLIYEIILVDDFSDDE- 202
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
S + + KL + R+ R GLI++R+ G A ++L FLD+HCE WL L
Sbjct: 203 ---SDCQLLTKLP-KVKCFRNNKREGLIRSRVRGTDAARAKVLTFLDSHCEVNKDWLPPL 258
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ R+ ED +RVV PVIDII+ TFAYV + G F+W LHF+W S + +R D
Sbjct: 259 LQRIKEDPSRVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQL-SPEQRARRTDP 317
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+P KTP +AGGLF IDRA+F H+G YD M +WGGEN E+SFRVWQCGGS+EI PCS V
Sbjct: 318 TQPIKTPIIAGGLFVIDRAWFNHLGKYDTAMDIWGGENFEISFRVWQCGGSLEILPCSRV 377
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMD++ FY+ P A + + +RSR+E
Sbjct: 378 GHVFRKKHPYVFPEGNANTYIKNTRRTAEVWMDDFRLFYYSARP-AARGKSYGDIRSRVE 436
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK+L C SFKWYL +V+P +P D D G I+ Q C+E +G Q +
Sbjct: 437 LRKKLNCKSFKWYLDNVYPELKVPDDSDSQSGVIK--QRQNCLESRKVEG---QEMPVLT 491
Query: 544 LLPCT---HLPVLTQMFV 558
L PCT +P + Q +V
Sbjct: 492 LAPCTGTEGVPAINQEWV 509
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/147 (42%), Positives = 90/147 (61%), Gaps = 10/147 (6%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN S+RIP +R L D R +C T ++ E LP +SI+I FHNEA S LLRT+
Sbjct: 120 YTLYAFNQRESERIPSDRALRDTRHYRCTTLHYDSE-LPSTSIIITFHNEARSTLLRTIK 178
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++R+P ++ EI+LVDD S E S + + KL + R+ R GLI++R+ G
Sbjct: 179 SVLNRTPVHLIYEIILVDDFSDDE----SDCQLLTKLP-KVKCFRNNKREGLIRSRVRGT 233
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLP 198
A ++L FLD+HCE N+++LP
Sbjct: 234 DAARAKVLTFLDSHCE----VNKDWLP 256
>gi|195028169|ref|XP_001986949.1| GH20244 [Drosophila grimshawi]
gi|193902949|gb|EDW01816.1| GH20244 [Drosophila grimshawi]
Length = 599
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 178/432 (41%), Positives = 248/432 (57%), Gaps = 33/432 (7%)
Query: 193 NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY 252
N LP +S++++F+NE +S LLRTVHS ++ LKEI+LVDD S L LD Y
Sbjct: 141 NPATLPTASVIVIFYNEPYSVLLRTVHSTLNTCNEQALKEIILVDDGSDNAELGGKLDHY 200
Query: 253 VAKLSVP---TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
V K P V+R R+GLI+ARL GAR A G++L+FLDAHCE GW E L+ R+
Sbjct: 201 V-KTRFPIGKVTVLRLNNRLGLIRARLAGARIATGDVLIFLDAHCEANEGWCEPLLQRIK 259
Query: 310 EDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ RT V+ P+ID+I V F Y +SF++ G F W HF W + + + ++
Sbjct: 260 DSRTSVLVPIIDVIDSVDFQYSTNGYKSFQV--GGFQWNGHFDWVNLPEREKLRQSRECN 317
Query: 366 EPFK-----TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
+P + +P MAGGLFA+DR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE P
Sbjct: 318 QPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIP 377
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
CS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D V
Sbjct: 378 CSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINVFFLNRPDLKFHPDIGDV 435
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQA 538
R+ LRK+L+C SF WYL +V+P F+P + K +GR+R V + C++ L + N+
Sbjct: 436 THRVVLRKKLRCKSFDWYLQNVYPEKFVPNKNVKAWGRVRSVHDNLCIDDLL---NNNEK 492
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRI-----LACS 593
L PC +Q+F + +E C V ++ SP RI L
Sbjct: 493 PYNLGLYPCGKTLQHSQLF--SFTNSQVLRNELSCATV---QHSSSPPYRIVMVPCLEND 547
Query: 594 GFNRQRWTYDKE 605
+N Q W Y+ +
Sbjct: 548 DYNEQ-WKYENQ 558
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 93/156 (59%), Gaps = 5/156 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF-NEEFLPKSSIVIVFHNEAWSAL 106
++Y+ N +S+++ NRT+ D R P C + + N LP +S++++F+NE +S L
Sbjct: 103 GDKIYKKIALNEELSEQLSYNRTVGDHRNPLCLNQRYDNPATLPTASVIVIFYNEPYSVL 162
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP---TRVIRSPGRVGL 163
LRTVHS ++ LKEI+LVDD S L LD YV K P V+R R+GL
Sbjct: 163 LRTVHSTLNTCNEQALKEIILVDDGSDNAELGGKLDHYV-KTRFPIGKVTVLRLNNRLGL 221
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARL GAR A G++L+FLDAHCE + E L +
Sbjct: 222 IRARLAGARIATGDVLIFLDAHCEANEGWCEPLLQR 257
>gi|195455372|ref|XP_002074693.1| GK23025 [Drosophila willistoni]
gi|194170778|gb|EDW85679.1| GK23025 [Drosophila willistoni]
Length = 599
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 176/431 (40%), Positives = 249/431 (57%), Gaps = 29/431 (6%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I+F NE +S LLRTVHS +S LKEI+LVDD S L LD
Sbjct: 140 FDTNSLPSASVIIIFFNEPYSVLLRTVHSTLSTCNEKSLKEIILVDDGSDNVELGGKLDH 199
Query: 252 YVAKLSVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
Y+ + P V+R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 200 YI-RTRFPAGKVTVLRLKNRLGLIRARLAGARMATGDVLIFLDAHCEGNVGWCEPLLQRI 258
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y ++F++ G F W HF W + +R++
Sbjct: 259 KESRTSVLVPIIDVIDANDFQYSTNGYKAFQV--GGFQWNGHFDWVNLPEREKQRQRREC 316
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 317 DQAREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 376
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMD++ +F P+ + D
Sbjct: 377 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDDYINIFFLNRPDLKFHADIGD 434
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C SF WYL +V+P F+P + +++G++R V + C++ L N+
Sbjct: 435 VTHRVMLRKKLRCKSFDWYLKNVYPEKFVPNKNVQYWGKVRAVNANLCLDDLLQN---NE 491
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS---G 594
L PC +Q+F + +E C V ++ + RV ++ CS
Sbjct: 492 KPFNLGLYPCGKTLQKSQLF--SYTNSQVLRNELSCATV-QHSDSPPRRVVMVPCSESDK 548
Query: 595 FNRQRWTYDKE 605
FN Q W Y+ +
Sbjct: 549 FNDQ-WKYEDQ 558
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
+++Y+ N +S+++ NRT+ D R P C ++ F+ LP +S++I+F NE +S LL
Sbjct: 104 GEKIYKKIALNEELSEQLSYNRTVGDHRNPLCASQRFDTNSLPSASVIIIFFNEPYSVLL 163
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLI 164
RTVHS +S LKEI+LVDD S L LD Y+ + P V+R R+GLI
Sbjct: 164 RTVHSTLSTCNEKSLKEIILVDDGSDNVELGGKLDHYI-RTRFPAGKVTVLRLKNRLGLI 222
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 223 RARLAGARMATGDVLIFLDAHCEGNVGWCEPLLQR 257
>gi|13929126|ref|NP_113984.1| polypeptide N-acetylgalactosaminyltransferase 5 [Rattus norvegicus]
gi|51315691|sp|O88422.1|GALT5_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|3510639|gb|AAC69708.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T5 [Rattus
norvegicus]
gi|149047792|gb|EDM00408.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
gi|149047793|gb|EDM00409.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5, isoform CRA_a
[Rattus norvegicus]
Length = 930
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 167/384 (43%), Positives = 238/384 (61%), Gaps = 26/384 (6%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
+++ R + +PG+ G K + R EG V+L A C
Sbjct: 418 VTLSPRDLNAPGQFGRPVVVPPGKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 477
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N+ LP +SI++ F +E WSALLR+VHSV++RSP ++KEILLVDD ST+++LK+
Sbjct: 478 EQLVHND--LPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKA 535
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 536 NLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 594
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTE 366
V +R +V CPVI++I+D +Y+ G F W ++F W T D I K T+
Sbjct: 595 VYLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFGWRTI-PPDVIAKNGIKETD 653
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
+ P MAGGLF+ID++YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H
Sbjct: 654 IIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGH 713
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLEL 485
+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + + EL
Sbjct: 714 IFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQREL 773
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPM 509
RK+LKC SFKWYL +V+P P+
Sbjct: 774 RKKLKCQSFKWYLDNVFPDLKAPV 797
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 69/151 (45%), Positives = 108/151 (71%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A+Q ++ FN+ +SD IPV+R + D R C ++ + + LP +SI++ F +E WSAL
Sbjct: 444 EAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHND-LPTTSIIMCFVDEVWSAL 502
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK++LD+Y+++ R++R R GLI+A
Sbjct: 503 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKANLDKYMSQFP-KVRILRLKERHGLIRA 561
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 562 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 592
>gi|348519900|ref|XP_003447467.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oreochromis niloticus]
Length = 777
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 157/331 (47%), Positives = 217/331 (65%), Gaps = 14/331 (4%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
CE +LV ++ LP +S++ F +E WS LLR+VHSV++RSP +LKEI+LVDD ST+++L
Sbjct: 325 CEQSLVHDD--LPSTSVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTKDYL 382
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K LD+Y+A+ R++R R GLI+ARL GA A+GE+L FLD+H EC +GWLE L+
Sbjct: 383 KKQLDDYMAQFP-KVRIVRLKERQGLIRARLAGAAVAKGEVLTFLDSHIECNVGWLEPLL 441
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
RV DR +V CPVI++ISD +Y+ G F W L F W D IK+ + T
Sbjct: 442 ERVYLDRKKVPCPVIEVISDKDMSYMMVDNFQRGIFKWPLVFGWSAVPPED--IKKFNLT 499
Query: 366 --EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+P + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF++W CGG IEI PCS
Sbjct: 500 ISDPIRCPVMAGGLFSIDKQYFFELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSR 559
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR--- 480
V H+FR +PY FP + + NLARVA VW+DE+ + ++ DK+ +
Sbjct: 560 VGHIFRGQNPYKFPKDRQKTVERNLARVAEVWLDEYKDLFYGHG--YHHLLDKKLINIGN 617
Query: 481 --SRLELRKQLKCHSFKWYLTHVWPHHFLPM 509
++ELRK+LKC SFKWYL +V+P P+
Sbjct: 618 LTDQIELRKKLKCKSFKWYLDNVYPDMVAPL 648
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 106/152 (69%), Gaps = 2/152 (1%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
++ ++ + FN+ +SD+IPV+R +PD R C+ + +++ LP +S++ F +E WS
Sbjct: 292 AEVKKRWDEGHFNVYLSDKIPVDRAIPDTRPQMCEQSLVHDD-LPSTSVIFCFVDEVWST 350
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLR+VHSV++RSP +LKEI+LVDD ST+++LK LD+Y+A+ R++R R GLI+
Sbjct: 351 LLRSVHSVLNRSPPHLLKEIILVDDFSTKDYLKKQLDDYMAQFP-KVRIVRLKERQGLIR 409
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
ARL GA A+GE+L FLD+H EC + + E L
Sbjct: 410 ARLAGAAVAKGEVLTFLDSHIECNVGWLEPLL 441
>gi|332221068|ref|XP_003259680.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4 isoform
1 [Nomascus leucogenys]
Length = 578
Score = 319 bits (818), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 173/396 (43%), Positives = 240/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T +VCPVID I TF Y+++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQ--THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + H + + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIHSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 511
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|5834643|emb|CAB55352.1| N-acetylgalactosaminyltransferase T-6 [Mus musculus]
Length = 623
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 171/364 (46%), Positives = 236/364 (64%), Gaps = 18/364 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKERLEQYVQQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGAR--QAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
+ RV+R R GLI ARLLGA QA+G + + + GWLE L+AR+AED+T
Sbjct: 236 QI-VRVVRQRERKGLITARLLGASVAQAKGAHVSWTPT-VSVSHGWLEPLLARIAEDKTP 293
Query: 315 VVCPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
VV P I I TF + R + G F+W L F W + +RKD T P K+
Sbjct: 294 VVSPDIVTIDLNTFQFSRPVQRGKAHSRGNFDWSLTFGWEMLPQHEKQ-RRKDETYPIKS 352
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR
Sbjct: 353 PTFAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRT 412
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEK---QRDKQAVRSRLELRK 487
SP++FP G S V+ N R+A VWMD++ + +++ N +A K + + + RL+LR+
Sbjct: 413 KSPHTFPKGTS-VIARNQVRLAEVWMDDYKKIFYRRNLQAAKMVQENNFGDISERLQLRE 471
Query: 488 QLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLP 546
QL+CH+F WYL +V+P F+P ++ F+G I+++ T++C L G N+ P +
Sbjct: 472 QLRCHNFSWYLHNVYPEMFVPDLNPTFYGAIKNLGTNQC----LDVGENNRGGKPLIMYV 527
Query: 547 CTHL 550
C +L
Sbjct: 528 CHNL 531
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/133 (51%), Positives = 91/133 (68%), Gaps = 3/133 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLPTTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ SP +LKEI+LVDDAST E LK L++YV +L + RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTSPAILLKEIILVDDASTDEHLKERLEQYVQQLQI-VRVVRQRERKGLI 250
Query: 165 KARLLGARQAEGE 177
ARLLGA A+ +
Sbjct: 251 TARLLGASVAQAK 263
>gi|34452725|ref|NP_003765.2| polypeptide N-acetylgalactosaminyltransferase 4 [Homo sapiens]
gi|338817878|sp|Q8N4A0.2|GALT4_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|119617834|gb|EAW97428.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
Length = 578
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 176/396 (44%), Positives = 240/396 (60%), Gaps = 19/396 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T VVCPVID I TF Y++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYEN 581
C H Q F ++ + +C +VPE +N
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKN 511
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|348533011|ref|XP_003453999.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oreochromis niloticus]
Length = 587
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 177/442 (40%), Positives = 262/442 (59%), Gaps = 42/442 (9%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP ++I+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S +E
Sbjct: 114 HPNCKHKLYAEKLPNTTIIIPFHNEGWSSLLRTVHSVLNRSPPHLIAEIILVDDFSDKEH 173
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK +L+EY+ +L R++R+ R GLI+ RLLGA A+GE+L FLD+HCE + WL L
Sbjct: 174 LKVALEEYMVRLP-KVRILRTKKREGLIRTRLLGAAAAKGEVLTFLDSHCEANVNWLPPL 232
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ + +++ D
Sbjct: 233 LDRIAQNRKTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYKRIPIPTE---LQKDD 289
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 290 PSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 349
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGGVS L NL RVA VWMDE+AE+ ++ PE + + +
Sbjct: 350 VGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDMTVQK 406
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-KPLAKGSMNQ 537
ELR +L C +FKW+++ V W P H+ P++ +G IR+V + C+E K GS
Sbjct: 407 ELRNRLNCKNFKWFMSEVAWDLPKHYPPVEPPAAAWGEIRNVGSSMCMESKHFVSGS--- 463
Query: 538 ASGPASLLPC---------THLPVLTQMFVMKLPTDLIATD----ESVCLDVPEYENDIS 584
P L C +H Q+F D+ D + VC D + + ++
Sbjct: 464 ---PIRLENCVKGRGDVSWSH----GQVFTFGWREDIRVGDPMHTKKVCFDAISHNSPVT 516
Query: 585 PRVRILACSGF-NRQRWTYDKE 605
+ C G Q W Y K+
Sbjct: 517 ----LYDCHGMKGNQLWRYRKD 534
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/149 (53%), Positives = 111/149 (74%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
L+DA Q Y+ N FN+ VSDRI +NR++PD+R P CK K++ E+ LP ++I+I FHNE
Sbjct: 80 LTDADRVDQAYRENGFNIYVSDRISLNRSVPDIRHPNCKHKLYAEK-LPNTTIIIPFHNE 138
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S +E LK +L+EY+ +L R++R+ R
Sbjct: 139 GWSSLLRTVHSVLNRSPPHLIAEIILVDDFSDKEHLKVALEEYMVRLP-KVRILRTKKRE 197
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ RLLGA A+GE+L FLD+HCE +
Sbjct: 198 GLIRTRLLGAAAAKGEVLTFLDSHCEANV 226
>gi|22137798|gb|AAH36390.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Homo
sapiens]
gi|123981562|gb|ABM82610.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|123996387|gb|ABM85795.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|124000643|gb|ABM87830.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
gi|157928222|gb|ABW03407.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4)
[synthetic construct]
Length = 578
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 176/399 (44%), Positives = 241/399 (60%), Gaps = 19/399 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIGRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T VVCPVID I TF Y++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
C H Q F ++ + +C +VPE +N +
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKNYVG 514
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 104/155 (67%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FLD HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 243
>gi|195488539|ref|XP_002092358.1| GE11714 [Drosophila yakuba]
gi|194178459|gb|EDW92070.1| GE11714 [Drosophila yakuba]
Length = 601
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 180/433 (41%), Positives = 251/433 (57%), Gaps = 33/433 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S+VI+F NE +S LLRTVHS +S LKEI+LVDD S L + LD
Sbjct: 143 FDAASLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 252 YVAKLSVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
YV + +P ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 203 YV-RTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRI 261
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 262 KESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRREC 319
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 320 KHDREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 379
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D
Sbjct: 380 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C SF+WYL +++P F+P D + +G++ + ++ C++ L N+
Sbjct: 438 VTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVHALNSNICLDDLLQN---NE 494
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC--- 592
A L PC + +Q+F + +E C V E SP RV ++ C
Sbjct: 495 KPYNAGLYPCGKVLQKSQLF--SFTNTNVLRNELSCATVQHSE---SPPYRVVMVPCMEN 549
Query: 593 SGFNRQRWTYDKE 605
FN Q W Y+ +
Sbjct: 550 DEFNEQ-WRYEHQ 561
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 94/152 (61%), Gaps = 4/152 (2%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
+Y+ N +S+++ NR++ D R P C + F+ LP +S+VI+F NE +S LLRTV
Sbjct: 110 IYKKIALNEELSEQLSYNRSVGDHRNPLCAKQRFDAASLPTASVVIIFFNEPYSVLLRTV 169
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLIKAR 167
HS +S LKEI+LVDD S L + LD YV + +P ++R R+GLI+AR
Sbjct: 170 HSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RTRIPAGKVTILRLKNRLGLIRAR 228
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L GAR A G++L+FLDAHCE + + E L +
Sbjct: 229 LAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 260
>gi|410953276|ref|XP_003983298.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
2 [Felis catus]
Length = 527
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 184/428 (42%), Positives = 257/428 (60%), Gaps = 31/428 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK L+EYV K
Sbjct: 69 LPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKY 128
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED V
Sbjct: 129 LPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDPRTV 188
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P ++P MAG
Sbjct: 189 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIRSPTMAG 246
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 247 GLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 306
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + + + R+ELR++L C SFK
Sbjct: 307 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTKSYGNISERVELRRKLGCKSFK 364
Query: 496 WYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G +Q
Sbjct: 365 WYLDNIYPEMQISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCL---VAQGRPSQ 421
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG-FN 596
G L C + Q+++ +L+ + +CLDV E + P R++ C G
Sbjct: 422 KGGLVVLKACDYSDP-GQVWIYNEEHELVLNN-LLCLDVSETRSSDPP--RLMKCHGSGG 477
Query: 597 RQRWTYDK 604
Q+WT+ +
Sbjct: 478 SQQWTFGR 485
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 101/147 (68%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 33 YQKHAFNMLISNRLGYRRDVPDTRNAACKDKSYPAD-LPVASVVICFYNEALSALLRTVH 91
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK L+EYV K L +VIR+ R GLI+ R++G
Sbjct: 92 SVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIG 151
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 152 AAHATGEVLVFLDSHCEVNVLWLQPLL 178
>gi|395732382|ref|XP_002812541.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Pongo abelii]
Length = 967
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 161/354 (45%), Positives = 227/354 (64%), Gaps = 5/354 (1%)
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
AE + +FL C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEI
Sbjct: 501 AEIKTPLFLIHGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEI 558
Query: 234 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAH 293
LLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H
Sbjct: 559 LLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSH 617
Query: 294 CECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYG 353
EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W ++F W T
Sbjct: 618 VECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIP 677
Query: 354 SSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCG 413
R T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CG
Sbjct: 678 PDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCG 737
Query: 414 GSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEK 472
G IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++
Sbjct: 738 GEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQ 797
Query: 473 QRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
D + + ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 798 GLDAGNLTQQRELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 850
Score = 123 bits (308), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 70/182 (38%), Positives = 109/182 (59%), Gaps = 27/182 (14%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKP-----------------KCKTKVF 84
H +A++ ++ FN+ +SD IPV+R + D R + KT +F
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGGQLFLPLFPYSHMTLAEIKTPLF 508
Query: 85 -----NEEF----LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 135
E+ LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 509 LIHGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 568
Query: 136 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEE 195
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC + + E
Sbjct: 569 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 627
Query: 196 FL 197
L
Sbjct: 628 LL 629
>gi|32698686|ref|NP_055383.1| polypeptide N-acetylgalactosaminyltransferase 5 [Homo sapiens]
gi|51315940|sp|Q7Z7M9.1|GALT5_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|30841528|gb|AAP34404.1| GalNAc-T5 [Homo sapiens]
gi|119631854|gb|EAX11449.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|148745655|gb|AAI42677.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Homo
sapiens]
gi|158257740|dbj|BAF84843.1| unnamed protein product [Homo sapiens]
Length = 940
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 174/422 (41%), Positives = 249/422 (59%), Gaps = 32/422 (7%)
Query: 126 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI------KARLLGARQAEGEIL 179
LL +D+ T + L+ +++ R ++PG+ G K + R EG
Sbjct: 413 LLPEDSGTHQVLR-------IDVTLSPRDPKAPGQFGRPVVVPHGKEKEAERRWKEGNFN 465
Query: 180 VFLD--------------AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 225
V+L A C LV N LP +S+++ F +E WS LLR+VHSVI+RS
Sbjct: 466 VYLSDLIPVDRAIEDTRPAGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVINRS 523
Query: 226 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 285
P ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G+
Sbjct: 524 PPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGD 582
Query: 286 ILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWEL 345
+L FLD+H EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W +
Sbjct: 583 VLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPM 642
Query: 346 HFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEM 405
+F W T R T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+
Sbjct: 643 NFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMEL 702
Query: 406 SFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYF 464
SF+VW CGG IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E FY
Sbjct: 703 SFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
Query: 465 KFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK 524
+ ++ D + + ELRK+LKC SFKWYL +V+P P+ + G + +V K
Sbjct: 763 HGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLENVFPDLRAPI-VRASGVLINVALGK 821
Query: 525 CV 526
C+
Sbjct: 822 CI 823
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSVI+RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|219804492|ref|NP_001137331.1| polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
gi|296490560|tpg|DAA32673.1| TPA: polypeptide N-acetylgalactosaminyltransferase 5 [Bos taurus]
Length = 940
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 162/345 (46%), Positives = 225/345 (65%), Gaps = 7/345 (2%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +SI++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCAEQLVHNN--LPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA++A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK- 362
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T D + K K
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTI-PPDVVAKNKI 659
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 660 KETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIVPCS 719
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRS 481
V H+FR +PYSFP + + NL RVA VW+DE+ E FY N ++ D +
Sbjct: 720 RVGHIFRNDNPYSFPKDRMKTVERNLGRVAEVWLDEYKELFYGHGNHLIDQGLDVGNLTQ 779
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
+ ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 780 QRELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H + ++ ++ FN+ +SD IPV+R + D R C ++ + LP +SI++ F +E
Sbjct: 449 HGKEKEVERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSIIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA++A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPLL 602
>gi|291391583|ref|XP_002712189.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Oryctolagus cuniculus]
Length = 941
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 172/401 (42%), Positives = 243/401 (60%), Gaps = 27/401 (6%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
+++ R ++PG+ GL K + R EG V+L A C
Sbjct: 429 VTLSPRDPQAPGQFGLPVVVPHGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 488
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+++LK
Sbjct: 489 EQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKD 546
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 547 NLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 605
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTE 366
V R +V CPVI++I+D +Y+ G F W ++F W T D + K K T+
Sbjct: 606 VYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFLWPMNFGWKTI-PPDVVAKNKIKETD 664
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
+ P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H
Sbjct: 665 IIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGH 724
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLEL 485
+FR +PYSFP + + NL RVA VW+DE+ E FY + E+ D + + EL
Sbjct: 725 IFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIEQGLDVGNLTQQREL 784
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
RK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 785 RKKLKCKSFKWYLENVFPDLKAPL-VRAGGVLINVALGKCI 824
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 450 HGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 508
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 509 VWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKDNLDKYMSQFP-KVRILRLKERH 567
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 568 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 603
>gi|47228512|emb|CAG05332.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 191/448 (42%), Positives = 254/448 (56%), Gaps = 59/448 (13%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP++S+VI F NEA SALLRTVHSV+ R+P +L EI+LVDD S E LK LD YV A+
Sbjct: 125 LPRASVVICFFNEALSALLRTVHSVLDRTPPFLLHEIILVDDYSELEELKGDLDRYVQAE 184
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEG-------------EILVFLDAHCECTLGWLE 302
L RV+R+ R GLI+ R++GA QA G E+LVFLD+HCE WL+
Sbjct: 185 LRGKVRVLRNQKREGLIRGRMIGAAQASGVSPDPQILDLCSGEVLVFLDSHCEVNQMWLQ 244
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
L+A + +DR VVCPVIDIIS T +Y S + G FNW LHF+W ++ + +
Sbjct: 245 PLLAPIRQDRRTVVCPVIDIISADTLSYSPS-PIVRGGFNWGLHFKWDPVPPAE-LKSPQ 302
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
P ++P MAGGLFAI+R YF IG YD M +WGGENLE+SFR+W CGG + I PCS
Sbjct: 303 GPVGPIRSPTMAGGLFAINRKYFNEIGQYDAGMDIWGGENLEISFRIWMCGGQLFIIPCS 362
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSR 482
V H+FRK PY PGG + + +L R+A VWMDE+ E Y P+ +QRD + R
Sbjct: 363 RVGHIFRKRRPYGSPGGQDTMAHNSL-RLAHVWMDEYKEQYLSMRPDL-RQRDYGDIGER 420
Query: 483 LELRKQLKCHSFKWYLTHVWPHH------------FLPMDDKF-----FGRIRHVQTHKC 525
+ LRK+L+C SF+WYL V+P F+ D K GR+R++ T++C
Sbjct: 421 VALRKRLQCRSFRWYLDTVYPEMQTVAGGNKHQPLFINKDLKRPKVLQRGRLRNLATNRC 480
Query: 526 VEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDES--------VCLDVP 577
+ +A+G +Q G L PC + P A DE +CLDV
Sbjct: 481 L---VAQGRASQKGGVVVLRPCDP----------QDPEQEWAYDEEGQLVLAGLLCLDVS 527
Query: 578 EYENDISPRVRILACSG-FNRQRWTYDK 604
E P R++ C G Q+W+ K
Sbjct: 528 EVRTFDPP--RLMKCHGSGGSQQWSVGK 553
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 82/180 (45%), Positives = 112/180 (62%), Gaps = 20/180 (11%)
Query: 36 SVDGGLHSNLSDAQQL----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPK 91
S + G+ N +D QQL Y + FNLL+S R+ +R LPD R P+C+ + + + LP+
Sbjct: 70 SAEMGMIFNEAD-QQLRDSGYHRHAFNLLISTRLGYHRELPDTRDPQCRDRTYPGD-LPR 127
Query: 92 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSV 150
+S+VI F NEA SALLRTVHSV+ R+P +L EI+LVDD S E LK LD YV A+L
Sbjct: 128 ASVVICFFNEALSALLRTVHSVLDRTPPFLLHEIILVDDYSELEELKGDLDRYVQAELRG 187
Query: 151 PTRVIRSPGRVGLIKARLLGARQA-------------EGEILVFLDAHCECTLVFNEEFL 197
RV+R+ R GLI+ R++GA QA GE+LVFLD+HCE ++ + L
Sbjct: 188 KVRVLRNQKREGLIRGRMIGAAQASGVSPDPQILDLCSGEVLVFLDSHCEVNQMWLQPLL 247
>gi|410953274|ref|XP_003983297.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11 isoform
1 [Felis catus]
Length = 608
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 184/427 (43%), Positives = 256/427 (59%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK L+EYV K
Sbjct: 150 LPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +VIR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + ED V
Sbjct: 210 LPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVLWLQPLLAAIREDPRTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P ++P MAG
Sbjct: 270 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIRSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + + + R+ELR++L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTKSYGNISERVELRRKLGCKSFK 445
Query: 496 WYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ H+QT+KC+ +A+G +Q
Sbjct: 446 WYLDNIYPEMQISGPNAKPQQPIFINRGPKRPKVLQRGRLYHLQTNKCL---VAQGRPSQ 502
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G L C + Q+++ +L+ + +CLDV E + PR+ SG
Sbjct: 503 KGGLVVLKACDYSDP-GQVWIYNEEHELVLNN-LLCLDVSETRSSDPPRLMKCHGSG-GS 559
Query: 598 QRWTYDK 604
Q+WT+ +
Sbjct: 560 QQWTFGR 566
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 101/147 (68%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYRRDVPDTRNAACKDKSYPAD-LPVASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK L+EYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNTKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 233 AAHATGEVLVFLDSHCEVNVLWLQPLL 259
>gi|341889853|gb|EGT45788.1| hypothetical protein CAEBREN_10062 [Caenorhabditis brenneri]
Length = 597
Score = 318 bits (815), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 157/333 (47%), Positives = 222/333 (66%), Gaps = 16/333 (4%)
Query: 182 LDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
+DA C+ T +NE LP++S+++ FHNEAWS LLRTVHSV+ R+P +L+EI+LVDD S
Sbjct: 169 IDAECK-TEKYNEN-LPRTSVIVCFHNEAWSVLLRTVHSVLERTPDHLLEEIVLVDDFSD 226
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ K L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC GW+
Sbjct: 227 MDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGWI 286
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAII 359
E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F W++ D
Sbjct: 287 EPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERD--- 343
Query: 360 KRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE+SF++W CGG++
Sbjct: 344 -RKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTL 402
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRD 475
EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++ +Y+ + N + D
Sbjct: 403 EIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYKTYYYERIN---NQLGD 458
Query: 476 KQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLP 508
V +R +LR L C SFKWYL +++P F+P
Sbjct: 459 FGDVSARKKLRSDLGCKSFKWYLDNIYPELFVP 491
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 98/145 (67%), Gaps = 1/145 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CKT+ +NE LP++S+++ FHNEAWS LLRTVHSV+
Sbjct: 149 NAFNQYASDMISVHRTLPTNIDAECKTEKYNEN-LPRTSVIVCFHNEAWSVLLRTVHSVL 207
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+EI+LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 208 ERTPDHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIA 267
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
GE+L +LD+HCEC + E L +
Sbjct: 268 TGEVLTYLDSHCECMEGWIEPLLDR 292
>gi|281346614|gb|EFB22198.1| hypothetical protein PANDA_015357 [Ailuropoda melanoleuca]
Length = 491
Score = 318 bits (815), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 171/391 (43%), Positives = 242/391 (61%), Gaps = 15/391 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 68 FDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 127
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+++D
Sbjct: 128 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISKD 186
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T VVCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 187 ETTVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 245
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 246 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 305
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++LK
Sbjct: 306 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLK 359
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ + +P D + G +R + + +C++ N SL C
Sbjct: 360 CQSFDWYLKNVFSNLHVPEDRPGWHGAVRSMGISSECLD--YNSPDNNPTGANLSLFGC- 416
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPE 578
H Q F ++ + +C +VPE
Sbjct: 417 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPE 447
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 105/157 (66%), Gaps = 1/157 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ F+ LP +S++I F+NEAWS
Sbjct: 29 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSRKFDYRRLPTTSVIIAFYNEAWS 88
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 89 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 147
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 148 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIS 184
>gi|410914862|ref|XP_003970906.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Takifugu rubripes]
Length = 600
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 172/438 (39%), Positives = 261/438 (59%), Gaps = 34/438 (7%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ E++LVDD S +E
Sbjct: 127 HADCKQKLYAEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPQLIAELILVDDFSDKEH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ ++ R++R+ R GLI+ RLLGA A+GE++ FLD+HCE + WL L
Sbjct: 187 LKVPLEEYMKRMP-KVRILRTKKREGLIRTRLLGASAAKGEVITFLDSHCEANVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ A ++R D
Sbjct: 246 LDRIAQNRKSIVCPMIDVIDHDNFGYDTQAGDAMRGAFDWEMYYKRIPI---PAEMQRDD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 303 PSQPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGG+S L NL RVA VWMDE+AE+ ++ PE + + +
Sbjct: 363 VGHIYRKYVPYKVPGGIS--LAKNLKRVAEVWMDEYAEYVYQRRPEY-RHLSAGDMTPQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-KPLAKGSMNQ 537
ELR +L C +FKW++++V W P H+ P++ +G I++V + C+E K GS
Sbjct: 420 ELRSRLGCKNFKWFMSNVAWDLPKHYPPVEPPAAAWGEIQNVGSGLCMEIKHFVSGS--- 476
Query: 538 ASGPASLLPC---------THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVR 588
P L C +H VLT + + VC D + + ++
Sbjct: 477 ---PIRLENCVKSRGEVGWSHGQVLTFGWREDIRVGDPMHTRKVCFDAVSHNSPVT---- 529
Query: 589 ILACSGF-NRQRWTYDKE 605
+ C G Q W Y K+
Sbjct: 530 LYDCHGMKGNQLWRYRKD 547
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 72/141 (51%), Positives = 105/141 (74%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ +SDRI +NR+LPD+R CK K++ E+ LP +S++I FHNE WS+LLRT
Sbjct: 101 QAYRENGFNIYISDRISLNRSLPDIRHADCKQKLYAEK-LPNTSVIIPFHNEGWSSLLRT 159
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ E++LVDD S +E LK L+EY+ ++ R++R+ R GLI+ RLL
Sbjct: 160 VHSVLNRSPPQLIAELILVDDFSDKEHLKVPLEEYMKRMP-KVRILRTKKREGLIRTRLL 218
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A+GE++ FLD+HCE +
Sbjct: 219 GASAAKGEVITFLDSHCEANV 239
>gi|301780762|ref|XP_002925798.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Ailuropoda melanoleuca]
Length = 578
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 171/391 (43%), Positives = 242/391 (61%), Gaps = 15/391 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R+++D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISKD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T VVCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 248 ETTVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 306
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 307 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 366
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++LK
Sbjct: 367 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRERLK 420
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLPCT 548
C SF WYL +V+ + +P D + G +R + + +C++ N SL C
Sbjct: 421 CQSFDWYLKNVFSNLHVPEDRPGWHGAVRSMGISSECLD--YNSPDNNPTGANLSLFGC- 477
Query: 549 HLPVLTQMFVMKLPTDL-IATDESVCLDVPE 578
H Q F ++ + +C +VPE
Sbjct: 478 HGQGGNQFFEYTSNKEIRFNSVTELCAEVPE 508
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 105/157 (66%), Gaps = 1/157 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ F+ LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSRKFDYRRLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIS 245
>gi|195583656|ref|XP_002081633.1| GD11122 [Drosophila simulans]
gi|194193642|gb|EDX07218.1| GD11122 [Drosophila simulans]
Length = 601
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 182/433 (42%), Positives = 252/433 (58%), Gaps = 33/433 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ E LP +S+VI+F NE +S LLRTVHS +S LKEI+LVDD S L + LD
Sbjct: 143 FDSESLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDY 202
Query: 252 YVAKLSVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
YV + +P ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 203 YV-RTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRI 261
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 262 KESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRREC 319
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 320 RQEREICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 379
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D
Sbjct: 380 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGD 437
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C SF+WYL +++P F+P D + +G++ V + C++ L N+
Sbjct: 438 VTHRVMLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVHAVNANICLDDLLQN---NE 494
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC--- 592
A L PC + +Q+F + +E C V E SP RV ++ C
Sbjct: 495 KPYNAGLYPCGKVLQKSQLF--SFTNTNVLRNELSCATVQHSE---SPPYRVVMVPCMEN 549
Query: 593 SGFNRQRWTYDKE 605
FN Q W Y+ +
Sbjct: 550 DEFNEQ-WRYEHQ 561
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 96/155 (61%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
++Y+ N +S+++ NR++ D R P C + F+ E LP +S+VI+F NE +S LL
Sbjct: 107 GDEIYKKIALNEELSEQLTYNRSVGDHRNPLCAKQRFDSESLPTASVVIIFFNEPYSVLL 166
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLI 164
RTVHS +S LKEI+LVDD S L + LD YV + +P ++R R+GLI
Sbjct: 167 RTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RTRIPAGKVTILRLKNRLGLI 225
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 226 RARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 260
>gi|355689592|gb|AER98884.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 [Mustela putorius
furo]
Length = 609
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 185/430 (43%), Positives = 258/430 (60%), Gaps = 32/430 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD S + LK L+EYV K
Sbjct: 150 LPVASVVICFYNEALSALLRTVHSVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT---LGWLENLVARVAEDR 312
L +VIR+ R GLI+ R++GA + GE+LVFLD+HCE L WL+ L+A + +DR
Sbjct: 210 LPGKIKVIRNAKREGLIRGRMIGAAHSTGEVLVFLDSHCEVNVMWLMWLQPLLAAIQQDR 269
Query: 313 TRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
VVCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P
Sbjct: 270 RTVVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGPEGATAPIKSPT 327
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK
Sbjct: 328 MAGGLFAMNRHYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRR 387
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
PY P G + + +L R+A VW+D++ E YF P+ + + + R+ELRK+L C
Sbjct: 388 PYGSPEGQDTMTHNSL-RLAHVWLDDYKEQYFSLRPDL-RTKSYGNISERVELRKKLGCK 445
Query: 493 SFKWYLTHVWPHHFL------PMDDKFF------------GRIRHVQTHKCVEKPLAKGS 534
SFKWYL +++P + P F GR+ H+QT+KC+ +A+G
Sbjct: 446 SFKWYLDNIYPEMQISGPNAKPQQPIFINRGPKRPKILQRGRLYHLQTNKCL---VAQGR 502
Query: 535 MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG 594
+Q G L C + +Q+++ +L+ + +CLD+ E + PR+ SG
Sbjct: 503 PSQKGGLVVLKACDYSDP-SQIWIYNEEHELVLNN-LLCLDMSETRSSDPPRLMKCHGSG 560
Query: 595 FNRQRWTYDK 604
Q+WT+ K
Sbjct: 561 -GSQQWTFGK 569
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/142 (50%), Positives = 100/142 (70%), Gaps = 2/142 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKDKSYPVD-LPVASVVICFYNEALSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK L+EYV K L +VIR+ R GLI+ R++G
Sbjct: 173 SVLDRTPAQLLHEIILVDDDSDFDDLKGELEEYVQKYLPGKIKVIRNAKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVF 192
A + GE+LVFLD+HCE +++
Sbjct: 233 AAHSTGEVLVFLDSHCEVNVMW 254
>gi|307214182|gb|EFN89299.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Harpegnathos
saltator]
Length = 442
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 176/407 (43%), Positives = 239/407 (58%), Gaps = 19/407 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 8 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 66
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC W+E L+ RVAED TRVV
Sbjct: 67 ----RVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWIEPLLERVAEDPTRVV 122
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W + ++KD T+ +TP +AGG
Sbjct: 123 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQIERQARQKDPTQAIRTPMIAGG 182
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF I++AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 183 LFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYSF 242
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +FY+ P A + ++ R+EL+++L C F W
Sbjct: 243 PGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKRRLHCKPFSW 301
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVLTQ 555
YL +V+P +P + G Q C++ SM G L PC H Q
Sbjct: 302 YLKNVYPELVIPTSEGGPGGSLK-QGTACLD------SMGHLLDGNVGLYPC-HDTGGNQ 353
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ L D + +CL +P Y + ++I C G Q+W +
Sbjct: 354 EW--GLTKDGLIKHHDLCLTLPVYAKGTTLLMQI--CDGSENQKWRH 396
>gi|432901709|ref|XP_004076908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 677
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 176/438 (40%), Positives = 257/438 (58%), Gaps = 34/438 (7%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S ++
Sbjct: 204 HPNCKQKLYAERLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDKDH 263
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK +L+EY+ +L R++R+ R GLI+ RLLGA A+GE++ FLD+HCE + WL L
Sbjct: 264 LKGALEEYMVRLP-KVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANINWLPPL 322
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ A +++ D
Sbjct: 323 LDRIALNRKTIVCPMIDVIDHDNFGYETQAGDAMRGAFDWEMYYKRIPI---PAELQKND 379
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 380 PSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 439
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGGVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 440 VGHIYRKYVPYKVPGGVS--LARNLKRVAEVWMDEYAEYVYQRRPEY-RHLSAGDVAAQK 496
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVEKPLAKGSMNQA 538
ELR L C SFKW++ V W P H+ P++ +G +R + C+E S +
Sbjct: 497 ELRSTLNCKSFKWFMKEVAWDLPKHYPPVEPPAAAWGEVRSAASGLCLE------SKHFV 550
Query: 539 SG-PASLLPCTHLPVLT-----QMFVMKLPTDLIATD----ESVCLDVPEYENDISPRVR 588
SG P L C Q+F D+ D + VC D + + ++
Sbjct: 551 SGTPIRLESCVKGRADVSWGHGQVFTFGWREDIRVGDPMHTKKVCFDAVSHHSPVT---- 606
Query: 589 ILACSGF-NRQRWTYDKE 605
+ C G Q W Y K+
Sbjct: 607 LYDCHGMRGNQLWRYRKD 624
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 78/149 (52%), Positives = 110/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
L+DA Q Y+ N FN+ VSDRI +NR++PD+R P CK K++ E LP +SI+I FHNE
Sbjct: 170 LTDADRVDQAYRENGFNIFVSDRISLNRSVPDIRHPNCKQKLYAER-LPNTSIIIPFHNE 228
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S ++ LK +L+EY+ +L R++R+ R
Sbjct: 229 GWSSLLRTVHSVLNRSPPQLIAEIILVDDFSDKDHLKGALEEYMVRLP-KVRILRTKKRE 287
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ RLLGA A+GE++ FLD+HCE +
Sbjct: 288 GLIRTRLLGAAAAKGEVITFLDSHCEANI 316
>gi|345483668|ref|XP_001601037.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 587
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 179/425 (42%), Positives = 249/425 (58%), Gaps = 27/425 (6%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C V + LP +SI+I+FHNEA+S LLRTV+SVI +P +LKEI+LVDD S E L
Sbjct: 122 CKNVTYDSVLPSASIIIIFHNEAFSVLLRTVYSVIKETPPKLLKEIILVDDKSNEELL-- 179
Query: 248 SLDEYVAKLSVPTRV--IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
L EY + +P +V +R R GL++ARL GA+ A G++L+FLDAHCE T WLE L+
Sbjct: 180 GLLEYYIQTRLPKKVKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLL 239
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFE---LHWGAFNWELHFRWYTYGSSDAIIKRK 362
R+ E + VV P+ID IS+ TF Y S E G F W HF W +D K
Sbjct: 240 QRIKEKKNAVVTPIIDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTS 299
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+ P K+P MAGGLFAI+R YF+ IG+YD++M+ WGGENLEMSFR+WQCGG +E PCS
Sbjct: 300 AIS-PVKSPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCS 358
Query: 423 HVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA--- 478
V H+FR PY FP + + +G N AR+A VWMD++ Y+ E+ +DK
Sbjct: 359 RVGHVFRNFLPYKFP--MDKDTHGINTARLANVWMDDYKRLYY---LHREEYKDKPELIG 413
Query: 479 -VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMN 536
++ R+ LR++LKC SFKWYL +V+P F+P ++ + FGR++ + + C++ N
Sbjct: 414 DIKERVNLREKLKCKSFKWYLDNVYPEKFIPDENVQAFGRVQVQKGNLCLD-----NLQN 468
Query: 537 QASGPASL--LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG 594
P +L C +Q F + +L D + E+ DI +V + AC
Sbjct: 469 DEEKPYNLGVYECHSQLFPSQYFSLSKVGELRREDTCATVVEDEHSKDIF-KVSMKACDE 527
Query: 595 FNRQR 599
R
Sbjct: 528 IENDR 532
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 71/144 (49%), Positives = 99/144 (68%), Gaps = 5/144 (3%)
Query: 58 NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 117
N+++S++IP+ R LPDVR P CK + + LP +SI+I+FHNEA+S LLRTV+SVI +
Sbjct: 101 NIILSNKIPLQRKLPDVRDPLCKNVTY-DSVLPSASIIIIFHNEAFSVLLRTVYSVIKET 159
Query: 118 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRV--IRSPGRVGLIKARLLGARQAE 175
P +LKEI+LVDD S E L L EY + +P +V +R R GL++ARL GA+ A
Sbjct: 160 PPKLLKEIILVDDKSNEELL--GLLEYYIQTRLPKKVKLLRLDERQGLVRARLKGAKSAT 217
Query: 176 GEILVFLDAHCECTLVFNEEFLPK 199
G++L+FLDAHCE T + E L +
Sbjct: 218 GDVLMFLDAHCEVTKQWLEPLLQR 241
>gi|6688167|emb|CAB65104.1| GalNAc-T5 [Homo sapiens]
Length = 668
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 174/422 (41%), Positives = 249/422 (59%), Gaps = 32/422 (7%)
Query: 126 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI------KARLLGARQAEGEIL 179
LL +D+ T + L+ +++ R ++PG+ G K + R EG
Sbjct: 141 LLPEDSGTHQVLR-------IDVTLSPRDPKAPGQFGRPVVVPHGKEKEAERRWKEGNFN 193
Query: 180 VFLD--------------AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 225
V+L A C LV N LP +S+++ F +E WS LLR+VHSVI+RS
Sbjct: 194 VYLSDLIPVDRAIEDTRPAGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVINRS 251
Query: 226 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 285
P ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G+
Sbjct: 252 PPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGD 310
Query: 286 ILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWEL 345
+L FLD+H EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W +
Sbjct: 311 VLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPM 370
Query: 346 HFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEM 405
+F W T R T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+
Sbjct: 371 NFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMEL 430
Query: 406 SFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYF 464
SF+VW CGG IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E FY
Sbjct: 431 SFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 490
Query: 465 KFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK 524
+ ++ D + + ELRK+LKC SFKWYL +V+P P+ + G + +V K
Sbjct: 491 HGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLENVFPDLRAPI-VRASGVLINVALGK 549
Query: 525 CV 526
C+
Sbjct: 550 CI 551
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 177 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 235
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSVI+RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 236 VWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 294
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 295 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 330
>gi|6525067|gb|AAF15313.1|AF154107_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 5 [Homo
sapiens]
Length = 610
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 174/422 (41%), Positives = 249/422 (59%), Gaps = 32/422 (7%)
Query: 126 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI------KARLLGARQAEGEIL 179
LL +D+ T + L+ +++ R ++PG+ G K + R EG
Sbjct: 83 LLPEDSGTHQVLR-------IDVTLSPRDPKAPGQFGRPVVVPHGKEKEAERRWKEGNFN 135
Query: 180 VFLD--------------AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 225
V+L A C LV N LP +S+++ F +E WS LLR+VHSVI+RS
Sbjct: 136 VYLSDLIPVDRAIEDTRPAGCAEQLVXNN--LPTTSVIMCFVDEVWSTLLRSVHSVINRS 193
Query: 226 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 285
P ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G+
Sbjct: 194 PPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGD 252
Query: 286 ILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWEL 345
+L FLD+H EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W +
Sbjct: 253 VLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPM 312
Query: 346 HFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEM 405
+F W T R T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+
Sbjct: 313 NFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMEL 372
Query: 406 SFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYF 464
SF+VW CGG IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E FY
Sbjct: 373 SFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 432
Query: 465 KFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK 524
+ ++ D + + ELRK+LKC SFKWYL +V+P P+ + G + +V K
Sbjct: 433 HGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLENVFPDLRAPI-VRASGVLINVALGK 491
Query: 525 CV 526
C+
Sbjct: 492 CI 493
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/156 (43%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ LP +S+++ F +E
Sbjct: 119 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVXNN-LPTTSVIMCFVDE 177
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSVI+RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 178 VWSTLLRSVHSVINRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 236
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 237 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 272
>gi|449271781|gb|EMC82021.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Columba
livia]
Length = 314
Score = 317 bits (813), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 160/308 (51%), Positives = 212/308 (68%), Gaps = 9/308 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S+VI F+NEAWS LLRTVHSV+ SP +L+EI+LVDD S +E LK +L+ YVA L
Sbjct: 11 LPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLKETLENYVAGL 70
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A+G+IL FLD HCEC GWLE L+AR+AE+ + VV
Sbjct: 71 R-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLARIAEEESAVV 129
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ + E G F+W L F W++ + +RK T+ ++P MAG
Sbjct: 130 CPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHSTPEREQK-RRKSKTDVIRSPTMAG 188
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++ + YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K +PYS
Sbjct: 189 GLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPKQAPYS 248
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMDE+ E Y+ NP A + RL LR++LKC FK
Sbjct: 249 RSKALA-----NSVRAAEVWMDEYKELYYHRNPHARLEPYGDVTERRL-LREKLKCKDFK 302
Query: 496 WYLTHVWP 503
W+L +V+P
Sbjct: 303 WFLENVYP 310
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 63/110 (57%), Positives = 82/110 (74%), Gaps = 1/110 (0%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 138
C+ K ++ LPK+S+VI F+NEAWS LLRTVHSV+ SP +L+EI+LVDD S +E LK
Sbjct: 1 CREKKYDYYSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLK 60
Query: 139 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
+L+ YVA L R+IR+ R GL++ARLLGA A+G+IL FLD HCEC
Sbjct: 61 ETLENYVAGLR-KVRLIRANKREGLVRARLLGASVAKGDILTFLDCHCEC 109
>gi|312374382|gb|EFR21947.1| hypothetical protein AND_15990 [Anopheles darlingi]
Length = 669
Score = 317 bits (813), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 175/408 (42%), Positives = 249/408 (61%), Gaps = 19/408 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S++I FHNEA S LLRTV SV++RSP ++ EI+LVDD S +F + + +A
Sbjct: 229 ESLPATSVIITFHNEARSTLLRTVVSVLNRSPERLIHEIILVDDFS--DFPEDG--QELA 284
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ R+IR+ R GL+++R+ GA A ++L FLD+HCEC + WLE L+ARVAED TR
Sbjct: 285 KIQ-KVRLIRNAKREGLVRSRVTGAAAATAKVLTFLDSHCECNVHWLEPLLARVAEDPTR 343
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS TF Y+ + G F+W L F+W ++ +++D T P +TP +A
Sbjct: 344 VVCPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSGAERKERQRDPTAPIRTPMIA 403
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF IDR+YF +G YD +M +WGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 404 GGLFVIDRSYFEKLGTYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 463
Query: 435 SFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
+FPGG S ++ N R A VWMDE+ +Y+ P A + RL LR++L+C
Sbjct: 464 TFPGGGSGNIFAKNTRRAAEVWMDEYKRYYYAAVPLATNIPFGD-IEDRLRLREELQCKP 522
Query: 494 FKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVL 553
F+WYL +V+P +P + + G IR Q C++ S+ +G L H
Sbjct: 523 FRWYLENVYPQLSVP-ERRNNGSIR--QGAFCLD------SLGNVAGAIVGLYSCHGNGG 573
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYE-NDISPRVRILACSGFNRQRW 600
Q +++ ++ D +CL + ++ N V + C G Q+W
Sbjct: 574 NQNWILNRKGEVKHHD--LCLTLIKFSVNARYNSVIMKYCDGSENQQW 619
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/159 (41%), Positives = 96/159 (60%), Gaps = 9/159 (5%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNE----EFLPKSSIVIVFHN 100
L + Y NRFN SD + NR LPD R C+ ++ E LP +S++I FHN
Sbjct: 183 LQRGEDPYLRNRFNQQASDGLKSNRELPDTRNAMCRRTSWSSATSIESLPATSVIITFHN 242
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGR 160
EA S LLRTV SV++RSP ++ EI+LVDD S +F + + +AK+ R+IR+ R
Sbjct: 243 EARSTLLRTVVSVLNRSPERLIHEIILVDDFS--DFPEDG--QELAKIQ-KVRLIRNAKR 297
Query: 161 VGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GL+++R+ GA A ++L FLD+HCEC + + E L +
Sbjct: 298 EGLVRSRVTGAAAATAKVLTFLDSHCECNVHWLEPLLAR 336
>gi|426337441|ref|XP_004032714.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Gorilla
gorilla gorilla]
Length = 940
Score = 317 bits (813), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 173/422 (40%), Positives = 249/422 (59%), Gaps = 32/422 (7%)
Query: 126 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI------KARLLGARQAEGEIL 179
LL +D+ T + L+ +++ R ++PG+ G K + R EG
Sbjct: 413 LLPEDSGTHQVLR-------IDVTLSPRDPKAPGQFGRPVVVPQGKEKEAERRWKEGNFN 465
Query: 180 VFLD--------------AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 225
V+L A C LV N LP +S+++ F +E WS LLR+VHSV++RS
Sbjct: 466 VYLSDLIPVDRAIEDTRPAGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRS 523
Query: 226 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 285
P ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G+
Sbjct: 524 PPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGD 582
Query: 286 ILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWEL 345
+L FLD+H EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W +
Sbjct: 583 VLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPM 642
Query: 346 HFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEM 405
+F W T R T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+
Sbjct: 643 NFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMEL 702
Query: 406 SFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYF 464
SF+VW CGG IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E FY
Sbjct: 703 SFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
Query: 465 KFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK 524
+ ++ D + + ELRK+LKC SFKWYL +V+P P+ + G + +V K
Sbjct: 763 HGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLENVFPDLRAPI-VRASGVLINVALGK 821
Query: 525 CV 526
C+
Sbjct: 822 CI 823
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 105/151 (69%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 454 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDEVWSTL 512
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+A
Sbjct: 513 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRA 571
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 572 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|348568069|ref|XP_003469821.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Cavia porcellus]
Length = 608
Score = 317 bits (813), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 187/429 (43%), Positives = 255/429 (59%), Gaps = 31/429 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LDEYV K
Sbjct: 150 LPVASVVICFYNEAFSALLRTVHSVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYVQK- 208
Query: 257 SVPTR--VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
S+PT+ VIR+ R GLI+ R++GA A GE+LVFLD+HCE WL+ L+A + D
Sbjct: 209 SLPTKIKVIRNAKREGLIRGRMIGAAHATGEVLVFLDSHCEVNEMWLQPLLATIRGDPHT 268
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVIDIIS T AY S + G FNW LHF+W S+ + T P K+P MA
Sbjct: 269 VVCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LGGEDGATAPIKSPTMA 326
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 327 GGLFAMNRQYFNELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPY 386
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
P G + + +L R+A VW+DE+ + YF P+ K + + R+ELRK+L C SF
Sbjct: 387 GSPEGQDTMTHNSL-RLAHVWLDEYKDQYFSLRPDL-KTKSYGNISERVELRKRLGCRSF 444
Query: 495 KWYLTHVWPHHFL----------------PMDDKFF--GRIRHVQTHKCVEKPLAKGSMN 536
KWYL +++P + P + GR+ H QT+KC+ +A+G +
Sbjct: 445 KWYLDNIYPEMQVQGPNAKAQQPVFVNRGPKRPRVLRRGRLYHFQTNKCL---VAQGRPS 501
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN 596
Q L C + Q+++ +L+ + +CLDV E + PR+ SG
Sbjct: 502 QKGSLVVLKACDYRDP-AQVWIYNEEHELVLNN-LLCLDVSETRSSDPPRLMKCHGSG-G 558
Query: 597 RQRWTYDKE 605
Q+WT+ K
Sbjct: 559 SQQWTFGKN 567
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/148 (50%), Positives = 105/148 (70%), Gaps = 4/148 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK + + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNVLISNRLGYHRDVPDTRNAACKEQSYPAD-LPVASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT--RVIRSPGRVGLIKARLL 169
SV+ R+P +L EI+LVDD S + LK LDEYV K S+PT +VIR+ R GLI+ R++
Sbjct: 173 SVLDRTPAYLLHEIILVDDDSDFDDLKGELDEYVQK-SLPTKIKVIRNAKREGLIRGRMI 231
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFL 197
GA A GE+LVFLD+HCE ++ + L
Sbjct: 232 GAAHATGEVLVFLDSHCEVNEMWLQPLL 259
>gi|148356242|ref|NP_001038243.2| polypeptide N-acetylgalactosaminyltransferase 4 precursor [Danio
rerio]
gi|60416047|gb|AAH90692.1| WD repeat domain 51B, like [Danio rerio]
gi|182890540|gb|AAI64662.1| Wdr51bl protein [Danio rerio]
Length = 582
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 159/339 (46%), Positives = 222/339 (65%), Gaps = 11/339 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
+N LP +S+VI F+NEAWS LLRT+HSV+ +P +LK+I+LVDD S R +LKS L +
Sbjct: 134 YNIRRLPTTSVVIAFYNEAWSTLLRTIHSVLETTPAVLLKDIILVDDFSDRGYLKSQLAQ 193
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G +L FLD HCEC GW+E L+ R+AE+
Sbjct: 194 YISNLER-VRLIRTKKREGLVRARLIGATYATGSVLTFLDCHCECVPGWIEPLLERIAEN 252
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T ++CPVID I TF Y+++ E G F+W L F+W+ D I RK +P ++
Sbjct: 253 ETTIICPVIDTIDWNTFEFYMQTEEPMVGGFDWRLTFQWHAVPEIDRKI-RKSRIDPIRS 311
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ +AYF ++G YD M+VWGGENLE+SFRVWQCGGS+EI PCSHV H+F K
Sbjct: 312 PTMAGGLFAVSKAYFEYLGTYDMGMEVWGGENLELSFRVWQCGGSLEIHPCSHVGHVFPK 371
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ N R A VWMD + + ++ NP A K+ + R+ LR +L+
Sbjct: 372 KAPYA-----RSNFLQNTVRAAEVWMDTYKQHFYNRNPPARKE-SYGDISERIVLRNRLQ 425
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTH-KCVE 527
C SF+WYL +V+P +P D + G +R H +C++
Sbjct: 426 CKSFEWYLQNVYPGLHVPEDRPGWHGAVRSAGIHSECLD 464
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 68/143 (47%), Positives = 97/143 (67%), Gaps = 1/143 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ +SD+I ++R + D R +CK K +N LP +S+VI F+NEAWS LLRT+HSV+
Sbjct: 107 INIFISDKISLHRHIQDNRMHECKAKKYNIRRLPTTSVVIAFYNEAWSTLLRTIHSVLET 166
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P +LK+I+LVDD S R +LKS L +Y++ L R+IR+ R GL++ARL+GA A G
Sbjct: 167 TPAVLLKDIILVDDFSDRGYLKSQLAQYISNLER-VRLIRTKKREGLVRARLIGATYATG 225
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
+L FLD HCEC + E L +
Sbjct: 226 SVLTFLDCHCECVPGWIEPLLER 248
>gi|109732606|gb|AAI16333.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 170/401 (42%), Positives = 245/401 (61%), Gaps = 27/401 (6%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
+++ R + +PG+ G K + R EG V+L A C
Sbjct: 418 VTLSPRDLNAPGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 477
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N+ LP +SI++ F +E WSALLR+VHSV++RSP ++KEILLVDD ST+E+LK+
Sbjct: 478 EQLVHND--LPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKA 535
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 536 DLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 594
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTE 366
V +R +V CPVI++I+D +Y+ G F W ++F W T D + K T+
Sbjct: 595 VYLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFGWKTI-PPDVVAKNGIKETD 653
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
+ P MAGGLF+ID++YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H
Sbjct: 654 IIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGH 713
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLEL 485
+FR +PYSFP + + NL RVA VW+D++ E FY + ++ D + + EL
Sbjct: 714 IFRNDNPYSFPKDRMKTVERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQREL 773
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
RK+LKC SFKWYL +V+P P+ + G + ++ KCV
Sbjct: 774 RKKLKCKSFKWYLDNVFPDLKAPV-VRASGVLINMALGKCV 813
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/151 (46%), Positives = 107/151 (70%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A+Q ++ FN+ +SD IPV+R + D R C ++ + + LP +SI++ F +E WSAL
Sbjct: 444 EAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHND-LPTTSIIMCFVDEVWSAL 502
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+E+LK+ LD+Y+++ R++R R GLI+A
Sbjct: 503 LRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQFP-KVRILRLKERHGLIRA 561
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 562 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 592
>gi|363730612|ref|XP_419065.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Gallus
gallus]
Length = 590
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 162/325 (49%), Positives = 217/325 (66%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S+VI F+NEAWS LLRTVHSV+ SP +L+E++LVDD S ++ LK L+ YVA L
Sbjct: 142 LPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHLKEPLENYVAGL 201
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A G+IL FLD HCEC GWLE L+ R+AE+ + VV
Sbjct: 202 RK-VRLIRANKREGLVRARLLGASIARGDILTFLDCHCECHEGWLEPLLERIAEEESAVV 260
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ + E G F+W L F W+T + +RK + ++P MAG
Sbjct: 261 CPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTTPEREQK-RRKSKIDVIRSPTMAG 319
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++ + YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K +PYS
Sbjct: 320 GLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPKQAPYS 379
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMDE+ E Y+ NP A + RL LR++LKC FK
Sbjct: 380 RSKALA-----NSVRAAEVWMDEYKELYYHRNPHARLEPYGDVSERRL-LREKLKCKDFK 433
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRH 519
W+L +V+P +P D FFG +++
Sbjct: 434 WFLENVYPELHVPEDRPGFFGMLKN 458
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/134 (52%), Positives = 96/134 (71%), Gaps = 1/134 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
++ N+ +SDRI ++R LP+ P CK K ++ LPK+S+VI F+NEAWS LLRTVHSV+
Sbjct: 108 HQINIYLSDRISLHRRLPERWHPLCKGKKYDYYSLPKTSVVIAFYNEAWSTLLRTVHSVL 167
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +L+E++LVDD S ++ LK L+ YVA L R+IR+ R GL++ARLLGA A
Sbjct: 168 ETSPDILLEEVILVDDYSDKDHLKEPLENYVAGLRK-VRLIRANKREGLVRARLLGASIA 226
Query: 175 EGEILVFLDAHCEC 188
G+IL FLD HCEC
Sbjct: 227 RGDILTFLDCHCEC 240
>gi|402890489|ref|XP_003908519.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14-like
[Papio anubis]
Length = 551
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 175/410 (42%), Positives = 247/410 (60%), Gaps = 21/410 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +++ + FHNEA S LLRT+ SV++R+P +++EI+LVDD S + D+ +
Sbjct: 109 LSRTARLXXFHNEARSTLLRTIRSVLNRTPMHLIREIILVDDFS------NDPDDCKQLI 162
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+P + +R+ R GL+++R+ GA A+G L FLD+HCE WL+ L+ RV ED TRV
Sbjct: 163 KLPKVKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRV 222
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDII+ TF Y+ S G F+W LHF+W S + +R D TEP +TP +AG
Sbjct: 223 VCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAG 281
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF ID+A+F ++G YD +M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY
Sbjct: 282 GLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYV 341
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FP G + N R A VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFK
Sbjct: 342 FPDGNANTYIKNTKRTAEVWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFK 400
Query: 496 WYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP--- 551
WYL +V+P +P + G IR Q KC+E ++ NQ + L PC +
Sbjct: 401 WYLENVYPELSIPKESSIQKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGED 455
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
+Q++ ++ E +CL V +P V +L +G +RQ+WT
Sbjct: 456 AKSQVWAFTYTQQILQ--EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 502
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 84/137 (61%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R + T V + L +++ + FHNEA S LLRT+
Sbjct: 73 YKLYAFNQRESERISSNRAIPDTRHLRMSTSVGCTD-LSRTARLXXFHNEARSTLLRTIR 131
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 132 SVLNRTPMHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 185
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 186 ADIAQGTTLTFLDSHCE 202
>gi|158749624|ref|NP_766443.2| polypeptide N-acetylgalactosaminyltransferase 5 [Mus musculus]
gi|341940730|sp|Q8C102.2|GALT5_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 5;
AltName: Full=Polypeptide GalNAc transferase 5;
Short=GalNAc-T5; Short=pp-GaNTase 5; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 5;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5
gi|148694985|gb|EDL26932.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 [Mus musculus]
Length = 930
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 166/384 (43%), Positives = 237/384 (61%), Gaps = 26/384 (6%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
+++ R + +PG+ G K + R EG V+L A C
Sbjct: 418 VTLSPRDLNAPGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 477
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N+ LP +SI++ F +E WSALLR+VHSV++RSP ++KEILLVDD ST+E+LK+
Sbjct: 478 EQLVHND--LPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKA 535
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 536 DLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 594
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTE 366
V +R +V CPVI++I+D +Y+ G F W ++F W T D + K T+
Sbjct: 595 VYLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFGWKTI-PPDVVAKNGIKETD 653
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
+ P MAGGLF+ID++YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H
Sbjct: 654 IIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGH 713
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLEL 485
+FR +PYSFP + + NL RVA VW+D++ E FY + ++ D + + EL
Sbjct: 714 IFRNDNPYSFPKDRMKTVERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQREL 773
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPM 509
RK+LKC SFKWYL +V+P P+
Sbjct: 774 RKKLKCKSFKWYLDNVFPDLKAPV 797
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/151 (46%), Positives = 107/151 (70%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A+Q ++ FN+ +SD IPV+R + D R C ++ + + LP +SI++ F +E WSAL
Sbjct: 444 EAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHND-LPTTSIIMCFVDEVWSAL 502
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+E+LK+ LD+Y+++ R++R R GLI+A
Sbjct: 503 LRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQFP-KVRILRLKERHGLIRA 561
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 562 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 592
>gi|440896773|gb|ELR48609.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Bos grunniens
mutus]
Length = 940
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 225/345 (65%), Gaps = 7/345 (2%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +SI++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCAEQLVHNN--LPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA++A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK- 362
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T D + K K
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTI-PPDVVAKNKI 659
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 660 KETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIVPCS 719
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRS 481
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D +
Sbjct: 720 RVGHIFRNDNPYSFPKDRMKTVERNLGRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQ 779
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
+ ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 780 QRELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H + ++ ++ FN+ +SD IPV+R + D R C ++ + LP +SI++ F +E
Sbjct: 449 HGKEKEVERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSIIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA++A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPLL 602
>gi|326917280|ref|XP_003204928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Meleagris gallopavo]
Length = 528
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 162/325 (49%), Positives = 218/325 (67%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S+VI F+NEAWS LLRTVHSV+ SP +L+E++LVDD S ++ LK L+ YVA L
Sbjct: 80 LPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILVDDYSDKDHLKEPLENYVAGL 139
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A+G+IL FLD HCEC GWLE L+ R+AE+ + VV
Sbjct: 140 RK-VRLIRANKREGLVRARLLGASVAKGDILTFLDCHCECHEGWLEPLLERIAEEESAVV 198
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ + E G F+W L F W+T + +RK + ++P MAG
Sbjct: 199 CPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTTPEREQK-RRKSKIDVIRSPTMAG 257
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++ + YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K +PYS
Sbjct: 258 GLFSVSKKYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPKQAPYS 317
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMDE+ E Y+ NP A + RL LR++LKC FK
Sbjct: 318 RSKALA-----NSVRAAEVWMDEYKELYYHRNPHARLEPYGDVTERRL-LREKLKCKDFK 371
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRH 519
W+L +V+P +P D FFG +++
Sbjct: 372 WFLENVYPELHVPEDRPGFFGMLKN 396
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/120 (55%), Positives = 86/120 (71%), Gaps = 1/120 (0%)
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R LP P+CK K ++ LPK+S+VI F+NEAWS LLRTVHSV+ SP +L+E++LV
Sbjct: 60 RKLPWRFSPRCKEKKYDYYSLPKTSVVIAFYNEAWSTLLRTVHSVLETSPDILLEEVILV 119
Query: 129 DDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
DD S ++ LK L+ YVA L R+IR+ R GL++ARLLGA A+G+IL FLD HCEC
Sbjct: 120 DDYSDKDHLKEPLENYVAGLRK-VRLIRANKREGLVRARLLGASVAKGDILTFLDCHCEC 178
>gi|327279823|ref|XP_003224655.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Anolis carolinensis]
Length = 941
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 161/356 (45%), Positives = 228/356 (64%), Gaps = 14/356 (3%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C LV N+ LP +SI++ F +E WS LLR+VHSV++RSP ++KEI+LVDD ST+E+L
Sbjct: 487 CSDILVHND--LPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPQLIKEIILVDDFSTKEYL 544
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K LD+Y+A+ R++ R GLI+ARL GA A+G++L FLD+H EC +GWLE L+
Sbjct: 545 KDKLDKYMAQFP-KVRILHLKERYGLIRARLAGAEIAKGDVLTFLDSHVECNVGWLEPLL 603
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DF 364
R+ +R +V CPVI++ISD +Y+ G FNW ++F W D I K K
Sbjct: 604 ERIHLNRKKVPCPVIEVISDKDMSYMTVDNFQRGIFNWPMNFGWKPI-PPDVIEKNKIKE 662
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+ + P MAGGLF+ID+ YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V
Sbjct: 663 TDVIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIIPCSRV 722
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE-AEKQRDKQAVRSRL 483
H+FR +PYSFP + NLARVA VW+D++ + ++ +K D + +
Sbjct: 723 GHIFRSDNPYSFPKDRLTTVERNLARVAEVWLDDYKDLFYGHGYHLVQKNLDVGDLTQQK 782
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQAS 539
ELRK+L+C SFKWYL +V+P P+ K G I ++ KC+ ++NQ+S
Sbjct: 783 ELRKRLQCKSFKWYLENVYPDIEAPL-VKASGLIINIALAKCI-------TVNQSS 830
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 101/151 (66%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ + FN+ +SD IP++R + D R C + + LP +SI++ F +E WS L
Sbjct: 455 EAKRRWNEGNFNVYLSDMIPIDRAIDDTRPIGC-SDILVHNDLPTTSIIMCFVDEVWSTL 513
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEI+LVDD ST+E+LK LD+Y+A+ R++ R GLI+A
Sbjct: 514 LRSVHSVLNRSPPQLIKEIILVDDFSTKEYLKDKLDKYMAQFP-KVRILHLKERYGLIRA 572
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA A+G++L FLD+H EC + + E L
Sbjct: 573 RLAGAEIAKGDVLTFLDSHVECNVGWLEPLL 603
>gi|195172682|ref|XP_002027125.1| GL20074 [Drosophila persimilis]
gi|194112938|gb|EDW34981.1| GL20074 [Drosophila persimilis]
Length = 597
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 172/430 (40%), Positives = 250/430 (58%), Gaps = 27/430 (6%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++++F+NE +S LLRTVHS + + LKEI+LVDD S L LD
Sbjct: 139 FDSSTLPTASVIVIFYNEPYSVLLRTVHSTLITCNQQALKEIILVDDGSDNPELGGKLDY 198
Query: 252 YVAKLSVPTRV--IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
Y+ + P +V +R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 199 YIRTRTPPGKVTVLRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNVGWCEPLLHRIK 258
Query: 310 EDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 259 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRRECK 316
Query: 366 E-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
+ P +P MAGGLFA+DR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE P
Sbjct: 317 QEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIP 376
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
CS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D V
Sbjct: 377 CSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEFINIFFLNRPDLKFHADIGDV 434
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQA 538
R+ LRK+L+C SF WYL +++P F+P D +G+++ V ++ C++ L N+
Sbjct: 435 THRVMLRKKLRCKSFAWYLKNIYPEKFVPNADVVGWGKVKSVSSNLCLDDLLQN---NEK 491
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC---SGF 595
L PC + +Q+F + +E C V ++ + RV ++ C F
Sbjct: 492 PYNVGLYPCGKVLQKSQLF--SFTNSQVLRNELSCATV-QHSDSPPYRVVMVPCMENDEF 548
Query: 596 NRQRWTYDKE 605
N Q W ++ +
Sbjct: 549 NEQ-WKFEHQ 557
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 93/149 (62%), Gaps = 2/149 (1%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
+Y+ N +S+++ NR++ D R P C + F+ LP +S++++F+NE +S LLRTV
Sbjct: 106 IYKKIALNEELSEQLSYNRSVGDHRNPLCLAQHFDSSTLPTASVIVIFYNEPYSVLLRTV 165
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR--VIRSPGRVGLIKARL 168
HS + + LKEI+LVDD S L LD Y+ + P + V+R R+GLI+ARL
Sbjct: 166 HSTLITCNQQALKEIILVDDGSDNPELGGKLDYYIRTRTPPGKVTVLRLKNRLGLIRARL 225
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
GAR A G++L+FLDAHCE + + E L
Sbjct: 226 AGARIATGDVLIFLDAHCEGNVGWCEPLL 254
>gi|196001853|ref|XP_002110794.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
gi|190586745|gb|EDV26798.1| hypothetical protein TRIADDRAFT_23130 [Trichoplax adhaerens]
Length = 536
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 177/417 (42%), Positives = 246/417 (58%), Gaps = 26/417 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VIVFHNEA S LLRTVHSV+ RS ++ +I+LVDD S+ + L +Y+A L
Sbjct: 90 LPQTSVVIVFHNEALSTLLRTVHSVLDRSAPDLIHQIILVDDFSSIKG-HDPLKKYIADL 148
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++R+P R GLI++R++G +A I+ FLDAHCE T+GWLE L+ RV ++R+ VV
Sbjct: 149 K-KVILVRNPKREGLIRSRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDRVHQNRSVVV 207
Query: 317 CPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF-KTPAMA 374
CP ID+I D TF Y S G FNW++ FRW S + +R ++ F ++P MA
Sbjct: 208 CPEIDVIDDKTFQYRAGSSGDIRGVFNWDMKFRWRLTPSQEQK-RRNNYNVLFARSPTMA 266
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAIDR YF IG YD +M +WGGENLE+SFR+WQCGG +EI PCSHV H+FR PY
Sbjct: 267 GGLFAIDRQYFQEIGLYDSQMDIWGGENLELSFRIWQCGGQLEIMPCSHVGHVFRNVIPY 326
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
FP + N R A VWMD + EF ++ P + + RLELRK+L+C SF
Sbjct: 327 KFPKDAGLTINKNSVRTAEVWMDGYKEFVYQRQPYM-RNIHFGNITERLELRKKLQCKSF 385
Query: 495 KWYLTHVWPHHFLPMDDKFF-GRIRHVQTHKCVEKPLAKGSMNQASGPA----SLLPCTH 549
KWYL HV+ LP + G++R+ ++ C +N P L PC H
Sbjct: 386 KWYLDHVFTDVILPNESAIAKGKVRNPESEMC---------LNTLGRPKHAFLGLSPCAH 436
Query: 550 LPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
M + + +A DE VC DV ++++ ++ +L C Q W++ K
Sbjct: 437 EG--KTMIISLTVLNELAMDE-VCFDVSDHQS--GGKITLLDCHSMGGNQFWSHKKN 488
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 106/154 (68%), Gaps = 3/154 (1%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
+++ +L++ + FN SD + ++RTLPD+R CK++VF ++ LP++S+VIVFHNEA S
Sbjct: 48 AESDKLFKNHGFNQWASDHMSLHRTLPDLRPSLCKSQVFPKD-LPQTSVVIVFHNEALST 106
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLRTVHSV+ RS ++ +I+LVDD S+ + L +Y+A L ++R+P R GLI+
Sbjct: 107 LLRTVHSVLDRSAPDLIHQIILVDDFSSIKG-HDPLKKYIADLK-KVILVRNPKREGLIR 164
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+R++G +A I+ FLDAHCE T+ + E L +
Sbjct: 165 SRIIGYSRATAPIVTFLDAHCEVTIGWLEPLLDR 198
>gi|307173963|gb|EFN64693.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Camponotus
floridanus]
Length = 597
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 175/424 (41%), Positives = 245/424 (57%), Gaps = 31/424 (7%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + ++ LP +S+VI+F+NE WS LLRTVHSV+ SP +LKEI+LVDD S E L+
Sbjct: 133 CMNITYDKLLPSASVVIIFYNEPWSVLLRTVHSVLKGSPPHLLKEIILVDDHSEEEELQG 192
Query: 248 SLDEYVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD Y++ +L +++R R GLI+ARL GAR A+G++LVFLDAHCE WL+ L+
Sbjct: 193 QLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAKGDVLVFLDAHCEVIKDWLQPLLQ 252
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHW---GAFNWELHFRWYTYGSSDAIIKRKD 363
R+ +++ V+ P+ID IS+ T Y E + G F W HF W + + R
Sbjct: 253 RIKDNKNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHE-VESRPS 311
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
P ++P MAGGLFAI+R YF+ IG+YD++M WGGENLEMSFR+WQCGG++EI PCS
Sbjct: 312 PISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEMSFRIWQCGGTLEIIPCSR 371
Query: 424 VAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFY------FKFNPEAEKQRDK 476
V H+FR PY FP + +G N AR+A VWMD + + FK NP+
Sbjct: 372 VGHIFRNFHPYKFPN--DKDTHGINTARLAFVWMDGYKRLFLLHRSEFKDNPKLFGD--- 426
Query: 477 QAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSM 535
V RLELRK+LKC SFKWYL +++P F+P +D +GR+R KPL ++
Sbjct: 427 --VSERLELRKRLKCKSFKWYLDNIYPEKFIPDEDAVAYGRVR------LRNKPLCLDNL 478
Query: 536 NQASGPA---SLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC 592
Q L C +Q F + +L D C + + +V+++ C
Sbjct: 479 QQEEDKPYNLGLYTCHSKLYPSQFFSLSNAGELRKDDS--CGIILDDNQKPYAQVQMIEC 536
Query: 593 SGFN 596
+ N
Sbjct: 537 NNEN 540
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/132 (50%), Positives = 94/132 (71%), Gaps = 2/132 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+++S++I + R LPD+R C + ++ LP +S+VI+F+NE WS LLRTVHSV+
Sbjct: 111 LNVILSNKISLTRKLPDIRNSLCMNITY-DKLLPSASVVIIFYNEPWSVLLRTVHSVLKG 169
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVA-KLSVPTRVIRSPGRVGLIKARLLGARQAE 175
SP +LKEI+LVDD S E L+ LD Y++ +L +++R R GLI+ARL GAR A+
Sbjct: 170 SPPHLLKEIILVDDHSEEEELQGQLDYYLSTRLPAKVKLLRLSHRQGLIRARLHGARNAK 229
Query: 176 GEILVFLDAHCE 187
G++LVFLDAHCE
Sbjct: 230 GDVLVFLDAHCE 241
>gi|189237799|ref|XP_001814012.1| PREDICTED: similar to N-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270008127|gb|EFA04575.1| PNR-like protein [Tribolium castaneum]
Length = 614
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 212/317 (66%), Gaps = 5/317 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP ++I+I F+NE + LLRTVHS+I R+P S+LKEILLVDD S E L +L Y+ K
Sbjct: 152 LPTAAIIICFYNEHYYTLLRTVHSIIDRTPASVLKEILLVDDFSDLENLHENLSTYITKN 211
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
++I++ R GLI+ARL GAR+ + ++++FLD+H E +GW+E L+ R+ ++ T V
Sbjct: 212 FDDRVKLIKTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGWIEPLLQRIKDNYTNV 271
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVIDII+ TFAY S L G FNW LHF+W + + DF +P K+P MAG
Sbjct: 272 AMPVIDIINADTFAYTAS-PLVRGGFNWGLHFKWENLPKG-TLSTKMDFIKPIKSPTMAG 329
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+ R YF +G YD M +WGGENLE+SFR+W CGG +E+ PCS V H+FR+ PY
Sbjct: 330 GLFAMSRKYFTDLGEYDAGMNIWGGENLEISFRIWMCGGRLELIPCSRVGHVFRQRRPYG 389
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G +L+ +L RVA VWMD + E++ P+A K+ D V SR++LRK+LKCH F
Sbjct: 390 APDGQDTMLHNSL-RVANVWMDSYKEYFLNHRPDA-KRIDFGDVSSRVQLRKELKCHDFD 447
Query: 496 WYLTHVWPHHFLPMDDK 512
WYL +V+P LP D++
Sbjct: 448 WYLKNVYPELALPTDNE 464
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 106/164 (64%), Gaps = 5/164 (3%)
Query: 40 GLHSNLSDAQQL---YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
GL N+ D ++ Y+ + +N+L+S+R+ +R +PD R CK ++ + LP ++I+I
Sbjct: 101 GLVKNIDDQRKKDEGYKKHAYNVLISERLSYHRDVPDTRNELCKNISYSAD-LPTAAIII 159
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVI 155
F+NE + LLRTVHS+I R+P S+LKEILLVDD S E L +L Y+ K ++I
Sbjct: 160 CFYNEHYYTLLRTVHSIIDRTPASVLKEILLVDDFSDLENLHENLSTYITKNFDDRVKLI 219
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++ R GLI+ARL GAR+ + ++++FLD+H E + + E L +
Sbjct: 220 KTERREGLIRARLFGARRTKQDVIIFLDSHIEVNVGWIEPLLQR 263
>gi|351709330|gb|EHB12249.1| Polypeptide N-acetylgalactosaminyltransferase 4 [Heterocephalus
glaber]
Length = 582
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 173/397 (43%), Positives = 242/397 (60%), Gaps = 16/397 (4%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + ++ LP +S+VI F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +L
Sbjct: 127 ECKSKTYDYRRLPTTSVVIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYL 186
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ L+ Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+
Sbjct: 187 KAQLETYISSLER-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLL 245
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ D T VVCPVID I TF Y+++ E G F+W L F+W++ + +R
Sbjct: 246 ERIGRDETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKQERD-RRTSR 304
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV
Sbjct: 305 IDPIRSPTMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHV 364
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P + N AR A VWMD++ E ++ NP A K+ +L
Sbjct: 365 GHVFPKRAPYARPNFLQ-----NTARAAEVWMDDYKEHFYNRNPPARKEAYGDISERKL- 418
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPA 542
LRKQL+C SF WYL +V+ + +P D + G IR + + +C++ N
Sbjct: 419 LRKQLRCKSFDWYLKNVFSNLHVPEDRPGWHGAIRSLGISSECLD--YNSPDNNPTGANL 476
Query: 543 SLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPE 578
SL C H Q F ++ + +C +VPE
Sbjct: 477 SLFGC-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPE 512
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 104/157 (66%), Gaps = 1/157 (0%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
L ++L + N+ +SDRI ++R + D R +CK+K ++ LP +S+VI F+NEA
Sbjct: 92 GELKQQEELIERYAINIYLSDRISLHRHIEDKRMSECKSKTYDYRRLPTTSVVIAFYNEA 151
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R G
Sbjct: 152 WSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKAQLETYISSLER-VRLIRTNKREG 210
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L++ARL+GA A G++L FLD HCEC + E L +
Sbjct: 211 LVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLER 247
>gi|344268426|ref|XP_003406061.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13
[Loxodonta africana]
Length = 560
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 180/454 (39%), Positives = 257/454 (56%), Gaps = 51/454 (11%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T V+ +E LP +S+VIVFHNEAWS LLRTVHSVI+RSP +L E++LVDDAS R+FLK +
Sbjct: 107 TKVYPDE-LPNTSVVIVFHNEAWSTLLRTVHSVINRSPHYLLSEVILVDDASERDFLKLT 165
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L+ +V L VP ++IR R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+
Sbjct: 166 LENHVKNLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARI 225
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
+DR VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 226 KDDRKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPV 285
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCG--GSIEIAPCSHVAH 426
+TP MAGGLF+IDR YF IG YD M +WGGENLEMSFR + S P S ++
Sbjct: 286 RTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRTYSLMELESKNTVPYSVMSC 345
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEW-----AEFYFKFNPEAEKQRDKQAVRS 481
A Y ++ V+ + W + W +F++ +P K D V
Sbjct: 346 HEAHAVVYVNSRALTHVIN---KKQQEDWQEVWDGMNLKDFFYIISPGVVKV-DYGDVSV 401
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------- 534
R LR+ LKC F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 402 RKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFN 461
Query: 535 --------------------------MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLI 566
+++ +GP +L C H+ L + +L I
Sbjct: 462 CHGMGGNQVFSYTADKEIRTDDLCLDVSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHI 521
Query: 567 ATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+++ CLD P E+ + P ++ CSG Q+W
Sbjct: 522 NSNQ--CLDEPSEEDKMVPTMQ--DCSGSRSQQW 551
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 118/151 (78%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L++IN+FNL+ SD I +NR+LPDVR CKTKV+ +E LP +S+VIVFHNEAWS LLR
Sbjct: 75 KELFKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDE-LPNTSVVIVFHNEAWSTLLR 133
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSVI+RSP +L E++LVDDAS R+FLK +L+ +V L VP ++IR R GLI+ARL
Sbjct: 134 TVHSVINRSPHYLLSEVILVDDASERDFLKLTLENHVKNLEVPVKIIRMEERSGLIRARL 193
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GA ++G+++ FLDAHCECTL + E L +
Sbjct: 194 RGAAASKGQVITFLDAHCECTLGWLEPLLAR 224
>gi|403258971|ref|XP_003922013.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Saimiri boliviensis boliviensis]
Length = 967
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/354 (44%), Positives = 225/354 (63%), Gaps = 5/354 (1%)
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
AE + +FL C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEI
Sbjct: 501 AEIKTSLFLIHGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEI 558
Query: 234 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAH 293
LLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H
Sbjct: 559 LLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSH 617
Query: 294 CECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYG 353
EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W ++F W T
Sbjct: 618 VECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIP 677
Query: 354 SSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCG 413
R T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CG
Sbjct: 678 PDVIAKNRIKETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCG 737
Query: 414 GSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQ 473
G IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E ++ Q
Sbjct: 738 GEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLINQ 797
Query: 474 R-DKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
D + + ELRK+L+C SFKWYL +V+P P+ + G + +V KC+
Sbjct: 798 GLDVGNLTQQRELRKKLRCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 850
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 70/182 (38%), Positives = 109/182 (59%), Gaps = 27/182 (14%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKP-----------------KCKTKVF 84
H +A++ ++ FN+ +SD IPV+R + D R + KT +F
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGEQLLLPLFPCSHMTLAEIKTSLF 508
Query: 85 -----NEEF----LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 135
E+ LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 509 LIHGCTEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 568
Query: 136 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEE 195
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC + + E
Sbjct: 569 YLKDNLDKYMSQFP-KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 627
Query: 196 FL 197
L
Sbjct: 628 LL 629
>gi|355689595|gb|AER98885.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mustela putorius
furo]
Length = 452
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 219/330 (66%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRTVHSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 3 FNYRRLPTTSVIIAFYNEAWSTLLRTVHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 62
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R++ D
Sbjct: 63 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERISYD 121
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T VVCPVID I TF Y+++ E G F+W L F+W++ + +RK +P ++
Sbjct: 122 ETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRKSRIDPIRS 180
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 181 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 240
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 241 QAPYSRNKALA-----NCVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 294
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C F+W+L +V+P +P D FFG +++
Sbjct: 295 CKDFRWFLENVYPELHVPEDRPGFFGMLQN 324
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/120 (51%), Positives = 83/120 (69%), Gaps = 1/120 (0%)
Query: 82 KVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 141
K FN LP +S++I F+NEAWS LLRTVHSV+ SP +LKEI+LVDD S R +LK+ L
Sbjct: 1 KKFNYRRLPTTSVIIAFYNEAWSTLLRTVHSVLETSPAVLLKEIILVDDLSDRVYLKTQL 60
Query: 142 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 61 ETYISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNSGWLEPLLERIS 119
>gi|170572320|ref|XP_001892064.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158602953|gb|EDP39125.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 576
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/345 (46%), Positives = 226/345 (65%), Gaps = 18/345 (5%)
Query: 181 FLDAHCECTLVFNEEF---LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 237
++D C+ NE++ LP +S++I FHNEAWS LLRTVHSV+ R+P ++L E++LVD
Sbjct: 75 YIDEECK-----NEKYTSDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAELILVD 129
Query: 238 DASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 297
D S LK+ L+ Y+ + S R++R R GLI+AR+ GA ++G ++ +LD+HCEC
Sbjct: 130 DFSDMAHLKADLEIYMRQFS-KVRILRLEKREGLIRARIRGAAISKGSVITYLDSHCECL 188
Query: 298 LGWLENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSS 355
GW+E L+ R+ + VVCPVID+I D TF Y +++ + G F+W L F W+
Sbjct: 189 EGWVEPLLDRIKRNPKTVVCPVIDVIDDNTFEYHYSKAYFTNVGGFDWSLQFNWHAIPEK 248
Query: 356 DAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGS 415
D +R +P K+P MAGGLF+IDR +F +G+YD + +WGGENLE+SF++W CGG
Sbjct: 249 DRKGRRD--IDPVKSPTMAGGLFSIDRTFFEELGSYDPGLDIWGGENLELSFKIWMCGGI 306
Query: 416 IEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQR 474
+EI PCSHV H+FRK SPY + GV+ VL N R+A VWMDE+ ++Y+ + N
Sbjct: 307 LEIVPCSHVGHIFRKRSPYKWRSGVN-VLKRNSVRLAEVWMDEYKKYYYERIN---NNLG 362
Query: 475 DKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH 519
D V SR LRK+L+C SFKWYL +V+P F+P D G IR+
Sbjct: 363 DFGDVSSRKALRKKLQCKSFKWYLDNVYPELFVPGDAIGKGEIRN 407
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 112/190 (58%), Gaps = 24/190 (12%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN +SD I ++R+LP +CK + + + LP +S++I FHNEAWS LLRTVHSV+
Sbjct: 56 NAFNQYISDMISIHRSLPSYIDEECKNEKYTSD-LPNTSVIICFHNEAWSVLLRTVHSVL 114
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P ++L E++LVDD S LK+ L+ Y+ + S R++R R GLI+AR+ GA +
Sbjct: 115 ERTPENLLAELILVDDFSDMAHLKADLEIYMRQFS-KVRILRLEKREGLIRARIRGAAIS 173
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEIL 234
+G ++ +LD+HCEC + E L + I R+P++++ ++
Sbjct: 174 KGSVITYLDSHCECLEGWVEPLLDR----------------------IKRNPKTVVCPVI 211
Query: 235 LVDDASTREF 244
V D +T E+
Sbjct: 212 DVIDDNTFEY 221
>gi|327282475|ref|XP_003225968.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Anolis carolinensis]
Length = 583
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 170/406 (41%), Positives = 250/406 (61%), Gaps = 16/406 (3%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + ++ LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S + +L
Sbjct: 128 ECRSKTYDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLESSPSVLLKEIILVDDLSDKVYL 187
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K L++Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+
Sbjct: 188 KGELEKYISNLQR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECVPGWLEPLL 246
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
RVAE+ + ++CPVID I TF Y++ E G F+W L F+W++ + +RK
Sbjct: 247 QRVAENESVIICPVIDTIDWNTFEFYMQPGEPMIGGFDWRLTFQWHSVPDYERQ-RRKSK 305
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF ++G YD M VWGGENLE+SFRVWQCGG +EI PCSHV
Sbjct: 306 VDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMDVWGGENLELSFRVWQCGGILEIHPCSHV 365
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P + N AR A VWMD++ E ++ NP A K+ +L
Sbjct: 366 GHVFPKRAPYARPNFLQ-----NTARAAEVWMDDYKEHFYNRNPPARKENFGDLSERKL- 419
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPA 542
LRK+L+C++F WYL +++P+ +P D + G IR + + +C++ N
Sbjct: 420 LRKKLQCNNFDWYLKNIFPNLHVPEDRPGWHGAIRSMGISSECLD--YNSPEHNPTGAHV 477
Query: 543 SLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDISPRV 587
SL C H Q F + ++ + +C +VPE ++ + R
Sbjct: 478 SLFGC-HGQGGNQFFEYTVNQEIRFNSVTELCAEVPEQKDFVGMRT 522
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 66/140 (47%), Positives = 99/140 (70%), Gaps = 1/140 (0%)
Query: 58 NLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 117
N+ +SD+I ++R + D R P+C++K ++ LP +S++I F+NEAWS LLRT+HSV+ S
Sbjct: 108 NIYLSDKISLHRHIDDGRMPECRSKTYDYRRLPTTSVIIAFYNEAWSTLLRTIHSVLESS 167
Query: 118 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 177
P +LKEI+LVDD S + +LK L++Y++ L R+IR+ R GL++ARL+GA A G+
Sbjct: 168 PSVLLKEIILVDDLSDKVYLKGELEKYISNLQR-VRLIRTNKREGLVRARLIGATFATGD 226
Query: 178 ILVFLDAHCECTLVFNEEFL 197
+L FLD HCEC + E L
Sbjct: 227 VLTFLDCHCECVPGWLEPLL 246
>gi|125810093|ref|XP_001361353.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
gi|54636528|gb|EAL25931.1| GA20875 [Drosophila pseudoobscura pseudoobscura]
Length = 597
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 172/430 (40%), Positives = 250/430 (58%), Gaps = 27/430 (6%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++++F+NE +S LLRTVHS + + LKEI+LVDD S L LD
Sbjct: 139 FDSSTLPTASVIVIFYNEPYSVLLRTVHSTLITCNQQALKEIILVDDGSDNPELGGKLDY 198
Query: 252 YVAKLSVPTRV--IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
Y+ + P +V +R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 199 YIRTRTPPGKVTVLRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNVGWCEPLLHRIK 258
Query: 310 EDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 259 ESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRRECK 316
Query: 366 E-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
+ P +P MAGGLFA+DR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE P
Sbjct: 317 QEREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIP 376
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
CS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D V
Sbjct: 377 CSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEFINIFFLNRPDLKFHADIGDV 434
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQA 538
R+ LRK+L+C SF WYL +++P F+P D +G+++ V ++ C++ L N+
Sbjct: 435 THRVMLRKKLRCKSFAWYLKNIYPEKFVPNADVVGWGKVKSVSSNLCLDDLLQN---NEK 491
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC---SGF 595
L PC + +Q+F + +E C V ++ + RV ++ C F
Sbjct: 492 PYNVGLYPCGKVLQKSQLF--SFTNSQVLRNELSCATV-QHSDSPPYRVVMVPCMENDEF 548
Query: 596 NRQRWTYDKE 605
N Q W ++ +
Sbjct: 549 NEQ-WKFEHQ 557
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 62/149 (41%), Positives = 93/149 (62%), Gaps = 2/149 (1%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
+Y+ N +S+++ NR++ D R P C + F+ LP +S++++F+NE +S LLRTV
Sbjct: 106 IYKKIALNEELSEQLSYNRSVGDHRNPLCLAQHFDSSTLPTASVIVIFYNEPYSVLLRTV 165
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR--VIRSPGRVGLIKARL 168
HS + + LKEI+LVDD S L LD Y+ + P + V+R R+GLI+ARL
Sbjct: 166 HSTLITCNQQALKEIILVDDGSDNPELGGKLDYYIRTRTPPGKVTVLRLKNRLGLIRARL 225
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
GAR A G++L+FLDAHCE + + E L
Sbjct: 226 AGARIATGDVLIFLDAHCEGNVGWCEPLL 254
>gi|393908333|gb|EFO20718.2| glycosyl transferase [Loa loa]
Length = 622
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 178/428 (41%), Positives = 254/428 (59%), Gaps = 27/428 (6%)
Query: 181 FLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS 240
++D C+ N+ LP +S++I FHNEAWS LLRTVHSV+ R+P ++L EI+LVDD S
Sbjct: 161 YIDEECKTEKYAND--LPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFS 218
Query: 241 TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGW 300
LK+SL+ Y+ + R++R R GLI+AR+ GA ++G ++ +LD+HCEC GW
Sbjct: 219 DMAHLKASLEIYMRQFP-KVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGW 277
Query: 301 LENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAI 358
+E L+ R+ ++ VVCPVID+I D TF Y +++ + G F+W L F W+ D
Sbjct: 278 MEPLLDRIKKNPKTVVCPVIDVIDDNTFEYHYSKAYFTNVGGFDWSLQFNWHAIPEKDRK 337
Query: 359 IKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEI 418
+R +P K+P MAGGLF+IDR +F +G+YD + +WGGENLE+SF+ W CGG +EI
Sbjct: 338 GRRD--IDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEI 395
Query: 419 APCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQ 477
PCSHV H+FRK SPY + GV+ VL N R+A VWMDE+ ++Y+ + N D
Sbjct: 396 VPCSHVGHIFRKRSPYKWLSGVN-VLKRNSVRLAEVWMDEYKKYYYERIN---NNLGDFG 451
Query: 478 AVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH--VQTHKCVEKPLAKGSM 535
V SR LR++L+C SFKWYL +V+P F+P D G IR+ + C++
Sbjct: 452 DVSSRKALREKLQCKSFKWYLDNVYPELFVPGDAIGKGEIRNRGGGSKNCLD---WASHG 508
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
Q S A L C H Q +++ + I DES C+D + V + C G
Sbjct: 509 RQRSVNAGLYWC-HKKGGNQYWMLSKDGE-IRRDES-CIDYAGVD------VMVYPCHGM 559
Query: 596 -NRQRWTY 602
Q W Y
Sbjct: 560 KGNQEWKY 567
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/193 (37%), Positives = 114/193 (59%), Gaps = 24/193 (12%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
+ N FN VSD I ++R+LP +CKT+ + + LP +S++I FHNEAWS LLRTVH
Sbjct: 139 FNKNAFNQYVSDMISIHRSLPSYIDEECKTEKYAND-LPNTSVIICFHNEAWSVLLRTVH 197
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV+ R+P ++L EI+LVDD S LK+SL+ Y+ + R++R R GLI+AR+ GA
Sbjct: 198 SVLERTPENLLAEIILVDDFSDMAHLKASLEIYMRQFP-KVRILRLEKREGLIRARIKGA 256
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLK 231
++G ++ +LD+HCEC + E L + I ++P++++
Sbjct: 257 AISKGSVITYLDSHCECLEGWMEPLLDR----------------------IKKNPKTVVC 294
Query: 232 EILLVDDASTREF 244
++ V D +T E+
Sbjct: 295 PVIDVIDDNTFEY 307
>gi|114581297|ref|XP_525944.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Pan troglodytes]
gi|410296312|gb|JAA26756.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
gi|410333399|gb|JAA35646.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 5 (GalNAc-T5) [Pan
troglodytes]
Length = 940
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 222/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDTIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLENVFPDLRAPI-VRASGVLINVALGKCI 823
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|340378190|ref|XP_003387611.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 512
Score = 316 bits (809), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/365 (43%), Positives = 227/365 (62%), Gaps = 17/365 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C LP +S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S
Sbjct: 74 HSWCYNQVYHPTLPSTSVIITFHNEARSTLLRTIVSVLNRSPPHLIEEIILVDDFS---- 129
Query: 245 LKSSLDEYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
++ + +P ++IR+ R GL+++R+ GA A+GEIL FLD+HCEC +GWLE
Sbjct: 130 --EDVNTGLLLTQMPKIKLIRNERREGLVRSRIFGADAAKGEILTFLDSHCECNIGWLEP 187
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV++DRT VV P+ID+IS TF Y+ + G F+W LHF+W + + KRK
Sbjct: 188 LLHRVSQDRTIVVSPIIDVISMDTFDYIGASSELRGGFDWSLHFKWDGFTPAQRA-KRKS 246
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
EP KTP +AGGLF+I+R F G YD++M +WGGEN E+SFR W CGGS+EI PCS
Sbjct: 247 PIEPIKTPMIAGGLFSINRQRFIETGKYDDQMDIWGGENFEISFRTWMCGGSLEIIPCSR 306
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FPGG + N R A VWMD + ++Y+ P A K RD +++SR+
Sbjct: 307 VGHVFRKRHPYVFPGGNAMTYMKNTKRAAEVWMDNYKDYYYSARPSA-KGRDMGSIKSRV 365
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
LRK+L C +F WY+ +V+P +P + G+++ Q + C++ +QA P
Sbjct: 366 ALRKRLNCTTFDWYMKNVYPELSVPSSTNNKHGKLK--QNNLCLDT-----LGHQAGEPV 418
Query: 543 SLLPC 547
L C
Sbjct: 419 GLQDC 423
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/188 (36%), Positives = 110/188 (58%), Gaps = 8/188 (4%)
Query: 15 NQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDV 74
N QD + + + G +V + S+ D+ Y+ N FN SD+ ++R +PD
Sbjct: 13 NHQDKLSSDDLIPGKPNIAGFNVTAYILSSPKDSNDAYKKNAFNQEASDKTSIDRKVPDT 72
Query: 75 RKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 134
R C +V++ LP +S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S
Sbjct: 73 RHSWCYNQVYHPT-LPSTSVIITFHNEARSTLLRTIVSVLNRSPPHLIEEIILVDDFS-- 129
Query: 135 EFLKSSLDEYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFN 193
++ + +P ++IR+ R GL+++R+ GA A+GEIL FLD+HCEC + +
Sbjct: 130 ----EDVNTGLLLTQMPKIKLIRNERREGLVRSRIFGADAAKGEILTFLDSHCECNIGWL 185
Query: 194 EEFLPKSS 201
E L + S
Sbjct: 186 EPLLHRVS 193
>gi|149730635|ref|XP_001491185.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Equus
caballus]
Length = 940
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGVFVWPMNFGWRTIPPDIVAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 DTDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLDNVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|26325284|dbj|BAC26396.1| unnamed protein product [Mus musculus]
Length = 930
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 237/384 (61%), Gaps = 26/384 (6%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
+++ R + +PG+ G K + R EG V+L A C
Sbjct: 418 VTLSPRDLNAPGQFGRPVVVPPEKKKEAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 477
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N+ LP +SI++ F +E WSALLR+VHSV++RSP ++KEILLVDD ST+E+LK+
Sbjct: 478 EQLVHND--LPTTSIIMCFVDEVWSALLRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKA 535
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 536 DLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 594
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTE 366
V +R +V CPVI++I+D +Y+ G F W ++F W T D + K T+
Sbjct: 595 VYLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFGWKTI-PPDVVAKNGIKETD 653
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
+ P MAGGLF+ID++YF+ +G YD + VWGGEN+E+SF+VW CGG IE+ PCS V H
Sbjct: 654 IIRCPVMAGGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIELIPCSRVGH 713
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLEL 485
+FR +PYSFP + + NL RVA VW+D++ E FY + ++ D + + EL
Sbjct: 714 IFRNDNPYSFPKDRMKTVERNLVRVAEVWLDDYRELFYGHGDHLIDQGLDVGNLTQQREL 773
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPM 509
RK+LKC SFKWYL +V+P P+
Sbjct: 774 RKKLKCKSFKWYLDNVFPDLKAPV 797
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/151 (46%), Positives = 107/151 (70%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A+Q ++ FN+ +SD IPV+R + D R C ++ + + LP +SI++ F +E WSAL
Sbjct: 444 EAEQRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHND-LPTTSIIMCFVDEVWSAL 502
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+E+LK+ LD+Y+++ R++R R GLI+A
Sbjct: 503 LRSVHSVLNRSPPHLIKEILLVDDFSTKEYLKADLDKYMSQFP-KVRILRLKERHGLIRA 561
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 562 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 592
>gi|348539520|ref|XP_003457237.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Oreochromis niloticus]
Length = 619
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 183/435 (42%), Positives = 246/435 (56%), Gaps = 46/435 (10%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S+VI F NEA SALLRTVHSV+ R+P +L EI+LVDD S E LK LD YV A+
Sbjct: 126 LPSASVVICFFNEALSALLRTVHSVLDRTPAYLLHEIILVDDHSELEELKDELDRYVRAE 185
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +++R+ R GLI+ R++GA A GE+LVFLD+HCE WL+ L+A + +D V
Sbjct: 186 LQGKVQLVRNQRREGLIRGRMIGASHATGEVLVFLDSHCEVNQAWLQPLLAPIQKDHRTV 245
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + + P ++P MAG
Sbjct: 246 VCPVIDIISADTLAYSPS-PIVRGGFNWGLHFKWDPVPPSE-LSGPEGASGPIRSPTMAG 303
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 304 GLFAMNRKYFNELGQYDAGMDIWGGENLEISFRIWMCGGQLFIIPCSRVGHIFRKRRPYG 363
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMD + E Y PE + R + R+ LRK+L+CHSF+
Sbjct: 364 SPGGHDTMAHNSL-RLAHVWMDGYKEQYLSLRPEL-RNRSYGDIGERVALRKRLQCHSFR 421
Query: 496 WYLTHVWPHHFLPMDDK----FF-------------GRIRHVQTHKCVEKPLAKGSMNQA 538
WYL V+P + F GR+R++ +C+ +A+G +Q
Sbjct: 422 WYLDTVYPEMQTAANGNKQQPLFINKGLKRPKVLQRGRLRNLAIRRCL---VAQGRASQK 478
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDES--------VCLDVPEYENDISPRVRIL 590
G L PC + P A DE +CLDV E P R++
Sbjct: 479 GGAVVLRPCDP----------RDPDQDWAYDEEGQLILAGLLCLDVSEVRTFDPP--RLM 526
Query: 591 ACSG-FNRQRWTYDK 604
C G Q+W+ K
Sbjct: 527 KCHGSGGSQQWSLGK 541
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 98/147 (66%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN+L+S+R+ +R LP+ R +C+ K + LP +S+VI F NEA SALLRTVH
Sbjct: 90 YHRHAFNVLISNRLGFHRQLPETRDAQCREKSYPVA-LPSASVVICFFNEALSALLRTVH 148
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S E LK LD YV A+L +++R+ R GLI+ R++G
Sbjct: 149 SVLDRTPAYLLHEIILVDDHSELEELKDELDRYVRAELQGKVQLVRNQRREGLIRGRMIG 208
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE + + L
Sbjct: 209 ASHATGEVLVFLDSHCEVNQAWLQPLL 235
>gi|348533009|ref|XP_003453998.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oreochromis niloticus]
Length = 600
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 157/357 (43%), Positives = 234/357 (65%), Gaps = 14/357 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ E++LVDD S +E
Sbjct: 127 HENCRQKLYAEKLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSRLITEVILVDDFSDKEH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK +L+EY+ ++ R++R+ R GLI+ RLLGA A+GE++ FLD+HCE + WL L
Sbjct: 187 LKVALEEYMKRMP-KVRILRTKKREGLIRTRLLGAAAAKGEVITFLDSHCEANVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LDRIAQNRKAIVCPMIDVIDHDNFGYDTQAGDAMRGAFDWEMYYKRIPIPPE---MQRDD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF++W CGG +E PCS
Sbjct: 303 PSEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKLWMCGGRMEDIPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGG+S L NL RVA VWMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPGGIS--LAKNLKRVAEVWMDEYAEYVYQRRPEY-RHLSAGDMTAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-KPLAKGS 534
ELR +L C SFKW++ V W P H+ P++ +G I++V + C+E K GS
Sbjct: 420 ELRTRLNCKSFKWFMNEVAWDLPKHYPPVEPPAAAWGEIQNVGSGMCMEVKHFVSGS 476
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 73/141 (51%), Positives = 106/141 (75%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VSDRI +NR+LPD+R C+ K++ E+ LP +SI+I FHNE WS+LLRT
Sbjct: 101 QAYRENGFNIYVSDRISLNRSLPDIRHENCRQKLYAEK-LPNTSIIIPFHNEGWSSLLRT 159
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ E++LVDD S +E LK +L+EY+ ++ R++R+ R GLI+ RLL
Sbjct: 160 VHSVLNRSPSRLITEVILVDDFSDKEHLKVALEEYMKRMP-KVRILRTKKREGLIRTRLL 218
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A+GE++ FLD+HCE +
Sbjct: 219 GAAAAKGEVITFLDSHCEANV 239
>gi|426221067|ref|XP_004004733.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Ovis
aries]
Length = 938
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 160/345 (46%), Positives = 225/345 (65%), Gaps = 7/345 (2%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 482 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 539
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA++A G++L FLD+H EC +GWLE
Sbjct: 540 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQKATGDVLTFLDSHVECNIGWLEP 598
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK- 362
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T D + K K
Sbjct: 599 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTI-PPDVVAKNKI 657
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 658 KETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCS 717
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRS 481
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D +
Sbjct: 718 RVGHIFRNDNPYSFPKDRMKTVERNLGRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQ 777
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
+ ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 778 QRELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 821
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H + ++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 447 HGKEKEVERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 505
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 506 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 564
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA++A G++L FLD+H EC + + E L
Sbjct: 565 GLIRARLAGAQKATGDVLTFLDSHVECNIGWLEPLL 600
>gi|449666442|ref|XP_002161887.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6-like [Hydra
magnipapillata]
Length = 591
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 186/428 (43%), Positives = 262/428 (61%), Gaps = 40/428 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE---FLKSSLDEYV 253
L +S++++FHNEAWS LLRTVH+V++RSP MLKEI+LVDDAS +E L L+ YV
Sbjct: 131 LSTTSVIVIFHNEAWSVLLRTVHTVLARSPPHMLKEIILVDDASVKEKYGHLGEKLENYV 190
Query: 254 AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
LS ++IRSP RVGL +ARL+GA A GE+LVFLD+HCE + GWLE L+AR+ E+
Sbjct: 191 NTLS-KVKLIRSPVRVGLTQARLIGADNAVGEVLVFLDSHCEASFGWLEPLLARLQENPK 249
Query: 314 RVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
V P I++IS F Y + G F+WEL F W + ++RK ++P K+P
Sbjct: 250 LAVVPDIEVISFKNFEYSSEKGSYNRGIFSWELMFNWGPLPPREK-MRRKYESDPIKSPT 308
Query: 373 MAGGLFAIDRAYFFHIGAYDEE---------MQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
MAGGLFA++R YFF GAYD + + WGGEN+EMSFR+W CG IEI PCS
Sbjct: 309 MAGGLFAMNRKYFFESGAYDRQNILGRXXXXLTYWGGENVEMSFRLWMCGEGIEIIPCSR 368
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKF--NPEAEKQRDKQAVRS 481
V H+FR+ +PY P G ++ N RVA VWMDE+ E ++ F N + E+ D V
Sbjct: 369 VGHVFRERAPYKSPDGSTD---HNSIRVAEVWMDEFKEIFYSFRANLKPEQGGD---VSE 422
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGR--IRHVQTHKCVEKPLAKGSMNQA 538
R +LR+ LKC SFKWYL ++ P +P DK+ +GR ++++ T C++ LA+ N
Sbjct: 423 RKKLREDLKCKSFKWYLQNIIPELEIP--DKYPYGRGDVKNLGTLSCLDT-LAQ---NNQ 476
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFN 596
G L PC H Q F+ ++ + +CLD+ ++D++ +V++ C G N
Sbjct: 477 GGKPGLYPC-HKMGTNQYFIFTKKFEIW--HDGLCLDLS--DSDLNAKVKLWPCHKQGGN 531
Query: 597 RQRWTYDK 604
Q+W + K
Sbjct: 532 -QKWKHTK 538
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 82/174 (47%), Positives = 111/174 (63%), Gaps = 22/174 (12%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEF---LPKSSIVIVFHNEAWSALLRTVHSV 113
FN L S +I + R++PD R C FN ++ L +S++++FHNEAWS LLRTVH+V
Sbjct: 100 FNELASSKISLERSIPDNRDSSC----FNVDYPVKLSTTSVIVIFHNEAWSVLLRTVHTV 155
Query: 114 ISRSPRSMLKEILLVDDASTRE---FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLG 170
++RSP MLKEI+LVDDAS +E L L+ YV LS ++IRSP RVGL +ARL+G
Sbjct: 156 LARSPPHMLKEIILVDDASVKEKYGHLGEKLENYVNTLS-KVKLIRSPVRVGLTQARLIG 214
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL------PKSSI-----VIVFHNEAWSA 213
A A GE+LVFLD+HCE + + E L PK ++ VI F N +S+
Sbjct: 215 ADNAVGEVLVFLDSHCEASFGWLEPLLARLQENPKLAVVPDIEVISFKNFEYSS 268
>gi|443720685|gb|ELU10336.1| hypothetical protein CAPTEDRAFT_176696 [Capitella teleta]
Length = 587
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 223/342 (65%), Gaps = 9/342 (2%)
Query: 190 LVFNEEF---LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
L +EE+ LP +S+VI F+NEAWS LLRTVHS+I R+P ++L EI+LVDD S + L
Sbjct: 110 LCQSEEYPAELPSASVVICFYNEAWSVLLRTVHSIIDRTPSALLHEIILVDDFSDLDHLA 169
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD YV++ T+++R+ R GLI+AR++G+ A GE+LVFLD+HCE + W++ L++
Sbjct: 170 EQLDAYVSEHLPQTKLVRNTRREGLIRARVIGSEHATGEVLVFLDSHCEVNVEWIQPLLS 229
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+ + RV P+IDII TF Y S L G FNW L +RW S + K++D+ +
Sbjct: 230 HIHGNHKRVAVPIIDIIDQDTFRY-ESSPLVRGGFNWGLFYRWDQIPES-LLRKQEDYVK 287
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P KTP MAGGLFA++R YF +G YD M VWGGENLE+SFRVWQCGGS+ I PCS V H
Sbjct: 288 PIKTPTMAGGLFAMNRKYFNDLGRYDTGMDVWGGENLEISFRVWQCGGSMHILPCSRVGH 347
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
+FRK PY P GV + +L RVA VWMDE+ +++F+ A+ + V R LR
Sbjct: 348 IFRKRRPYGSPVGVDTITKNSL-RVAHVWMDEYIKYFFQVRKTAD-HAEYGDVSDRKALR 405
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEK 528
+L+C SFKW+L +V+P LP D + G I + H ++K
Sbjct: 406 NELQCQSFKWFLDNVYPEQTLPSDKEGGGLI--AKGHNLIKK 445
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/161 (45%), Positives = 109/161 (67%), Gaps = 4/161 (2%)
Query: 40 GLHSNLSDA---QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
GL N D +Q Y++ FN L+SDR+ +R +PDVR C+++ + E LP +S+VI
Sbjct: 69 GLVRNPQDQVKREQGYKLYAFNELISDRLSFHRPIPDVRHQLCQSEEYPAE-LPSASVVI 127
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
F+NEAWS LLRTVHS+I R+P ++L EI+LVDD S + L LD YV++ T+++R
Sbjct: 128 CFYNEAWSVLLRTVHSIIDRTPSALLHEIILVDDFSDLDHLAEQLDAYVSEHLPQTKLVR 187
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ R GLI+AR++G+ A GE+LVFLD+HCE + + + L
Sbjct: 188 NTRREGLIRARVIGSEHATGEVLVFLDSHCEVNVEWIQPLL 228
>gi|118404432|ref|NP_001072705.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
(Silurana) tropicalis]
gi|115313486|gb|AAI24052.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
gi|134026084|gb|AAI35912.1| polypeptide N-acetylgalactosaminyltransferase 4 [Xenopus (Silurana)
tropicalis]
Length = 582
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 172/403 (42%), Positives = 246/403 (61%), Gaps = 16/403 (3%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + FN LP +S+VI F+NEA S LLRT+HSV+ SP +L+EI+LVDD S + +L
Sbjct: 127 ECKSKTFNYRKLPTTSVVIAFYNEALSTLLRTIHSVLETSPAVLLREIILVDDFSDKVYL 186
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
KS L++Y+ L R+IR+ R GL++AR++GA A G++L FLD HCEC GWLE L+
Sbjct: 187 KSQLEDYIGGLDR-VRLIRTTKREGLVRARIIGATYAIGDVLTFLDCHCECISGWLEPLL 245
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ E+ T VVCPVID I TF Y+++ E G F+W L F+W+ + +RK
Sbjct: 246 QRIGENETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHAVPEKERQ-RRKSR 304
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG++EI PCSHV
Sbjct: 305 IDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEIEPCSHV 364
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P + N AR A VWMD + E ++ NP A K+ + + R
Sbjct: 365 GHVFPKKAPYARPNFLQ-----NTARAAEVWMDGYKELFYNRNPPARKE-NYGDISERKL 418
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPA 542
LR++L+C SF WYL +V+P +P D + G +R + +++C++ N
Sbjct: 419 LRERLQCKSFDWYLKNVFPDLHIPEDRPGWHGAVRSMGISNECLD--YNAPDHNPTGAHL 476
Query: 543 SLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
SL C H Q F ++ + +C +VPE + I
Sbjct: 477 SLFGC-HGQGGNQFFEYTTMREIRFNSVTELCAEVPEQQTYIG 518
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 67/143 (46%), Positives = 98/143 (68%), Gaps = 1/143 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ +SD+I ++R + D R +CK+K FN LP +S+VI F+NEA S LLRT+HSV+
Sbjct: 106 LNIYLSDQISLHRHIMDNRMYECKSKTFNYRKLPTTSVVIAFYNEALSTLLRTIHSVLET 165
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+EI+LVDD S + +LKS L++Y+ L R+IR+ R GL++AR++GA A G
Sbjct: 166 SPAVLLREIILVDDFSDKVYLKSQLEDYIGGLDR-VRLIRTTKREGLVRARIIGATYAIG 224
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
++L FLD HCEC + E L +
Sbjct: 225 DVLTFLDCHCECISGWLEPLLQR 247
>gi|332233960|ref|XP_003266176.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Nomascus
leucogenys]
Length = 940
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 222/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWKTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDIIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|432901498|ref|XP_004076865.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Oryzias latipes]
Length = 607
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 172/435 (39%), Positives = 255/435 (58%), Gaps = 28/435 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP ++I+I FHNE WS+LLRTVHSVI+RSP ++ EI+LVDD S +E
Sbjct: 134 HENCRQKLYAEKLPNTTIIIPFHNEGWSSLLRTVHSVINRSPPRLVAEIILVDDFSDKEH 193
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK +L+EY+ + R++R+ R GLI+ RLLGA A+GE++ FLD+HCE + WL L
Sbjct: 194 LKVALEEYMKRFP-KVRILRTKKREGLIRTRLLGAGAAKGEVITFLDSHCEANVNWLPPL 252
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+ ++R +VCP+ID+I F Y ++ + GAF+WE++++ A ++ D
Sbjct: 253 LDRIVQNRKTIVCPMIDVIDHDNFGYDTQAGDAMRGAFDWEMYYKRIPI---PAEMRTDD 309
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
TEPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 310 PTEPFESPVMAGGLFAVDRKWFWELGGYDTGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 369
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY PGG+S L NL RVA VWMDE+AE+ ++ PE + + ++
Sbjct: 370 VGHIYRKYVPYKVPGGIS--LAKNLKRVAEVWMDEYAEYVYQRRPEY-RHLSAGDMSAQK 426
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-------KPLA 531
ELR L C +F+W++ V W P H+ P++ +G IR V + C+E P+
Sbjct: 427 ELRSHLNCKNFRWFMEEVAWDLPKHYPPVEPPAAAWGEIRSVGSGMCMEIKHFVSGSPIR 486
Query: 532 KGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILA 591
S + G S +H VLT + + VC D + + ++ +
Sbjct: 487 LESCVKGRGDVSW---SHGQVLTFGWREDIRVGDPMHTRKVCFDAVSHHSPVT----LYD 539
Query: 592 CSGF-NRQRWTYDKE 605
C G Q W Y K+
Sbjct: 540 CHGMKGNQLWRYRKD 554
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 73/141 (51%), Positives = 105/141 (74%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VS+RI +NR+LPD+R C+ K++ E+ LP ++I+I FHNE WS+LLRT
Sbjct: 108 QAYRENGFNIYVSNRISLNRSLPDIRHENCRQKLYAEK-LPNTTIIIPFHNEGWSSLLRT 166
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSVI+RSP ++ EI+LVDD S +E LK +L+EY+ + R++R+ R GLI+ RLL
Sbjct: 167 VHSVINRSPPRLVAEIILVDDFSDKEHLKVALEEYMKRFP-KVRILRTKKREGLIRTRLL 225
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A+GE++ FLD+HCE +
Sbjct: 226 GAGAAKGEVITFLDSHCEANV 246
>gi|402888383|ref|XP_003907542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Papio
anubis]
Length = 940
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 219/344 (63%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++ R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ K P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR-DKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E ++ Q D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLMDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++ R
Sbjct: 508 VWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILHLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|260787295|ref|XP_002588689.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
gi|229273857|gb|EEN44700.1| hypothetical protein BRAFLDRAFT_248153 [Branchiostoma floridae]
Length = 415
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 155/319 (48%), Positives = 214/319 (67%), Gaps = 10/319 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI++ F E+WS LLR+VHSVI+RSP +++EILL+DDAS R LK LD+Y++K
Sbjct: 88 LPTTSIIMCFCEESWSTLLRSVHSVINRSPPHLVEEILLIDDASRRSHLKQKLDQYMSKF 147
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+ R GLI+ARL GA A G +L FLD+H EC +GWLE L+ R+ EDRTRVV
Sbjct: 148 P-QVRVVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPLLDRIREDRTRVV 206
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT----EPFKTPA 372
CP ID +++ TFAY + E G F+WEL F+W + + +A KR+ E ++P
Sbjct: 207 CPSIDRVNEATFAYEVANENVRGGFDWELFFQWVSLPAVEA--KRRTHNVFQHEVIRSPT 264
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLF+IDR +F+ +G YD Q+WGGENLE+SF++W CGGS+EI PCS V H+FRK+
Sbjct: 265 MAGGLFSIDRGFFYELGGYDPGFQIWGGENLELSFKIWMCGGSLEILPCSRVGHVFRKSQ 324
Query: 433 PYSFPGGVS--EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA-VRSRLELRKQL 489
PY++ S EV++ N R+A VW+DE+ + Y+ +P E + K + R LR+ L
Sbjct: 325 PYNYSNATSIMEVVHHNNVRLAEVWLDEYKKIYYALHPGVEVELAKMGDISERKLLRENL 384
Query: 490 KCHSFKWYLTHVWPHHFLP 508
C SF+WYL +V P +P
Sbjct: 385 GCRSFQWYLENVIPELHIP 403
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/166 (43%), Positives = 106/166 (63%), Gaps = 2/166 (1%)
Query: 34 GDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSS 93
G +V+ + + +D + ++ FN VSD IP R+LPD R P+C + ++ LP +S
Sbjct: 34 GRAVEDAMPQHQADIEAGWKAASFNQFVSDLIPYERSLPDTRPPRCAEQEVADD-LPTTS 92
Query: 94 IVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR 153
I++ F E+WS LLR+VHSVI+RSP +++EILL+DDAS R LK LD+Y++K R
Sbjct: 93 IIMCFCEESWSTLLRSVHSVINRSPPHLVEEILLIDDASRRSHLKQKLDQYMSKFP-QVR 151
Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
V+ R GLI+ARL GA A G +L FLD+H EC + + E L +
Sbjct: 152 VVHLKERAGLIRARLKGAELATGTVLTFLDSHIECNVGWLEPLLDR 197
>gi|432097047|gb|ELK27545.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Myotis davidii]
Length = 558
Score = 315 bits (807), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 178/409 (43%), Positives = 240/409 (58%), Gaps = 28/409 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA SALLRTVHSV+ R+P +L EI+LVDD+S + LK LDE+V K
Sbjct: 135 LPVASVVICFYNEALSALLRTVHSVLDRTPARLLHEIILVDDSSDFDDLKGELDEFVQKH 194
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L ++IR+ R GLI+ R++GA A GE+LVFLD+HCE + WL+ L+A + EDR V
Sbjct: 195 LPGKIKLIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIREDRRTV 254
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T AY S + G FNW LHF+W S+ + + T P K+P MAG
Sbjct: 255 VCPVIDIISADTLAYSSSPVVR-GGFNWGLHFKWDLVPLSE-LEGPEGATAPIKSPTMAG 312
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R+YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 313 GLFAMNRSYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 372
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ + R V R+ELRK+L C SFK
Sbjct: 373 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-RTRSYGNVSERVELRKKLGCKSFK 430
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
WYL ++P +Q KP +N+ +L + + +
Sbjct: 431 WYLDSIYP---------------EMQISGPNAKPQQPIFINRGPKRPKILQRGRIWIYNE 475
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDK 604
+ L L CLD+ E + PR+ SG Q+WT+ K
Sbjct: 476 EHELVLSNLL-------CLDMSETRSSDPPRLMKCHGSG-GSQQWTFGK 516
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/147 (49%), Positives = 104/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R CK K++ + LP +S+VI F+NEA SALLRTVH
Sbjct: 99 YQKHAFNLLISNRLGHHRDVPDTRNAACKDKIYPTD-LPVASVVICFYNEALSALLRTVH 157
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD+S + LK LDE+V K L ++IR+ R GLI+ R++G
Sbjct: 158 SVLDRTPARLLHEIILVDDSSDFDDLKGELDEFVQKHLPGKIKLIRNTKREGLIRGRMIG 217
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A A GE+LVFLD+HCE +++ + L
Sbjct: 218 AAHATGEVLVFLDSHCEVNVMWLQPLL 244
>gi|328699727|ref|XP_001944936.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Acyrthosiphon pisum]
Length = 581
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 160/352 (45%), Positives = 220/352 (62%), Gaps = 14/352 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S S+ + ++K+
Sbjct: 143 LPQTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSD----DSTDGQELSKI 198
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR+ R GL+++R+ G+ A +L FLD+H EC + WLE L+ RVAED TRVV
Sbjct: 199 Q-KVKLIRNEKREGLMRSRVRGSEIATAPVLTFLDSHVECNVNWLEPLLDRVAEDPTRVV 257
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+I+ F Y+ + G F+W L F+W ++KD T P +TP +AGG
Sbjct: 258 CPIIDVINMDNFQYIGASSELRGGFDWNLVFKWEYLSKEVRAQRQKDPTLPIRTPMIAGG 317
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD+EM +WGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 318 LFVMDKDYFVKLGTYDKEMNIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 377
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ +Y+ P + + RL L+K L C FKW
Sbjct: 378 PGGSGNVFAHNTRRAAEVWMDQYKRYYYNAVP-LSRIVPFGNIADRLALKKNLGCKPFKW 436
Query: 497 YLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
YL +V+P LP D+F G IR Q + C++ NQ A + PC
Sbjct: 437 YLDNVYPELKLPATVDEFVGSIR--QGYMCLDT-----LENQVGKTAGIFPC 481
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 96/155 (61%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L Y N+FN L SD + NR +PD R KC TK + + LP++S++I FHNEA S
Sbjct: 100 LRHGDDAYSRNKFNQLASDSLRSNRPVPDTRNAKCLTKKYRID-LPQTSVIITFHNEARS 158
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV SV++RSP ++KEI+LVDD S S+ + ++K+ ++IR+ R GL+
Sbjct: 159 TLLRTVVSVLNRSPEHLIKEIILVDDFSD----DSTDGQELSKIQ-KVKLIRNEKREGLM 213
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ G+ A +L FLD+H EC + + E L +
Sbjct: 214 RSRVRGSEIATAPVLTFLDSHVECNVNWLEPLLDR 248
>gi|281341921|gb|EFB17505.1| hypothetical protein PANDA_013078 [Ailuropoda melanoleuca]
Length = 936
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 481 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 538
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 539 YLKGNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 597
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 598 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVVAKNRIK 657
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 658 ETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 717
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 718 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTEQ 777
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 778 RELRKKLKCKSFKWYLENVFPDLKAPI-VRANGVLINVALGKCI 820
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 105/151 (69%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 451 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDEVWSTL 509
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+A
Sbjct: 510 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKGNLDKYMSQFP-KVRILRLKERHGLIRA 568
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 569 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 599
>gi|301776863|ref|XP_002923851.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Ailuropoda melanoleuca]
Length = 937
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 481 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 538
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 539 YLKGNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 597
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 598 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVVAKNRIK 657
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 658 ETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 717
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 718 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTEQ 777
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 778 RELRKKLKCKSFKWYLENVFPDLKAPI-VRANGVLINVALGKCI 820
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 105/151 (69%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 451 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDEVWSTL 509
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+A
Sbjct: 510 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKGNLDKYMSQFP-KVRILRLKERHGLIRA 568
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 569 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 599
>gi|391347961|ref|XP_003748222.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 658
Score = 315 bits (806), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 171/397 (43%), Positives = 232/397 (58%), Gaps = 12/397 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
HC + + +P +S+VI+F +E +S LLRT+ SVI RSPR +L+EI+LVDD S E
Sbjct: 190 HCR-AITYPVAEMPTASVVIIFTDEIFSTLLRTIVSVIDRSPRHLLREIILVDDFSQSED 248
Query: 245 LKSSLDEYVAK--LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
LK L+ Y+ + R+IR P R GLI+ARL+GAR A G++L+FLD+HCE T GWLE
Sbjct: 249 LKDRLERYIEHHFRADVVRLIRLPERSGLIRARLVGARAARGDVLIFLDSHCETTPGWLE 308
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYV--RSFELHWGAFNWELHFRWYTYGSSDAIIK 360
L+ + DR VVCPVID+I T YV G FNW F W+ S+
Sbjct: 309 PLLEPIRRDRRAVVCPVIDVIDYRTLQYVAAEGDRFQIGGFNWRGEFTWHNIPSA-WRRN 367
Query: 361 RKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
R EP ++P MAGGLFAI+R YF+ G+YDEEM WGGENLEMSFR+WQCGG I IAP
Sbjct: 368 RVSVAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVIAP 427
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
CSHV H+FR PY PGG + N R VWMDE+ ++ ++ PE +K R +
Sbjct: 428 CSHVGHIFRDYQPYKIPGG-KDTNAINTKRAVEVWMDEFKKYIYQARPELKKIRIGD-IS 485
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQAS 539
+R R+ +C FKWYL +V+PH +L +D + FG +R+ T+ C++ G +
Sbjct: 486 ARRAFRELNRCKPFKWYLDNVYPHKYLIEEDSQGFGIVRNPLTNMCLD---TYGKARGKT 542
Query: 540 GPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDV 576
+ C +P ++ L E +C V
Sbjct: 543 SDLGIFECHPIPEEATNQLLSLSRKGELRQEDLCAKV 579
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/153 (47%), Positives = 102/153 (66%), Gaps = 2/153 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A + + FN+ VSDR+P+NR+LPD R C+ + +P +S+VI+F +E +S L
Sbjct: 159 EADREFSKAAFNVYVSDRLPLNRSLPDTRHRHCRAITYPVAEMPTASVVIIFTDEIFSTL 218
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK--LSVPTRVIRSPGRVGLI 164
LRT+ SVI RSPR +L+EI+LVDD S E LK L+ Y+ + R+IR P R GLI
Sbjct: 219 LRTIVSVIDRSPRHLLREIILVDDFSQSEDLKDRLERYIEHHFRADVVRLIRLPERSGLI 278
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ARL+GAR A G++L+FLD+HCE T + E L
Sbjct: 279 RARLVGARAARGDVLIFLDSHCETTPGWLEPLL 311
>gi|355750550|gb|EHH54877.1| hypothetical protein EGM_03977 [Macaca fascicularis]
Length = 940
Score = 315 bits (806), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++ R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ K P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++ R
Sbjct: 508 VWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILHLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|195384663|ref|XP_002051034.1| GJ22477 [Drosophila virilis]
gi|194145831|gb|EDW62227.1| GJ22477 [Drosophila virilis]
Length = 598
Score = 315 bits (806), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 173/427 (40%), Positives = 246/427 (57%), Gaps = 31/427 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I+F+NE +S L+RTVHS ++ + LKE++LVDD S L LD Y
Sbjct: 144 LPTASVIIIFYNEPYSVLVRTVHSTLNTCNQKALKEVILVDDGSDNAELGGKLDHYTRTR 203
Query: 257 --SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
S ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+ + RT
Sbjct: 204 FPSGKVTILRLKNRLGLIRARLAGARIASGDVLIFLDAHCEANVGWCEPLLQRIKDSRTS 263
Query: 315 VVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK- 369
V+ P+ID+I F Y +SF++ G F W HF W + + + ++ ++P +
Sbjct: 264 VLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWVNLSEREKLRQSRECSQPREI 321
Query: 370 ----TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+P MAGGLFA+DR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE PCS V
Sbjct: 322 CPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRVG 381
Query: 426 HLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D V R+
Sbjct: 382 HIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINVFFLNRPDLKFHADIGDVTHRVM 439
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK+L+C SF WYL +V+P F+P + K +GRI+ V + C + L S N+
Sbjct: 440 LRKKLRCKSFDWYLKNVYPEKFVPNKNVKAWGRIKAVHANLCADDLL---SNNEKPYNLG 496
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRI-----LACSGFNRQ 598
L PC +Q+F + +E C V ++ SP RI L +N Q
Sbjct: 497 LYPCGKELQKSQLF--SYTKSQVLRNEISCATV---QHSSSPPYRIVMVPCLENDDYNEQ 551
Query: 599 RWTYDKE 605
W Y+ +
Sbjct: 552 -WKYENQ 557
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 3/155 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEE-FLPKSSIVIVFHNEAWSAL 106
++Y+ N +S+++ NRT+ D R P C + +++ LP +S++I+F+NE +S L
Sbjct: 102 GDKIYKKIALNEELSEQLSYNRTVGDHRNPLCLAQKYDDPGTLPTASVIIIFYNEPYSVL 161
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL--SVPTRVIRSPGRVGLI 164
+RTVHS ++ + LKE++LVDD S L LD Y S ++R R+GLI
Sbjct: 162 VRTVHSTLNTCNQKALKEVILVDDGSDNAELGGKLDHYTRTRFPSGKVTILRLKNRLGLI 221
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 222 RARLAGARIASGDVLIFLDAHCEANVGWCEPLLQR 256
>gi|327270185|ref|XP_003219870.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Anolis carolinensis]
Length = 592
Score = 315 bits (806), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 161/338 (47%), Positives = 222/338 (65%), Gaps = 11/338 (3%)
Query: 185 HCECTLV-FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
H +CT ++ LPK+S++I F+NEAWS LLRTVHSV+ SP +L+EI+LVDD S +E
Sbjct: 131 HPQCTEKRYDYYNLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDILLEEIILVDDYSDKE 190
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
LK L+ YVA L R+IR+ R GL++ARLLGA A+G++L FLD HCEC WLE
Sbjct: 191 HLKEKLENYVANLR-KVRLIRANKREGLVRARLLGASIAKGDVLTFLDCHCECHEEWLEP 249
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
L+ R+ E+ + VVCPVID+I TF Y+ + E G F+W L F W+ + +R+
Sbjct: 250 LLERIKEEPSAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPEREQK-QRR 308
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+ ++P MAGGLFA+++ YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCS
Sbjct: 309 SKTDVIRSPTMAGGLFAVNKNYFSYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCS 368
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSR 482
HV H+F K +PYS ++ N R A VWMD + E Y+ NP A + R
Sbjct: 369 HVGHVFPKQAPYSRAKALA-----NSVRAAEVWMDSYKELYYHRNPHARMEPYGDVTERR 423
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRH 519
L LR++LKC FKWYL +++P +P D +FG +++
Sbjct: 424 L-LREKLKCKDFKWYLDNIYPELHVPEDRLGYFGMLKN 460
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 135/247 (54%), Gaps = 42/247 (17%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S L ++ +++ N+ +SDRI ++R LP+ R P+C K ++ LPK+S++I F+NEA
Sbjct: 98 SELRRQEESVALHQINVYLSDRISLHRRLPERRHPQCTEKRYDYYNLPKTSVIIAFYNEA 157
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRTVHSV+ SP +L+EI+LVDD S +E LK L+ YVA L R+IR+ R G
Sbjct: 158 WSTLLRTVHSVLETSPDILLEEIILVDDYSDKEHLKEKLENYVANLR-KVRLIRANKREG 216
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 222
L++ARLLGA A+G++L FLD HCEC +E W L + I
Sbjct: 217 LVRARLLGASIAKGDVLTFLDCHCEC-------------------HEEW---LEPLLERI 254
Query: 223 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS---------VPTR----------VI 263
P +++ ++ V D +T E+L ++ + + VP R VI
Sbjct: 255 KEEPSAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPEREQKQRRSKTDVI 314
Query: 264 RSPGRVG 270
RSP G
Sbjct: 315 RSPTMAG 321
>gi|355564907|gb|EHH21396.1| hypothetical protein EGK_04452 [Macaca mulatta]
Length = 940
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++ R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ K P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++ R
Sbjct: 508 VWSTLLRSVHSVLNRSPPYLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILHLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|195120520|ref|XP_002004772.1| GI19414 [Drosophila mojavensis]
gi|193909840|gb|EDW08707.1| GI19414 [Drosophila mojavensis]
Length = 604
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 175/436 (40%), Positives = 254/436 (58%), Gaps = 30/436 (6%)
Query: 190 LVFNEEF-----LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
L N+++ LP +S+VI+F+NE +S L+RTVHS ++ LKEI+LVDD S
Sbjct: 138 LCLNQKYDDPSTLPTASVVIIFYNEPYSVLVRTVHSTLNTCNEKSLKEIILVDDGSDNAE 197
Query: 245 LKSSLDEYVAKLSVPTRV--IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
L LD YV P +V +R R+GLI+ARL GAR A G++L+FLDAHCE GW E
Sbjct: 198 LGGKLDYYVRTRFPPGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEANEGWCE 257
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAI 358
L+ R+ E RT V+ P+ID+I F Y +SF++ G F W HF W +
Sbjct: 258 PLLQRIKESRTSVLVPIIDVIDAKDFQYSTNGYKSFQV--GGFQWSGHFDWVNLPEREKQ 315
Query: 359 IKRKDFTEPFK-----TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCG 413
+ ++ ++P + +P MAGGLFA+DR YF+ +G+YDE+M WGGENLEMSFR+WQCG
Sbjct: 316 RQLRECSQPREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCG 375
Query: 414 GSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQ 473
G+IE PCS V H+FR PY FP + N AR+ALVWMDE+ +F P+ +
Sbjct: 376 GTIETIPCSRVGHIFRDFHPYKFPND-RDTHGINTARMALVWMDEYINVFFLNRPDLKFH 434
Query: 474 RDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAK 532
D V R+ LRK+L+C SF+WYL +V+P F+P M+ K +G+++ V ++ C++ L
Sbjct: 435 PDIGDVTHRVVLRKKLRCKSFEWYLKNVYPEKFVPNMNVKAWGKVKAVNSNLCLDDLL-- 492
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC 592
+ N+ L C +Q+F L+ +E C V ++ + RV ++ C
Sbjct: 493 -NNNEKPYNLGLYACGKALQKSQLF--SYTNSLVLRNELSCATV-QHSSSPPHRVVMVPC 548
Query: 593 ---SGFNRQRWTYDKE 605
+N Q W Y+ +
Sbjct: 549 LENDDYNEQ-WKYENQ 563
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 64/155 (41%), Positives = 95/155 (61%), Gaps = 3/155 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSAL 106
+++Y+ N +S+++ NRT+ D R P C + +++ LP +S+VI+F+NE +S L
Sbjct: 108 GERIYKKIALNEELSEQLSYNRTVGDHRNPLCLNQKYDDPSTLPTASVVIIFYNEPYSVL 167
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRV--IRSPGRVGLI 164
+RTVHS ++ LKEI+LVDD S L LD YV P +V +R R+GLI
Sbjct: 168 VRTVHSTLNTCNEKSLKEIILVDDGSDNAELGGKLDYYVRTRFPPGKVTILRLKNRLGLI 227
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + E L +
Sbjct: 228 RARLAGARIATGDVLIFLDAHCEANEGWCEPLLQR 262
>gi|397525624|ref|XP_003832760.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Pan
paniscus]
Length = 940
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 172/422 (40%), Positives = 248/422 (58%), Gaps = 32/422 (7%)
Query: 126 LLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI------KARLLGARQAEGEIL 179
LL +D+ T + L+ +++ R ++PG+ G K + R EG
Sbjct: 413 LLPEDSGTHQVLR-------IDVTLSPRDPKAPGQFGRPVVVPHGKEKEAERRWKEGNFN 465
Query: 180 VFLD--------------AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRS 225
V+L A C LV N LP +S+++ F +E WS LLR+VHSV++RS
Sbjct: 466 VYLSDLIPVDRAIEDTRPAGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRS 523
Query: 226 PRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGE 285
P ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+ARL GA+ A G+
Sbjct: 524 PPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGD 582
Query: 286 ILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWEL 345
+L FLD+H EC +GWLE L+ RV R +V CPVI++I+D +Y+ G F W +
Sbjct: 583 VLTFLDSHVECNVGWLEPLLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPM 642
Query: 346 HFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEM 405
+F W T R T+ + P MAGGLF+I ++YFF +G YD + VWGGEN+E+
Sbjct: 643 NFGWRTIPPDVIAKNRIKETDTIRCPVMAGGLFSIHKSYFFELGTYDPGLDVWGGENMEL 702
Query: 406 SFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYF 464
SF+VW CGG IEI PCS V H+FR +PYSFP + + NL RVA VW+DE+ E FY
Sbjct: 703 SFKVWMCGGEIEIIPCSRVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYG 762
Query: 465 KFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHK 524
+ ++ D + + ELRK+LKC SFKWYL +V+P P+ + G + +V K
Sbjct: 763 HGDHLIDQGLDVGNLTQQRELRKKLKCKSFKWYLENVFPDLRAPI-VRASGVLINVALGK 821
Query: 525 CV 526
C+
Sbjct: 822 CI 823
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|431894831|gb|ELK04624.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Pteropus alecto]
Length = 939
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 167/400 (41%), Positives = 239/400 (59%), Gaps = 25/400 (6%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
+++ R ++PG+ G K + R EG V+L A C
Sbjct: 427 MTLSPRDFKAPGQFGRPVVVPHGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 486
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N LP +S+++ F +E WS L+R+VHSV++RSP ++KEILLVDD ST+++LK
Sbjct: 487 KQLVHNN--LPTTSVIMCFVDEVWSTLVRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKD 544
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 545 NLDKYMSQFP-KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLER 603
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
V R +V CPVI++I+D +Y+ G F W ++F W T R T+
Sbjct: 604 VYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVVAKNRIKETDI 663
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+ P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H+
Sbjct: 664 IRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHI 723
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLELR 486
FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + + ELR
Sbjct: 724 FRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELR 783
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
K+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 784 KKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 822
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 448 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAKQLVHNN-LPTTSVIMCFVDE 506
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS L+R+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 507 VWSTLVRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLRERH 565
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 566 GLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLL 601
>gi|354487360|ref|XP_003505841.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Cricetulus griseus]
Length = 633
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 175/362 (48%), Positives = 231/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVF+NEAWS LLRTVHSV+ SP +LKEI+LVDDAS ++L L+EY+ +
Sbjct: 184 LPTTSVVIVFYNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDDYLHEKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGAAAATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVRSFEL----HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGNNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I R YF HIG+YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISREYFEHIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE---KQRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A KQ+ + R E++K+L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKSFGDLSKRFEIKKRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL V+P ++P ++ G I+ V C L G NQ P L C
Sbjct: 481 QCKNFTWYLNTVYPEVYVPDLNPVISGYIKSVGQPLC----LDVGENNQGGKPLILYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S+VIVF+NEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVVIVFYNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS ++L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDDYLHEKLEEYIKQFSI-VKIVRQKERKGLITARLLGAAAA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|340723540|ref|XP_003400147.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus terrestris]
gi|340723542|ref|XP_003400148.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus terrestris]
Length = 602
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 183/422 (43%), Positives = 257/422 (60%), Gaps = 38/422 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 160 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--M 217
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 218 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 277
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 278 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEP 330
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 331 VWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 390
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELR 486
FRK SPY + GV+ VL N R++ VW+DE+A++Y+ + + K D V R LR
Sbjct: 391 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGD---VSERKALR 446
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASL 544
K+L C SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L
Sbjct: 447 KKLGCKSFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGL 503
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
PC H Q +++ T I DES CLD + V + C G Q+W Y+
Sbjct: 504 YPC-HRQGGNQYWMLS-KTGEIRRDES-CLDYSGTD------VILYPCHGSKGNQQWIYN 554
Query: 604 KE 605
+
Sbjct: 555 HQ 556
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK + LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 126 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVL 185
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 186 DRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 243
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R P +++ +
Sbjct: 244 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARDPTTVVCPV 281
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 282 IDVIDDTTLEY 292
>gi|350426661|ref|XP_003494505.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 1 [Bombus impatiens]
Length = 602
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 183/422 (43%), Positives = 257/422 (60%), Gaps = 38/422 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 160 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--M 217
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 218 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 277
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 278 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEP 330
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 331 VWSPTMAGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 390
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELR 486
FRK SPY + GV+ VL N R++ VW+DE+A++Y+ + + K D V R LR
Sbjct: 391 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGD---VSERKALR 446
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASL 544
K+L C SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L
Sbjct: 447 KKLGCKSFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGL 503
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
PC H Q +++ T I DES CLD + V + C G Q+W Y+
Sbjct: 504 YPC-HRQGGNQYWMLS-KTGEIRRDES-CLDYSGTD------VILYPCHGSKGNQQWIYN 554
Query: 604 KE 605
+
Sbjct: 555 HQ 556
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK + LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 126 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVL 185
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 186 DRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 243
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R P +++ +
Sbjct: 244 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARDPTTVVCPV 281
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 282 IDVIDDTTLEY 292
>gi|334333371|ref|XP_001365881.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Monodelphis domestica]
Length = 757
Score = 314 bits (805), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 169/360 (46%), Positives = 233/360 (64%), Gaps = 14/360 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP++S+VI F+NEAWS LLRTV+SV+ SP +LKE++LVDD S RE LK L+
Sbjct: 153 YDYENLPQTSVVIAFYNEAWSTLLRTVYSVLETSPDILLKEVILVDDYSDREHLKEQLEN 212
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++ L R+IR+ R GL++ARLLGA A GEIL FLD HCEC GWLE L+ R+ E+
Sbjct: 213 HLSDLPK-VRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPLLERIHEE 271
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ + E G F+W L F W+ + +R+ + ++
Sbjct: 272 ESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPQREQK-RRRSQIDVIRS 330
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+++ YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K
Sbjct: 331 PTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPK 390
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ N A K+ V RL+LR +LK
Sbjct: 391 QAPYSRNKALA-----NSVRAAEVWMDEYKEIYYHRNMHARKE-PYGDVTERLQLRAKLK 444
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH-VQTHKCVEKPLAKGSMNQASGPASLL-PC 547
C +FKW+L +V+P +P D FFG +++ T C + S N+ +G +L PC
Sbjct: 445 CKNFKWFLDNVYPELHVPEDRPGFFGMLKNRGMTDYCFD--YNPPSENEITGSQVILYPC 502
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/136 (52%), Positives = 97/136 (71%), Gaps = 1/136 (0%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
Q ++ N +SDRI ++R LP+ P CK K ++ E LP++S+VI F+NEAWS LLRTV+S
Sbjct: 122 QRHQINTYLSDRISLHRRLPERWHPMCKEKKYDYENLPQTSVVIAFYNEAWSTLLRTVYS 181
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +LKE++LVDD S RE LK L+ +++ L R+IR+ R GL++ARLLGA
Sbjct: 182 VLETSPDILLKEVILVDDYSDREHLKEQLENHLSDLPK-VRLIRANKREGLVRARLLGAS 240
Query: 173 QAEGEILVFLDAHCEC 188
A GEIL FLD HCEC
Sbjct: 241 IATGEILTFLDCHCEC 256
>gi|296204771|ref|XP_002749473.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Callithrix jacchus]
Length = 940
Score = 314 bits (804), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDDLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+L+C SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLRCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDDLDKYMSQFP-KVRILRLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|7657112|ref|NP_056552.1| polypeptide N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|51315802|sp|O08832.1|GALT4_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 4;
AltName: Full=Polypeptide GalNAc transferase 4;
Short=GalNAc-T4; Short=pp-GaNTase 4; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 4;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 4
gi|2121220|gb|AAB58301.1| polypeptide GalNAc transferase-T4 [Mus musculus]
gi|26329157|dbj|BAC28317.1| unnamed protein product [Mus musculus]
gi|34786032|gb|AAH57882.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
gi|74140684|dbj|BAE31844.1| unnamed protein product [Mus musculus]
gi|74195122|dbj|BAE28303.1| unnamed protein product [Mus musculus]
gi|148689697|gb|EDL21644.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 [Mus musculus]
Length = 578
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/320 (48%), Positives = 214/320 (66%), Gaps = 9/320 (2%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R++ D
Sbjct: 189 YISNLER-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERISRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y+++ E G F+W L F+W++ + +R +P ++
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRTSRIDPIRS 306
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 307 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 366
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++LK
Sbjct: 367 RAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDLSERKL-LRERLK 420
Query: 491 CHSFKWYLTHVWPHHFLPMD 510
C SF WYL +V+ + +P D
Sbjct: 421 CKSFDWYLKNVFSNLHVPED 440
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 104/157 (66%), Gaps = 1/157 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK K F+ LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKAKKFHYRSLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQLETYISNLER-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIS 245
>gi|242024227|ref|XP_002432530.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212517982|gb|EEB19792.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 603
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 176/405 (43%), Positives = 239/405 (59%), Gaps = 19/405 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRT+ SV++RSP ++KEI+LVDD S S E +AK+
Sbjct: 167 LPPTSVIITFHNEARSTLLRTIVSVMNRSPEHLIKEIILVDDFSD----NPSDGEELAKI 222
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R+ R GL+++R+ GA A IL FLD+H EC + WLE L+ RV ED+TRVV
Sbjct: 223 Q-KIKLVRNEKREGLMRSRVRGADLATAPILTFLDSHVECNVNWLEPLLERVVEDKTRVV 281
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+IS TF Y+ + G F+W L F+W + +++D T KTP +AGG
Sbjct: 282 CPIIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLTLDQRLRRQQDPTRAIKTPMIAGG 341
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF IDR YF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 342 LFVIDRLYFDTLGKYDMQMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 401
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ ++Y+ P A K + RLEL+++L C SFKW
Sbjct: 402 PGGSGNVFARNTRRAAEVWMDDYKKYYYAAVPLA-KSIPFGNIDDRLELKRKLHCKSFKW 460
Query: 497 YLTHVWPHHFLPMDDK-FFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
YL +V+P +P FG IR Q C++ S+ Q G L H Q
Sbjct: 461 YLENVYPELSIPHSTSPAFGSIRQRQL--CLDT--LGHSIEQTVG----LYVCHDTGGNQ 512
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+ M+ D +CL +P Y +R+ C + Q+W
Sbjct: 513 EWGME--DDSYIKHHDLCLTIPNYVPGALVLMRL--CEDADNQKW 553
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y N+FN SD +P NR +PD R CK K + + LP +S++I FHNEA S
Sbjct: 124 LKEGEDPYFKNKFNQAASDSLPSNREIPDTRGQACKRKKWRTD-LPPTSVIITFHNEARS 182
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP ++KEI+LVDD S S E +AK+ +++R+ R GL+
Sbjct: 183 TLLRTIVSVMNRSPEHLIKEIILVDDFSD----NPSDGEELAKIQ-KIKLVRNEKREGLM 237
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A IL FLD+H EC + + E L +
Sbjct: 238 RSRVRGADLATAPILTFLDSHVECNVNWLEPLLER 272
>gi|403258969|ref|XP_003922012.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Saimiri boliviensis boliviensis]
Length = 940
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 220/344 (63%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 484 AGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILRLRERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDVIRCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR-DKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E ++ Q D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLINQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+L+C SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLRCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 508 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLRERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|256071383|ref|XP_002572020.1| n-acetylgalactosaminyltransferase [Schistosoma mansoni]
Length = 697
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 175/430 (40%), Positives = 253/430 (58%), Gaps = 17/430 (3%)
Query: 181 FLDAHCECTLVFNE--EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDD 238
+L + E T N+ LP +SI+I FHNEAWS LLR+VHSVI RSP ++L+EI+LVDD
Sbjct: 231 YLPDYREGTCKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNLLQEIILVDD 290
Query: 239 ASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 298
S R LK +L+EY+ L++ +++R+ R GLI+AR++GA + G++LVFLD+H ECT
Sbjct: 291 FSDRPHLKEALEEYMGMLNI-VKIVRTKQREGLIRARMIGAELSTGKVLVFLDSHIECTT 349
Query: 299 GWLENLVARVAEDRTRVVCPVIDIISDVTFA--YVRSFELHWGAFNWELHFRWYTYGSSD 356
GWLE L+ R+A + + VV PVI I+D T ++++ + G F+W L FRW+ D
Sbjct: 350 GWLEPLLDRIAYNSSIVVVPVISTINDKTLKMNFLKADNVQVGGFDWSLTFRWHEQTERD 409
Query: 357 AIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
++ P ++P MAGGLFAI R YF H+G YD M++WGGENLE+SF+VW CGG +
Sbjct: 410 RNRSGAPYS-PVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKVWMCGGIL 468
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDK 476
E CS V H+FR SPY + V + L NL R+A VW+D++ FY+ K D
Sbjct: 469 ETVVCSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRFYYAR--IGFKTIDF 526
Query: 477 QAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMN 536
V R LR++LKC SF WYLT+++P F+P G I C++ P + +
Sbjct: 527 GDVSERKALREKLKCRSFDWYLTNIYPELFIPSKALASGDIESAAGPHCLDSPTPRNG-D 585
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF- 595
+ + PC H Q +++ P + I DE C D + I + C G
Sbjct: 586 KKRTVIKIWPC-HKQGGNQFWLLS-PNNEIRRDE-YCFDSGMKNHTIG----LYRCHGAK 638
Query: 596 NRQRWTYDKE 605
Q++TY ++
Sbjct: 639 GNQKFTYGED 648
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 82/171 (47%), Positives = 116/171 (67%), Gaps = 11/171 (6%)
Query: 44 NLSDAQQL-----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVF 98
+LS A+Q+ ++ N +N SDRI V R LPD R+ CK + LP +SI+I F
Sbjct: 200 DLSPAEQIIFDKGWEDNAYNQYASDRISVRRYLPDYREGTCKVNQYGSN-LPSASIIICF 258
Query: 99 HNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSP 158
HNEAWS LLR+VHSVI RSP ++L+EI+LVDD S R LK +L+EY+ L++ +++R+
Sbjct: 259 HNEAWSVLLRSVHSVIDRSPPNLLQEIILVDDFSDRPHLKEALEEYMGMLNI-VKIVRTK 317
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK----SSIVIV 205
R GLI+AR++GA + G++LVFLD+H ECT + E L + SSIV+V
Sbjct: 318 QREGLIRARMIGAELSTGKVLVFLDSHIECTTGWLEPLLDRIAYNSSIVVV 368
>gi|334310655|ref|XP_001378662.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Monodelphis domestica]
Length = 563
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 169/418 (40%), Positives = 246/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V LP +SIVI FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 118 HYRCTSVHYASDLPTTSIVITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 173
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 174 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQP 231
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 232 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 290
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+A+F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 291 PTQPIRTPVIAGGIFVIDKAWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 350
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + ++ R
Sbjct: 351 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSFGSIADRE 409
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P + G I+ Q C+E + N S
Sbjct: 410 EQRKKMNCKSFQWYLENVYPELKIPEKEMIPGIIK--QGTICLESQGQDTAGNNLVVMGS 467
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
++ P +TQ +V +D + + CL + + +V + AC+ + RQ+W
Sbjct: 468 CKGTSNNPSMTQEWVF---SDPLIRQQDKCLAITSFST--GSQVTLEACNQKDGRQKW 520
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 93/154 (60%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C T V LP +SIVI FHNEA S
Sbjct: 87 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVHYASDLPTTSIVITFHNEARS 145
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGL 163
LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R GL
Sbjct: 146 TLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREGL 199
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 200 IRSRVRGAEVATADILTFLDSHCE----VNSEWL 229
>gi|363730187|ref|XP_418741.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 2 [Gallus gallus]
Length = 638
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 164/394 (41%), Positives = 246/394 (62%), Gaps = 16/394 (4%)
Query: 190 LVFNEEF---LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
L +E+ LP +S++I FH+EAWS LLRTVHS+++ +P++ LK+I+LVDD S + LK
Sbjct: 181 LCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSILNTAPKASLKDIILVDDLSQQGPLK 240
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
S+L EY++KL ++IRS R+G+I+ R+LGA +A G++LVF+D+HCEC GWLE L+A
Sbjct: 241 SALSEYISKLD-GVKLIRSNRRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPLLA 299
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R++ +R VV P+ID+I TF Y S LH G F+W+L F W + + R+ T
Sbjct: 300 RLSSNRNSVVSPIIDVIDWKTFQYYHSVSLHRGVFDWKLDFHWEPVPEHEEKV-RQSPTS 358
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++PA+AG + A+DR YF +IGAYD +M +WG ENLE+S R W CGGS+EI PCS V H
Sbjct: 359 PIRSPAVAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSIRTWLCGGSVEIIPCSRVGH 418
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA--EKQRDKQAVRSRLE 484
++R P++F E + N R+A W+D + E ++K + A + +K RL+
Sbjct: 419 VYRHHIPHAF--SYEEAIVRNKIRIAETWLDSFKENFYKNDTVAFLISKAEKPDCSERLQ 476
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
L+K+L C SF+W++T+V+P P D + G++ + C + + M A G
Sbjct: 477 LQKRLGCRSFQWFITNVYPELSRPEDAPRLSGKLYNTGAGFCAD---YRPGMALADGSIK 533
Query: 544 LLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDV 576
L PCT+ LTQ F ++ + + CLDV
Sbjct: 534 LSPCTN--SLTQHFEYNSKKEIRVGSALLFCLDV 565
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/145 (48%), Positives = 106/145 (73%), Gaps = 2/145 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN +S+RIP+ R LP+VR P C + ++ LP +S++I FH+EAWS LLRTVHS+++
Sbjct: 160 FNEALSERIPLRRELPEVRHPLCLQQEYDSS-LPTASVIICFHDEAWSTLLRTVHSILNT 218
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P++ LK+I+LVDD S + LKS+L EY++KL ++IRS R+G+I+ R+LGA +A G
Sbjct: 219 APKASLKDIILVDDLSQQGPLKSALSEYISKLD-GVKLIRSNRRLGVIRGRMLGAARATG 277
Query: 177 EILVFLDAHCECTLVFNEEFLPKSS 201
++LVF+D+HCEC + E L + S
Sbjct: 278 DVLVFMDSHCECQKGWLEPLLARLS 302
>gi|350645519|emb|CCD59759.1| n-acetylgalactosaminyltransferase, putative [Schistosoma mansoni]
Length = 654
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 175/427 (40%), Positives = 251/427 (58%), Gaps = 17/427 (3%)
Query: 181 FLDAHCECTLVFNE--EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDD 238
+L + E T N+ LP +SI+I FHNEAWS LLR+VHSVI RSP ++L+EI+LVDD
Sbjct: 231 YLPDYREGTCKVNQYGSNLPSASIIICFHNEAWSVLLRSVHSVIDRSPPNLLQEIILVDD 290
Query: 239 ASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 298
S R LK +L+EY+ L++ +++R+ R GLI+AR++GA + G++LVFLD+H ECT
Sbjct: 291 FSDRPHLKEALEEYMGMLNI-VKIVRTKQREGLIRARMIGAELSTGKVLVFLDSHIECTT 349
Query: 299 GWLENLVARVAEDRTRVVCPVIDIISDVTFA--YVRSFELHWGAFNWELHFRWYTYGSSD 356
GWLE L+ R+A + + VV PVI I+D T ++++ + G F+W L FRW+ D
Sbjct: 350 GWLEPLLDRIAYNSSIVVVPVISTINDKTLKMNFLKADNVQVGGFDWSLTFRWHEQTERD 409
Query: 357 AIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
++ P ++P MAGGLFAI R YF H+G YD M++WGGENLE+SF+VW CGG +
Sbjct: 410 RNRSGAPYS-PVRSPTMAGGLFAISREYFSHLGKYDSGMEIWGGENLELSFKVWMCGGIL 468
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDK 476
E CS V H+FR SPY + V + L NL R+A VW+D++ FY+ K D
Sbjct: 469 ETVVCSLVGHIFRGRSPYKWNVNVKDPLKRNLLRLADVWLDDYKRFYYAR--IGFKTIDF 526
Query: 477 QAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMN 536
V R LR++LKC SF WYLT+++P F+P G I C++ P + +
Sbjct: 527 GDVSERKALREKLKCRSFDWYLTNIYPELFIPSKALASGDIESAAGPHCLDSPTPRNG-D 585
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF- 595
+ + PC H Q +++ P + I DE C D + I + C G
Sbjct: 586 KKRTVIKIWPC-HKQGGNQFWLLS-PNNEIRRDE-YCFDSGMKNHTIG----LYRCHGAK 638
Query: 596 NRQRWTY 602
Q++TY
Sbjct: 639 GNQKFTY 645
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 82/171 (47%), Positives = 116/171 (67%), Gaps = 11/171 (6%)
Query: 44 NLSDAQQL-----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVF 98
+LS A+Q+ ++ N +N SDRI V R LPD R+ CK + LP +SI+I F
Sbjct: 200 DLSPAEQIIFDKGWEDNAYNQYASDRISVRRYLPDYREGTCKVNQYGSN-LPSASIIICF 258
Query: 99 HNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSP 158
HNEAWS LLR+VHSVI RSP ++L+EI+LVDD S R LK +L+EY+ L++ +++R+
Sbjct: 259 HNEAWSVLLRSVHSVIDRSPPNLLQEIILVDDFSDRPHLKEALEEYMGMLNI-VKIVRTK 317
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK----SSIVIV 205
R GLI+AR++GA + G++LVFLD+H ECT + E L + SSIV+V
Sbjct: 318 QREGLIRARMIGAELSTGKVLVFLDSHIECTTGWLEPLLDRIAYNSSIVVV 368
>gi|147907290|ref|NP_001085038.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Xenopus
laevis]
gi|47506925|gb|AAH71009.1| MGC81150 protein [Xenopus laevis]
Length = 582
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/326 (47%), Positives = 217/326 (66%), Gaps = 10/326 (3%)
Query: 187 EC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC + F+ LP +S++I F+NEA S LLRT+HSV+ SP +L+EI+LVDD S + +L
Sbjct: 127 ECKSKTFSYRKLPTTSVIIAFYNEALSTLLRTIHSVLESSPAVLLREIILVDDFSDKVYL 186
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
KS L++Y+ L R+IR+ R GL++AR++GA A G++L FLD HCEC GWLE L+
Sbjct: 187 KSQLEDYIGGLDR-VRLIRTTKREGLVRARIIGATYAIGDVLTFLDCHCECVTGWLEPLL 245
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ E+ T VVCPVID I TF Y+++ E G F+W L F+W+ + +RK
Sbjct: 246 ERIGENETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHAVPEKERQ-RRKSR 304
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG++EI PCSHV
Sbjct: 305 IDPIRSPTMAGGLFAVSKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEIEPCSHV 364
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P + N AR A VWMD + E ++ NP A+K+ + + R
Sbjct: 365 GHVFPKKAPYARPNFLQ-----NTARAAEVWMDGYKELFYNRNPPAQKE-NYGDISERKL 418
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD 510
LR++L+C SF WYL V+P +P D
Sbjct: 419 LRERLQCKSFDWYLKKVFPELHIPED 444
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 98/143 (68%), Gaps = 1/143 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+ +SD+I ++R + D R +CK+K F+ LP +S++I F+NEA S LLRT+HSV+
Sbjct: 106 LNIYLSDQISLHRHIMDNRMYECKSKTFSYRKLPTTSVIIAFYNEALSTLLRTIHSVLES 165
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+EI+LVDD S + +LKS L++Y+ L R+IR+ R GL++AR++GA A G
Sbjct: 166 SPAVLLREIILVDDFSDKVYLKSQLEDYIGGLDR-VRLIRTTKREGLVRARIIGATYAIG 224
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
++L FLD HCEC + E L +
Sbjct: 225 DVLTFLDCHCECVTGWLEPLLER 247
>gi|301611308|ref|XP_002935181.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Xenopus (Silurana) tropicalis]
Length = 532
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/323 (47%), Positives = 215/323 (66%), Gaps = 9/323 (2%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T+ + LPK+S++I F+NEAWS LLRTVHSV+ SP +L+EI+LVDD S RE LK
Sbjct: 110 TVKYKYRRLPKTSVIIAFYNEAWSTLLRTVHSVLETSPDLLLEEIILVDDYSDREHLKEP 169
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L++Y++ R+IR+ R GL++ARLLGA A+ E+L FLD HCEC GWLE L+ R+
Sbjct: 170 LEKYISSWRK-VRLIRANKREGLVRARLLGASIAKAEVLTFLDCHCECHEGWLEPLLERI 228
Query: 309 AEDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
E + V+CPVID+I TF Y+ + E G F+W + F W+T ++ KR+ +
Sbjct: 229 REKESAVICPVIDVIDWNTFEYLGNAGEPQIGGFDWRMVFTWHTVPETEQK-KRRSPIDV 287
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF++ + YF H+G+YD M+VWGGENLE SFR+WQCGGS+E+ PCSHV H+
Sbjct: 288 ISSPTMAGGLFSVSKKYFEHLGSYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSHVGHV 347
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
F + +PYS +S N R A VW+DE+ E Y+ NP A K+ V RL+L++
Sbjct: 348 FPRQAPYSRSKALS-----NSVRAAEVWLDEYKEIYYHRNPHARKE-PYGDVSERLQLKE 401
Query: 488 QLKCHSFKWYLTHVWPHHFLPMD 510
+L+C FKW+L +++P +P D
Sbjct: 402 RLQCKHFKWFLDNIYPELHVPED 424
Score = 122 bits (306), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 65/132 (49%), Positives = 86/132 (65%), Gaps = 8/132 (6%)
Query: 64 RIPVNRTLPDVRKPK-------CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
R P + D R P+ CKT + LPK+S++I F+NEAWS LLRTVHSV+
Sbjct: 86 RPPPSSHPKDQRPPRPRFDPERCKTVKYKYRRLPKTSVIIAFYNEAWSTLLRTVHSVLET 145
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+EI+LVDD S RE LK L++Y++ R+IR+ R GL++ARLLGA A+
Sbjct: 146 SPDLLLEEIILVDDYSDREHLKEPLEKYISSWRK-VRLIRANKREGLVRARLLGASIAKA 204
Query: 177 EILVFLDAHCEC 188
E+L FLD HCEC
Sbjct: 205 EVLTFLDCHCEC 216
>gi|194755004|ref|XP_001959782.1| GF13042 [Drosophila ananassae]
gi|190621080|gb|EDV36604.1| GF13042 [Drosophila ananassae]
Length = 599
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 175/433 (40%), Positives = 250/433 (57%), Gaps = 33/433 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ + LP +S+VI+F NE +S LLRTVHS ++ LKEI+LVDD S L LD
Sbjct: 141 FDVDTLPTASVVIIFFNEPYSVLLRTVHSTLTTCNEKALKEIILVDDGSDNPELGGKLDY 200
Query: 252 YVAKLSVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
Y+ + +P ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+
Sbjct: 201 YI-RTRIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRI 259
Query: 309 AEDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
E RT V+ P+ID+I F Y +SF++ G F W HF W + +R++
Sbjct: 260 KESRTSVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRREC 317
Query: 365 TE-----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ P +P MAGGLFA+DR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE
Sbjct: 318 KQQREICPAYSPTMAGGLFAMDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETI 377
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS V H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D
Sbjct: 378 PCSRVGHIFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGD 435
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQ 537
V R+ LRK+L+C +F+WYL +++P F+P + +G+++ V + C++ L N+
Sbjct: 436 VTHRVMLRKKLRCKNFEWYLKNIYPEKFVPTHNVNAWGKVQAVSGNLCLDDLLQN---NE 492
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC--- 592
L PC +Q+F + +E C V E SP RV ++ C
Sbjct: 493 KPYNVGLYPCGKTLQKSQLF--SFTKSQVLRNELSCATVQHSE---SPPYRVVMVPCLEN 547
Query: 593 SGFNRQRWTYDKE 605
FN Q W Y+++
Sbjct: 548 DEFNEQ-WKYERQ 559
Score = 118 bits (295), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 95/155 (61%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
+ +Y+ N +S+++ NR++ D R P C + F+ + LP +S+VI+F NE +S LL
Sbjct: 105 GEAIYKKIALNEELSEQLLYNRSVGDHRNPLCAAERFDVDTLPTASVVIIFFNEPYSVLL 164
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLI 164
RTVHS ++ LKEI+LVDD S L LD Y+ + +P ++R R+GLI
Sbjct: 165 RTVHSTLTTCNEKALKEIILVDDGSDNPELGGKLDYYI-RTRIPAGKVTILRLKNRLGLI 223
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 224 RARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 258
>gi|224051278|ref|XP_002200509.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Taeniopygia guttata]
Length = 570
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 167/418 (39%), Positives = 246/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 125 HYRCTSVRYDTDLPATSLIITFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFS---- 180
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 181 --SDPEDCQLLTKIPKVKCLRNTHREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQP 238
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 239 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 297
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 298 PTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 357
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + +V R+
Sbjct: 358 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSFGSVADRV 416
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
ELR +L C SF+WYL +V+P +P + G IR Q C+E + N +G +
Sbjct: 417 ELRHKLNCKSFQWYLENVYPELKIPEKELIPGIIR--QGENCLESQAQDITGNVLAGMGN 474
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
+ P +TQ ++ P+ + CL + + ++ + AC+ + RQ+W
Sbjct: 475 CKGTVNNPPVTQEWIFSDPS---IRQQDKCLSIASFST--GSQITLEACNQKDGRQKW 527
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 93/154 (60%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C T V + LP +S++I FHNEA S
Sbjct: 94 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVRYDTDLPATSLIITFHNEARS 152
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGL 163
LLRTV SV++R+P S+++EI+LVDD S S ++ +P + +R+ R GL
Sbjct: 153 TLLRTVKSVLNRTPPSLIQEIILVDDFS------SDPEDCQLLTKIPKVKCLRNTHREGL 206
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 207 IRSRVRGAEVATADILTFLDSHCE----VNSEWL 236
>gi|354482531|ref|XP_003503451.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Cricetulus griseus]
Length = 929
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/363 (44%), Positives = 230/363 (63%), Gaps = 7/363 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N+ LP +SI++ F +E WSALLR+VHS+++RSP ++KEILLVDD ST++
Sbjct: 473 AECAEQLVHND--LPTTSIIMCFVDEVWSALLRSVHSILNRSPPHLIKEILLVDDFSTKD 530
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 531 YLKDNLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 589
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK- 362
L+ RV +R +V CPVI++I+D +Y+ G F W ++F W T D + K
Sbjct: 590 LLERVYLNRKKVACPVIEVINDKDMSYMTVDNFQRGVFTWPMNFGWRTI-PPDVVAKSGI 648
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+ + P M GLF+ID++YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 649 KETDIIRCPVMGCGLFSIDKSYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCS 708
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRS 481
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D +
Sbjct: 709 RVGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQ 768
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGP 541
+ ELRK+LKC SFKWYL +V+P P+ + G + +V KCV ++ G
Sbjct: 769 QRELRKRLKCKSFKWYLENVFPDLKAPV-VRASGVLVNVALGKCVSIENTTATLEDCDGS 827
Query: 542 ASL 544
+ L
Sbjct: 828 SQL 830
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 108/151 (71%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R +C ++ + + LP +SI++ F +E WSAL
Sbjct: 443 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPAECAEQLVHND-LPTTSIIMCFVDEVWSAL 501
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHS+++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+A
Sbjct: 502 LRSVHSILNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRA 560
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 561 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 591
>gi|326437922|gb|EGD83492.1| hypothetical protein PTSG_04099 [Salpingoeca sp. ATCC 50818]
Length = 699
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 180/418 (43%), Positives = 244/418 (58%), Gaps = 39/418 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD-EYVAK 255
LP+++++I F NEAWS LLRTV SV+ R+P +LKEILLVDDAS +E L L+ E
Sbjct: 269 LPQATVIICFVNEAWSTLLRTVWSVLDRTPPHLLKEILLVDDASDQEHLLDKLEVEVRDN 328
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +++RSP R+GLI+AR+LGA A + +VFLD+HCE LGWLE L+A +A+D+TRV
Sbjct: 329 LPDKVKLVRSPKRLGLIRARVLGAEHATADYMVFLDSHCEANLGWLEPLLAWMAKDKTRV 388
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW----YTYGSSDAIIKRKDFTEPFKTP 371
VCP ID IS T YV G F+W L F W +G + A +P K+P
Sbjct: 389 VCPTIDRISAQTMDYVGGGASSRGTFHWTLDFTWEYAVRQHGETPA--------DPIKSP 440
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLF I+R YF+ +G YD M WGGENLEMSFR+WQCGGS+ I PCS V H+FR
Sbjct: 441 TMAGGLFGINRDYFYELGTYDMGMDGWGGENLEMSFRIWQCGGSLHIIPCSRVGHIFRDW 500
Query: 432 SPYSFPGG-VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY+ P V+E N R+A VWMDE+ + ++ P A + D V R LR++L
Sbjct: 501 HPYAIPNSTVNETFLKNSIRLAEVWMDEYKDIFYDIKPSA-RSVDFGDVSERKALREKLG 559
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C SFKWYL +V P +P D G++R+ + C++K G SL
Sbjct: 560 CKSFKWYLDNVVPGKLIPNSDVVLHKGQVRN-SLNICMDK-----------GAGSLAYPC 607
Query: 549 HLPVL---TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYD 603
H P + +Q F + + ++ +CL ++N RV + C G N ++W YD
Sbjct: 608 HTPGVHSTSQAFWLTVYKEVRHV-WDLCL--TSHDN---KRVMLSTC-GPNSRKWEYD 658
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/224 (38%), Positives = 120/224 (53%), Gaps = 23/224 (10%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN S ++ ++R +PD R P C+ + + LP+++++I F NEAWS LLRTV SV+
Sbjct: 236 NAFNEYRSSKLSLHRDIPDSRNPLCRQQEHPRD-LPQATVIICFVNEAWSTLLRTVWSVL 294
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLD-EYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 173
R+P +LKEILLVDDAS +E L L+ E L +++RSP R+GLI+AR+LGA
Sbjct: 295 DRTPPHLLKEILLVDDASDQEHLLDKLEVEVRDNLPDKVKLVRSPKRLGLIRARVLGAEH 354
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSA--LLRTVHSVISRSPRSMLK 231
A + +VFLD+HCE L + E L AW A R V I R +
Sbjct: 355 ATADYMVFLDSHCEANLGWLEPLL------------AWMAKDKTRVVCPTIDRISAQTMD 402
Query: 232 EILLVDDASTREFLKSSLD---EYVAKL--SVPTRVIRSPGRVG 270
+ AS+R +LD EY + P I+SP G
Sbjct: 403 YV--GGGASSRGTFHWTLDFTWEYAVRQHGETPADPIKSPTMAG 444
>gi|427789289|gb|JAA60096.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 526
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 166/364 (45%), Positives = 232/364 (63%), Gaps = 16/364 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST-REFLKSSLD 250
++ E LP +S+VI+F +E +SALLRTV+SVI+R+P +L+EI+LVDD S E L+
Sbjct: 154 YDVENLPTASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQIDEMANGRLE 213
Query: 251 EYVAKLSVP--TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
++ + P ++I P R GLI+ARL GAR A G++LVFLD+HCE T WLE +V +
Sbjct: 214 RFIRRHFRPGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEATDHWLEPMVELI 273
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHW--GAFNWELHFRWYTYGSSDAIIK-RKDFT 365
+DRT VVCP+ID+I D T Y+ + + G FNW+ F W + +A K RK
Sbjct: 274 KKDRTTVVCPIIDVIDDKTLQYMGTSSDFYQIGGFNWKGEFIWIN--TPEAWRKARKSKA 331
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+P ++P MAGGLFAIDR YF+ G+YD EM+ WGGENLEMSFR+W CGGS+ IAPCSHV
Sbjct: 332 DPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGGSLVIAPCSHVG 391
Query: 426 HLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR PY FP ++ +G N AR+A VWMD + ++++ PE K + R
Sbjct: 392 HIFRDYHPYKFPS--NKDTHGINTARLAEVWMDNYKYYFYQNRPELRKISFGD-ISERKA 448
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LR +L+C SFKWYL +V+P+ F+P + F FG R+ T C++ + + P
Sbjct: 449 LRNKLQCKSFKWYLDNVYPNKFVPSEKVFAFGNARNPNTGMCLD---SMSHNYDNTEPLG 505
Query: 544 LLPC 547
+ PC
Sbjct: 506 IYPC 509
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 93/136 (68%), Gaps = 3/136 (2%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN V DR+P+NRTL D R C+ ++ E LP +S+VI+F +E +SALLRTV+SVI+R
Sbjct: 127 FNAYVCDRLPLNRTLGDRRHRSCRNAEYDVENLPTASVVIIFTDELFSALLRTVYSVINR 186
Query: 117 SPRSMLKEILLVDDAST-REFLKSSLDEYVAKLSVP--TRVIRSPGRVGLIKARLLGARQ 173
+P +L+EI+LVDD S E L+ ++ + P ++I P R GLI+ARL GAR
Sbjct: 187 TPHRLLREIILVDDYSQIDEMANGRLERFIRRHFRPGFVKLITLPKREGLIRARLTGARA 246
Query: 174 AEGEILVFLDAHCECT 189
A G++LVFLD+HCE T
Sbjct: 247 ASGDVLVFLDSHCEAT 262
>gi|270008661|gb|EFA05109.1| hypothetical protein TcasGA2_TC015209 [Tribolium castaneum]
Length = 565
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 172/422 (40%), Positives = 247/422 (58%), Gaps = 24/422 (5%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T +N + K+S+V++F+NE S +LRTV SVI ++P+ +L+EI+LVDDAST E LK
Sbjct: 113 TFTYNPKL--KASVVVIFYNELLSVILRTVWSVILQTPKELLEEIILVDDASTEESLKGL 170
Query: 249 LDEYVAKL--SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD Y+ S R+I R+GLI+ARL GAR A G++L+FLDAHCE T W+E L++
Sbjct: 171 LDYYIETRLSSTKLRLIHLKTRMGLIRARLQGARIATGDVLIFLDAHCEATTDWMEPLLS 230
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHW--GAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ ++ T V+ P+ID+I T AY + + + G F+W HF W + + K
Sbjct: 231 RIEQEPTAVLVPIIDVIEANTLAYSTNGDTSYQVGGFSWSGHFTWIDIQNEED----KHK 286
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
P K+P MAGGLFAIDR +F+ IG+YDE+M WGGENLEMSFR+WQCGG +E PCS V
Sbjct: 287 LTPVKSPTMAGGLFAIDRKFFWEIGSYDEQMDGWGGENLEMSFRIWQCGGRLETVPCSRV 346
Query: 425 AHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
H+FR PYSFP ++ +G N AR+A VWMD++ F+F + P E + R
Sbjct: 347 GHIFRDFHPYSFPD--NKDTHGINTARLAHVWMDDYKRFFFMYQPALENNPVVGDLTHRK 404
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
+LR++L+C SFKWYL +V+P F+P D+ + + + L G GP
Sbjct: 405 QLRQKLRCKSFKWYLENVYPEKFIP-DENVYAHGQVQNDYGMCLDDLQLGE--DKIGPLG 461
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQR-WTY 602
L C ++Q F + +L E+ C + V++ C G R++ W +
Sbjct: 462 LYQCHPYLAMSQYFSLNFKGEL--RKENFCAETFGVR-----EVQLTECHGHKREQFWMF 514
Query: 603 DK 604
K
Sbjct: 515 YK 516
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 72/145 (49%), Positives = 100/145 (68%), Gaps = 4/145 (2%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N ++SDR+P++R L D R PKCKT +N + K+S+V++F+NE S +LRTV SVI +
Sbjct: 89 LNTVLSDRMPLDRKLRDPRNPKCKTFTYNPKL--KASVVVIFYNELLSVILRTVWSVILQ 146
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKL--SVPTRVIRSPGRVGLIKARLLGARQA 174
+P+ +L+EI+LVDDAST E LK LD Y+ S R+I R+GLI+ARL GAR A
Sbjct: 147 TPKELLEEIILVDDASTEESLKGLLDYYIETRLSSTKLRLIHLKTRMGLIRARLQGARIA 206
Query: 175 EGEILVFLDAHCECTLVFNEEFLPK 199
G++L+FLDAHCE T + E L +
Sbjct: 207 TGDVLIFLDAHCEATTDWMEPLLSR 231
>gi|326921959|ref|XP_003207221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Meleagris gallopavo]
Length = 596
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 175/448 (39%), Positives = 264/448 (58%), Gaps = 17/448 (3%)
Query: 136 FLKSSLDEYVAKLSVPT-RVIRSPGRVG--LIKARLLGARQAEGEILVFLDAHCECTLVF 192
F+ DE +A + PT R S R G ++K + EG+ C
Sbjct: 86 FISLREDELIAAVVSPTGRRNHSKARKGYRVVKQQSRRPEAKEGDPXXXXXXX-XCLQQE 144
Query: 193 NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY 252
+ LP +S++I FH+EAWS LLRTVHS+I+ +P++ LK+I+LVDD S + LKS+L EY
Sbjct: 145 YDSSLPTASVIICFHDEAWSTLLRTVHSIINTAPKASLKDIILVDDLSQQGPLKSALSEY 204
Query: 253 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR 312
++KL ++IRS R+G+I+ R+LGA +A G++LVF+D+HCEC GWLE L+AR++ +R
Sbjct: 205 ISKLD-GVKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPLLARLSSNR 263
Query: 313 TRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
VV P+ID+I TF Y S LH G F+W+L F W + + R+ T P ++PA
Sbjct: 264 NSVVSPIIDVIDWKTFQYYHSVSLHRGVFDWKLDFHWEPVPEHEEKV-RQSPTSPIRSPA 322
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
+AG + A+DR YF +IGAYD +M +WG ENLE+S R W CGGS+EI PCS V H++R
Sbjct: 323 VAGAVVAMDRHYFQNIGAYDSDMTMWGAENLELSIRTWLCGGSVEIIPCSRVGHVYRNHI 382
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA--EKQRDKQAVRSRLELRKQLK 490
P++F E + N R+A W+ + E ++K + A + +K RL+L+K+L
Sbjct: 383 PHAF--SYEEAIVRNKIRIAETWLGSFKENFYKNDTVAFLISKAEKPDCSERLQLQKRLG 440
Query: 491 CHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTH 549
C SF+W++T+V+P P D + G++ + C + + S A G L PC++
Sbjct: 441 CRSFQWFITNVYPELSRPEDTPRLSGKLYNTGAGFCADYRPGRAS---ADGSIKLSPCSN 497
Query: 550 LPVLTQMFVMKLPTDL-IATDESVCLDV 576
LTQ F ++ + + +CLDV
Sbjct: 498 --SLTQHFEYNSMKEIRLGSVPLLCLDV 523
Score = 122 bits (305), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 58/113 (51%), Positives = 86/113 (76%), Gaps = 1/113 (0%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +S++I FH+EAWS LLRTVHS+I+ +P++ LK+I+LVDD S + LKS+L EY++KL
Sbjct: 149 LPTASVIICFHDEAWSTLLRTVHSIINTAPKASLKDIILVDDLSQQGPLKSALSEYISKL 208
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
++IRS R+G+I+ R+LGA +A G++LVF+D+HCEC + E L + S
Sbjct: 209 D-GVKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPLLARLS 260
>gi|345797223|ref|XP_545481.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Canis
lupus familiaris]
Length = 602
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/362 (44%), Positives = 225/362 (62%), Gaps = 5/362 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST++
Sbjct: 146 AGCAEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKD 203
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 204 YLKDDLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 262
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 263 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVVAKNRIK 322
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ + P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 323 ETDIIRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 382
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 383 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTEQ 442
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+ ++ G +
Sbjct: 443 RELRKKLKCKSFKWYLENVFPDLKAPI-VRANGVLINVALGKCISIENTTATLEDCDGSS 501
Query: 543 SL 544
L
Sbjct: 502 EL 503
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 104/151 (68%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 116 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDEVWSTL 174
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK LD+Y+++ R++R R GLI+A
Sbjct: 175 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDDLDKYMSQFP-KVRILRLKERHGLIRA 233
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 234 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 264
>gi|148706466|gb|EDL38413.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_b [Mus
musculus]
Length = 551
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 184/452 (40%), Positives = 252/452 (55%), Gaps = 28/452 (6%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 69 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 128
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 129 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 182
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 183 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 242
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 243 ASELRGGFDWSLHFQWEQLSLEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 301
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R A
Sbjct: 302 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 361
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKWYL +V+P +P D
Sbjct: 362 VWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSI 420
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E + L PC + +Q++ +I
Sbjct: 421 QKGNIR--QRQKCLE-----SQKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQ-- 471
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G RQ WT
Sbjct: 472 EELCLSVVTLFPG-APVVLALCKNGDERQLWT 502
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 75 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 133
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 134 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRMRG 187
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 188 ADIAQGTTLTFLDSHCE 204
>gi|391346326|ref|XP_003747427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Metaseiulus occidentalis]
Length = 622
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 177/420 (42%), Positives = 253/420 (60%), Gaps = 27/420 (6%)
Query: 185 HCECTLV---FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
H +C + +N LPK+S+VI FHNEA SALLRTV SV+ R+P +LKEI+LVDDAS
Sbjct: 151 HQDCAALHTSYNTSTLPKTSVVITFHNEARSALLRTVVSVLQRTPSHLLKEIVLVDDASD 210
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
L K+ + I + R GL+++R+ GA++A+G +L FLD+HCEC GW+
Sbjct: 211 DPTDGIELQMKYDKIEL----ITNRERQGLMRSRVFGAKKAKGPVLTFLDSHCECNEGWI 266
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
E L+AR+ ++ ++VVCPVID++S TF Y + G F+W L F+W S +
Sbjct: 267 EPLLARIRDEPSKVVCPVIDVLSMDTFGYFPASSDLRGGFDWNLVFKWEFITSKPELA-- 324
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
T+P KTPAMAGGLFAI + F +G+YD +M +WG ENLEMSFRVWQCG IEI PC
Sbjct: 325 ---TDPIKTPAMAGGLFAITKKEFERLGSYDTQMDIWGAENLEMSFRVWQCGSGIEILPC 381
Query: 422 SHVAHLFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
S V H+FRK PY+FPGG S +V N R A VWMD++ ++Y++ P A K +
Sbjct: 382 SRVGHVFRKQHPYTFPGGGSGKVFARNSRRAAEVWMDDYKKYYYEQVP-AAKSVAYGDIS 440
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASG 540
RL+LR++L+C SF+WY+ +V+P LP + G +R Q ++C++ G+++ S
Sbjct: 441 ERLKLREKLRCKSFEWYMKNVYPELKLPSN--VHGYVR--QNNRCLD---TLGAISDGS- 492
Query: 541 PASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+ PC +L Q F + L+ D V L + V++ C+G N Q+W
Sbjct: 493 TVHVYPCHYLGG-NQDFRLAKNHLLMVHDMCVSLGSLAGQQ----LVKLRTCNGENSQKW 547
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/154 (42%), Positives = 91/154 (59%), Gaps = 6/154 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKT--KVFNEEFLPKSSIVIVFHNEAWSA 105
Q + ++FN + SD + R LPD R C +N LPK+S+VI FHNEA SA
Sbjct: 123 GQDPFAAHKFNQVASDDTLIGRALPDTRHQDCAALHTSYNTSTLPKTSVVITFHNEARSA 182
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLRTV SV+ R+P +LKEI+LVDDAS L K+ +I + R GL++
Sbjct: 183 LLRTVVSVLQRTPSHLLKEIVLVDDASDDPTDGIELQMKYDKIE----LITNRERQGLMR 238
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+R+ GA++A+G +L FLD+HCEC + E L +
Sbjct: 239 SRVFGAKKAKGPVLTFLDSHCECNEGWIEPLLAR 272
>gi|109099754|ref|XP_001087663.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Macaca mulatta]
Length = 940
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 221/344 (64%), Gaps = 5/344 (1%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C LV N LP +S+++ F +E WS LLR+VHSV++RSP +++EILLVDD ST++
Sbjct: 484 AGCTEQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPYLIEEILLVDDFSTKD 541
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+LK +LD+Y+++ R++ R GLI+ARL GA+ A G++L FLD+H EC +GWLE
Sbjct: 542 YLKDNLDKYMSQFP-KVRILHLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEP 600
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV R +V CPVI++I+D +Y+ G F W ++F W T R
Sbjct: 601 LLERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIK 660
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ K P MAGGLF+ID++YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS
Sbjct: 661 ETDAIKCPVMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSR 720
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSR 482
V H+FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + +
Sbjct: 721 VGHIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQ 780
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
ELRK+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 781 RELRKKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 823
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 65/156 (41%), Positives = 105/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 449 HGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCTEQLVHNN-LPTTSVIMCFVDE 507
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP +++EILLVDD ST+++LK +LD+Y+++ R++ R
Sbjct: 508 VWSTLLRSVHSVLNRSPPYLIEEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILHLKERH 566
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 567 GLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLL 602
>gi|224047294|ref|XP_002195048.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Taeniopygia guttata]
Length = 552
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 181/419 (43%), Positives = 247/419 (58%), Gaps = 17/419 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT + + LP +S++I FHNEA S LLRT+ SV++R+P ++ EI+LVDD S
Sbjct: 98 HHRCTTLHYSQDLPPTSVIITFHNEARSTLLRTIRSVLNRTPVHLVHEIILVDDFSDDPD 157
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L + + +R+ R GLI++R+ GA A+ +L FLD+HCE WL L
Sbjct: 158 DCRLLGQLPK-----VKCLRNGRREGLIRSRIRGADVAKASVLTFLDSHCEVNKDWLLPL 212
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ R+ ED TRVV PVIDII+ TFAYV + G F+W LHF+W S + KR D
Sbjct: 213 LQRIKEDPTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQL-SPEQKAKRLDP 271
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
TEP KTP +AGGLF ID+A+F H+G YD M +WGGEN E+SFRVW CGGS+EI PCS V
Sbjct: 272 TEPIKTPIIAGGLFVIDKAWFNHLGKYDSAMDIWGGENFEISFRVWMCGGSLEIIPCSRV 331
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMDE+ ++Y+ P A+ R ++SR+E
Sbjct: 332 GHVFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKQYYYAARPAAQG-RPFGNIQSRVE 390
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK+LKCHSFKWYL +V+P +P + + G IR Q+ C+E + S +Q S
Sbjct: 391 LRKKLKCHSFKWYLENVYPELRIPKESLYQTGIIRQRQS--CLE---SHKSEDQEFPILS 445
Query: 544 LLPC-THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS-GFNRQRW 600
L PC + ++ + + + +CL V Y +V + C G N+QRW
Sbjct: 446 LTPCNSSKGIVPKAQEWTYTYNHHIRQQQLCLSV--YTLFPGSQVLLSPCKEGDNKQRW 502
Score = 93.6 bits (231), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 85/136 (62%), Gaps = 6/136 (4%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+++ FN S+RIP +R + D R +C T ++++ LP +S++I FHNEA S LLRT+
Sbjct: 74 YRLHAFNQRESERIPSDRAVRDTRHHRCTTLHYSQD-LPPTSVIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++R+P ++ EI+LVDD S L + + +R+ R GLI++R+ GA
Sbjct: 133 SVLNRTPVHLVHEIILVDDFSDDPDDCRLLGQLPK-----VKCLRNGRREGLIRSRIRGA 187
Query: 172 RQAEGEILVFLDAHCE 187
A+ +L FLD+HCE
Sbjct: 188 DVAKASVLTFLDSHCE 203
>gi|254910954|ref|NP_082140.2| polypeptide N-acetylgalactosaminyltransferase 14 [Mus musculus]
gi|115527999|gb|AAI17801.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mus musculus]
Length = 550
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 184/452 (40%), Positives = 252/452 (55%), Gaps = 28/452 (6%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQLSLEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E + L PC + +Q++ +I
Sbjct: 420 QKGNIR--QRQKCLE-----SQKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQ-- 470
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G RQ WT
Sbjct: 471 EELCLSVVTLFPG-APVVLALCKNGDERQLWT 501
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRMRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|427789065|gb|JAA59984.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 626
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 166/364 (45%), Positives = 232/364 (63%), Gaps = 16/364 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST-REFLKSSLD 250
++ E LP +S+VI+F +E +SALLRTV+SVI+R+P +L+EI+LVDD S E L+
Sbjct: 154 YDVENLPTASVVIIFTDELFSALLRTVYSVINRTPHRLLREIILVDDYSQIDEMANGRLE 213
Query: 251 EYVAKLSVP--TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
++ + P ++I P R GLI+ARL GAR A G++LVFLD+HCE T WLE +V +
Sbjct: 214 RFIRRHFRPGFVKLITLPKREGLIRARLTGARAASGDVLVFLDSHCEATDHWLEPMVELI 273
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHW--GAFNWELHFRWYTYGSSDAIIK-RKDFT 365
+DRT VVCP+ID+I D T Y+ + + G FNW+ F W + +A K RK
Sbjct: 274 KKDRTTVVCPIIDVIDDKTLQYMGTSSDFYQIGGFNWKGEFIWIN--TPEAWRKARKSKA 331
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+P ++P MAGGLFAIDR YF+ G+YD EM+ WGGENLEMSFR+W CGGS+ IAPCSHV
Sbjct: 332 DPMRSPTMAGGLFAIDRKYFWESGSYDSEMEGWGGENLEMSFRIWMCGGSLVIAPCSHVG 391
Query: 426 HLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR PY FP ++ +G N AR+A VWMD + ++++ PE K + R
Sbjct: 392 HIFRDYHPYKFPS--NKDTHGINTARLAEVWMDNYKYYFYQNRPELRKISFGD-ISERKA 448
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LR +L+C SFKWYL +V+P+ F+P + F FG R+ T C++ + + P
Sbjct: 449 LRNKLQCKSFKWYLDNVYPNKFVPSEKVFAFGNARNPNTGMCLD---SMSHNYDNTEPLG 505
Query: 544 LLPC 547
+ PC
Sbjct: 506 IYPC 509
Score = 128 bits (322), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 93/136 (68%), Gaps = 3/136 (2%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN V DR+P+NRTL D R C+ ++ E LP +S+VI+F +E +SALLRTV+SVI+R
Sbjct: 127 FNAYVCDRLPLNRTLGDRRHRSCRNAEYDVENLPTASVVIIFTDELFSALLRTVYSVINR 186
Query: 117 SPRSMLKEILLVDDAST-REFLKSSLDEYVAKLSVP--TRVIRSPGRVGLIKARLLGARQ 173
+P +L+EI+LVDD S E L+ ++ + P ++I P R GLI+ARL GAR
Sbjct: 187 TPHRLLREIILVDDYSQIDEMANGRLERFIRRHFRPGFVKLITLPKREGLIRARLTGARA 246
Query: 174 AEGEILVFLDAHCECT 189
A G++LVFLD+HCE T
Sbjct: 247 ASGDVLVFLDSHCEAT 262
>gi|395515411|ref|XP_003761898.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Sarcophilus harrisii]
Length = 590
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 160/328 (48%), Positives = 218/328 (66%), Gaps = 10/328 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LPK+S+VI F+NEAWS LLRTV+SV+ SP +LKEI+LVDD S RE LK L+
Sbjct: 140 YDYENLPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLKEIILVDDYSDREHLKEPLEN 199
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++ L R+IR+ R GL++ARLLGA A GEIL FLD HCEC GWLE L+ R+ E+
Sbjct: 200 HLSGLR-KVRLIRANKREGLVRARLLGASIATGEILTFLDCHCECHDGWLEPLLERIHEE 258
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S + G F+W L F W++ + +R+ + ++
Sbjct: 259 ESAVVCPVIDVIDWNTFEYLGNSGDPQIGGFDWRLVFTWHSVPEKEQK-RRRSKIDVIRS 317
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+++ YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K
Sbjct: 318 PTMAGGLFAVNKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPK 377
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ N A K+ + R ELR +LK
Sbjct: 378 QAPYSRSKALA-----NSVRAAEVWMDEFKEIYYHRNMHARKE-PYGDITERKELRDKLK 431
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRI 517
C F+W+L +V+P +P D +FG +
Sbjct: 432 CKDFRWFLENVYPELHIPEDRPGYFGML 459
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 93/237 (39%), Positives = 130/237 (54%), Gaps = 42/237 (17%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
Q ++ N+ +SD I ++R LP+ P C+ K ++ E LPK+S+VI F+NEAWS LLRTV+S
Sbjct: 109 QRHQINIYLSDLISLHRRLPERWHPLCREKKYDYENLPKTSVVIAFYNEAWSTLLRTVYS 168
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +LKEI+LVDD S RE LK L+ +++ L R+IR+ R GL++ARLLGA
Sbjct: 169 VLETSPDILLKEIILVDDYSDREHLKEPLENHLSGLR-KVRLIRANKREGLVRARLLGAS 227
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
A GEIL FLD HCEC H+ LL +H S +++
Sbjct: 228 IATGEILTFLDCHCEC------------------HDGWLEPLLERIHEEES----AVVCP 265
Query: 233 ILLVDDASTREFLKSSLDEYVAKL---------SVPTR----------VIRSPGRVG 270
++ V D +T E+L +S D + SVP + VIRSP G
Sbjct: 266 VIDVIDWNTFEYLGNSGDPQIGGFDWRLVFTWHSVPEKEQKRRRSKIDVIRSPTMAG 322
>gi|344268422|ref|XP_003406059.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Loxodonta africana]
Length = 939
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 152/331 (45%), Positives = 216/331 (65%), Gaps = 3/331 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++
Sbjct: 494 LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQF 553
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ RV R +V
Sbjct: 554 P-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLLERVYLSRKKVA 612
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVI++I+D +Y+ G F W ++F W T R T+ + P MAGG
Sbjct: 613 CPVIEVINDKDMSYMTVDNFQRGVFVWPMNFGWRTIPPDVVAKNRIKETDVIRCPVMAGG 672
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H+FR +PY+F
Sbjct: 673 LFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRNDNPYTF 732
Query: 437 PGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P + + NL RVA VW+DE+ E FY + ++ D + + ELR++LKC SFK
Sbjct: 733 PKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELRRKLKCKSFK 792
Query: 496 WYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
WYL +V+P P+ K G + +V KC+
Sbjct: 793 WYLENVFPDLKAPI-VKAGGVLINVALDKCI 822
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/156 (42%), Positives = 106/156 (67%), Gaps = 2/156 (1%)
Query: 42 HSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
H +A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E
Sbjct: 448 HGKEKEAKRRWKEGNFNVYLSDLIPVDRAIEDTRPTGCAEQLVHSN-LPTTSVIMCFVDE 506
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R
Sbjct: 507 VWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERH 565
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
GLI+ARL GA+ A G++L FLD+H EC + + E L
Sbjct: 566 GLIRARLAGAQNATGDVLTFLDSHVECNIGWLEPLL 601
>gi|383857913|ref|XP_003704448.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Megachile rotundata]
Length = 638
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 180/421 (42%), Positives = 254/421 (60%), Gaps = 39/421 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 161 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDYSDMPHLQRQLEDYM--M 218
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 219 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 278
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 279 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 332 VWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 391
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK SPY + GV+ VL N R++ VW+DE+A++Y++ + + V R LRK
Sbjct: 392 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRI--GHDKGNYGDVSDRKALRK 448
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLL 545
+L C SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L
Sbjct: 449 KLGCKSFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGLY 505
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFNRQRWTYD 603
PC H Q+ L++ S+C+D P D+ V + C G N Q W
Sbjct: 506 PC-HRQGGNQI------RHLVS---SICIDSPGKPEDLHQPVGLYPCHRQGGN-QYWMLS 554
Query: 604 K 604
K
Sbjct: 555 K 555
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK + LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 127 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLKELPPTAVIICFHNEAWSVLLRTVHSVL 186
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 187 DRSPEHLIQEIILVDDYSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 244
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R P +++ +
Sbjct: 245 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARDPTTVVCPV 282
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 283 IDVIDDTTLEY 293
>gi|432096894|gb|ELK27469.1| Polypeptide N-acetylgalactosaminyltransferase 5 [Myotis davidii]
Length = 940
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 219/342 (64%), Gaps = 5/342 (1%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+++L
Sbjct: 486 CAKQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYL 543
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K +LD+Y+++ R++ R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+
Sbjct: 544 KDNLDKYMSQFP-KVRILHLKERHGLIRARLAGAQIATGDVLTFLDSHVECNIGWLEPLL 602
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
RV R +V CPVI++I+D +Y+ G F W ++F W T R T
Sbjct: 603 ERVYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIKET 662
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+ + P MAGGLF+ID+ YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V
Sbjct: 663 DVIRCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVG 722
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLE 484
H+FR +PYSFP + + NL RVA VW+DE+ E FY N ++ D + + E
Sbjct: 723 HIFRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGNHLIDQGLDVGNLTQQRE 782
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
LRK+LKC SFKWYL +V+P PM + G + +V KC+
Sbjct: 783 LRKKLKCKSFKWYLENVFPDLKAPM-VRAGGVLINVALGKCI 823
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 104/151 (68%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 454 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPVGCAKQLVHNN-LPTTSVIMCFVDEVWSTL 512
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++ R GLI+A
Sbjct: 513 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILHLKERHGLIRA 571
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 572 RLAGAQIATGDVLTFLDSHVECNIGWLEPLL 602
>gi|344268030|ref|XP_003405867.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3
[Loxodonta africana]
Length = 633
Score = 312 bits (799), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 172/362 (47%), Positives = 229/362 (63%), Gaps = 15/362 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTVHSV+ SP +LKEI+LVDDAS E+L L+EY+ +
Sbjct: 184 LPTTSVIIVFHNEAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHGKLEEYIKQF 243
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S+ +++R R GLI ARLLGA A E L FLDAHCEC GWLE L+AR+AE+ T VV
Sbjct: 244 SI-VKIVRQKERKGLITARLLGAAAATAETLTFLDAHCECFYGWLEPLLARIAENYTAVV 302
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P KTP
Sbjct: 303 SPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQ-RRKDETYPIKTPT 361
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I + YF +IG YDEEM++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 362 FAGGLFSISKEYFEYIGTYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVFRSKS 421
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEK---QRDKQAVRSRLELRKQL 489
P++FP G ++V+ N R+A VWMDE+ E +++ N +A K Q+ + R E++ +L
Sbjct: 422 PHTFPKG-TQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVRQKSFGDLSKRFEIKHRL 480
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
+C +F WYL V+P ++P ++ G I+ H C L G NQ P + C
Sbjct: 481 QCKNFTWYLNSVYPEVYVPDLNPVISGYIKSFGQHLC----LDVGENNQGGKPLIMYTCH 536
Query: 549 HL 550
L
Sbjct: 537 GL 538
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/134 (55%), Positives = 92/134 (68%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R L PD R P+C + F LP +S++IVFHNEAWS LLRTVHSV+
Sbjct: 150 FNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHNEAWSTLLRTVHSVL 209
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS E+L L+EY+ + S+ +++R R GLI ARLLGA A
Sbjct: 210 YSSPAILLKEIILVDDASVDEYLHGKLEEYIKQFSI-VKIVRQKERKGLITARLLGAAAA 268
Query: 175 EGEILVFLDAHCEC 188
E L FLDAHCEC
Sbjct: 269 TAETLTFLDAHCEC 282
>gi|170051778|ref|XP_001861920.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
quinquefasciatus]
gi|167872876|gb|EDS36259.1| polypeptide N-acetylgalactosaminyltransferase 12 [Culex
quinquefasciatus]
Length = 601
Score = 312 bits (799), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 155/347 (44%), Positives = 221/347 (63%), Gaps = 13/347 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP +S++I+F+NE +S L+RTVHSV++ + +LKEI+LVDD ST E LK LD
Sbjct: 141 YDIENLPSTSVIIIFYNEPYSVLVRTVHSVLNTADERLLKEIVLVDDGSTNEELKGKLDY 200
Query: 252 YV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE 310
YV +L +V+R RVGLI+ARL GAR A+ ++LVFLDAHCEC WLE L+ R+ E
Sbjct: 201 YVRTRLPSKVKVLRQRNRVGLIRARLAGARFAKADVLVFLDAHCECMPQWLEPLLERIRE 260
Query: 311 DRTRVVCPVIDIISDVTFAYVRS--FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE-- 366
RT V+ P+ID+I F Y + + G F W+ HF W+ + ++++ +E
Sbjct: 261 SRTSVLVPIIDVIEAKNFFYSTNGFTDFQIGGFTWDGHFDWHDVTQREKERQKRECSEKD 320
Query: 367 ----PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
P +P MAGGLFAI R YF+ IG+YDE+M WGGENLEMSFRVWQCGG++E PCS
Sbjct: 321 VAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGGENLEMSFRVWQCGGTLETIPCS 380
Query: 423 HVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
+ H+FR PYSFP +G N R+A VWMD++ + + P+ + V
Sbjct: 381 RIGHIFRDFHPYSFPN--DRDTHGINTVRMATVWMDDYIDLLYLNRPDLRDHPEVGDVTH 438
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVE 527
R LR++L+C SF WY+ +V+P F+P + + FGR+ + + C++
Sbjct: 439 RRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAFGRVTSLAENLCLD 485
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/144 (50%), Positives = 98/144 (68%), Gaps = 1/144 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N +S+ + NRT PD R P CK K ++ E LP +S++I+F+NE +S L+RTVHSV++
Sbjct: 114 LNEELSEHLSYNRTPPDERHPSCKRKSYDIENLPSTSVIIIFYNEPYSVLVRTVHSVLNT 173
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
+ +LKEI+LVDD ST E LK LD YV +L +V+R RVGLI+ARL GAR A+
Sbjct: 174 ADERLLKEIVLVDDGSTNEELKGKLDYYVRTRLPSKVKVLRQRNRVGLIRARLAGARFAK 233
Query: 176 GEILVFLDAHCECTLVFNEEFLPK 199
++LVFLDAHCEC + E L +
Sbjct: 234 ADVLVFLDAHCECMPQWLEPLLER 257
>gi|52851353|dbj|BAD52069.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
Length = 550
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 184/452 (40%), Positives = 251/452 (55%), Gaps = 28/452 (6%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRHNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQLSLEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKWYL +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E + L PC + +Q++ +I
Sbjct: 420 QKGNIR--QRQKCLE-----SQKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQ-- 470
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G RQ WT
Sbjct: 471 EELCLSVVTLFPG-APVVLALCKNGDERQLWT 501
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 86/137 (62%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRHNERQGLVRSRMRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|1934912|emb|CAA69875.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 578
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 174/399 (43%), Positives = 239/399 (59%), Gaps = 19/399 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
FN LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FNYRTLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLET 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FL HCEC GWLE L+ R+
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLYCHCECNSGWLEPLLERIGRY 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSS--DAIIKRKDFTEPF 368
T VVCPVID I TF Y++ E G F+W L F+W++ D I R D P
Sbjct: 248 ETAVVCPVIDTIDWNTFEFYMQIGEPMIGGFDWRLTFQWHSVPKQERDRRISRID---PI 304
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F
Sbjct: 305 RSPTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVF 364
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PY+ P + N AR A VWMDE+ E ++ NP A K+ +L LR++
Sbjct: 365 PKRAPYARPNFLQ-----NTARAAEVWMDEYKEHFYNRNPPARKEAYGDISERKL-LRER 418
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQASGPASLLP 546
L+C SF WYL +V+P+ +P D + G IR + +C++ N SL
Sbjct: 419 LRCKSFDWYLKNVFPNLHVPEDRPGWHGAIRSRGISSECLD--YNSPDNNPTGANLSLFG 476
Query: 547 CTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
C H Q F ++ + +C +VPE +N +
Sbjct: 477 C-HGQGGNQFFEYTSNKEIRFNSVTELCAEVPEQKNYVG 514
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/155 (45%), Positives = 103/155 (66%), Gaps = 1/155 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK++ FN LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKSQKFNYRTLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRVYLKTQLETYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL+GA A G++L FL HCEC + E L +
Sbjct: 209 RARLIGATFATGDVLTFLYCHCECNSGWLEPLLER 243
>gi|363731300|ref|XP_419370.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14 [Gallus
gallus]
Length = 552
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 173/364 (47%), Positives = 228/364 (62%), Gaps = 13/364 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT + + LP +SIVI FHNEA S LLRT+ SV++R+P ++ EI+LVDD S
Sbjct: 98 HHRCTTLHYRQDLPPTSIVITFHNEARSTLLRTIRSVMNRTPVHLIHEIILVDDFSD--- 154
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
+AKL + +R+ R GLI++R+ GA A+ +L FLD+HCE WL L
Sbjct: 155 -DPDDCRLLAKLP-KVKCLRNRQREGLIRSRIQGADVAQAGVLTFLDSHCEVNKDWLLPL 212
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ R+ ED TRVV PVIDII+ TFAYV + G F+W LHF+W S + KR D
Sbjct: 213 LQRIKEDPTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQL-SPEQKAKRLDP 271
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+P KTP +AGGLF ID+A+F H+G YD M +WGGEN E+SFRVW CGGS+EI PCS V
Sbjct: 272 TKPIKTPIIAGGLFVIDKAWFNHLGKYDNAMDIWGGENFEISFRVWMCGGSLEIIPCSRV 331
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMDE+ ++Y+ P A+ R ++SR+E
Sbjct: 332 GHVFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKQYYYAARPAAQG-RPYGNIQSRVE 390
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK+LKCHSFKWYL +V+P +P + + G IR Q+ C+E + S +Q S
Sbjct: 391 LRKRLKCHSFKWYLENVYPELRIPEELLYQTGMIRQRQS--CLE---SHKSEDQELPILS 445
Query: 544 LLPC 547
L PC
Sbjct: 446 LNPC 449
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/136 (41%), Positives = 86/136 (63%), Gaps = 6/136 (4%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+++ FN S+R+P +R + D R +C T + ++ LP +SIVI FHNEA S LLRT+
Sbjct: 74 YRLHAFNQRESERMPSDRAVRDTRHHRCTTLHYRQD-LPPTSIVITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++R+P ++ EI+LVDD S +AKL + +R+ R GLI++R+ GA
Sbjct: 133 SVMNRTPVHLIHEIILVDDFSD----DPDDCRLLAKLP-KVKCLRNRQREGLIRSRIQGA 187
Query: 172 RQAEGEILVFLDAHCE 187
A+ +L FLD+HCE
Sbjct: 188 DVAQAGVLTFLDSHCE 203
>gi|395504161|ref|XP_003756425.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Sarcophilus harrisii]
Length = 563
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 245/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V LP +SIVI FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 118 HYRCTSVHYASDLPATSIVITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 173
Query: 245 LKSSLDEYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 174 --SDPEDCLLLTRIPKIKCLRNDRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQP 231
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 232 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 290
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 291 PTQPIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 350
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + ++ R
Sbjct: 351 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSFGSIADRE 409
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P + G I+ Q C+E + N
Sbjct: 410 EQRKKMNCKSFQWYLENVYPELKIPEKEVIPGIIK--QGTNCLESQGQDTAGNNLVVMGG 467
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
++ P++TQ +V +D + + CL + + +V + C+ +RQ+W
Sbjct: 468 CKGTSNNPLMTQEWVF---SDPVIRQQDKCLSITSFST--GSQVTLEVCNQKDDRQKW 520
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/154 (40%), Positives = 93/154 (60%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C T V LP +SIVI FHNEA S
Sbjct: 87 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVHYASDLPATSIVITFHNEARS 145
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT-RVIRSPGRVGL 163
LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R GL
Sbjct: 146 TLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKIKCLRNDRREGL 199
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 200 IRSRVRGAEVATADILTFLDSHCE----VNSEWL 229
>gi|449274705|gb|EMC83783.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Columba livia]
Length = 502
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 245/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 57 HYRCTSVRYDTDLPATSLIITFHNEARSTLLRTVKSVLNRTPPSLIQEIILVDDFS---- 112
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 113 --SDPEDCQLLTKIPKVKCLRNTRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQP 170
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 171 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 229
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 230 PTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 289
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + +V R+
Sbjct: 290 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSFGSVAERV 348
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E R++L C SF+WYL +V+P +P + G I+ Q C+E + N G +
Sbjct: 349 EQRRKLNCKSFQWYLENVYPELKIPEKELIPGIIK--QGGNCLESQAQDTTGNTLVGMGN 406
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
P +TQ +V +D + + CL + + + + + AC+ + RQ+W
Sbjct: 407 CKGTVSNPPVTQEWVF---SDPLIRQQDKCLSITSFS--MGSHITLEACNQKDGRQKW 459
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 62/155 (40%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R +C T V + LP +S++I FHNEA
Sbjct: 25 QLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVRYDTDLPATSLIITFHNEAR 83
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ +P + +R+ R G
Sbjct: 84 STLLRTVKSVLNRTPPSLIQEIILVDDFS------SDPEDCQLLTKIPKVKCLRNTRREG 137
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +IL FLD+HCE N E+L
Sbjct: 138 LIRSRVRGAEVATADILTFLDSHCE----VNSEWL 168
>gi|77736615|ref|NP_001020224.2| polypeptide N-acetylgalactosaminyltransferase 4 [Rattus norvegicus]
gi|76780269|gb|AAI05819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
gi|149067086|gb|EDM16819.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 4 (GalNAc-T4) [Rattus
norvegicus]
Length = 578
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 153/320 (47%), Positives = 214/320 (66%), Gaps = 9/320 (2%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQLEA 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+IR+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R++ D
Sbjct: 189 YISNLDR-VRLIRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERISRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y+++ E G F+W L F+W++ + +R +P ++
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRTSRIDPIRS 306
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 307 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFPK 366
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMD++ E ++ NP A K+ +L LR++L+
Sbjct: 367 RAPYARPNFLQ-----NTARAAEVWMDDYKEHFYNRNPPARKETYGDISERKL-LRERLQ 420
Query: 491 CHSFKWYLTHVWPHHFLPMD 510
C SF WYL +V+ + +P D
Sbjct: 421 CKSFDWYLKNVFSNLHVPED 440
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 72/157 (45%), Positives = 104/157 (66%), Gaps = 1/157 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK K F+ LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKAKKFHYRSLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+IR+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQLEAYISNLDR-VRLIRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIS 245
>gi|390364218|ref|XP_793815.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 3-like,
partial [Strongylocentrotus purpuratus]
Length = 531
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 163/383 (42%), Positives = 229/383 (59%), Gaps = 14/383 (3%)
Query: 129 DDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
DD R+ + ++ +K +P + R G++ R R +D C
Sbjct: 33 DDDGRRKPSDTVDNDEASKFRIPAEEMMENMRNGVVDWRACVYRTE-------IDKSDMC 85
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
+ LP +SI+I FHNEAWS LLRT++S+I RSP ++KEI+L+DDAST E L+
Sbjct: 86 KNISYPHDLPSTSIIICFHNEAWSTLLRTLNSIIDRSPLRLIKEIILLDDASTMEHLQEP 145
Query: 249 LDEYVAKL-SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+++Y++++ SV R++R+ R+GLIKAR++G +EGE FLD+H E +GWLE L+AR
Sbjct: 146 IEDYISQIHSVRIRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLAR 205
Query: 308 VAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
+A DRT VV PV+D I+ TF Y V L G FNW +RW + D KR
Sbjct: 206 LASDRTIVVMPVVDEINKDTFNYNVVPEPLQRGGFNWRFEYRWKPIPNYD---KRPSKVA 262
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P K+PAM GGL +DR++F +G +D M+VWGGENLE S ++W CGGSIEI PCS V H
Sbjct: 263 PIKSPAMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSRVGH 322
Query: 427 LFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
++R SPYSF G +++ N RV VW DE ++ P K RD V R +L
Sbjct: 323 VYRDTSPYSFLGQNPLDIVEHNAMRVVEVWTDEHKHHFYDRLPML-KNRDFGDVSKRKKL 381
Query: 486 RKQLKCHSFKWYLTHVWPHHFLP 508
R+ L C+ F WYLT+V+P ++P
Sbjct: 382 RESLNCYDFNWYLTNVYPELYVP 404
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 69/166 (41%), Positives = 103/166 (62%), Gaps = 3/166 (1%)
Query: 35 DSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSI 94
D+VD S + N N +V R V RT D + CK + + LP +SI
Sbjct: 42 DTVDNDEASKFRIPAEEMMENMRNGVVDWRACVYRTEID-KSDMCKNISYPHD-LPSTSI 99
Query: 95 VIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL-SVPTR 153
+I FHNEAWS LLRT++S+I RSP ++KEI+L+DDAST E L+ +++Y++++ SV R
Sbjct: 100 IICFHNEAWSTLLRTLNSIIDRSPLRLIKEIILLDDASTMEHLQEPIEDYISQIHSVRIR 159
Query: 154 VIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ R+GLIKAR++G +EGE FLD+H E + + E L +
Sbjct: 160 MVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLAR 205
>gi|363734725|ref|XP_001231965.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 1
[Gallus gallus]
Length = 563
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 246/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V + LP +S++I FHNEA SALLRTV SV++R+P ++++EI+LVDD S
Sbjct: 118 HYRCTSVRYDTDLPATSLIITFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFS---- 173
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 174 --SDPEDCQLLTRIPKVKCLRNIRREGLIRSRVRGAEAATADILTFLDSHCEVNSEWLQP 231
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 232 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 290
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ +TP +AGG+F I++++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 291 PTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 350
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + ++ R+
Sbjct: 351 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSYGSIADRV 409
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E R++L C SF+WYL V+P +P D G IR Q C+E + N +G +
Sbjct: 410 EQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIR--QGGNCLESWAQDTTGNTLAGIGN 467
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
+ P +TQ +V +D + + CL + + + + AC+ + RQ+W
Sbjct: 468 CKGTVNNPPVTQEWVF---SDPLIRQQDKCLSITSFST--GSHITLEACNQKDGRQKW 520
Score = 102 bits (253), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 94/154 (61%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C T V + LP +S++I FHNEA S
Sbjct: 87 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVRYDTDLPATSLIITFHNEARS 145
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGL 163
ALLRTV SV++R+P ++++EI+LVDD S S ++ +P + +R+ R GL
Sbjct: 146 ALLRTVKSVLNRTPPNLIQEIILVDDFS------SDPEDCQLLTRIPKVKCLRNIRREGL 199
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 200 IRSRVRGAEAATADILTFLDSHCE----VNSEWL 229
>gi|307183874|gb|EFN70488.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Camponotus
floridanus]
Length = 451
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 174/416 (41%), Positives = 240/416 (57%), Gaps = 28/416 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 8 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 66
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 67 ----RVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWLEPLLERVAEDPTRVV 122
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W ++ ++KD T+ +TP +AGG
Sbjct: 123 CPVIDVISMDTFQYIGASADLRGGFDWSLVFKWEYLSQAERQARQKDPTQAIRTPMIAGG 182
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENL---------EMSFRVWQCGGSIEIAPCSHVAHL 427
LF I++AYF +G YD +M VWGGENL ++SFRVWQCGGS+EI PCS V H+
Sbjct: 183 LFVINKAYFEKLGKYDTQMDVWGGENLGIVIQFHVQKISFRVWQCGGSLEIIPCSRVGHV 242
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK PYSFPGG V N R A VWMD++ +FY+ P A + ++ R+EL++
Sbjct: 243 FRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA-RNIPYGNIQDRMELKR 301
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLP 546
+L C F WYL +V+P +P + G Q C++ SM G L P
Sbjct: 302 RLHCKPFSWYLKNVYPELVIPTSEGGPGGSLK-QGTACLD------SMGHLLDGNVGLYP 354
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
C + + + K D + +CL +P Y + ++I C G Q+W +
Sbjct: 355 CHNTGGNQEWGLTK---DGLIKHHDLCLTLPVYAKGTTLLMQI--CDGSENQKWRH 405
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 70/111 (63%), Gaps = 5/111 (4%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S L + K+
Sbjct: 8 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDHPEDGEELSR-IHKV 66
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RVIR+ R GL+++R+ GA A +L FLD+HCEC + E L +
Sbjct: 67 ----RVIRNEKREGLMRSRVRGADAATANVLTFLDSHCECNADWLEPLLER 113
>gi|432882425|ref|XP_004074024.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Oryzias latipes]
Length = 549
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 220/332 (66%), Gaps = 10/332 (3%)
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
L ++ LP +S+VI F+NEAWS LLRTVHSV+ SP +LKE++LVDD S R LK L
Sbjct: 94 LKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPL 153
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
++Y++ R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+
Sbjct: 154 EKYLSGFR-KVRLIRATKREGLVRARLLGASIATGDVLTFLDCHCECHEGWLEPLLHRIK 212
Query: 310 EDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
E+ T VVCPVID+I TF Y+ E G F+W L F W++ ++ +R T+
Sbjct: 213 EEPTAVVCPVIDVIDWDTFQYLGNPGEPQIGGFDWRLVFTWHSVPDNEQK-RRHSPTDVI 271
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLF++++ YF+++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F
Sbjct: 272 RSPTMAGGLFSVNKNYFYYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVF 331
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PYS ++ N R A VWMD++ E Y+ NP A + V RL+LR++
Sbjct: 332 PKKAPYSRSKALA-----NSVRAAEVWMDDYKEIYYHRNPHARLEAFGD-VSERLKLREK 385
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
L C +F+WYL +V+P +P D FG +++
Sbjct: 386 LGCKNFQWYLDNVYPEIHVPHDRPGMFGMLKN 417
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 122/235 (51%), Gaps = 42/235 (17%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
++ N+ VSD++ ++R LP+ P C+ ++ LP +S+VI F+NEAWS LLRTVHSV+
Sbjct: 67 HQINIYVSDKVSLHRRLPEKWNPLCRELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVL 126
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKE++LVDD S R LK L++Y++ R+IR+ R GL++ARLLGA A
Sbjct: 127 ETSPDILLKEVVLVDDYSDRAHLKEPLEKYLSGFR-KVRLIRATKREGLVRARLLGASIA 185
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEIL 234
G++L FLD HCEC +E W L + I P +++ ++
Sbjct: 186 TGDVLTFLDCHCEC-------------------HEGW---LEPLLHRIKEEPTAVVCPVI 223
Query: 235 LVDDASTREFL-------------------KSSLDEYVAKLSVPTRVIRSPGRVG 270
V D T ++L S D + PT VIRSP G
Sbjct: 224 DVIDWDTFQYLGNPGEPQIGGFDWRLVFTWHSVPDNEQKRRHSPTDVIRSPTMAG 278
>gi|363734723|ref|XP_003641443.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 isoform 2
[Gallus gallus]
Length = 557
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 246/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V + LP +S++I FHNEA SALLRTV SV++R+P ++++EI+LVDD S
Sbjct: 112 HYRCTSVRYDTDLPATSLIITFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFS---- 167
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 168 --SDPEDCQLLTRIPKVKCLRNIRREGLIRSRVRGAEAATADILTFLDSHCEVNSEWLQP 225
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 226 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 284
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ +TP +AGG+F I++++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 285 PTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 344
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + ++ R+
Sbjct: 345 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSYGSIADRV 403
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E R++L C SF+WYL V+P +P D G IR Q C+E + N +G +
Sbjct: 404 EQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIR--QGGNCLESWAQDTTGNTLAGIGN 461
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
+ P +TQ +V +D + + CL + + + + AC+ + RQ+W
Sbjct: 462 CKGTVNNPPVTQEWVF---SDPLIRQQDKCLSITSFST--GSHITLEACNQKDGRQKW 514
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 94/154 (61%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C T V + LP +S++I FHNEA S
Sbjct: 81 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVRYDTDLPATSLIITFHNEARS 139
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGL 163
ALLRTV SV++R+P ++++EI+LVDD S S ++ +P + +R+ R GL
Sbjct: 140 ALLRTVKSVLNRTPPNLIQEIILVDDFS------SDPEDCQLLTRIPKVKCLRNIRREGL 193
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 194 IRSRVRGAEAATADILTFLDSHCE----VNSEWL 223
>gi|395846631|ref|XP_003796006.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Otolemur garnettii]
Length = 943
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 168/399 (42%), Positives = 241/399 (60%), Gaps = 23/399 (5%)
Query: 148 LSVPTRVIRSPGRVG------LIKARLLGARQAEGEILVFL------DAHCE------CT 189
+++ R ++PG+ G L K + R EG V+L D E C
Sbjct: 431 VTLSPRDPKAPGQFGRPVVVPLGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPVGCA 490
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+++LK +L
Sbjct: 491 EQLVHSNLPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKDNL 550
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
DEY+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ RV
Sbjct: 551 DEYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVY 609
Query: 310 EDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTEPF 368
R +V CPVI++I+D +Y+ G F W ++F W T D + K K T+
Sbjct: 610 LSRQKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWKTI-PPDVVAKNKIKETDII 668
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+ P MAGGLF+ID+ YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H+F
Sbjct: 669 RCPVMAGGLFSIDKNYFYELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIF 728
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLELRK 487
R +PYSFP + + NL RVA VW+DE+ E FY + ++ + + + ELRK
Sbjct: 729 RNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLEVGNLTQQRELRK 788
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 789 KLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 826
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/151 (44%), Positives = 105/151 (69%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 457 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPVGCAEQLVHSN-LPTTSVIMCFVDEVWSTL 515
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LDEY+++ R++R R GLI+A
Sbjct: 516 LRSVHSVLNRSPPHLIKEILLVDDCSTKDYLKDNLDEYMSQFP-KVRILRLKERHGLIRA 574
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 575 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 605
>gi|345491789|ref|XP_001607575.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 1-like
[Nasonia vitripennis]
Length = 566
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 165/348 (47%), Positives = 215/348 (61%), Gaps = 12/348 (3%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C V + LP +S+VI+FHNEAWS LLRTV+SV+ SP LKEI+LVDD S E L+
Sbjct: 110 CKSVTYDTKLPTTSVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELED 169
Query: 248 SLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L Y+ +L +++R P R GLI+ARL GA+QA G++LVFLDAHCE T GWL L+
Sbjct: 170 ILAYYIETRLPKKVKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLH 229
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY---VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
R+ V+ PVID+I T Y R + G F W F W S KR
Sbjct: 230 RIKARPNAVLIPVIDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSP---KRTT 286
Query: 364 FT--EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
+ +P TP MAGGLFAIDR YF+ IG+YDE M WGGENLEMSFR+WQCGGSIEI PC
Sbjct: 287 ASPIDPINTPTMAGGLFAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPC 346
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE-KQRDKQAVR 480
S V H+FR PY FP L N AR A VWMD++ +F + E ++ +
Sbjct: 347 SRVGHIFRDFFPYEFPSSRDTYLI-NTARAAHVWMDDYKRLFFLHHKNMEGNTKEIGDLT 405
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVE 527
+R +LR++L+C SFKWYL +V+P F+P ++ +GR R + + C++
Sbjct: 406 ARKKLRERLQCASFKWYLQNVYPEKFIPDENVLAYGRARSPRRNLCLD 453
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 86/213 (40%), Positives = 129/213 (60%), Gaps = 29/213 (13%)
Query: 38 DGGLHSNLSDAQQ-----LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKS 92
D G + LSD+++ +Y N+++S++IP+ R + D+R P CK+ ++ + LP +
Sbjct: 64 DFGEAAYLSDSEKQNGSLVYSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTK-LPTT 122
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVP 151
S+VI+FHNEAWS LLRTV+SV+ SP LKEI+LVDD S E L+ L Y+ +L
Sbjct: 123 SVVIIFHNEAWSVLLRTVYSVLQESPPKFLKEIILVDDNSNEEELEDILAYYIETRLPKK 182
Query: 152 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW 211
+++R P R GLI+ARL GA+QA G++LVFLDAHCE T + W
Sbjct: 183 VKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVT-------------------KGW 223
Query: 212 SALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ L +H + +R P ++L ++ V DA T E+
Sbjct: 224 LSPL--LHRIKAR-PNAVLIPVIDVIDAKTLEY 253
>gi|194882801|ref|XP_001975498.1| GG20529 [Drosophila erecta]
gi|190658685|gb|EDV55898.1| GG20529 [Drosophila erecta]
Length = 601
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 178/428 (41%), Positives = 250/428 (58%), Gaps = 33/428 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI+F NE +S LLRTVHS +S LKEI+LVDD S L + LD YV +
Sbjct: 148 LPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RT 206
Query: 257 SVPT---RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
+P ++R R+GLI+ARL GAR A G++L+FLDAHCE +GW E L+ R+ E RT
Sbjct: 207 RIPAGKVTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIKESRT 266
Query: 314 RVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE--- 366
V+ P+ID+I F Y +SF++ G F W HF W + +R++ +
Sbjct: 267 SVLVPIIDVIDANDFQYSTNGYKSFQV--GGFQWNGHFDWINLPEREKQRQRRECRQQRE 324
Query: 367 --PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
P +P MAGGLFAIDR YF+ +G+YDE+M WGGENLEMSFR+WQCGG+IE PCS V
Sbjct: 325 ICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRV 384
Query: 425 AHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
H+FR PY FP +G N AR+ALVWMDE+ +F P+ + D V R+
Sbjct: 385 GHVFRDFHPYKFPN--DRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGDVTHRV 442
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
LRK+L+C SF+WYL +++P F+P D + +G++ + ++ C++ L N+ A
Sbjct: 443 MLRKKLRCKSFEWYLKNIYPEKFVPTKDVQGWGKVHALNSNICLDDLLQN---NEKPYNA 499
Query: 543 SLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISP--RVRILAC---SGFNR 597
L PC + +Q+F + +E C V E SP RV ++ C FN
Sbjct: 500 GLYPCGKVLQKSQLF--SFTNTNVLRNELSCATVQHSE---SPPYRVVMVPCMENDEFN- 553
Query: 598 QRWTYDKE 605
+ W Y+ +
Sbjct: 554 EHWRYEHQ 561
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/155 (41%), Positives = 95/155 (61%), Gaps = 4/155 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
++Y+ N +S+++ NR++ D R P C + F+ LP +S+VI+F NE +S LL
Sbjct: 107 GDEIYKKIALNEELSEQLSYNRSVGDHRNPLCAKQRFDFGSLPTASVVIIFFNEPYSVLL 166
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT---RVIRSPGRVGLI 164
RTVHS +S LKEI+LVDD S L + LD YV + +P ++R R+GLI
Sbjct: 167 RTVHSTLSTCNEKALKEIILVDDGSDNVELGAKLDYYV-RTRIPAGKVTILRLKNRLGLI 225
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ARL GAR A G++L+FLDAHCE + + E L +
Sbjct: 226 RARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQR 260
>gi|297264099|ref|XP_002798960.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Macaca mulatta]
Length = 375
Score = 311 bits (796), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 157/368 (42%), Positives = 220/368 (59%), Gaps = 40/368 (10%)
Query: 268 RVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVT 327
R GLI+ARL GA ++G+++ FLDAHCECTLGWLE L+AR+ EDR VVCP+ID+ISD T
Sbjct: 4 RSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLARIKEDRKTVVCPIIDVISDDT 63
Query: 328 FAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFH 387
F Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGGLF+IDR YF
Sbjct: 64 FEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDRNYFEE 123
Query: 388 IGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGN 447
IG YD M +WGGENLEMSFR+WQCGGS+EI CSHV H+FRKA+PY+FPGG V+ N
Sbjct: 124 IGTYDAGMDIWGGENLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYTFPGGTGHVINKN 183
Query: 448 LARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFL 507
R+A VWMDE+ +F++ +P K D V R LR+ LKC F WYL +++P +
Sbjct: 184 NRRLAEVWMDEFKDFFYIISPGVVKV-DYGDVSVRKTLRENLKCKPFSWYLENIYPDSQI 242
Query: 508 PMDDKFFGRIRHVQTHKCVEKPLAKGS--------------------------------- 534
P G IR+V+T++C++ K +
Sbjct: 243 PRRYYSLGEIRNVETNQCLDNMGRKENEKVGIFNCHGMGGNQVFSYTADKEIRTDDLCLD 302
Query: 535 MNQASGPASLLPCTHL--PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC 592
+++ +GP +L C H+ L + +L + +++ CLD P E+ + P ++ C
Sbjct: 303 VSRLNGPVIMLKCHHMRGNQLWEYDAERLTLRHVNSNQ--CLDEPSEEDKMVPTMQ--DC 358
Query: 593 SGFNRQRW 600
SG Q+W
Sbjct: 359 SGSRSQQW 366
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 30/40 (75%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R GLI+ARL GA ++G+++ FLDAHCECTL + E L +
Sbjct: 4 RSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLLAR 43
>gi|334348070|ref|XP_001368069.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 4-like
[Monodelphis domestica]
Length = 708
Score = 311 bits (796), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 174/407 (42%), Positives = 244/407 (59%), Gaps = 16/407 (3%)
Query: 183 DAHCECTL-VFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
D EC L F+ LP +S++I F+NEAWS LLRTVHSV+ +P +LKEI+LVDD S
Sbjct: 249 DRMYECRLKSFDYRRLPTTSVIIAFYNEAWSTLLRTVHSVLETAPAVLLKEIILVDDLSD 308
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
+ +LK+ L+ Y++ L R+IR+ R GL++ARL+GA A GE+L FLD HCEC GWL
Sbjct: 309 KVYLKAQLETYISSLQR-VRLIRTKKREGLVRARLIGATFATGEVLTFLDCHCECNQGWL 367
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIK 360
E L+ R+ +D + ++CPVID I TF Y++ E G F+W L F+W + +
Sbjct: 368 EPLLERIGQDESVIICPVIDTIDWNTFDFYMQEGEPVIGGFDWHLTFQWQPVPEHERR-R 426
Query: 361 RKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
+ T+P K+P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG++EI P
Sbjct: 427 WQSRTDPIKSPVMAGGLFAVSKKYFEYLGTYDTGMEVWGGENLELSFRVWQCGGALEIHP 486
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
CSHV H+F K +PY+ P N R A VWMD++ E ++ NP A K+
Sbjct: 487 CSHVGHVFPKRAPYARPN-----FRQNTVRAAEVWMDDYKEHFYNRNPLARKESYGDVSE 541
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQ-THKCVEKPLAKGSMNQA 538
+L LRK+L C SF WYL V+P +P D + G IR V + +C++ + +A
Sbjct: 542 RKL-LRKRLNCKSFDWYLKTVFPALRVPEDRPGWHGAIRSVGISSECLDYKTPERDPTEA 600
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDL-IATDESVCLDVPEYENDIS 584
SL C H Q F L +L + +CL P+ ++ +
Sbjct: 601 H--LSLFGC-HGQGGNQFFEYTLKKELRFSVQTELCLVAPKQKDAVG 644
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 70/150 (46%), Positives = 102/150 (68%), Gaps = 1/150 (0%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
A++L + N+ +SDRI ++R + D R +C+ K F+ LP +S++I F+NEAWS LL
Sbjct: 223 AEELTEKYAINIYLSDRISLHRHIRDDRMYECRLKSFDYRRLPTTSVIIAFYNEAWSTLL 282
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTVHSV+ +P +LKEI+LVDD S + +LK+ L+ Y++ L R+IR+ R GL++AR
Sbjct: 283 RTVHSVLETAPAVLLKEIILVDDLSDKVYLKAQLETYISSLQR-VRLIRTKKREGLVRAR 341
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L+GA A GE+L FLD HCEC + E L
Sbjct: 342 LIGATFATGEVLTFLDCHCECNQGWLEPLL 371
>gi|427789023|gb|JAA59963.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 648
Score = 311 bits (796), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 178/412 (43%), Positives = 248/412 (60%), Gaps = 22/412 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+++ FHNEAWS LLRTVHS+I RSP +L EI+LVDD S LK L++YVA
Sbjct: 199 LPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIILVDDYSDMPHLKQKLEDYVAHF 258
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R+ R GLI+ARLLGA A +L +LD+HCECT GWLE L+ R+A + T VV
Sbjct: 259 P-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVV 317
Query: 317 CPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
CPVID+ISD TF Y S ++ G F+W L F W+ + +RK +P +P MA
Sbjct: 318 CPVIDVISDSTFEYHYRDSGGVNVGGFDWNLQFSWHAVPERERQ-RRKHSWDPVWSPTMA 376
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF+ID+A+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SPY
Sbjct: 377 GGLFSIDKAFFEKLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPY 436
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ GV+ VL N R+A VW+DE+ ++Y++ + D V +R LR LKC SF
Sbjct: 437 KWRSGVN-VLRRNSVRLAEVWLDEYKQYYYQRI--GDDLGDFGDVSARKRLRDNLKCRSF 493
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQT--HKCVEKPLAKGSMNQASGPASLLPCTHLPV 552
WY+ ++P F+P D G +R+ C++ P + +M++ P + PC H
Sbjct: 494 DWYVRTIYPELFVPGDAVASGEVRNKGQGGSSCLDSPSGRDNMHK---PVGMYPC-HGQG 549
Query: 553 LTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
Q +++ + I DE+ CLD + V + C G Q W YD
Sbjct: 550 GNQYWMLSKEGE-IRRDEA-CLDYAGSD------VILYPCHGSKGNQLWIYD 593
Score = 126 bits (316), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/151 (47%), Positives = 102/151 (67%), Gaps = 2/151 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+Q + N FN +SD I ++R+LPDVR +CK + + ++ LP +S+++ FHNEAWS LLR
Sbjct: 160 KQGWDRNAFNQYISDMISLHRSLPDVRDSECKDERYLKD-LPSTSVIVCFHNEAWSVLLR 218
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHS+I RSP +L EI+LVDD S LK L++YVA +++R+ R GLI+ARL
Sbjct: 219 TVHSIIDRSPPKLLHEIILVDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARL 277
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA A +L +LD+HCECT + E L +
Sbjct: 278 LGAAAATAPVLTYLDSHCECTEGWLEPLLDR 308
>gi|432934421|ref|XP_004081934.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Oryzias latipes]
Length = 758
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 148/327 (45%), Positives = 214/327 (65%), Gaps = 8/327 (2%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C+ + LP +S++ F +E WS LLR+VHSV++RSP +LKEI+LVDD ST+++LK
Sbjct: 304 CSQAVVHDDLPSTSVIFCFVDEVWSTLLRSVHSVLNRSPPHLLKEIILVDDFSTKDYLKE 363
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
LD+Y+++ R++R R GLI+ARL GA A GE+L FLD+H EC +GWLE L+ R
Sbjct: 364 PLDKYMSQFP-KVRIVRLKERQGLIRARLAGAAVATGEVLTFLDSHVECNVGWLEPLLER 422
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT-- 365
+ DR +V CPVI++I+D +Y+ G F W L F W S+ I++ + T
Sbjct: 423 IYLDRRKVPCPVIEVINDKDMSYMLIDNFQRGIFKWPLVFGWNAL--SEDYIRKHNITVS 480
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+P + P MAGGLF+ID+ YF+ +G YD + VWGGEN+E+SF++W CGG IEI PCS V
Sbjct: 481 DPIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVG 540
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE---AEKQRDKQAVRSR 482
H+FR +PYSFP + + NLARVA VW+DE+ + ++ + + D + +
Sbjct: 541 HIFRGQNPYSFPKDRQKTVERNLARVAEVWLDEYKDLFYGHGYQHLLDKSVTDIGNLTEQ 600
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPM 509
+ELRK+LKC +FKWYL +V+P P+
Sbjct: 601 IELRKKLKCKNFKWYLENVYPELDAPL 627
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 69/143 (48%), Positives = 100/143 (69%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN+ +SDRIPV+R +PD R C V +++ LP +S++ F +E WS LLR+VHSV++R
Sbjct: 282 FNVYLSDRIPVDRAIPDTRPEVCSQAVVHDD-LPSTSVIFCFVDEVWSTLLRSVHSVLNR 340
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +LKEI+LVDD ST+++LK LD+Y+++ R++R R GLI+ARL GA A G
Sbjct: 341 SPPHLLKEIILVDDFSTKDYLKEPLDKYMSQFP-KVRIVRLKERQGLIRARLAGAAVATG 399
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
E+L FLD+H EC + + E L +
Sbjct: 400 EVLTFLDSHVECNVGWLEPLLER 422
>gi|326920610|ref|XP_003206562.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Meleagris gallopavo]
Length = 509
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 165/418 (39%), Positives = 246/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V + LP +S++I FHNEA SALLRTV SV++R+P ++++EI+LVDD S
Sbjct: 64 HYRCTSVRYDADLPATSLIITFHNEARSALLRTVKSVLNRTPPNLIQEIILVDDFS---- 119
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 120 --SDPEDCQLLTKIPKVKCLRNIRREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQP 177
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 178 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 236
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ +TP +AGG+F I++++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 237 PTQSIRTPVIAGGIFVINKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 296
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + ++ R+
Sbjct: 297 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSYGSIADRV 355
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E R++L C SF+WYL V+P +P D G IR Q C+E + N +G +
Sbjct: 356 EQRRKLNCKSFQWYLEKVYPELKVPEKDLIPGIIR--QGGNCLESWAQDTTGNTLAGIGN 413
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
+ P +TQ + +D + + CL + + ++ + AC+ + RQ+W
Sbjct: 414 CKGTVNNPPVTQEWAF---SDPLIRQQDKCLSITSFST--GSQITLEACNQKDGRQKW 466
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/154 (40%), Positives = 94/154 (61%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C T V + LP +S++I FHNEA S
Sbjct: 33 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRC-TSVRYDADLPATSLIITFHNEARS 91
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGL 163
ALLRTV SV++R+P ++++EI+LVDD S S ++ +P + +R+ R GL
Sbjct: 92 ALLRTVKSVLNRTPPNLIQEIILVDDFS------SDPEDCQLLTKIPKVKCLRNIRREGL 145
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 146 IRSRVRGAEVATADILTFLDSHCE----VNSEWL 175
>gi|118403595|ref|NP_001072369.1| polypeptide N-acetylgalactosaminyltransferase 14 [Xenopus
(Silurana) tropicalis]
gi|111305707|gb|AAI21473.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Xenopus (Silurana)
tropicalis]
Length = 555
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 178/424 (41%), Positives = 250/424 (58%), Gaps = 24/424 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT + + LP +S++I FHNEA S LLRT+ SV++R+P ++ EILLVDD S
Sbjct: 98 HYRCTELHYQSDLPPTSVIITFHNEARSTLLRTIRSVLNRTPMHLIHEILLVDDFS---- 153
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGR-VGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
+LD+ +P R +R+ R GLI++R+ GA A+ +L FLD+HCE WL
Sbjct: 154 --DNLDDCRLLSKLPKVRCLRNEQREAGLIRSRVRGAGVAQAAVLTFLDSHCEVNKDWLP 211
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
L+ R+ ED TRVV PVIDII+ TFAY+ + G F+W LHF+W S++ KR
Sbjct: 212 PLLHRIKEDPTRVVSPVIDIINLDTFAYIAASSDLRGGFDWSLHFKWEQL-SAEQKAKRL 270
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
D TEP KTP +AGGLF I++++F H+G YD M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 271 DPTEPIKTPVIAGGLFVIEKSWFNHLGKYDTAMDIWGGENFEISFRVWMCGGSLEIIPCS 330
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSR 482
V H+FRK PY FP G + N R A VWMDE+ Y+ P A+ R ++ R
Sbjct: 331 RVGHVFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKNHYYAARPAAQG-RPYGDIQKR 389
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQASGP 541
L LR+ LKC SFKWYL +V+P +P + G IR Q +C+E +G +
Sbjct: 390 LSLRRTLKCRSFKWYLENVYPELQIPAESLSKSGIIR--QRQRCIESQKTEGPEPPS--- 444
Query: 542 ASLLPCTHLPVLTQMFVMKLPTDLIATDES-VCLDVPEYENDISP--RVRILAC-SGFNR 597
+L+PC+ L ++ + T + + +C+ V + + P +V +L C G +
Sbjct: 445 LNLVPCSSLKGVSPQSQEWVYTQVQQISQGPLCMSV----HTLFPGTQVVLLPCREGDGK 500
Query: 598 QRWT 601
QRW+
Sbjct: 501 QRWS 504
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 93/149 (62%), Gaps = 13/149 (8%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RIP +R + D R +C T++ + LP +S++I FHNEA S LLRT+
Sbjct: 74 YRLYAFNQRESERIPSDRAIKDTRHYRC-TELHYQSDLPPTSVIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGR-VGLIKARLL 169
SV++R+P ++ EILLVDD S +LD+ +P R +R+ R GLI++R+
Sbjct: 133 SVLNRTPMHLIHEILLVDDFS------DNLDDCRLLSKLPKVRCLRNEQREAGLIRSRVR 186
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFLP 198
GA A+ +L FLD+HCE N+++LP
Sbjct: 187 GAGVAQAAVLTFLDSHCE----VNKDWLP 211
>gi|170046214|ref|XP_001850669.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
gi|167869055|gb|EDS32438.1| polypeptide N-acetylgalactosaminyltransferase 2 [Culex
quinquefasciatus]
Length = 576
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 154/313 (49%), Positives = 215/313 (68%), Gaps = 7/313 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRT+ SV++RSP ++ EI+LVDD S +F + + +AK+
Sbjct: 142 LPPTSVIITFHNEARSTLLRTIVSVLNRSPEHLIHEIILVDDFS--DFPEDG--QELAKI 197
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VIR+ R GL+++R+ GA A ++L FLD+HCEC + WLE L+ RV ED TRVV
Sbjct: 198 H-KVKVIRNEKREGLVRSRVKGADVATAKLLTFLDSHCECNVDWLEPLLVRVQEDPTRVV 256
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W +++ ++KD T P +TP +AGG
Sbjct: 257 CPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSNAERHERQKDPTTPIRTPMIAGG 316
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF ID+AYF +G YD +M +WGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 317 LFVIDKAYFEKLGKYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 376
Query: 437 PGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG S ++ N R A VWMD++ ++Y+ P A K + RL+L++QL+C +FK
Sbjct: 377 PGGGSGNIFAKNTRRAAEVWMDDYKQYYYAAVPLA-KNIPFGNIDERLQLKEQLECKNFK 435
Query: 496 WYLTHVWPHHFLP 508
WYL +V+P +P
Sbjct: 436 WYLDNVYPQLTIP 448
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 96/155 (61%), Gaps = 6/155 (3%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
+L + Y NRFN SD + NR LPD R P C+ K + LP +S++I FHNEA
Sbjct: 97 GSLQQGEDPYLRNRFNQQASDSLKSNRELPDTRNPMCRRK-WPSYSLPPTSVIITFHNEA 155
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
S LLRT+ SV++RSP ++ EI+LVDD S +F + + +AK+ +VIR+ R G
Sbjct: 156 RSTLLRTIVSVLNRSPEHLIHEIILVDDFS--DFPEDG--QELAKIH-KVKVIRNEKREG 210
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L+++R+ GA A ++L FLD+HCEC + + E L
Sbjct: 211 LVRSRVKGADVATAKLLTFLDSHCECNVDWLEPLL 245
>gi|196001847|ref|XP_002110791.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
gi|190586742|gb|EDV26795.1| hypothetical protein TRIADDRAFT_22565 [Trichoplax adhaerens]
Length = 556
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 154/333 (46%), Positives = 217/333 (65%), Gaps = 7/333 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VIVF+NEAWS L+RTVHSV+ RSP +L E++LVDD+S L L+EYV +L
Sbjct: 112 LPSTSVVIVFYNEAWSTLMRTVHSVLDRSPPDLLHEVILVDDSSDE--LHQPLEEYVRQL 169
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+ R+ R GLI+ARL G Q I+ FLDAHCE T+GWLE L+ R+ +DRT VV
Sbjct: 170 D-KVRLHRNSQREGLIRARLRGLEQTSAPIVTFLDAHCEVTIGWLEPLLNRIHQDRTTVV 228
Query: 317 CPVIDIISDVTFAYVRSFE-LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CP ID I FAY + G FNW+L F+W +S+ + +R T+P ++P MAG
Sbjct: 229 CPEIDSIDLNNFAYKYGPSGVLRGTFNWDLSFKWSIAPTSERL-RRTSATDPMRSPTMAG 287
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF +G YD +++WG EN+E+SF+VWQCGG +EI PCSHV H+FR+ PY
Sbjct: 288 GLFAIDREYFLELGTYDRGLEIWGAENMELSFKVWQCGGKLEIIPCSHVGHVFREVQPYD 347
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
+ + N RVA VWMD++ +F+++ +P Q + L+LR++LKC SF+
Sbjct: 348 TSVSLHSIANKNYQRVAEVWMDDYKKFFYQRHPYLTDQSFGD-ISENLKLRQRLKCRSFR 406
Query: 496 WYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVE 527
WYL +V+ LP + G++R+ ++ C++
Sbjct: 407 WYLQNVFTDVILPNETAIATGKVRNPISNMCLD 439
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/150 (50%), Positives = 101/150 (67%), Gaps = 4/150 (2%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
+++L+Q N FN VSDRI + RTLPD R CK+ + + LP +S+VIVF+NEAWS L+
Sbjct: 72 SEELFQRNHFNQWVSDRISLQRTLPDPRPEMCKSMTYPVD-LPSTSVVIVFYNEAWSTLM 130
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTVHSV+ RSP +L E++LVDD+S + L L+EYV +L R+ R+ R GLI+AR
Sbjct: 131 RTVHSVLDRSPPDLLHEVILVDDSS--DELHQPLEEYVRQLD-KVRLHRNSQREGLIRAR 187
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L G Q I+ FLDAHCE T+ + E L
Sbjct: 188 LRGLEQTSAPIVTFLDAHCEVTIGWLEPLL 217
>gi|307203928|gb|EFN82835.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Harpegnathos saltator]
Length = 482
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 179/421 (42%), Positives = 249/421 (59%), Gaps = 39/421 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 5 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLKRQLEDYM--M 62
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P R+IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 63 NYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 122
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 123 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEP 175
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDR +F IG YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 176 VWSPTMAGGLFSIDRVFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 235
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK SPY + GV+ VL N R++ VW+DE+A++Y++ + + V R LRK
Sbjct: 236 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQR--IGHDKGNYGDVSERKTLRK 292
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLL 545
+L C SFKWYL +V+P F+P + G +R++ + C++ P K +++ P L
Sbjct: 293 KLGCKSFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PCGLY 349
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFNRQRWTYD 603
PC H Q+ + +C+D P D+ V + C G N Q W
Sbjct: 350 PC-HRQGGNQIRQVA---------SGICIDSPGKSEDLHQPVGLYPCHRQGGN-QYWMLS 398
Query: 604 K 604
K
Sbjct: 399 K 399
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 95/157 (60%), Gaps = 25/157 (15%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 5 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLKRQLEDYM--M 62
Query: 149 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFH 207
+ P R+IR+ R GLI+ARLLGA A+ +L +LD+HCECT
Sbjct: 63 NYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECT------------------ 104
Query: 208 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
E W L + I+R P +++ ++ V D +T E+
Sbjct: 105 -EGW---LEPLLDRIARDPTTVVCPVIDVIDDTTLEY 137
>gi|242020636|ref|XP_002430758.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212515955|gb|EEB18020.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 623
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 216/323 (66%), Gaps = 5/323 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S++I F+NE ++ LLR+++SV+ R+P +LKEI+LV+D S L ++ YV
Sbjct: 157 LPTASVIICFYNEHFTTLLRSIYSVLERTPSYLLKEIILVNDFSDLAGLHRNISNYVNTN 216
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ ++ +S R+GLI+AR+ G+R+A G++LVFLD+H E + WL+ L++R+ + + V
Sbjct: 217 FTDKVKLFKSKKRLGLIRARIFGSRKASGDVLVFLDSHIEVNVNWLQPLLSRIVDSKKNV 276
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
V P+IDII+ TF Y S L G FNW LHF+W S + +DF +P +P MAG
Sbjct: 277 VVPIIDIINADTFKYSSS-PLVRGGFNWGLHFKWENLPKS-TLKSNEDFVKPILSPTMAG 334
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAI+RAYF +G YD M +WGGENLE+SFR+W CGG++E+ PCS V H+FRK PY
Sbjct: 335 GLFAINRAYFKELGEYDNGMNIWGGENLEISFRIWMCGGNLELIPCSRVGHVFRKRRPYG 394
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + N RVA VWMD++ EF++K +PE K + RL+LRK+L CHSF+
Sbjct: 395 SPNG-EDTMMRNSLRVANVWMDDYKEFFYKQHPEG-KTFPFGDISDRLKLRKKLHCHSFE 452
Query: 496 WYLTHVWPHHFLPMDDKFFGRIR 518
WYL +++P LP D++ +I+
Sbjct: 453 WYLQNIYPELILPSDNEQKSKIK 475
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 58/135 (42%), Positives = 93/135 (68%), Gaps = 2/135 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN+LVSD I ++R LPD R CK K +++ LP +S++I F+NE ++ LLR+++SV+ R
Sbjct: 126 FNILVSDAIGIHRELPDTRHNLCKKKKYSKN-LPTASVIICFYNEHFTTLLRSIYSVLER 184
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
+P +LKEI+LV+D S L ++ YV + ++ +S R+GLI+AR+ G+R+A
Sbjct: 185 TPSYLLKEIILVNDFSDLAGLHRNISNYVNTNFTDKVKLFKSKKRLGLIRARIFGSRKAS 244
Query: 176 GEILVFLDAHCECTL 190
G++LVFLD+H E +
Sbjct: 245 GDVLVFLDSHIEVNV 259
>gi|55742075|ref|NP_001006904.1| polypeptide N-acetylgalactosaminyltransferase 11 [Xenopus
(Silurana) tropicalis]
gi|49522064|gb|AAH75106.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 11 (GalNAc-T11)
[Xenopus (Silurana) tropicalis]
Length = 563
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 183/427 (42%), Positives = 258/427 (60%), Gaps = 29/427 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SIVI F+NEA+SALLRTVHSV+ R+P +L EI+LVDD S + LK LD Y+ +
Sbjct: 105 LPMASIVICFYNEAFSALLRTVHSVLDRTPAQLLHEIILVDDNSELDDLKKDLDGYMQEN 164
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
LS +++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+A + E+ V
Sbjct: 165 LSKKVKLVRNKQREGLIRGRMVGASHATGDVLVFLDSHCEVNEMWLQPLLAPIKENPRTV 224
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W ++ + + F+ PF++P MAG
Sbjct: 225 VCPVIDIISADTLIYSSSPVVR-GGFNWGLHFKWDPVPLAE-LGGPEGFSAPFRSPTMAG 282
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD M +WGGENLE+SFR+W CGGS+ I PCS V H+FRK PY
Sbjct: 283 GLFAMDREYFNMLGQYDSGMDIWGGENLEISFRIWMCGGSLLIVPCSRVGHIFRKRRPYG 342
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG + + +L R+A VWMDE+ + YF PE + RD +R RL LR++L C SFK
Sbjct: 343 SPGGHDTMAHNSL-RLAHVWMDEYKDQYFALRPEL-RNRDFGDIRERLALRRRLNCKSFK 400
Query: 496 WYLTHVWPH------HFLPMDDKFF------------GRIRHVQTHKCVEKPLAKGSMNQ 537
WYL +++P + P F GR+ ++QT++C+ +A+G +Q
Sbjct: 401 WYLDNIYPEMQVSGPNAKPQPPVFMNKGQKRPKILQRGRLINMQTNRCL---VAQGHPSQ 457
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR 597
G C + Q++ +LI ++ +CLD+ E + PR+ SG
Sbjct: 458 KGGLVVAKECDYNDS-EQVWSYNEEHELILSN-LLCLDMSETRSSDPPRLMKCHGSG-GS 514
Query: 598 QRWTYDK 604
Q+W + K
Sbjct: 515 QQWVFGK 521
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 79/191 (41%), Positives = 117/191 (61%), Gaps = 24/191 (12%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FNLL+S+R+ +R +PD R KC K + + LP +SIVI F+NEA+SALLRTVH
Sbjct: 69 YQKHAFNLLISNRLGYHRDVPDTRDSKCAKKTYPPD-LPMASIVICFYNEAFSALLRTVH 127
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + LK LD Y+ + LS +++R+ R GLI+ R++G
Sbjct: 128 SVLDRTPAQLLHEIILVDDNSELDDLKKDLDGYMQENLSKKVKLVRNKQREGLIRGRMVG 187
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSML 230
A A G++LVFLD+HCE NE W L+ + + I +PR+++
Sbjct: 188 ASHATGDVLVFLDSHCEV-------------------NEMW---LQPLLAPIKENPRTVV 225
Query: 231 KEILLVDDAST 241
++ + A T
Sbjct: 226 CPVIDIISADT 236
>gi|167536139|ref|XP_001749742.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771890|gb|EDQ85551.1| predicted protein [Monosiga brevicollis MX1]
Length = 1275
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 157/362 (43%), Positives = 229/362 (63%), Gaps = 11/362 (3%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
+C V LP ++++I F NEAWSAL RTV SV+ R+P ++L EI+L+DDAS +L+
Sbjct: 239 QCKDVAYPPDLPAATVIICFVNEAWSALFRTVWSVLDRTPENLLHEIILLDDASDASWLQ 298
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+E + +L +++RSP R+GLI+ARLLGA+ A + ++FLD+HCE +GW++ L+A
Sbjct: 299 QPLEEELQRLPAKVKLVRSPRRLGLIRARLLGAKHATADYMIFLDSHCEANVGWIQPLLA 358
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKDFT 365
+A D +RVV PVID I++ +Y + G F+W L F W A + T
Sbjct: 359 WMAGDPSRVVTPVIDSINNNDMSYHGAGGASSRGTFHWTLDFSWEANPEPVAQV-----T 413
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+P K+P MAGGLF I+R YF+ +G+YD+ M WGGENLEMSFRVWQCGGS+ I PCSHV
Sbjct: 414 DPVKSPTMAGGLFGINRQYFYDVGSYDQGMDGWGGENLEMSFRVWQCGGSLHILPCSHVG 473
Query: 426 HLFRKASPYSFPGG-VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FR + PY+ P +++ N R+A WMD++ E +++ P A K D V RL
Sbjct: 474 HIFRDSHPYTIPNSTINDTFLRNSIRLAETWMDDYKEIFYQIRPSARKV-DHGDVGERLA 532
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT-HKCVEKPLAKGSMNQASGPAS 543
LR++L+C SF WYL ++ P+ +P D R + + + C++K A S+ PAS
Sbjct: 533 LREKLQCKSFSWYLENIQPNKLIPSADFVLARGQVKNSLNVCLDKGAA--SLVYPCHPAS 590
Query: 544 LL 545
+L
Sbjct: 591 VL 592
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 60/141 (42%), Positives = 96/141 (68%), Gaps = 1/141 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN S ++ ++R +PD R +CK + + LP ++++I F NEAWSAL RTV SV+ R
Sbjct: 218 FNEYRSSQLSLHRDVPDARPMQCKDVAYPPD-LPAATVIICFVNEAWSALFRTVWSVLDR 276
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P ++L EI+L+DDAS +L+ L+E + +L +++RSP R+GLI+ARLLGA+ A
Sbjct: 277 TPENLLHEIILLDDASDASWLQQPLEEELQRLPAKVKLVRSPRRLGLIRARLLGAKHATA 336
Query: 177 EILVFLDAHCECTLVFNEEFL 197
+ ++FLD+HCE + + + L
Sbjct: 337 DYMIFLDSHCEANVGWIQPLL 357
>gi|62148928|dbj|BAD93348.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase-4 [Rattus
norvegicus]
Length = 578
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 213/320 (66%), Gaps = 9/320 (2%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LP +S++I F+NEAWS LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+
Sbjct: 129 FHYRSLPTTSVIIAFYNEAWSTLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQLEA 188
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y++ L R+ R+ R GL++ARL+GA A G++L FLD HCEC GWLE L+ R++ D
Sbjct: 189 YISNLDR-VRLTRTNKREGLVRARLIGATFATGDVLTFLDCHCECNTGWLEPLLERISRD 247
Query: 312 RTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
T +VCPVID I TF Y+++ E G F+W L F+W++ + +R +P ++
Sbjct: 248 ETAIVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWHSVPKHERD-RRTSRIDPIRS 306
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG +EI PCSHV H+F K
Sbjct: 307 PTMAGGLFAVSKKYFQYLGTYDTGMEVWGGENLELSFRVWQCGGKLEIHPCSHVGHVFSK 366
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PY+ P + N AR A VWMD++ E ++ NP A K+ + R LR++L+
Sbjct: 367 RAPYARPNFLQ-----NTAREAEVWMDDYKEHFYNRNPPARKE-TYDDISERKLLRERLQ 420
Query: 491 CHSFKWYLTHVWPHHFLPMD 510
C SF WYL +V+ + +P D
Sbjct: 421 CKSFDWYLKNVFSNLHVPED 440
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 103/157 (65%), Gaps = 1/157 (0%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++L + N+ +SDRI ++R + D R +CK K F+ LP +S++I F+NEAWS
Sbjct: 90 LKQQEELIERYAINIYLSDRISLHRHIEDKRMYECKAKKFHYRSLPTTSVIIAFYNEAWS 149
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+HSV+ SP +LKEI+LVDD S R +LK+ L+ Y++ L R+ R+ R GL+
Sbjct: 150 TLLRTIHSVLETSPAVLLKEIILVDDLSDRIYLKAQLEAYISNLDR-VRLTRTNKREGLV 208
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ARL+GA A G++L FLD HCEC + E L + S
Sbjct: 209 RARLIGATFATGDVLTFLDCHCECNTGWLEPLLERIS 245
>gi|241746527|ref|XP_002414286.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215508140|gb|EEC17594.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 493
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 159/329 (48%), Positives = 212/329 (64%), Gaps = 11/329 (3%)
Query: 185 HCECT---LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 241
H +C L+ +E LP +S+VI FHNEA SALLRT+ SV++RSP +++EI+LVDD S
Sbjct: 105 HPQCAKEGLLKPQEELPATSVVITFHNEARSALLRTIVSVLNRSPAELIEEIILVDDFSD 164
Query: 242 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
S E +AK+ R++R+ R GL+++R+ GAR A+ +L FLD+HCEC GWL
Sbjct: 165 ----DPSDGEELAKIQ-KIRLLRNTQREGLVRSRVRGARAAKAPVLTFLDSHCECNQGWL 219
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
L+ RV ED RVVCPVID+I+ +F Y + G FNW L F+W + + +
Sbjct: 220 PPLLRRVKEDPRRVVCPVIDVINLESFKYFGASSDLRGGFNWNLVFKWEFLSNKEREERA 279
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
+ T P +TP +AGGLF +DRA F +GAYD M +WGGENLE+SFR WQCGGS+EI PC
Sbjct: 280 NNPTLPIRTPMIAGGLFVVDRAQFERLGAYDTAMDIWGGENLELSFRAWQCGGSLEILPC 339
Query: 422 SHVAHLFRKASPYSFPGGVSEVL--YGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
S V H+FRK PYSFPGG V N R A VWMD++ ++Y+ P A + +V
Sbjct: 340 SRVGHVFRKQHPYSFPGGSGNVFARQANTRRAAEVWMDDYKKYYYATVPVA-RNVPMGSV 398
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLP 508
RL LRK L CHSF+WYL +V+P +P
Sbjct: 399 EERLNLRKSLGCHSFQWYLDNVYPELKVP 427
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 97/149 (65%), Gaps = 11/149 (7%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVF--NEEFLPKSSIVIVFHNEAWSALLRT 109
YQ ++FN SD + NR +PD R P+C + +E LP +S+VI FHNEA SALLRT
Sbjct: 81 YQRHKFNQEASDALASNRAIPDTRHPQCAKEGLLKPQEELPATSVVITFHNEARSALLRT 140
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
+ SV++RSP +++EI+LVDD S S E +AK+ R++R+ R GL+++R+
Sbjct: 141 IVSVLNRSPAELIEEIILVDDFSD----DPSDGEELAKIQ-KIRLLRNTQREGLVRSRVR 195
Query: 170 GARQAEGEILVFLDAHCECTLVFNEEFLP 198
GAR A+ +L FLD+HCEC N+ +LP
Sbjct: 196 GARAAKAPVLTFLDSHCEC----NQGWLP 220
>gi|108935842|sp|Q8BVG5.2|GLT14_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 14;
AltName: Full=Polypeptide GalNAc transferase 14;
Short=GalNAc-T14; Short=pp-GaNTase 14; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 14;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 14
Length = 550
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 183/452 (40%), Positives = 251/452 (55%), Gaps = 28/452 (6%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQLSLEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKW L +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWNLENVYPELRVPPDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E + L PC + +Q++ +I
Sbjct: 420 QKGNIR--QRQKCLE-----SQKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQ-- 470
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G RQ WT
Sbjct: 471 EELCLSVVTLFPG-APVVLALCKNGDERQLWT 501
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRMRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|26347119|dbj|BAC37208.1| unnamed protein product [Mus musculus]
Length = 550
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 183/452 (40%), Positives = 251/452 (55%), Gaps = 28/452 (6%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 68 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPHTSIIITFHNEARSTL 127
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 128 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 181
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRS 333
+R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y+ S
Sbjct: 182 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFNYIES 241
Query: 334 FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDE 393
G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G YD
Sbjct: 242 ASELRGGFDWSLHFQWEQLSLEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGKYDV 300
Query: 394 EMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVAL 453
+M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R A
Sbjct: 301 DMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKRTAE 360
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF 513
VWMDE+ ++Y+ P A +R + +RL LRK L C +FKW L +V+P +P D
Sbjct: 361 VWMDEYKQYYYAARPFA-LERHFGNIENRLNLRKNLHCQTFKWNLENVYPELRVPPDSSI 419
Query: 514 -FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLIATD 569
G IR Q KC+E + L PC + +Q++ +I
Sbjct: 420 QKGNIR--QRQKCLE-----SQKQKKQEILRLSPCAKVKGDGAKSQVWAFTYTQQIIQ-- 470
Query: 570 ESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G RQ WT
Sbjct: 471 EELCLSVVTLFPG-APVVLALCKNGDERQLWT 501
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 74 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPHTSIIITFHNEARSTLLRTIR 132
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 133 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRMRG 186
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 187 ADIAQGTTLTFLDSHCE 203
>gi|345319818|ref|XP_001521442.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Ornithorhynchus anatinus]
Length = 628
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 192 LPATSVVITFHNEARSALLRTVASVLKKSPPHLVKEIILVDDYSN-DPEDGALLGKIEKV 250
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ +L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 251 ----RVLRNDRREGLMRSRVRGADAAQARVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 306
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 307 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 366
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D++YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 367 LFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTF 426
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 427 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKRLSCKPFKW 485
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 486 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 515
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 97/150 (64%), Gaps = 10/150 (6%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 152 GQDPYARNKFNQVESDKLRMDRAVPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 210
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 211 RTVASVLKKSPPHLVKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 265
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ GA A+ +L FLD+HCEC NE +L
Sbjct: 266 VRGADAAQARVLTFLDSHCEC----NEHWL 291
>gi|395531657|ref|XP_003767891.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Sarcophilus harrisii]
Length = 542
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 214/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 106 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 164
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 165 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 220
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 221 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 280
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 281 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYSF 340
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 341 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 399
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q + C++
Sbjct: 400 YLENVYPELRVPDHQDIAFGALQ--QGNNCLD 429
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 66 GQDPYARNKFNQVESDKLRMDRNIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 124
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 125 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 179
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 180 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 211
>gi|307207692|gb|EFN85329.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Harpegnathos
saltator]
Length = 598
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 155/338 (45%), Positives = 213/338 (63%), Gaps = 10/338 (2%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + + LP S++I+F+NE WS LLRTVHSV+ S +LKEI+LVDD S E L+
Sbjct: 133 CANLTYDTLLPSVSVIIIFYNEPWSVLLRTVHSVLKGSLPHLLKEIILVDDHSEEEELQG 192
Query: 248 SLDEYVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD Y++ +L +++R P R GLI+ARL GA+ A G++LVFLDAHCE WL+ L+
Sbjct: 193 QLDYYLSTRLPTKVKLLRLPYRQGLIRARLHGAKNATGDVLVFLDAHCEVIKDWLQPLLQ 252
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHW---GAFNWELHFRWYTYGSSDAIIKRKD 363
R+ E R V+ P+ID IS+ T Y E + G F W HF W + + R
Sbjct: 253 RIKEKRNAVLMPIIDNISEETLEYFHDNEASFFQVGGFTWSGHFTWINIQKHE-LKSRLS 311
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
P ++P MAGGLFAIDR YF+ +G+YD++M WGGENLEMSFR+WQCGG++EI PCS
Sbjct: 312 LISPTRSPTMAGGLFAIDRKYFWEVGSYDDKMDGWGGENLEMSFRIWQCGGTLEIIPCSR 371
Query: 424 VAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA-VRS 481
V H+FR PY FP + +G N AR+A VWMDE+ + E + + +
Sbjct: 372 VGHIFRNFHPYKFPN--DKDTHGINTARLAFVWMDEYKRLFLLHRSEFKNKSSLFGDISE 429
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIR 518
RL+LR++LKC SFKWYL +++P F+P + +GR+R
Sbjct: 430 RLKLRRKLKCKSFKWYLDNIYPEKFIPDEHAIAYGRVR 467
Score = 132 bits (331), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 79/185 (42%), Positives = 115/185 (62%), Gaps = 7/185 (3%)
Query: 6 VVQLLKLARNQQDYIDRRGVHVVVGHYLGDSVDGG-LHSNLS-DAQQLYQINRFNLLVSD 63
+Q LK R +Q + + VVVG LG++ + LH + + + N+++S+
Sbjct: 61 ALQRLKKERYEQQIL-KLEYDVVVG--LGENGEPAYLHGKEKVEGETVLAKKALNVVLSN 117
Query: 64 RIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLK 123
+I + R LPDVR P C + + LP S++I+F+NE WS LLRTVHSV+ S +LK
Sbjct: 118 KISLTRKLPDVRNPLCANLTY-DTLLPSVSVIIIFYNEPWSVLLRTVHSVLKGSLPHLLK 176
Query: 124 EILLVDDASTREFLKSSLDEYVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFL 182
EI+LVDD S E L+ LD Y++ +L +++R P R GLI+ARL GA+ A G++LVFL
Sbjct: 177 EIILVDDHSEEEELQGQLDYYLSTRLPTKVKLLRLPYRQGLIRARLHGAKNATGDVLVFL 236
Query: 183 DAHCE 187
DAHCE
Sbjct: 237 DAHCE 241
>gi|432950788|ref|XP_004084611.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 11-like [Oryzias
latipes]
Length = 574
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 181/445 (40%), Positives = 246/445 (55%), Gaps = 46/445 (10%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
+C + LP +S+VI F NEA SALLRTVHSV+ R+P +L EI+LVDD S E LK
Sbjct: 107 QCRKRSYPQALPSASVVICFFNEALSALLRTVHSVLDRTPAYLLHEIILVDDQSELEELK 166
Query: 247 SSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
LD V +L R++R+ R GLI+ R++GA A G++LVFLD+HCE WL+ L+
Sbjct: 167 EGLDRCVREELQGKVRLVRNRKREGLIRGRMIGAAHATGDVLVFLDSHCEVNQDWLQPLL 226
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
A + +DR VVCP+IDIIS T Y S + G FNW LHF+W S+ I +
Sbjct: 227 APIQKDRRTVVCPIIDIISADTLTYSSS-PIVRGGFNWGLHFKWDPVPPSE-ISGPEGAA 284
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
P ++P MAGGLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V
Sbjct: 285 GPIRSPTMAGGLFAMNREYFNELGRYDPGMDIWGGENLEISFRIWMCGGQLLIIPCSRVG 344
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
H+FRK PY PGG + + +L R+A VWMDE+ E Y PE + R + R+ L
Sbjct: 345 HIFRKRRPYGSPGGQDTMAHNSL-RLAHVWMDEYKEQYLSLRPEL-RNRSYGDISERVAL 402
Query: 486 RKQLKCHSFKWYLTHVWPHH------------FLPMDDKF-----FGRIRHVQTHKCVEK 528
RK+L+C SF+WYL V+P F+ K GR+R++ +C+
Sbjct: 403 RKRLQCRSFRWYLDTVYPEMQAVASGNRPPPLFVNKGLKRPKVLQRGRLRNLAVGRCL-- 460
Query: 529 PLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDES--------VCLDVPEYE 580
A+G +Q G + PC + P + DE +CLDV E
Sbjct: 461 -TAQGRASQKGGAVVVRPCDP----------RDPEQEWSYDEEGQLVLAGLLCLDVSEVR 509
Query: 581 NDISPRVRILACSG-FNRQRWTYDK 604
P R++ C G Q+W+ K
Sbjct: 510 TFDPP--RLMKCHGSGGSQQWSLAK 532
Score = 128 bits (322), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 67/137 (48%), Positives = 94/137 (68%), Gaps = 2/137 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN+L+S+R+ +R LPD R +C+ + + + LP +S+VI F NEA SALLRTVH
Sbjct: 81 YHRHAFNVLISNRLGSHRELPDTRDKQCRKRSY-PQALPSASVVICFFNEALSALLRTVH 139
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S E LK LD V +L R++R+ R GLI+ R++G
Sbjct: 140 SVLDRTPAYLLHEIILVDDQSELEELKEGLDRCVREELQGKVRLVRNRKREGLIRGRMIG 199
Query: 171 ARQAEGEILVFLDAHCE 187
A A G++LVFLD+HCE
Sbjct: 200 AAHATGDVLVFLDSHCE 216
>gi|326427851|gb|EGD73421.1| GALNT4 protein [Salpingoeca sp. ATCC 50818]
Length = 537
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 161/315 (51%), Positives = 209/315 (66%), Gaps = 6/315 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S++++F+NEA S LLRTV SV+ RSPRS++KEILLVDD S+ L LD+ VA +
Sbjct: 204 LPTVSVILIFYNEARSTLLRTVWSVLDRSPRSLIKEILLVDDHSSMPHLGYPLDQEVAGI 263
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
TRVIR P R GLI+A++ GA+QA G++LV+LD+HCE GWLE L+ R+ +R V
Sbjct: 264 P-KTRVIRLPERSGLIRAKVYGAQQARGDVLVYLDSHCEVNDGWLEPLLDRIRRNRKTVA 322
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID I T+ + R+ L G F+W L F+W ++D R D T+PF +PAMAGG
Sbjct: 323 MPIIDAIDYETWEH-RTGLLERGIFDWSLVFKWKQL-TADDKRGRPDDTDPFASPAMAGG 380
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFA+DR YFF +GAYD M+ WGGEN+EMS RVW CGG IE PCSHVAH+FRK +PY F
Sbjct: 381 LFAMDRKYFFEVGAYDMGMETWGGENIEMSMRVWACGGRIEALPCSHVAHVFRKKTPYEF 440
Query: 437 P-GGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
E + NL RVA VWMDE+ + Y+ ++ V R +LR+QL C SFK
Sbjct: 441 KTKDPQETIARNLNRVAEVWMDEYKDVYYAVT--HNRKYGYGDVSQRKQLRQQLGCKSFK 498
Query: 496 WYLTHVWPHHFLPMD 510
WYL +V+ LP D
Sbjct: 499 WYLDNVFYDLELPYD 513
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 71/155 (45%), Positives = 103/155 (66%), Gaps = 2/155 (1%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKC-KTKVFNEEFLPKSSIVIVFHNEA 102
++ + + Y+ N+FN +SDR+ ++R D R +C K + LP S++++F+NEA
Sbjct: 158 DIKEYNEGYKNNQFNQWISDRLSLHRRAYDTRPVECLHKKYYPLSELPTVSVILIFYNEA 217
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
S LLRTV SV+ RSPRS++KEILLVDD S+ L LD+ VA + TRVIR P R G
Sbjct: 218 RSTLLRTVWSVLDRSPRSLIKEILLVDDHSSMPHLGYPLDQEVAGIP-KTRVIRLPERSG 276
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI+A++ GA+QA G++LV+LD+HCE + E L
Sbjct: 277 LIRAKVYGAQQARGDVLVYLDSHCEVNDGWLEPLL 311
>gi|332019618|gb|EGI60096.1| Putative polypeptide N-acetylgalactosaminyltransferase 9
[Acromyrmex echinatior]
Length = 566
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 166/362 (45%), Positives = 231/362 (63%), Gaps = 26/362 (7%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 89 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLKRQLEDYM--M 146
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P R+IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 147 NYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 206
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 207 VCPVIDVIDDTTLEY------HWRDSSGVNVGGFDWNLQFNWHAVPERERK-RHKNPAEP 259
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDRA+F IG YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 260 VWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 319
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK SPY + GV+ VL N R++ VW+DE+A++Y++ + + + R LRK
Sbjct: 320 FRKRSPYKWRNGVN-VLKRNSIRLSEVWLDEYAKYYYQR--IGHDKGNYGDISERKALRK 376
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT--HKCVEKPLAKGSMNQASGPASLL 545
+L C SFKWYL +V+P F+P + G +R++ + C++ P K +++ P L
Sbjct: 377 KLGCKSFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PCGLY 433
Query: 546 PC 547
PC
Sbjct: 434 PC 435
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 103/177 (58%), Gaps = 26/177 (14%)
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
+ P + +P CK + LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LV
Sbjct: 70 KRFPTLFEP-CKESGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILV 128
Query: 129 DDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
DD S LK L++Y+ ++ P R+IR+ R GLI+ARLLGA A+ +L +LD+HCE
Sbjct: 129 DDFSDMPHLKRQLEDYM--MNYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCE 186
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
CT E W L + I+R P +++ ++ V D +T E+
Sbjct: 187 CT-------------------EGW---LEPLLDRIARDPTTVVCPVIDVIDDTTLEY 221
>gi|410956556|ref|XP_003984908.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Felis catus]
Length = 601
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|321477075|gb|EFX88034.1| hypothetical protein DAPPUDRAFT_305669 [Daphnia pulex]
Length = 553
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 153/319 (47%), Positives = 205/319 (64%), Gaps = 6/319 (1%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + + LP +S++I FHNEA S LLRT+ SV++RSP ++KEI+LVDD S
Sbjct: 102 HKKCLDLEFSKDLPSTSVIITFHNEARSTLLRTIVSVLNRSPSHLIKEIILVDDFSND-- 159
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
S E V V ++R+ R GL+++R+ GA A GE L FLD+HCEC GWLE L
Sbjct: 160 -ASDGRELVQIEKVI--LVRNSKREGLVRSRVKGAEIATGEFLTFLDSHCECNEGWLEPL 216
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ARV EDRTR+VCPVID+I+ +F Y+ + G F+W L F+W + + ++ D
Sbjct: 217 LARVVEDRTRIVCPVIDVIAMDSFQYIAASTELRGGFDWNLVFKWELLPAEEKANRKTDP 276
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T P +TP +AGGLF IDR YF +G+YD +M +WGGENLE+SFR WQCGG +EI PCS V
Sbjct: 277 TIPIRTPMIAGGLFVIDRQYFQKLGSYDLQMDIWGGENLEISFRTWQCGGRLEIVPCSRV 336
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PYSFPGG + N R A VWMD++ ++YF P A + + RL
Sbjct: 337 GHVFRKQHPYSFPGGSGTIFARNTRRAAEVWMDDYKKYYFAAVPMA-RTVTFGNITDRLA 395
Query: 485 LRKQLKCHSFKWYLTHVWP 503
LR L C FKWY+ +V+P
Sbjct: 396 LRNSLNCKPFKWYVENVYP 414
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 92/155 (59%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y N+FN SD + NR +PD R KC F+++ LP +S++I FHNEA S
Sbjct: 71 LKPGEDAYHNNKFNQEASDTLESNRAIPDYRHKKCLDLEFSKD-LPSTSVIITFHNEARS 129
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP ++KEI+LVDD S S E V V ++R+ R GL+
Sbjct: 130 TLLRTIVSVLNRSPSHLIKEIILVDDFSND---ASDGRELVQIEKVI--LVRNSKREGLV 184
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A GE L FLD+HCEC + E L +
Sbjct: 185 RSRVKGAEIATGEFLTFLDSHCECNEGWLEPLLAR 219
>gi|328785249|ref|XP_393950.3| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis mellifera]
Length = 635
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 180/416 (43%), Positives = 256/416 (61%), Gaps = 29/416 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 158 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDFSDMPHLQRQLEDYM--M 215
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A + T V
Sbjct: 216 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARNPTTV 275
Query: 316 VCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VCPVID+I D T Y S ++ G F+W L F W+ + + K+ EP +P M
Sbjct: 276 VCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEPVWSPTM 334
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 335 AGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 394
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELRKQLKCH 492
Y + GV+ VL N R++ VW+DE+A++Y+ + + K D V R LRK+L C
Sbjct: 395 YKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGD---VSERKALRKRLGCK 450
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L PC H
Sbjct: 451 SFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGLYPC-HR 506
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFNRQRWTYDK 604
Q+ L++ S+C+D P D+ V + C G N Q W K
Sbjct: 507 QGGNQI------RHLVS---SMCIDSPGKPEDLHQPVGLYPCHRQGGN-QYWMLSK 552
Score = 121 bits (304), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 124 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLTDLPPTAVIICFHNEAWSVLLRTVHSVL 183
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 184 DRSPDHLIQEIILVDDFSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 241
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R+P +++ +
Sbjct: 242 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARNPTTVVCPV 279
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 280 IDVIDDTTLEY 290
>gi|322785490|gb|EFZ12159.1| hypothetical protein SINV_06585 [Solenopsis invicta]
Length = 466
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 178/432 (41%), Positives = 242/432 (56%), Gaps = 45/432 (10%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRTV SV++RSP ++KEI+LVDD S E ++++
Sbjct: 8 LPPTSVIITFHNEARSTLLRTVVSVLNRSPEHLIKEIILVDDFSDH----PEDGEELSRI 63
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED TRVV
Sbjct: 64 H-KVRVIRNEKREGLMRSRVRGADAATASVLTFLDSHCECNADWLEPLLERVAEDPTRVV 122
Query: 317 CPVIDIISDVTFAYVR-------------------------SFELHWGAFNWELHFRWYT 351
CPVID+IS TF Y+ S +L G F+W L F+W
Sbjct: 123 CPVIDVISMDTFQYIEICLRCNLKRISETRRDKILFRFLGASADLR-GGFDWSLVFKWEY 181
Query: 352 YGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQ 411
+ ++KD T+ +TP +AGGLF I++AYF +G YD +M VWGGENLE+SFRVWQ
Sbjct: 182 LSQGERQARQKDPTQSIRTPMIAGGLFVINKAYFEKLGKYDTQMDVWGGENLEISFRVWQ 241
Query: 412 CGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE 471
CGGS+EI PCS V H+FRK PYSFPGG V N R A VWMD++ +FY+ P A
Sbjct: 242 CGGSLEIIPCSRVGHVFRKRHPYSFPGGSGNVFARNTRRAAEVWMDDYKQFYYNAVPLA- 300
Query: 472 KQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLA 531
+ ++ R+EL+++L C F WYL +V+P +P + G Q C++
Sbjct: 301 RNIPYGNIQDRMELKRRLHCKPFSWYLKNVYPELVIPTSEGGPGGSLK-QGTACLD---- 355
Query: 532 KGSMNQ-ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRIL 590
SM G L PC H Q + L D + +CL +P Y + ++I
Sbjct: 356 --SMGHLLDGNVGLYPC-HDTGGNQEW--GLTKDGLIKHHDLCLTLPVYAKGTTLLMQI- 409
Query: 591 ACSGFNRQRWTY 602
C G Q+W +
Sbjct: 410 -CDGSENQKWRH 420
>gi|348566779|ref|XP_003469179.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Cavia porcellus]
Length = 601
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HTNCKHKMYLETLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EYVA+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKEKLEEYVARFS-KVRILRTRKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + C++
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCID 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 74/139 (53%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHTNCKHKMYLET-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EYVA+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKEKLEEYVARFS-KVRILRTRKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|340723544|ref|XP_003400149.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 3 [Bombus terrestris]
Length = 637
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 166/357 (46%), Positives = 234/357 (65%), Gaps = 16/357 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 160 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--M 217
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 218 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 277
Query: 316 VCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VCPVID+I D T Y S ++ G F+W L F W+ + + K+ EP +P M
Sbjct: 278 VCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEPVWSPTM 336
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 337 AGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 396
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELRKQLKCH 492
Y + GV+ VL N R++ VW+DE+A++Y+ + + K D V R LRK+L C
Sbjct: 397 YKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGD---VSERKALRKKLGCK 452
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPC 547
SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L PC
Sbjct: 453 SFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGLYPC 506
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK + LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 126 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVL 185
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 186 DRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 243
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R P +++ +
Sbjct: 244 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARDPTTVVCPV 281
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 282 IDVIDDTTLEY 292
>gi|109076171|ref|XP_001084788.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1
[Macaca mulatta]
gi|355687723|gb|EHH26307.1| hypothetical protein EGK_16237 [Macaca mulatta]
Length = 601
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|354468855|ref|XP_003496866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Cricetulus griseus]
gi|344247257|gb|EGW03361.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Cricetulus
griseus]
Length = 535
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 164/344 (47%), Positives = 216/344 (62%), Gaps = 9/344 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S +
Sbjct: 87 HDQCQRKQWRGDLPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DP 145
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
+L + K+ RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L
Sbjct: 146 EDGALLGKIEKV----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPL 201
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RVAEDRTRVV P+ID+I+ F YV + G F+W L F+W ++ +
Sbjct: 202 LERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNP 261
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
P KTP +AGGLF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V
Sbjct: 262 VAPIKTPMIAGGLFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRV 321
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY+FPGG V N R A VWMDE+ FY+ P A + ++SRLE
Sbjct: 322 GHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLE 380
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
LRK+L C FKWYL +V+P +P D FG ++ Q C++
Sbjct: 381 LRKKLSCKPFKWYLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 422
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 59 GQDPYARNKFNQVESDKLRMDRGIPDTRHDQCQRKQWRGD-LPATSVVITFHNEARSALL 117
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 118 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 172
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 173 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 204
>gi|345798845|ref|XP_003434499.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Canis
lupus familiaris]
Length = 588
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 152 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 210
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 211 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 266
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 267 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 326
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 327 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTF 386
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 387 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 445
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 446 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 475
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 112 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 170
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 171 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 225
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 226 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 257
>gi|126307024|ref|XP_001369295.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Monodelphis domestica]
Length = 571
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 214/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PYSF
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYSF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLNCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q + C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGNNCLD 458
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRNIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|402870847|ref|XP_003899411.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Papio anubis]
Length = 601
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNSIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|350426664|ref|XP_003494506.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like isoform 2 [Bombus impatiens]
Length = 637
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 166/357 (46%), Positives = 234/357 (65%), Gaps = 16/357 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 160 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--M 217
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 218 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 277
Query: 316 VCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VCPVID+I D T Y S ++ G F+W L F W+ + + K+ EP +P M
Sbjct: 278 VCPVIDVIDDTTLEYHWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEPVWSPTM 336
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+FRK SP
Sbjct: 337 AGGLFSIDRAFFDRLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSP 396
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELRKQLKCH 492
Y + GV+ VL N R++ VW+DE+A++Y+ + + K D V R LRK+L C
Sbjct: 397 YKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGD---VSERKALRKKLGCK 452
Query: 493 SFKWYLTHVWPHHFLPMDDKFFGRIRHV--QTHKCVEKPLAKGSMNQASGPASLLPC 547
SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L PC
Sbjct: 453 SFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGLYPC 506
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK + LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 126 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLKDLPPTAVIICFHNEAWSVLLRTVHSVL 185
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 186 DRSPEHLIQEIILVDDFSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 243
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R P +++ +
Sbjct: 244 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARDPTTVVCPV 281
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 282 IDVIDDTTLEY 292
>gi|363731636|ref|XP_419581.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gallus
gallus]
Length = 566
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 214/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 130 LPATSVVITFHNEARSALLRTVVSVLKKSPSHLIKEIILVDDYSN-DPDDGALLGKIEKV 188
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 189 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 244
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 245 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 304
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D++YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 305 LFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 364
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SR+ELRK+L C FKW
Sbjct: 365 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRMELRKRLSCKPFKW 423
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 424 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 453
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 90 GQDPYARNKFNQVESDKLRMDRNIPDTRHDQCQRKQWRID-LPATSVVITFHNEARSALL 148
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 149 RTVVSVLKKSPSHLIKEIILVDDYSN-DPDDGALLGKIEKV----RVLRNDRREGLMRSR 203
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 204 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 235
>gi|344278311|ref|XP_003410938.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Loxodonta africana]
Length = 572
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 214/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 136 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 194
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 195 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 250
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 251 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 310
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D++YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 311 LFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 370
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 371 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 429
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 430 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 459
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + LP +S+VI FHNEA SALL
Sbjct: 96 GQDPYARNKFNQVESDKLRMDRNVPDTRHDQCQRKQW-RVGLPATSVVITFHNEARSALL 154
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 155 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 209
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 210 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 241
>gi|334331052|ref|XP_001372346.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Monodelphis domestica]
Length = 573
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 227/349 (65%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 99 HANCKHKMYLEKLPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREH 158
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A+GE+L FLD+HCE + WL L
Sbjct: 159 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 217
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A +R +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 218 LNQIALNRKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 274
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 275 PSDPFESPVMAGGLFAVDRRWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSR 334
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 335 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 391
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK LKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 392 ELRKHLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 440
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 79/167 (47%), Positives = 116/167 (69%), Gaps = 12/167 (7%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E+ LP +SI+I FHNE W++LLRT+H
Sbjct: 75 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLEK-LPNTSIIIPFHNEGWTSLLRTIH 133
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 134 SIINRTPDSLIAEIILVDDFSDREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 192
Query: 172 RQAEGEILVFLDAHCECTL-----VFNEEFLPKSSIV-----IVFHN 208
A+GE+L FLD+HCE + + N+ L + +IV ++ HN
Sbjct: 193 SMAKGEVLTFLDSHCEVNVNWLPPLLNQIALNRKTIVCPMIDVIDHN 239
>gi|113677422|ref|NP_001038460.1| polypeptide N-acetylgalactosaminyltransferase 14 [Danio rerio]
Length = 554
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 169/364 (46%), Positives = 217/364 (59%), Gaps = 13/364 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT + + LP ++IVI FHNEA S LLRTV SV++R+P ++ EI+LVDD F
Sbjct: 100 HYRCTTLHYDPDLPSTTIVITFHNEARSTLLRTVRSVLNRTPVHLIHEIILVDD-----F 154
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
+ D + + +R+ R GLI++R+ GA A +IL FLD+HCE WL L
Sbjct: 155 SEDPNDCLLLTKLPKVKCLRNKHREGLIRSRVRGADAAGAQILTFLDSHCEVNKDWLPPL 214
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RV ED T V PVIDII+ TFAYV + G F+W LHF+W S++ KR D
Sbjct: 215 LQRVKEDPTSVASPVIDIINMDTFAYVAASSDLRGGFDWSLHFKWEQL-SAEKRAKRADP 273
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
TEP KTP +AGGLF IDR++F +G YD M +WGGEN E+SFRVW CGGS+EI PCS V
Sbjct: 274 TEPIKTPIIAGGLFVIDRSWFNRLGKYDTAMDIWGGENFEISFRVWMCGGSLEIIPCSRV 333
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMDE+ FY+ P A + + + R E
Sbjct: 334 GHVFRKKHPYIFPEGNANTYIKNTRRTAEVWMDEFKLFYYSARP-AARGKSYGDIHGRQE 392
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK L C SFKWYL +V+P +P D D G IR Q C+E + +G Q +
Sbjct: 393 LRKSLNCKSFKWYLDNVYPELKVPDDSDAKSGVIR--QRQNCLESRVVEG---QDLPVLT 447
Query: 544 LLPC 547
L PC
Sbjct: 448 LAPC 451
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/151 (41%), Positives = 90/151 (59%), Gaps = 10/151 (6%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y + FN S+RIP NR L D R +C T ++ + LP ++IVI FHNEA S LL
Sbjct: 72 GQDPYSLYAFNQRESERIPSNRALRDTRHYRCTTLHYDPD-LPSTTIVITFHNEARSTLL 130
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV++R+P ++ EI+LVDD F + D + + +R+ R GLI++R
Sbjct: 131 RTVRSVLNRTPVHLIHEIILVDD-----FSEDPNDCLLLTKLPKVKCLRNKHREGLIRSR 185
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
+ GA A +IL FLD+HCE N+++LP
Sbjct: 186 VRGADAAGAQILTFLDSHCE----VNKDWLP 212
>gi|449497211|ref|XP_002190803.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Taeniopygia guttata]
Length = 669
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 214/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 233 LPATSVVITFHNEARSALLRTVVSVLKKSPSHLIKEIILVDDYSN-DPDDGALLGKIEKV 291
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 292 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 347
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 348 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 407
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D++YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 408 LFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 467
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SR+ELRK+L C FKW
Sbjct: 468 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRMELRKRLSCKPFKW 526
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 527 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 556
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 99/152 (65%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R++PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 193 GQDPYARNKFNQVESDKLRMDRSIPDTRHDQCQRKQWRID-LPATSVVITFHNEARSALL 251
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 252 RTVVSVLKKSPSHLIKEIILVDDYSN-DPDDGALLGKIEKV----RVLRNDRREGLMRSR 306
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 307 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 338
>gi|410975135|ref|XP_003993990.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Felis
catus]
Length = 653
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 217 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 275
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 276 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 331
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 332 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 391
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 392 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTF 451
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 452 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 510
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 511 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 540
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 177 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 235
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 236 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 290
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 291 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 322
>gi|47225457|emb|CAG11940.1| unnamed protein product [Tetraodon nigroviridis]
Length = 534
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 171/437 (39%), Positives = 238/437 (54%), Gaps = 85/437 (19%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VIVFHNEAWS LLRTVHSVI RSP ++L+EI+LVDDAS R+FLK L++YV +L
Sbjct: 139 LPRTSVVIVFHNEAWSTLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPLEQYVRRL 198
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
VP RV+R R GLI+ARL GA + G+++ FLDAHCECT GWLE L+AR+ +DR VV
Sbjct: 199 EVPVRVVRMEQRSGLIRARLKGASLSTGQVITFLDAHCECTTGWLEPLLARIKKDRKTVV 258
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P +TP MAGG
Sbjct: 259 CPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGG 318
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG YD M +WGGENLE+SFR+ + C V
Sbjct: 319 LFSIDRDYFQEIGTYDAGMDIWGGENLEISFRL-----QMWFVVCVCV------------ 361
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
GV++V YG+++ SR LR++L+C F W
Sbjct: 362 --GVTKVDYGDIS-------------------------------SRTTLRQKLQCKPFSW 388
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS---------------------- 534
YL +++P +P G IR+V+T++C++ K +
Sbjct: 389 YLENIYPDSQIPRHYYSLGEIRNVETNQCLDNMARKENEKVGIFNCHGMGGNQVFSYTAN 448
Query: 535 -----------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDI 583
+++ +GP +L C HL + LI + + CLD E+
Sbjct: 449 KEIRTDDLCLDVSKLNGPVMMLKCHHLKGNQLWEYDPVKLTLIHVNSNQCLDKASEEDSQ 508
Query: 584 SPRVRILACSGFNRQRW 600
P VR C+ Q+W
Sbjct: 509 VPSVR--DCTHSRSQQW 523
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/156 (55%), Positives = 118/156 (75%), Gaps = 1/156 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N +++++IN+FNL+ S+ I +NR+LPDVR CK K++ + LP++S+VIVFHNEAW
Sbjct: 95 NQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKNKLYPDN-LPRTSVVIVFHNEAW 153
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRTVHSVI RSP ++L+EI+LVDDAS R+FLK L++YV +L VP RV+R R GL
Sbjct: 154 STLLRTVHSVIDRSPHTLLEEIILVDDASERDFLKRPLEQYVRRLEVPVRVVRMEQRSGL 213
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
I+ARL GA + G+++ FLDAHCECT + E L +
Sbjct: 214 IRARLKGASLSTGQVITFLDAHCECTTGWLEPLLAR 249
>gi|296195172|ref|XP_002745263.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Callithrix jacchus]
Length = 601
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR++ + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNIAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|13938114|gb|AAH07172.1| Galnt2 protein, partial [Mus musculus]
Length = 526
Score = 308 bits (790), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 90 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 148
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 149 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 204
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 205 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 264
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 265 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 324
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 325 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 383
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 384 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 413
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 50 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 108
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 109 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 163
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 164 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 195
>gi|149698080|ref|XP_001498934.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Equus
caballus]
Length = 601
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKRREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS-KVRIVRTKRREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|351699379|gb|EHB02298.1| Polypeptide N-acetylgalactosaminyltransferase-like 6, partial
[Heterocephalus glaber]
Length = 522
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 45 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 104
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 105 LKDKLEEYMARFS-KVRILRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 163
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 164 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 220
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 221 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 280
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 281 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 337
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 338 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 386
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 21 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 79
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 80 SIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS-KVRILRTKKREGLIRTRLLGA 138
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 139 SMARGEVLTFLDSHCEVNV 157
>gi|332217746|ref|XP_003258022.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Nomascus leucogenys]
Length = 601
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMLDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|308485401|ref|XP_003104899.1| CRE-GLY-5 protein [Caenorhabditis remanei]
gi|308257220|gb|EFP01173.1| CRE-GLY-5 protein [Caenorhabditis remanei]
Length = 685
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 162/383 (42%), Positives = 233/383 (60%), Gaps = 37/383 (9%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC + E LP++S+++ FHNEAWS LLRTVHSV+ R+P +L+EI+LVDD S + K
Sbjct: 167 ECKVEKYNENLPRTSVIVCFHNEAWSVLLRTVHSVLERTPEHLLEEIVLVDDFSDMDHTK 226
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLG------- 299
L+EY+++ +++R R GLI+ARL GA A GE+L +LD+HCEC G
Sbjct: 227 RPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIATGEVLTYLDSHCECMEGKETENRV 286
Query: 300 ----------WLENLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHF 347
W+E L+ R+ D T VVCPVID+I D TF Y +++ G F+W L F
Sbjct: 287 RTRNKKCKKRWIEPLLDRIKRDPTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQF 346
Query: 348 RWYTYGSSDAIIKRKDFT---EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLE 404
W++ D RK+ T +P ++P MAGGLF+ID+ YF +G YD +WGGENLE
Sbjct: 347 NWHSIPERD----RKNRTRAIDPVRSPTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLE 402
Query: 405 MSFRV----WQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWA 460
+SF+V W CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VW+D++
Sbjct: 403 LSFKVRKCIWMCGGTLEIVPCSHVGHVFRKRSPYKWRTGVN-VLKRNSIRLAEVWLDDYK 461
Query: 461 EFYF-KFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH 519
+Y+ + N + D V +R +LR L C SFKWYL +++P F+P + G +R+
Sbjct: 462 TYYYERINNQLGDFGD---VSARKKLRSDLGCKSFKWYLDNIYPELFVPGESVAKGEVRN 518
Query: 520 --VQTHKCVEKPLAKGSMNQASG 540
VQ +C++ + + N+ G
Sbjct: 519 SAVQPARCLDCMVGRHEKNRPVG 541
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/193 (39%), Positives = 115/193 (59%), Gaps = 6/193 (3%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN SD I V+RTLP +CK + +NE LP++S+++ FHNEAWS LLRTVHSV+
Sbjct: 144 NAFNQYASDMISVHRTLPTNIDAECKVEKYNEN-LPRTSVIVCFHNEAWSVLLRTVHSVL 202
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
R+P +L+EI+LVDD S + K L+EY+++ +++R R GLI+ARL GA A
Sbjct: 203 ERTPEHLLEEIVLVDDFSDMDHTKRPLEEYMSQFGGKVKILRMEKREGLIRARLRGAAIA 262
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEIL 234
GE+L +LD+HCEC E + V + + + + I R P +++ ++
Sbjct: 263 TGEVLTYLDSHCECM-----EGKETENRVRTRNKKCKKRWIEPLLDRIKRDPTTVVCPVI 317
Query: 235 LVDDASTREFLKS 247
V D +T E+ S
Sbjct: 318 DVIDDNTFEYHHS 330
>gi|241557818|ref|XP_002400302.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
gi|215501764|gb|EEC11258.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase, putative
[Ixodes scapularis]
Length = 464
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 156/327 (47%), Positives = 218/327 (66%), Gaps = 11/327 (3%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST- 241
+ CE L F+++ LP +S++I+F +E +SALLRTV+SV++R+P +L+E++LVDDAS+
Sbjct: 142 NPSCE-RLRFDDKDLPSASVIIIFTDEIFSALLRTVYSVVNRTPAKLLREVILVDDASSI 200
Query: 242 REFLKSSLDEYVAKLSVP--TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLG 299
E LD+Y+ + P ++IR P R GLI+ARL GA+ A G++LVFLD+HCE T
Sbjct: 201 EELANQRLDKYLRRHFRPGLVKLIRLPQRQGLIRARLTGAQAASGDVLVFLDSHCEATDY 260
Query: 300 WLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHW--GAFNWELHFRWYTYGSSDA 357
WLE L+ + EDRT VVCP+ID++ D + Y+ + ++ G FNW+ F W S
Sbjct: 261 WLEPLLQPIREDRTTVVCPIIDVVDDKSLQYMGNGADYFQIGGFNWKGEFVWINL-PSGW 319
Query: 358 IIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIE 417
+ RK +P TP MAGGLFAIDR YF+ G+YD EM+ WGGENLEMSFR+W CGG +
Sbjct: 320 KVARKTKADPVNTPTMAGGLFAIDRKYFWESGSYDNEMEGWGGENLEMSFRIWMCGGKLV 379
Query: 418 IAPCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDK 476
IAPCSHV H+FR PY FP ++ +G N R+A VWMD + ++++ PE +K +
Sbjct: 380 IAPCSHVGHIFRDYHPYKFPN--NKDTHGINTVRLAEVWMDGYKNYFYQNRPELKKT-NY 436
Query: 477 QAVRSRLELRKQLKCHSFKWYLTHVWP 503
+ R LRK L C SFKWYL +V+P
Sbjct: 437 GDISKRKALRKSLGCKSFKWYLDNVYP 463
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 68/144 (47%), Positives = 103/144 (71%), Gaps = 3/144 (2%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN +SDR+P+NR+L + R P C+ F+++ LP +S++I+F +E +SALLRTV+SV++R
Sbjct: 123 FNAYISDRVPLNRSLGNRRNPSCERLRFDDKDLPSASVIIIFTDEIFSALLRTVYSVVNR 182
Query: 117 SPRSMLKEILLVDDAST-REFLKSSLDEYVAKLSVP--TRVIRSPGRVGLIKARLLGARQ 173
+P +L+E++LVDDAS+ E LD+Y+ + P ++IR P R GLI+ARL GA+
Sbjct: 183 TPAKLLREVILVDDASSIEELANQRLDKYLRRHFRPGLVKLIRLPQRQGLIRARLTGAQA 242
Query: 174 AEGEILVFLDAHCECTLVFNEEFL 197
A G++LVFLD+HCE T + E L
Sbjct: 243 ASGDVLVFLDSHCEATDYWLEPLL 266
>gi|156353877|ref|XP_001623135.1| predicted protein [Nematostella vectensis]
gi|156209801|gb|EDO31035.1| predicted protein [Nematostella vectensis]
Length = 454
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 167/382 (43%), Positives = 231/382 (60%), Gaps = 15/382 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+S+VIVFHNE WS L+RTVH+V+ RSP ML+EI++VDD S ++FLK LD+Y KL
Sbjct: 59 LPKASVVIVFHNEGWSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKL 118
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R+ RVGLIKAR++GA A GE+++FLDAHCEC GWL L+ R+A +R V
Sbjct: 119 G-KIKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALNRRTAV 177
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP ID I TF Y G FNW ++ + + KR+D T+ K+P MAGG
Sbjct: 178 CPTIDFIDHKTFQYKPMDPYIRGTFNWRFDYKERAV-RPEEMAKRRDPTQEVKSPVMAGG 236
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI+R +F +G YD M +WGGE E+SF++WQCGG +E PCS V H++R PY++
Sbjct: 237 LFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCSRVGHVYRHHVPYTY 296
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
P + ++ N RVA VWMDE+ ++ + PE K D + R+ LRK+LKC SFKW
Sbjct: 297 PKHDATLV--NFRRVAEVWMDEYKDWLYDKRPEI-KSVDYGDISDRIALRKRLKCKSFKW 353
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V F ++R+ + C++ SM + G L C ++ Q
Sbjct: 354 YLENVANDTVKTKLCACF-QVRNQGKNMCLD------SMGRKDGHVGLASCHNMGG-NQA 405
Query: 557 FVMKLPTDLIATDESVCLDVPE 578
F +L TDE+ C DV E
Sbjct: 406 FQYTYIREL-RTDET-CFDVHE 425
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 75/142 (52%), Positives = 102/142 (71%), Gaps = 6/142 (4%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN VSD+I + RT+ D R CK + + LPK+S+VIVFHNE WS L+RTVH+V+ R
Sbjct: 28 FNQFVSDQISLERTISDTRHQACKQRSYPIN-LPKASVVIVFHNEGWSTLMRTVHTVLLR 86
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP ML+EI++VDD S ++FLK LD+Y KL +++R+ RVGLIKAR++GA A G
Sbjct: 87 SPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKLG-KIKIVRTKERVGLIKARVIGANNAVG 145
Query: 177 EILVFLDAHCECTLVFNEEFLP 198
E+++FLDAHCEC N+ +LP
Sbjct: 146 EVVIFLDAHCEC----NKGWLP 163
>gi|397508104|ref|XP_003824510.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
paniscus]
Length = 533
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 209/332 (62%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S D V
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-----DPEDGAVLGK 151
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 152 IEKVRVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 332 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 390
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 391 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 420
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 94/152 (61%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S D V RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-----DPEDGAVLGKIEKVRVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|157128332|ref|XP_001661405.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108872614|gb|EAT36839.1| AAEL011095-PA [Aedes aegypti]
Length = 573
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 212/313 (67%), Gaps = 7/313 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA S LLRT+ SV++RSP ++ EI+LVDD S +F + + +AK+
Sbjct: 139 LPPTSVIITFHNEARSTLLRTIVSVLNRSPEHLIHEIILVDDYS--DFPEDGQE--LAKI 194
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+VIR+ R GL+++R+ GA A +L FLD+HCEC + WLE L+ RV ED TRVV
Sbjct: 195 H-KVKVIRNEQREGLVRSRVRGADAATASVLTFLDSHCECNVDWLEPLLIRVKEDPTRVV 253
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y+ + G F+W L F+W +++ ++KD T P +TP +AGG
Sbjct: 254 CPVIDVISMDTFQYIGASADLRGGFDWNLVFKWEYLSTAERHERQKDPTTPIRTPMIAGG 313
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF ID+ YF +G YD +M +WGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 314 LFVIDKVYFEKLGKYDTQMDIWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 373
Query: 437 PGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
PGG S ++ N R A VWMD++ ++Y+ P A K + R+EL+++L+C FK
Sbjct: 374 PGGGSGNIFAKNTRRAAEVWMDDYKQYYYAAVPLA-KNIPFGDIEERMELKERLQCKPFK 432
Query: 496 WYLTHVWPHHFLP 508
WYL +V+P +P
Sbjct: 433 WYLANVYPQLTIP 445
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/155 (42%), Positives = 95/155 (61%), Gaps = 7/155 (4%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
+L + Y NRFN SD + NR LPD R P C+ K + LP +S++I FHNEA
Sbjct: 95 GSLQQGEDPYLRNRFNQQASDSLKSNRELPDTRNPMCRKKWPSN--LPPTSVIITFHNEA 152
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
S LLRT+ SV++RSP ++ EI+LVDD S +F + + +AK+ +VIR+ R G
Sbjct: 153 RSTLLRTIVSVLNRSPEHLIHEIILVDDYS--DFPEDGQE--LAKIH-KVKVIRNEQREG 207
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L+++R+ GA A +L FLD+HCEC + + E L
Sbjct: 208 LVRSRVRGADAATASVLTFLDSHCECNVDWLEPLL 242
>gi|380021258|ref|XP_003694487.1| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
9-like [Apis florea]
Length = 537
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 181/422 (42%), Positives = 255/422 (60%), Gaps = 41/422 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S L+ L++Y+ +
Sbjct: 60 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPDHLIQEIILVDDFSDMPHLQRQLEDYM--M 117
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P ++IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A + T V
Sbjct: 118 NYPKVQIIRAQKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARNPTTV 177
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 178 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPEREKK-RHKNPAEP 230
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDRA+F +G YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 231 VWSPTMAGGLFSIDRAFFERLGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 290
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQAVRSRLELR 486
FRK SPY + GV+ VL N R++ VW+DE+A++Y+ + + K D V R LR
Sbjct: 291 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRIGHDKGKYGD---VSERKALR 346
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT--HKCVEKPLAKGSMNQASGPASL 544
K+L C SFKWYL +V+P F+P + G +R++ + C++ P K +++ PA L
Sbjct: 347 KRLGCKSFKWYLDNVYPELFIPGEAVASGEVRNLGEGGNTCLDSPARKADLHK---PAGL 403
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC--SGFNRQRWTY 602
PC H Q+ L++ S+C+D P D+ V + C G N Q W
Sbjct: 404 YPC-HRQGGNQI------RHLVS---SMCIDSPGKPEDLHQPVGLYPCHRQGGN-QYWML 452
Query: 603 DK 604
K
Sbjct: 453 SK 454
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/191 (41%), Positives = 114/191 (59%), Gaps = 25/191 (13%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I V+RTLPD R P CK LP ++++I FHNEAWS LLRTVHSV+
Sbjct: 26 NAFNQYVSDLISVHRTLPDPRDPWCKEPGRYLTDLPPTAVIICFHNEAWSVLLRTVHSVL 85
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQ 173
RSP +++EI+LVDD S L+ L++Y+ ++ P ++IR+ R GLI+ARLLGA
Sbjct: 86 DRSPDHLIQEIILVDDFSDMPHLQRQLEDYM--MNYPKVQIIRAQKREGLIRARLLGAAA 143
Query: 174 AEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEI 233
A+ +L +LD+HCECT E W L + I+R+P +++ +
Sbjct: 144 AKAPVLTYLDSHCECT-------------------EGW---LEPLLDRIARNPTTVVCPV 181
Query: 234 LLVDDASTREF 244
+ V D +T E+
Sbjct: 182 IDVIDDTTLEY 192
>gi|148706465|gb|EDL38412.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14, isoform CRA_a [Mus
musculus]
Length = 515
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 185/455 (40%), Positives = 252/455 (55%), Gaps = 31/455 (6%)
Query: 160 RVGLIKARLLGARQAEGEILVFLDA-----HCECTLVFNEEFLPKSSIVIVFHNEAWSAL 214
RVG +L Q E E + A H C+L+ LP +SI+I FHNEA S L
Sbjct: 30 RVGDDPYKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTDLPPTSIIITFHNEARSTL 89
Query: 215 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIK 273
LRT+ SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL++
Sbjct: 90 LRTIRSVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVR 143
Query: 274 ARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE---DRTRVVCPVIDIISDVTFAY 330
+R+ GA A+G L FLD+HCE WL+ L+ RV E D TRVVCPVIDII+ TF Y
Sbjct: 144 SRMRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEVLQDYTRVVCPVIDIINLDTFNY 203
Query: 331 VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGA 390
+ S G F+W LHF+W + R D TEP +TP +AGGLF ID+A+F ++G
Sbjct: 204 IESASELRGGFDWSLHFQWEQLSLEQKAL-RLDPTEPIRTPIIAGGLFVIDKAWFDYLGK 262
Query: 391 YDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLAR 450
YD +M +WGGEN E+SFRVW CGG +EI PCS V H+FRK PY FP G + N R
Sbjct: 263 YDVDMDIWGGENFEISFRVWMCGGGLEIIPCSRVGHVFRKKHPYVFPDGNANTYIKNTKR 322
Query: 451 VALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD 510
A VWMDE+ ++Y+ P A +R + +RL LRK L C +FKWYL +V+P +P D
Sbjct: 323 TAEVWMDEYKQYYYAARPFA-LERPFGNIENRLNLRKNLHCQTFKWYLENVYPELRVPPD 381
Query: 511 DKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLI 566
G IR Q KC+E K L PC + +Q++ +I
Sbjct: 382 SSIQKGNIR--QRQKCLESQKQK-----KQEILRLSPCAKVKGDGAKSQVWAFTYTQQII 434
Query: 567 ATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V L +G RQ WT
Sbjct: 435 Q--EELCLSVVTLFPG-APVVLALCKNGDERQLWT 466
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/137 (40%), Positives = 87/137 (63%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R +C V+ + LP +SI+I FHNEA S LLRT+
Sbjct: 36 YKLYAFNQRESERISSNRAVPDTRHKRCSLLVYCTD-LPPTSIIITFHNEARSTLLRTIR 94
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + ++ + +P + +R+ R GL+++R+ G
Sbjct: 95 SVLNRTPMHLIQEIILVDDFS------NDPEDCKQLIKLPKVKCLRNNERQGLVRSRMRG 148
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 149 ADIAQGTTLTFLDSHCE 165
>gi|194018457|ref|NP_001030017.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
gi|296434516|sp|Q49A17.2|GLTL6_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase-like 6;
AltName: Full=Polypeptide GalNAc transferase 17;
Short=GalNAc-T17; Short=pp-GaNTase 17; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 17;
AltName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase 17; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 17
gi|311103007|gb|ADP69004.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Homo sapiens]
Length = 601
Score = 308 bits (789), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|31418564|gb|AAH53063.1| Galnt2 protein [Mus musculus]
Length = 536
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 100 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 158
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 159 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 214
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 215 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 274
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 275 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 334
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 335 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 393
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 394 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 423
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 60 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 118
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 119 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 173
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 174 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 205
>gi|403295707|ref|XP_003938772.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Saimiri boliviensis boliviensis]
Length = 601
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR++ + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNIAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|402870849|ref|XP_003899412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Papio anubis]
Length = 584
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 110 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREH 169
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 170 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 228
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 229 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 285
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 286 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 345
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 346 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 402
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 403 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 451
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 86 YRENGFNIFVSNSIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 144
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 145 SIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 203
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 204 SMARGEVLTFLDSHCEVNV 222
>gi|74195843|dbj|BAE30483.1| unnamed protein product [Mus musculus]
Length = 544
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 108 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 166
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 167 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 222
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 223 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 282
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 283 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 342
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 343 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 401
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 402 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 431
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 68 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 126
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 127 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 181
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 182 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 213
>gi|109076173|ref|XP_001084905.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 2
[Macaca mulatta]
Length = 584
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 110 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSEREH 169
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 170 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 228
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 229 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 285
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 286 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 345
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 346 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 402
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 403 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 451
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 86 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 144
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 145 SIINRTPESLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 203
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 204 SMARGEVLTFLDSHCEVNV 222
>gi|149758073|ref|XP_001496259.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Equus
caballus]
Length = 539
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 103 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 161
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 162 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 217
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 218 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 277
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 278 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 337
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 338 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 396
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 397 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 426
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 63 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 121
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 122 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 176
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 177 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 208
>gi|148679819|gb|EDL11766.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 548
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 112 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 170
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 171 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 226
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 227 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 286
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 287 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 346
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 347 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 405
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 406 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 435
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 72 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 130
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 131 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 185
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 186 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 217
>gi|149043194|gb|EDL96726.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (predicted), isoform
CRA_a [Rattus norvegicus]
Length = 504
Score = 308 bits (788), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 68 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 126
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 127 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 182
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 183 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 242
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 243 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 302
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 303 PGGSGTVFARNTRRAAEVWMDEFKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 361
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 362 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 391
Score = 117 bits (292), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 99/152 (65%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R++PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 28 GQDPYARNKFNQVESDKLRMDRSIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 86
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 87 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 141
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 142 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 173
>gi|395840008|ref|XP_003792862.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 2 [Otolemur garnettii]
Length = 600
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S R+
Sbjct: 126 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDH 185
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 186 LKDKLEEYMARFSQ-VRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 244
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 245 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LRRAD 301
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 302 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 361
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 362 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 418
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 419 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 467
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 102 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 160
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S R+ LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 161 SIINRTPESLIAEIILVDDFSDRDHLKDKLEEYMARFS-QVRIVRTKKREGLIRTRLLGA 219
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 220 SMARGEVLTFLDSHCEVNV 238
>gi|395840006|ref|XP_003792861.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
isoform 1 [Otolemur garnettii]
Length = 601
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S R+
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-QVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LRRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S R+ LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDRDHLKDKLEEYMARFS-QVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|344288241|ref|XP_003415859.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Loxodonta africana]
Length = 601
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L++Y+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|13650039|gb|AAK37548.1| polypeptide GalNAc transferase-T2 [Mus musculus]
Length = 570
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 134 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 193 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 248
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 249 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 308
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 309 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 368
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 369 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 427
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 428 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 457
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 94 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 152
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 153 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 207
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 208 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 239
>gi|119590314|gb|EAW69908.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_a [Homo sapiens]
Length = 508
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 155
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 156 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 332 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 390
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 391 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 420
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|417402857|gb|JAA48260.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 571
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ +L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQARVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D++YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKSYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ +L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQARVLTFLDSHCECNEHWLEPLLER 240
>gi|88192992|pdb|2FFU|A Chain A, Crystal Structure Of Human Ppgalnact-2 Complexed With Udp
And Ea2
gi|88192994|pdb|2FFV|A Chain A, Human Ppgalnact-2 Complexed With Manganese And Udp
gi|88192995|pdb|2FFV|B Chain B, Human Ppgalnact-2 Complexed With Manganese And Udp
Length = 501
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 65 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 123
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 124 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 179
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 180 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 239
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 240 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 299
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 300 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 358
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 359 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 388
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 25 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 83
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 84 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 138
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 139 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 170
>gi|195032291|ref|XP_001988471.1| GH11183 [Drosophila grimshawi]
gi|193904471|gb|EDW03338.1| GH11183 [Drosophila grimshawi]
Length = 640
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 172/407 (42%), Positives = 239/407 (58%), Gaps = 18/407 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP +S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 206 EDLPATSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 261
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ R+IR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 262 KID-KVRIIRNDKREGLVRSRVRGADAAVSNVLTFLDSHVECNEQWLEPLLERVREDPTR 320
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W +S+ + D T +TP +A
Sbjct: 321 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSASERTARHNDPTTAIRTPMIA 380
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 381 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 440
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 441 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 499
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G L PC H
Sbjct: 500 KWYLENVYPDLQAP-DPQEVGQFRQDMT-ECLD---TMGHL--VDGTVGLFPC-HNTGGN 551
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
Q + ++ D +CL + ++ +V + +C QRW
Sbjct: 552 QEWAYSKRGEIKHDD--LCLTLVQFSR--GSQVVLKSCDDTENQRWN 594
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 100/153 (65%), Gaps = 10/153 (6%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP +S++I FHNEA S
Sbjct: 165 LRNGEDPYIRNRFNQEASDDLPSNREIPDTRNPMCRTKKYRED-LPATSVIITFHNEARS 223
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ R+IR+ R GL+
Sbjct: 224 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRIIRNDKREGLV 278
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
++R+ GA A +L FLD+H EC NE++L
Sbjct: 279 RSRVRGADAAVSNVLTFLDSHVEC----NEQWL 307
>gi|395504936|ref|XP_003756802.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Sarcophilus harrisii]
Length = 651
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 172/435 (39%), Positives = 253/435 (58%), Gaps = 34/435 (7%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 177 HPNCNSKLYLEKLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLVAEIVLVDDFSDREH 236
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 237 LKKRLEDYMAQFPN-VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 295
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I + F Y ++ + GAF+WE++++ +++ D
Sbjct: 296 LDRIASNRKTIVCPMIDVIDNDHFGYKTQAGDAMRGAFDWEMYYKRIPIPLE---LQKSD 352
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 353 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 412
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 413 VGHIYRKYIPYKIPTGVS--LARNLKRVAEVWMDEYAEYIYQRLPEY-RHLSTGDVTAQK 469
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVEKPLAKGSMNQA 538
+LR L C SFKW++T + W P ++ P++ +G IR+V T C+ G+ + A
Sbjct: 470 DLRNHLNCKSFKWFMTEIAWDLPRYYPPVEPAAAAWGEIRNVGTQLCI------GTKHGA 523
Query: 539 SG-PASLLPCTHLPVLT-----QMFVMKLPTDLIATD----ESVCLDVPEYENDISPRVR 588
G P L C Q+F D+ D + C D + S V
Sbjct: 524 PGSPLRLESCVKGRAEAAWSNVQVFTFSWREDIRPGDPQHTKKFCFDTISH----SSPVT 579
Query: 589 ILACSGF-NRQRWTY 602
+ C G Q W Y
Sbjct: 580 LYDCHGMKGNQLWKY 594
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 110/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E+ LP +SI+I FHNE
Sbjct: 143 ITDAERVDQAYRENGFNIFVSDKIALNRSLPDIRHPNCNSKLYLEK-LPNTSIIIPFHNE 201
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A+ R++R+ R
Sbjct: 202 GWSSLLRTVHSVLNRSPPQLVAEIVLVDDFSDREHLKKRLEDYMAQFPN-VRILRTKKRE 260
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 261 GLIRTRMLGASVAIGDVITFLDSHCEANV 289
>gi|440897357|gb|ELR49068.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos grunniens mutus]
Length = 557
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 167/418 (39%), Positives = 240/418 (57%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEFKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 402 EQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGTNCLESQGQDTAGNFQLGMGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
P Q ++ TD + + CL V + AC+ RQRW
Sbjct: 460 CRGSAKNPPAAQAWLF---TDHLIQQQGKCLAATSTSVSPGSLVVLQACNPREGRQRW 514
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAAAAVLTFLDSHCE----VNTEWL 221
>gi|74203117|dbj|BAE26246.1| unnamed protein product [Mus musculus]
Length = 618
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 137 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 195
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 196 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 251
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 252 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 311
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 312 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 371
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 372 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 430
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 431 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 460
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 97 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 155
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 156 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 210
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 211 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 242
>gi|348575518|ref|XP_003473535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Cavia porcellus]
Length = 531
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 95 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 153
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 154 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 209
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 210 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 269
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 270 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 329
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C F+W
Sbjct: 330 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKRLGCRPFQW 388
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 389 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 418
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 55 GQDPYARNKFNQVESDKLRMDRAIPDTRHEQCQRKQWRVD-LPATSVVITFHNEARSALL 113
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 114 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 168
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 169 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 200
>gi|155371981|ref|NP_001094597.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Bos taurus]
gi|151554939|gb|AAI47930.1| GALNTL1 protein [Bos taurus]
gi|296482974|tpg|DAA25089.1| TPA: polypeptide N-acetylgalactosaminyltransferase-like 1 [Bos
taurus]
Length = 557
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 167/418 (39%), Positives = 240/418 (57%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEFKQYYYEARPSAIG-KAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 402 EQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGTNCLESQGQDTAGNFQLGMGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
P Q ++ TD + + CL V + AC+ RQRW
Sbjct: 460 CRGSAKNPPAAQAWLF---TDHLIQQQGKCLAATSTSVSPGSLVVLQACNPREGRQRW 514
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAAAAVLTFLDSHCE----VNTEWL 221
>gi|86475571|emb|CAF25036.1| pp-GalNAc-transferase 17 [Homo sapiens]
Length = 584
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 110 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 169
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 170 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 228
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 229 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 285
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 286 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 345
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 346 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 402
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 403 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 451
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 86 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 144
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 145 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 203
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 204 SMARGEVLTFLDSHCEVNV 222
>gi|380798879|gb|AFE71315.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor, partial
[Macaca mulatta]
Length = 554
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 118 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 176
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 177 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 232
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 233 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 292
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 293 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 352
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 353 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 411
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 412 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 441
Score = 115 bits (288), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 78 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 136
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 137 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 191
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 192 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 223
>gi|46877109|ref|NP_644678.2| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Mus
musculus]
gi|51315867|sp|Q6PB93.1|GALT2_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|37590571|gb|AAH59818.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [Mus musculus]
Length = 570
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 134 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 193 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 248
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 249 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 308
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 309 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 368
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 369 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 427
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 428 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 457
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 94 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 152
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 153 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 207
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 208 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 239
>gi|291231066|ref|XP_002735481.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Saccoglossus kowalevskii]
Length = 2434
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 170/406 (41%), Positives = 240/406 (59%), Gaps = 20/406 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I FHNEA S LLRTV SV++RSP S++KEI+LVDD S L E + K+
Sbjct: 1994 LPQTSVIITFHNEARSTLLRTVVSVLNRSPTSIIKEIILVDDYSDNAEDGKEL-EKIPKV 2052
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+V+R+ R GL+++R+ GA A G IL FLD+HCEC W+E L+ ++ E+ VV
Sbjct: 2053 ----KVLRNEKREGLMRSRVRGADYATGTILTFLDSHCECNQNWIEPLITKIQENNKAVV 2108
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ ++ D +TP +AGG
Sbjct: 2109 SPIIDVINMDNFQYVAASADLKGGFDWNLVFKWDYMTPAERNKRKSDPIAAIRTPMIAGG 2168
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI +++F +G YD M VWGGENLE+SFRVWQCGG++EI PCS V H+FRK PY+F
Sbjct: 2169 LFAISKSWFEELGKYDMMMDVWGGENLEISFRVWQCGGTLEIIPCSRVGHVFRKQHPYTF 2228
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ ++Y+ P + K ++SRL+LRK+L+C SF W
Sbjct: 2229 PGGSGNVFAKNTRRAAEVWMDEYKKYYYSAVP-SSKNIAFGNIQSRLDLRKKLQCKSFGW 2287
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQ 555
YL +V+P +P D FG ++ Q H C++ G G L C H Q
Sbjct: 2288 YLENVYPELRIPDKKDIAFGALQ--QGHLCMD---TLGHF--TDGTLGLYEC-HNTGGNQ 2339
Query: 556 MFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
F L D + +CL V ++ S +++ CS N Q+W
Sbjct: 2340 EFA--LTKDKAIRHQDLCLTVMDHR--PSGVIKLHGCSESNLNQKW 2381
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ +R +PD R P CK K+ + LP++S++I FHNEA S LL
Sbjct: 1954 GQDAYARNKFNQVESDKLSYDRDIPDTRNPLCK-KLDWKTALPQTSVIITFHNEARSTLL 2012
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV++RSP S++KEI+LVDD S L E + K+ +V+R+ R GL+++R
Sbjct: 2013 RTVVSVLNRSPTSIIKEIILVDDYSDNAEDGKEL-EKIPKV----KVLRNEKREGLMRSR 2067
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A G IL FLD+HCEC + E + K
Sbjct: 2068 VRGADYATGTILTFLDSHCECNQNWIEPLITK 2099
>gi|426220611|ref|XP_004004508.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Ovis aries]
Length = 601
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDLSAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELR+QLKC F+W++ V W P H+ P++ +G +R+V + CV+
Sbjct: 420 ELRRQLKCKDFRWFMAAVAWDVPRHYPPVEPPPAAWGELRNVAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|351708624|gb|EHB11543.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Heterocephalus
glaber]
Length = 567
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 131 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 189
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 190 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 245
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 246 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 305
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 306 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 365
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C F+W
Sbjct: 366 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKRLGCQPFQW 424
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 425 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 454
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 104/168 (61%), Gaps = 13/168 (7%)
Query: 32 YLGDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPK 91
Y+G +V + Q Y N+FN + SD++ ++R +PD R +C+ K + + LP
Sbjct: 82 YVGGTV-------VRSGQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPA 133
Query: 92 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP 151
+S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 134 TSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV--- 189
Query: 152 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 190 -RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLER 236
>gi|355689604|gb|AER98888.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 [Mustela putorius
furo]
Length = 306
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 199/308 (64%), Gaps = 6/308 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S S D +
Sbjct: 2 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-----DSEDGALLGK 56
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 57 IEKVRVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 116
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 117 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 176
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 177 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGHVFRKQHPYTF 236
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 237 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 295
Query: 497 YLTHVWPH 504
YL +V+P
Sbjct: 296 YLENVYPE 303
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 49/101 (48%), Positives = 67/101 (66%), Gaps = 5/101 (4%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S S D +
Sbjct: 2 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-----DSEDGALLGK 56
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC
Sbjct: 57 IEKVRVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECN 97
>gi|440891991|gb|ELR45390.1| Polypeptide N-acetylgalactosaminyltransferase 2, partial [Bos
grunniens mutus]
Length = 530
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 94 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 152
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 153 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 208
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 209 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 268
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 269 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 328
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 329 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLNCKPFKW 387
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 388 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 417
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 54 GQDPYARNKFNQVESDKLRMDRAVPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 112
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 113 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 167
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 168 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 199
>gi|332812183|ref|XP_001147638.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
4 [Pan troglodytes]
Length = 533
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 155
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 156 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 332 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 390
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 391 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 420
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|119590315|gb|EAW69909.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
gi|119590316|gb|EAW69910.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_b [Homo sapiens]
Length = 533
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 155
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 156 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 332 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 390
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 391 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 420
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|221043222|dbj|BAH13288.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 155
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 156 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 332 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 390
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 391 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 420
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|426334121|ref|XP_004028610.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Gorilla
gorilla gorilla]
Length = 533
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 155
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 156 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 331
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 332 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 390
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 391 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 420
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|327281948|ref|XP_003225707.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Anolis carolinensis]
Length = 574
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 162/418 (38%), Positives = 243/418 (58%), Gaps = 17/418 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +SI+I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 129 HYRCASIHYGADLPSTSIIITFHNEARSTLLRTVTSVLNRTPANLIQEIILVDDFS---- 184
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ +P + +R+ R GLI++R+ GA A +IL FLD+HCE WL+
Sbjct: 185 --SDPEDCQLLTKIPKVKCLRNNRREGLIRSRVRGADMATADILTFLDSHCEVNSEWLQP 242
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 243 MLQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKLSRTD 301
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+ +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 302 PTQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 361
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y++ P A + ++ R+
Sbjct: 362 VGHVFRKRHPYDFPEGNALTYIKNTKRTAEVWMDEYKQYYYEARPSA-IGKSFGSIADRV 420
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
+ R++L C SF+WYL +V+P +P + G I+ Q C+E + N + S
Sbjct: 421 DQRRKLNCKSFQWYLENVYPELKVPEKELIPGIIK--QGANCLESQGQDSAGNSLAAVGS 478
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
+ P TQ ++ +D + + CL + + ++ I AC+ + RQ+W
Sbjct: 479 CKGTANDPAATQEWIF---SDPLIRQQDKCLSITSFST--GSQITIEACNQKDGRQKW 531
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 93/154 (60%), Gaps = 12/154 (7%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C + + + LP +SI+I FHNEA S
Sbjct: 98 LKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYRCASIHYGAD-LPSTSIIITFHNEARS 156
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGL 163
LLRTV SV++R+P ++++EI+LVDD S S ++ +P + +R+ R GL
Sbjct: 157 TLLRTVTSVLNRTPANLIQEIILVDDFS------SDPEDCQLLTKIPKVKCLRNNRREGL 210
Query: 164 IKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
I++R+ GA A +IL FLD+HCE N E+L
Sbjct: 211 IRSRVRGADMATADILTFLDSHCE----VNSEWL 240
>gi|355559183|gb|EHH15963.1| hypothetical protein EGK_02147, partial [Macaca mulatta]
Length = 530
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 94 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 152
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 153 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 208
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 209 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 268
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 269 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 328
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 329 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 387
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 388 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 417
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 54 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 112
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 113 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 167
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 168 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 199
>gi|332265851|ref|XP_003281927.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
1 [Nomascus leucogenys]
Length = 556
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 120 LPPTSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 178
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 179 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 234
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 235 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 294
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 295 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 354
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 355 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 413
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 414 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 443
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 80 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPPTSVVITFHNEARSALL 138
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 139 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 193
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 194 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 225
>gi|403300209|ref|XP_003940844.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Saimiri
boliviensis boliviensis]
Length = 724
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 288 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 346
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 347 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 402
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 403 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 462
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 463 LFVMDKFYFEALGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 522
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 523 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 581
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 582 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 611
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 248 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 306
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 307 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 361
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 362 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 393
>gi|345790655|ref|XP_543189.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Canis lupus
familiaris]
Length = 601
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNSIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|197246167|gb|AAI68926.1| Galnt2 protein [Rattus norvegicus]
Length = 569
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 162/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 133 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 191
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 192 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 247
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 248 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 307
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 308 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 367
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 368 PGGSGTVFARNTRRAAEVWMDEFKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 426
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 427 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 456
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 99/152 (65%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R++PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 93 GQDPYARNKFNQVESDKLRMDRSIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 151
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 152 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 206
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 207 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 238
>gi|301607546|ref|XP_002933365.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 1 [Xenopus (Silurana) tropicalis]
Length = 600
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 171/436 (39%), Positives = 256/436 (58%), Gaps = 30/436 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HSVI+R+P S+++E++LVDD S RE
Sbjct: 126 HPNCKHKLYLERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREH 185
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L+ L+EY+A R++R+ R GLI+ RLLGA A+GE+L FLD+HCE + WL L
Sbjct: 186 LREKLEEYMAYYP-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 244
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 245 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRTD 301
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 302 PSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCSR 361
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + S+
Sbjct: 362 VGHIYRKYVPYKVPTGTS--LARNLKRVAETWMDEYAEYIYQRRPEY-RHLSTGDISSQK 418
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE-KPLAKGS--- 534
ELRK LKC FKWY++ V W P + P++ +G IR+V ++ C++ K A G+
Sbjct: 419 ELRKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVASNLCIDSKHGATGTELR 478
Query: 535 MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATD----ESVCLDVPEYENDISPRVRIL 590
++ S +H Q+F D+ + C D + S V +
Sbjct: 479 LDTCVKDGSERTWSH----EQLFTFGWREDIRPGEPLHTRKFCFDSISH----SSPVTLY 530
Query: 591 ACSGF-NRQRWTYDKE 605
C G Q+W+Y K+
Sbjct: 531 DCHGMKGNQQWSYRKD 546
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/140 (51%), Positives = 105/140 (75%), Gaps = 2/140 (1%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
+Y+ N FN+ VS++I + R+LPD+R P CK K++ E LP +SI+I FHNE W++LLRT+
Sbjct: 101 VYRENGFNIFVSNKIALARSLPDIRHPNCKHKLYLER-LPNTSIIIPFHNEGWTSLLRTI 159
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLG 170
HSVI+R+P S+++E++LVDD S RE L+ L+EY+A R++R+ R GLI+ RLLG
Sbjct: 160 HSVINRTPDSLIEEMILVDDFSDREHLREKLEEYMAYYP-KVRIVRTKKREGLIRTRLLG 218
Query: 171 ARQAEGEILVFLDAHCECTL 190
A A+GE+L FLD+HCE +
Sbjct: 219 ASMAKGEVLTFLDSHCEVNV 238
>gi|301763305|ref|XP_002917071.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ailuropoda melanoleuca]
Length = 555
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 240/418 (57%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 108 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 163
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 164 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQP 221
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 222 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 280
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 281 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 340
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 341 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 399
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 400 EQRKKMNCRSFRWYLDNVYPELTVPVKEVLPGIIK--QGVNCLESQGQDSAGNFLLGMGI 457
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
P Q ++ +D + + CL V V + C+ RQRW
Sbjct: 458 CRGSAKNPPAPQAWLF---SDHLIQQQGKCLTVSSTSVTPGSLVLLQGCNPREGRQRW 512
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 76 QLKPGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 134
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 135 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 188
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 189 LIRSRVRGADMATAAVLTFLDSHCE----VNTEWL 219
>gi|327262105|ref|XP_003215866.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Anolis carolinensis]
Length = 575
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 139 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 197
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R++R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 198 ----RILRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 253
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 254 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 313
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 314 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 373
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLEL+K+L C FKW
Sbjct: 374 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELKKRLNCKPFKW 432
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 433 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 462
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R + D R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 99 GQDPYARNKFNQVESDKLRMDRNVQDTRHDQCQRKQWRID-LPATSVVITFHNEARSALL 157
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ R++R+ R GL+++R
Sbjct: 158 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RILRNDRREGLMRSR 212
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 213 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 244
>gi|301607548|ref|XP_002933366.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like isoform 2 [Xenopus (Silurana) tropicalis]
Length = 601
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HSVI+R+P S+++E++LVDD S RE
Sbjct: 127 HPNCKHKLYLERLPNTSIIIPFHNEGWTSLLRTIHSVINRTPDSLIEEMILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L+ L+EY+A R++R+ R GLI+ RLLGA A+GE+L FLD+HCE + WL L
Sbjct: 187 LREKLEEYMAYYP-KVRIVRTKKREGLIRTRLLGASMAKGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRTD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYELSFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + S+
Sbjct: 363 VGHIYRKYVPYKVPTGTS--LARNLKRVAETWMDEYAEYIYQRRPEY-RHLSTGDISSQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK LKC FKWY++ V W P + P++ +G IR+V ++ C++
Sbjct: 420 ELRKHLKCKDFKWYMSEVAWDVPKFYPPVEPPPASWGEIRNVASNLCID 468
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 72/140 (51%), Positives = 105/140 (75%), Gaps = 2/140 (1%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
+Y+ N FN+ VS++I + R+LPD+R P CK K++ E LP +SI+I FHNE W++LLRT+
Sbjct: 102 VYRENGFNIFVSNKIALARSLPDIRHPNCKHKLYLER-LPNTSIIIPFHNEGWTSLLRTI 160
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLG 170
HSVI+R+P S+++E++LVDD S RE L+ L+EY+A R++R+ R GLI+ RLLG
Sbjct: 161 HSVINRTPDSLIEEMILVDDFSDREHLREKLEEYMAYYP-KVRIVRTKKREGLIRTRLLG 219
Query: 171 ARQAEGEILVFLDAHCECTL 190
A A+GE+L FLD+HCE +
Sbjct: 220 ASMAKGEVLTFLDSHCEVNV 239
>gi|291402210|ref|XP_002717436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oryctolagus cuniculus]
Length = 571
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|410342331|gb|JAA40112.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410342333|gb|JAA40113.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 163/344 (47%), Positives = 216/344 (62%), Gaps = 9/344 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S +
Sbjct: 128 HDQCQRKQWRGGLPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DP 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
+L + K+ RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L
Sbjct: 187 EDGALLGKIEKV----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPL 242
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RVAEDRTRVV P+ID+I+ F YV + G F+W L F+W ++ +
Sbjct: 243 LERVAEDRTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNP 302
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
P KTP +AGGLF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V
Sbjct: 303 VAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRV 362
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY+FPGG V N R A VWMDE+ FY+ P A + ++SRLE
Sbjct: 363 GHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLE 421
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
LRK+L C FKWYL +V+P +P D FG ++ Q C++
Sbjct: 422 LRKKLSCKPFKWYLENVYPELRVPDHQDIAFGALQ--QGTNCLD 463
Score = 114 bits (285), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + LP +S+VI FHNEA SALL
Sbjct: 100 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRGG-LPATSVVITFHNEARSALL 158
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 159 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 213
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 214 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 245
>gi|209364560|ref|NP_001129228.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Rattus
norvegicus]
Length = 601
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ + R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEDYMARFPI-VRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHSHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EPF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSEPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK+LKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKRLKCKDFKWFMAAVAWDVPKYYPPVEPLPAAWGEIRNVAANLCVD 468
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 71/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L++Y+A+ + R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARFPI-VRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SVARGEVLTFLDSHCEVNV 239
>gi|195114158|ref|XP_002001634.1| GI15842 [Drosophila mojavensis]
gi|193912209|gb|EDW11076.1| GI15842 [Drosophila mojavensis]
Length = 628
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 174/407 (42%), Positives = 240/407 (58%), Gaps = 20/407 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + ++ +A
Sbjct: 194 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGME--LA 249
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ R+IR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 250 KID-KVRIIRNDKREGLVRSRVRGADAAVSSVLTFLDSHVECNEQWLEPLLERVREDPTR 308
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W ++ + D T +TP +A
Sbjct: 309 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERAARHNDPTTAIRTPMIA 368
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 369 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 428
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMDE+ + Y+ P A K + RL L+++L+C F
Sbjct: 429 TFPGGSGNVFAKNTRRAAEVWMDEYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLQCKPF 487
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ-ASGPASLLPCTHLPVL 553
KWYL HV+P P D + G+ R T +C++ +M G L PC H
Sbjct: 488 KWYLEHVYPDLQTP-DPQDVGQFRQDAT-ECLD------TMGHIVDGTVGLFPC-HNTGG 538
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + + I DE +CL + ++ +V + C QRW
Sbjct: 539 NQEWTFSKRGE-IKHDE-LCLTLVQFAR--GSQVILKPCDESENQRW 581
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 101/155 (65%), Gaps = 10/155 (6%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA
Sbjct: 151 GGLRNGEDPYIRNRFNQEASDALPSNREIPDTRNPMCRTKKYRED-LPETSVIITFHNEA 209
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
S LLRT+ SV++RSP +++EI+LVDD S + + ++ +AK+ R+IR+ R G
Sbjct: 210 RSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGME--LAKID-KVRIIRNDKREG 264
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L+++R+ GA A +L FLD+H EC NE++L
Sbjct: 265 LVRSRVRGADAAVSSVLTFLDSHVEC----NEQWL 295
>gi|332812181|ref|XP_003308857.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Pan
troglodytes]
gi|410227516|gb|JAA10977.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410264536|gb|JAA20234.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
gi|410296424|gb|JAA26812.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Pan
troglodytes]
Length = 576
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 140 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 198
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 199 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 254
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 255 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 314
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 315 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 374
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 375 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 433
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 434 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 463
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 100 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 158
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 159 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 213
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 214 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 245
>gi|355767580|gb|EHH62635.1| hypothetical protein EGM_21033, partial [Macaca fascicularis]
Length = 453
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 17 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 75
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 76 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 131
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 132 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 191
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 192 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 251
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 252 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 310
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 311 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 340
Score = 91.7 bits (226), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/128 (42%), Positives = 81/128 (63%), Gaps = 6/128 (4%)
Query: 72 PDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDA 131
P +C+ K + + LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD
Sbjct: 1 PSFSSYRCQRKQWRVD-LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDY 59
Query: 132 STREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLV 191
S + +L + K+ RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC
Sbjct: 60 SN-DPEDGALLGKIEKV----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEH 114
Query: 192 FNEEFLPK 199
+ E L +
Sbjct: 115 WLEPLLER 122
>gi|281349386|gb|EFB24970.1| hypothetical protein PANDA_005243 [Ailuropoda melanoleuca]
Length = 553
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 166/418 (39%), Positives = 240/418 (57%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 106 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 161
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 162 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADMATAAVLTFLDSHCEVNTEWLQP 219
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 220 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 278
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 279 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 338
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 339 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 397
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 398 EQRKKMNCRSFRWYLDNVYPELTVPVKEVLPGIIK--QGVNCLESQGQDSAGNFLLGMGI 455
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
P Q ++ +D + + CL V V + C+ RQRW
Sbjct: 456 CRGSAKNPPAPQAWLF---SDHLIQQQGKCLTVSSTSVTPGSLVLLQGCNPREGRQRW 510
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 74 QLKPGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 132
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 133 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 186
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 187 LIRSRVRGADMATAAVLTFLDSHCE----VNTEWL 217
>gi|311246104|ref|XP_003122084.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Sus
scrofa]
Length = 541
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 175/426 (41%), Positives = 247/426 (57%), Gaps = 28/426 (6%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP +S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 128 YDYDHLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAI 187
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+AKL R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E
Sbjct: 188 ELAKLP-KVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQRIHEK 246
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + + + K + ++
Sbjct: 247 ESAVVCPVIDVIDWNTFEYMGNSREPQIGGFDWRLVFTWHVVPERERL-RMKSPIDVIRS 305
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++GAYD M+VWGGENLE SFR+WQCGG++EI PCSHV H+F K
Sbjct: 306 PTMAGGLFAVSKKYFEYLGAYDTGMEVWGGENLEFSFRIWQCGGTLEIHPCSHVGHVFPK 365
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 366 QAPYSRSKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 419
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFG---------RIRHVQTHKCVEKPLAKGSMNQASG 540
C FKW+L V+P +P D FFG IR+ TH+ +G + +G
Sbjct: 420 CKDFKWFLETVYPELHVPEDRPGFFGMFFEYTSRNEIRY-NTHQ------PEGCVAVEAG 472
Query: 541 PASLLP--CTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQ 598
L+ C + Q F+++ L C+ E+ S + C+ + Q
Sbjct: 473 TDKLIMHLCEEVTPEDQKFILQEDGSLFHVQSKKCVQAERAESGKSFEPLLRDCTHSDHQ 532
Query: 599 RWTYDK 604
+W + +
Sbjct: 533 KWFFKE 538
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 87/242 (35%), Positives = 134/242 (55%), Gaps = 44/242 (18%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ ++++ N+ +SDRI ++R LP+ P CK K ++ + LP +S+VI F+NEAWS LLR
Sbjct: 93 EESLRLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDHLPTTSVVIAFYNEAWSTLLR 152
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SV+ SP +L+E++LVDD S RE LK L +AKL R+IR+ R GL++ARL
Sbjct: 153 TVYSVLETSPDILLEEVILVDDYSDREHLKERLAIELAKLP-KVRLIRANKREGLVRARL 211
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPR 227
LGA A+G++L FLD HCEC +E W LL+ +H S
Sbjct: 212 LGASVAKGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES---- 248
Query: 228 SMLKEILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGR 268
+++ ++ V D +T E++ +S + + ++ P VIRSP
Sbjct: 249 AVVCPVIDVIDWNTFEYMGNSREPQIGGFDWRLVFTWHVVPERERLRMKSPIDVIRSPTM 308
Query: 269 VG 270
G
Sbjct: 309 AG 310
>gi|4758412|ref|NP_004472.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Homo
sapiens]
gi|51315838|sp|Q10471.1|GALT2_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
AltName: Full=Polypeptide GalNAc transferase 2;
Short=GalNAc-T2; Short=pp-GaNTase 2; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 2;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2; Contains: RecName:
Full=Polypeptide N-acetylgalactosaminyltransferase 2
soluble form
gi|971461|emb|CAA59381.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyl transferase [Homo
sapiens]
gi|26996816|gb|AAH41120.1| GALNT2 protein [Homo sapiens]
gi|119590317|gb|EAW69911.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2), isoform
CRA_c [Homo sapiens]
gi|239740418|gb|ACS13744.1| polypeptide N-acetylgalactosaminyltransferase 2 [Homo sapiens]
gi|307686451|dbj|BAJ21156.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 [synthetic
construct]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|441612314|ref|XP_004088076.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Nomascus leucogenys]
Length = 570
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 134 LPPTSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 193 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 248
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 249 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 308
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 309 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 368
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 369 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 427
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 428 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 457
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 94 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPPTSVVITFHNEARSALL 152
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 153 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 207
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 208 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 239
>gi|158261119|dbj|BAF82737.1| unnamed protein product [Homo sapiens]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|114596861|ref|XP_001155128.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 isoform 1 [Pan
troglodytes]
Length = 601
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|386780726|ref|NP_001248284.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Macaca
mulatta]
gi|384941838|gb|AFI34524.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
gi|387540526|gb|AFJ70890.1| polypeptide N-acetylgalactosaminyltransferase 2 [Macaca mulatta]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|157134100|ref|XP_001663146.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870595|gb|EAT34820.1| AAEL012972-PA [Aedes aegypti]
Length = 600
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 221/350 (63%), Gaps = 17/350 (4%)
Query: 191 VFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD 250
++ E LP +S++I+F+NE +S L+RTVHSV++ + +LKEI+LVDD S+ E LK LD
Sbjct: 139 TYDLESLPSTSVIIIFYNEPYSVLVRTVHSVLNTADDRLLKEIILVDDGSSNEELKGKLD 198
Query: 251 EYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
YV +L +V+R RVGLI+ARL GAR A+ ++LVFLDAHCEC WLE L+ R+
Sbjct: 199 YYVRTRLPSKVKVLRQRNRVGLIRARLAGARFAKADVLVFLDAHCECMHQWLEPLLERIK 258
Query: 310 EDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ T V+ P+ID+I F Y V F++ G F W+ HF W+ + ++++
Sbjct: 259 QSSTSVLVPIIDVIEAKNFYYSTNGVTDFQI--GGFTWDGHFDWHDVTQREKERQKRECP 316
Query: 366 E------PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
E P +P MAGGLFAI R YF+ IG+YDE+M WGGENLEMSFRVWQCGG++E
Sbjct: 317 EKDMAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGGENLEMSFRVWQCGGTLETI 376
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS + H+FR PYSFP +G N R+A VWMD++ + + P+ +
Sbjct: 377 PCSRIGHIFRDFHPYSFPN--DRDTHGINTVRMATVWMDDYIDLLYLNRPDLRDHPEVGD 434
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVE 527
V R LR++L+C SF WY+ +V+P F+P + + +GR+ + + C++
Sbjct: 435 VTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAYGRVSSLAENLCLD 484
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/144 (49%), Positives = 98/144 (68%), Gaps = 1/144 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N +S+ + NRT PD R P CK K ++ E LP +S++I+F+NE +S L+RTVHSV++
Sbjct: 113 LNEELSEHLSYNRTPPDERHPACKRKTYDLESLPSTSVIIIFYNEPYSVLVRTVHSVLNT 172
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
+ +LKEI+LVDD S+ E LK LD YV +L +V+R RVGLI+ARL GAR A+
Sbjct: 173 ADDRLLKEIILVDDGSSNEELKGKLDYYVRTRLPSKVKVLRQRNRVGLIRARLAGARFAK 232
Query: 176 GEILVFLDAHCECTLVFNEEFLPK 199
++LVFLDAHCEC + E L +
Sbjct: 233 ADVLVFLDAHCECMHQWLEPLLER 256
>gi|198422185|ref|XP_002121130.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
4 [Ciona intestinalis]
Length = 582
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 158/365 (43%), Positives = 221/365 (60%), Gaps = 13/365 (3%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C+ F+ LP +S+VI F+NE WS L+RTV SV+ SP ++L EI+LVDD S + +L
Sbjct: 129 CKSRRPFDYRSLPTTSVVIAFYNEGWSTLIRTVFSVLHNSPDALLTEIILVDDYSDKVYL 188
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K L +++ L+ R++R+ R GL++ARLLGA A+GE+L FLD HCEC GWLE L+
Sbjct: 189 KDKLADFLKALAR-VRLVRTTKREGLVRARLLGASLAKGEVLTFLDCHCECVEGWLEPLL 247
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRS-FELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ ED + +V PVID I TF Y E G F+W L F+W+T + +RK
Sbjct: 248 ERIMEDESVIVVPVIDTIDWNTFEYYYGGHEPQIGGFDWRLTFQWHTIPDHERK-RRKSP 306
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P ++P MAGGLFA+ + YF IG YD M++WGGENLE+SFR W CGG +E PCSHV
Sbjct: 307 VDPIRSPTMAGGLFAVSKRYFTRIGTYDAGMEIWGGENLELSFRTWMCGGKLETIPCSHV 366
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K SPY P ++ L R A VWMD++ ++ NP A K+ + + +R +
Sbjct: 367 GHVFPKQSPYPRPKFLTNTL-----RAAEVWMDDYKRHFYIRNPPASKE-NYGDISARKD 420
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPA 542
LR L+CH FKWYL +V+P +P D ++G R+ C++ A N G
Sbjct: 421 LRNSLQCHDFKWYLDNVYPDLHVPEDRPGYYGAFRNSGMSSFCLD--YAPPQHNPTGGRV 478
Query: 543 SLLPC 547
S+ C
Sbjct: 479 SIFGC 483
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 107/164 (65%), Gaps = 9/164 (5%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTK-VFNEEFLPKSSIVIVFHNEAWSA 105
D+ Q Y +N+F VS+RI ++R L D R CK++ F+ LP +S+VI F+NE WS
Sbjct: 100 DSIQTYAVNQF---VSERISLHRRLQDPRHEMCKSRRPFDYRSLPTTSVVIAFYNEGWST 156
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
L+RTV SV+ SP ++L EI+LVDD S + +LK L +++ L+ R++R+ R GL++
Sbjct: 157 LIRTVFSVLHNSPDALLTEIILVDDYSDKVYLKDKLADFLKALAR-VRLVRTTKREGLVR 215
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK----SSIVIV 205
ARLLGA A+GE+L FLD HCEC + E L + S+++V
Sbjct: 216 ARLLGASLAKGEVLTFLDCHCECVEGWLEPLLERIMEDESVIVV 259
>gi|157133631|ref|XP_001662949.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108870752|gb|EAT34977.1| AAEL012823-PA [Aedes aegypti]
Length = 600
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 221/350 (63%), Gaps = 17/350 (4%)
Query: 191 VFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD 250
++ E LP +S++I+F+NE +S L+RTVHSV++ + +LKEI+LVDD S+ E LK LD
Sbjct: 139 TYDLESLPSTSVIIIFYNEPYSVLVRTVHSVLNTADDRLLKEIILVDDGSSNEELKGKLD 198
Query: 251 EYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
YV +L +V+R RVGLI+ARL GAR A+ ++LVFLDAHCEC WLE L+ R+
Sbjct: 199 YYVRTRLPSKVKVLRQRNRVGLIRARLAGARFAKADVLVFLDAHCECMHQWLEPLLERIK 258
Query: 310 EDRTRVVCPVIDIISDVTFAY----VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ T V+ P+ID+I F Y V F++ G F W+ HF W+ + ++++
Sbjct: 259 QSSTSVLVPIIDVIEAKNFYYSTNGVTDFQI--GGFTWDGHFDWHDVTQREKERQKRECP 316
Query: 366 E------PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
E P +P MAGGLFAI R YF+ IG+YDE+M WGGENLEMSFRVWQCGG++E
Sbjct: 317 EKDMAICPTYSPTMAGGLFAISRDYFWEIGSYDEQMDGWGGENLEMSFRVWQCGGTLETI 376
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
PCS + H+FR PYSFP +G N R+A VWMD++ + + P+ +
Sbjct: 377 PCSRIGHIFRDFHPYSFPN--DRDTHGINTVRMATVWMDDYIDLLYLNRPDLRDHPEVGD 434
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVE 527
V R LR++L+C SF WY+ +V+P F+P + + +GR+ + + C++
Sbjct: 435 VTHRRVLREKLRCKSFDWYMKNVYPEKFIPTRNVRAYGRVSSLAENLCLD 484
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 71/144 (49%), Positives = 98/144 (68%), Gaps = 1/144 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N +S+ + NRT PD R P CK K ++ E LP +S++I+F+NE +S L+RTVHSV++
Sbjct: 113 LNEELSEHLSYNRTPPDERHPACKRKTYDLESLPSTSVIIIFYNEPYSVLVRTVHSVLNT 172
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
+ +LKEI+LVDD S+ E LK LD YV +L +V+R RVGLI+ARL GAR A+
Sbjct: 173 ADDRLLKEIILVDDGSSNEELKGKLDYYVRTRLPSKVKVLRQRNRVGLIRARLAGARFAK 232
Query: 176 GEILVFLDAHCECTLVFNEEFLPK 199
++LVFLDAHCEC + E L +
Sbjct: 233 ADVLVFLDAHCECMHQWLEPLLER 256
>gi|402858708|ref|XP_003893834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Papio anubis]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|332265853|ref|XP_003281928.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 isoform
2 [Nomascus leucogenys]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPPTSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPPTSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|397506054|ref|XP_003823551.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Pan paniscus]
Length = 518
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 44 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 103
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 104 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 162
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 163 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 219
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 220 PSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 279
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 280 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 336
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 337 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 385
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 20 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 78
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 79 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 137
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 138 SMARGEVLTFLDSHCEVNV 156
>gi|300797173|ref|NP_001180032.1| polypeptide N-acetylgalactosaminyltransferase 2 precursor [Bos
taurus]
gi|296472282|tpg|DAA14397.1| TPA: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 2 (GalNAc-T2) [Bos
taurus]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLNCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAVPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 240
>gi|300796651|ref|NP_001178227.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Bos taurus]
Length = 601
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDLSAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELR+QLKC FKW++ V W P ++ P++ +G +R+V + CV+
Sbjct: 420 ELRRQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGELRNVAANLCVD 468
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKEKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|350592744|ref|XP_001927809.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Sus
scrofa]
Length = 571
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAVPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|71297071|gb|AAH47551.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Homo sapiens]
Length = 601
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++ + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIPLNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMARGEVLTFLDSHCEVNV 239
>gi|432107114|gb|ELK32537.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Myotis davidii]
Length = 518
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 167/420 (39%), Positives = 242/420 (57%), Gaps = 17/420 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 67 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 122
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 123 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQP 180
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 181 LLQRVQEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 239
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 240 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 299
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V SR+
Sbjct: 300 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVASRI 358
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + I+ Q C+E + N G +
Sbjct: 359 EQRKKMNCKSFRWYLENVYPELTVPVKEVLPSIIK--QGVNCLESQGQDTAGNFLLGVGT 416
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPR--VRILACSGF-NRQRW 600
P Q ++ +D + + CL ISP V + C+ +QRW
Sbjct: 417 CRGSAKNPPAPQAWLF---SDHLIQQQGKCLTATSTSASISPGSPVGLQTCNPREGKQRW 473
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 35 QLKPGEDPYRQHAFNQLESDKLTSDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 93
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 94 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 147
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 148 LIRSRVRGADVATAAVLTFLDSHCE----VNTEWL 178
>gi|395836156|ref|XP_003791031.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Otolemur garnettii]
Length = 571
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRARQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 429 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 458
Score = 115 bits (287), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|431909863|gb|ELK12965.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Pteropus alecto]
Length = 543
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 217/330 (65%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP +S+VI F+NEAWS LLRT++SV+ SP ++L+E++LVDD S RE LK L
Sbjct: 92 YDYDHLPTTSVVIAFYNEAWSTLLRTIYSVLETSPDTLLEEVILVDDYSDREHLKERLAN 151
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 152 ELSGLP-KVRLIRAHKREGLVRARLLGASAAKGDVLTFLDCHCECHEGWLEPLLQRIHEE 210
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E H G F+W L F W+ + + + R + ++
Sbjct: 211 ESAVVCPVIDVIDWNTFEYLGNSGEPHIGGFDWRLVFTWHVVPTRERMRMRSPI-DVIRS 269
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++EI PCSHV H+F K
Sbjct: 270 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLEIHPCSHVGHVFPK 329
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 330 QAPYSRKKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERRQLRAKLQ 383
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 384 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 413
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/246 (36%), Positives = 135/246 (54%), Gaps = 44/246 (17%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L ++ Q ++ N+ +SDRI ++R LP+ P CK ++ + LP +S+VI F+NEAWS
Sbjct: 53 LRRQEESVQRHQINIYLSDRISLHRRLPERWNPLCKETKYDYDHLPTTSVVIAFYNEAWS 112
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT++SV+ SP ++L+E++LVDD S RE LK L ++ L R+IR+ R GL+
Sbjct: 113 TLLRTIYSVLETSPDTLLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRAHKREGLV 171
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVIS 223
+ARLLGA A+G++L FLD HCEC +E W LL+ +H S
Sbjct: 172 RARLLGASAAKGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES 212
Query: 224 RSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS---------VPTR----------VIR 264
+++ ++ V D +T E+L +S + ++ VPTR VIR
Sbjct: 213 ----AVVCPVIDVIDWNTFEYLGNSGEPHIGGFDWRLVFTWHVVPTRERMRMRSPIDVIR 268
Query: 265 SPGRVG 270
SP G
Sbjct: 269 SPTMAG 274
>gi|47847466|dbj|BAD21405.1| mFLJ00205 protein [Mus musculus]
Length = 634
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 226/353 (64%), Gaps = 13/353 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 163 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 222
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 223 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 281
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 282 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 338
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 339 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 398
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 399 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 455
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVEKPLA 531
+LR L C SFKW++T + W P + P++ +G IR+V T C + L
Sbjct: 456 KLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDTKLG 508
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 129 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 187
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 188 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 246
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 247 GLIRTRMLGASAATGDVITFLDSHCEANV 275
>gi|291385920|ref|XP_002709516.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6
[Oryctolagus cuniculus]
Length = 601
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S R+
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDRDH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMAGGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLF++DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFSVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YKENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S R+ LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDRDHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SMAGGEVLTFLDSHCEVNV 239
>gi|426346013|ref|XP_004040685.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 6,
partial [Gorilla gorilla gorilla]
Length = 555
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 81 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPGSLIAEIILVDDFSEREH 140
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 141 LKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 199
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 200 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 256
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 257 PSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 316
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 317 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 373
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRKQLKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 374 ELRKQLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 422
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 57 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 115
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ S R++R+ R GLI+ RLLGA
Sbjct: 116 SIINRTPGSLIAEIILVDDFSEREHLKDKLEEYMARFS-KVRIVRTKKREGLIRTRLLGA 174
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 175 SMARGEVLTFLDSHCEVNV 193
>gi|390477336|ref|XP_003735278.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Callithrix jacchus]
Length = 571
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 204/312 (65%), Gaps = 6/312 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 135 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 193
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 194 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 249
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 250 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 309
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 310 LFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 369
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 370 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 428
Query: 497 YLTHVWPHHFLP 508
YL +V+P +P
Sbjct: 429 YLENVYPELRVP 440
Score = 115 bits (287), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 95 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 153
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 154 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 208
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 209 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 240
>gi|196007338|ref|XP_002113535.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
gi|190583939|gb|EDV24009.1| hypothetical protein TRIADDRAFT_27318 [Trichoplax adhaerens]
Length = 455
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 161/356 (45%), Positives = 221/356 (62%), Gaps = 23/356 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA SALLRT+ SV++RSP +LKEI+LVDD F ++ D + K+
Sbjct: 65 LPTTSVIITFHNEARSALLRTIRSVLNRSPSELLKEIILVDD-----FSDNANDGRLLKI 119
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ +R+ R GLI++R+ GA A+G++L FLD+HCE WLE L++RVA++ T VV
Sbjct: 120 LPKVKTLRNNKREGLIRSRVRGADLAKGDVLTFLDSHCEVNERWLEPLLSRVAQNETIVV 179
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I TF Y+ S G F W L+F+W + S + + T P KTP +AGG
Sbjct: 180 SPIIDVIHMDTFNYIGSSADLKGGFGWNLNFKWDSMTSEEQSQRAAHPTRPIKTPMIAGG 239
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+I + +F G YD M VWGGENLE+S R+W CGGS+EI PCS V H+FRK PY+F
Sbjct: 240 LFSISKNWFIKSGKYDMGMDVWGGENLEISLRIWMCGGSLEIVPCSRVGHVFRKRHPYTF 299
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A WMD +A+FY+K P A + + RL+LR++LKC SFKW
Sbjct: 300 PGGGGFVFAKNTRRAAEAWMDGYAKFYYKREPGA-RGVPYGDISDRLKLREKLKCRSFKW 358
Query: 497 YLTHVWPHHFLP--MDDKFFGRIRHVQTHKCVEK------------PLAKGSMNQA 538
Y+ +V+P +P ++DK FG +R Q KC++ P G NQA
Sbjct: 359 YMRNVYPELNVPEGVNDK-FGELR--QGGKCLDSIGGKPGDRVSTFPCHGGGGNQA 411
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 104/179 (58%), Gaps = 15/179 (8%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y N FN D++P +R +PD R C++ + + LP +S++I FHNEA S
Sbjct: 22 LKQGEDAYIRNAFNQAECDKLPTDRGVPDTRDYSCRSLEYKHK-LPTTSVIITFHNEARS 80
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
ALLRT+ SV++RSP +LKEI+LVDD F ++ D + K+ + +R+ R GLI
Sbjct: 81 ALLRTIRSVLNRSPSELLKEIILVDD-----FSDNANDGRLLKILPKVKTLRNNKREGLI 135
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK---------SSIVIVFHNEAWSAL 214
++R+ GA A+G++L FLD+HCE + E L + S I+ V H + ++ +
Sbjct: 136 RSRVRGADLAKGDVLTFLDSHCEVNERWLEPLLSRVAQNETIVVSPIIDVIHMDTFNYI 194
>gi|395519600|ref|XP_003763931.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5
[Sarcophilus harrisii]
Length = 945
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 163/385 (42%), Positives = 230/385 (59%), Gaps = 28/385 (7%)
Query: 148 LSVPTRVIRSPGRVG------LIKARLLGARQAEGEILVFLD--------------AHCE 187
+++ R ++PG+ G K + + R EG V+L + C
Sbjct: 433 VTLSPRNPKAPGQFGNPVVVPFGKEKEVKRRWKEGNFNVYLSDLIPLDRAIDDTRPSGCA 492
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N LP +SI++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+ +LK
Sbjct: 493 DQLVHNN--LPTTSIIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKGYLKD 550
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
LD+Y+++ RV+ R GLI+ARL GA A G++L FLD+H EC +GWLE L+ R
Sbjct: 551 QLDKYMSQFP-KVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLLER 609
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF--T 365
V ++ +V CPVI+II+D +Y+ G F W ++F W IIK+ T
Sbjct: 610 VYLNKKKVACPVIEIINDKDLSYMTVDNFQRGIFVWPMNFSWKKI--PPEIIKQNKIKET 667
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+ + P MAGGLF+ID+ YFF +G YD ++VWGGEN+E+SF+VW CGG IEI PCS V
Sbjct: 668 DVIRCPVMAGGLFSIDKKYFFELGTYDPGLEVWGGENMELSFKVWMCGGEIEIIPCSRVG 727
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR-DKQAVRSRLE 484
H+FRK +PYSFP + + NL RVA VW+DE+ E ++ Q + + + E
Sbjct: 728 HIFRKDNPYSFPENRIKTIERNLIRVAEVWLDEYKELFYGHGYHLLDQSLNVGNLTQQRE 787
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPM 509
LRK+L C SFKWYL +V+P P+
Sbjct: 788 LRKKLGCKSFKWYLENVFPDIEAPI 812
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 100/151 (66%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+ ++ ++ FN+ +SD IP++R + D R C ++ + LP +SI++ F +E WS L
Sbjct: 459 EVKRRWKEGNFNVYLSDLIPLDRAIDDTRPSGCADQLVHNN-LPTTSIIMCFVDEVWSTL 517
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+ +LK LD+Y+++ RV+ R GLI+A
Sbjct: 518 LRSVHSVLNRSPPHLIKEILLVDDFSTKGYLKDQLDKYMSQFP-KVRVLHLKERHGLIRA 576
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA A G++L FLD+H EC + + E L
Sbjct: 577 RLAGAEIATGDVLTFLDSHVECNVGWLEPLL 607
>gi|348522865|ref|XP_003448944.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Oreochromis niloticus]
Length = 590
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 168/431 (38%), Positives = 246/431 (57%), Gaps = 30/431 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C V + E LP +S+VI F++EAWS LLRTVHSV+ +P+ L+E+LLVDD S +
Sbjct: 172 HPKCLGVEHIESLPSASVVICFNDEAWSTLLRTVHSVLDTAPKQYLQEVLLVDDLSQQGH 231
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK+ L EYV+ L R+IRS R+G+ R LGA +A GE++VF+D+HCEC GWLE L
Sbjct: 232 LKTGLSEYVSHLD-GVRLIRSTKRLGVGGCRTLGAARAVGEVVVFMDSHCECQKGWLEPL 290
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD- 363
+ R+A DRTRVV P++D+I TF Y + G F+W L F W ++I + +D
Sbjct: 291 LERIALDRTRVVSPIMDVIDWQTFRYNATQWPVRGVFDWRLDFFW------ESIPELQDK 344
Query: 364 ----FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+P ++PA+ GG+ AIDR +F +G YD M +WG E +E+S RVW CGGS+E+
Sbjct: 345 EPEMAVQPLQSPALGGGVVAIDRHFFQSVGTYDPGMVLWGAEQIELSIRVWSCGGSMEVV 404
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE--KQRDKQ 477
PCS V HL R PY FP ++L N R+A WMD + + Y++ + A +Q +
Sbjct: 405 PCSRVGHLIRHHLPYRFPD--QDLLQRNKIRIAETWMDTYKKIYYRRDTLAHFIRQSESP 462
Query: 478 AVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEKPLAKGSMN 536
+ RL L++ L C +F W+LT V+P ++P D G + V T +C + P +G
Sbjct: 463 NITERLRLKRSLGCRNFHWFLTTVYPQLYIPQDRTALSGELYSVGTGRCADYPQGRGLQ- 521
Query: 537 QASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRIL---ACS 593
SG + PCT +L L+ +CL+ + E D L C+
Sbjct: 522 --SGAMDVAPCTGTG-------SQLSGQLVHQLSQLCLEAVKEEGDNKRSTGGLFLRQCT 572
Query: 594 GFNRQRWTYDK 604
RQ+W +++
Sbjct: 573 HHPRQQWHFEQ 583
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 101/146 (69%), Gaps = 2/146 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN +VS+RI ++R LP+ R PKC V + E LP +S+VI F++EAWS LLRTVHSV+
Sbjct: 153 FNEVVSERISLHRRLPEARHPKC-LGVEHIESLPSASVVICFNDEAWSTLLRTVHSVLDT 211
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P+ L+E+LLVDD S + LK+ L EYV+ L R+IRS R+G+ R LGA +A G
Sbjct: 212 APKQYLQEVLLVDDLSQQGHLKTGLSEYVSHLD-GVRLIRSTKRLGVGGCRTLGAARAVG 270
Query: 177 EILVFLDAHCECTLVFNEEFLPKSSI 202
E++VF+D+HCEC + E L + ++
Sbjct: 271 EVVVFMDSHCECQKGWLEPLLERIAL 296
>gi|260836359|ref|XP_002613173.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
gi|229298558|gb|EEN69182.1| hypothetical protein BRAFLDRAFT_114107 [Branchiostoma floridae]
Length = 539
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 156/332 (46%), Positives = 212/332 (63%), Gaps = 8/332 (2%)
Query: 189 TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSS 248
T + E+ LP +S++I FHNEA S LLRT+ SV+ RSP +++EI+LVDD S +
Sbjct: 95 THTWKEDLLP-TSVIITFHNEARSTLLRTITSVLLRSPPHLIQEIILVDDYSDKPDDGLE 153
Query: 249 LDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARV 308
L + + K+ +++R+ R GL+++R+ GA A IL FLD+HCEC WLE ++ RV
Sbjct: 154 LAQ-IQKV----KILRNERREGLMRSRVKGADAATAPILTFLDSHCECNQHWLEPMLERV 208
Query: 309 AEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
EDRTRVVCP+ID+I+ F YV + G F+W L F+W ++ +R D P
Sbjct: 209 MEDRTRVVCPIIDVINMDNFQYVGASADLRGGFDWNLVFKWDYMTANQRNARRSDPIAPI 268
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
+TP +AGGLF ID+++F +G YD M VWGGENLE+SFRVWQC GS+EI PCS V H+F
Sbjct: 269 RTPMIAGGLFMIDKSWFDELGKYDMMMDVWGGENLEISFRVWQCQGSLEIIPCSRVGHVF 328
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
RK PY+FPGG V N R A VWMDE+ E+Y+ P A + ++SRLELRK+
Sbjct: 329 RKQHPYTFPGGSGNVFTRNTRRAAEVWMDEYKEYYYAAVPSA-RNVPFGNIQSRLELRKK 387
Query: 489 LKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRH 519
L C F WYL HV+P +P D FG ++
Sbjct: 388 LSCKPFAWYLEHVYPELRIPDKKDVAFGALQQ 419
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 95/157 (60%), Gaps = 6/157 (3%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
+ +S Y N+FN + SD++ R +PD R CKT + E+ LP +S++I FHNEA
Sbjct: 57 TKVSPGGDAYSRNKFNQVESDKLKCTRDIPDTRNQLCKTHTWKEDLLP-TSVIITFHNEA 115
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
S LLRT+ SV+ RSP +++EI+LVDD S + L + + K+ +++R+ R G
Sbjct: 116 RSTLLRTITSVLLRSPPHLIQEIILVDDYSDKPDDGLELAQ-IQKV----KILRNERREG 170
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L+++R+ GA A IL FLD+HCEC + E L +
Sbjct: 171 LMRSRVKGADAATAPILTFLDSHCECNQHWLEPMLER 207
>gi|148675838|gb|EDL07785.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
Length = 603
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 226/353 (64%), Gaps = 13/353 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVEKPLA 531
+LR L C SFKW++T + W P + P++ +G IR+V T C + L
Sbjct: 425 KLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDTKLG 477
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASAATGDVITFLDSHCEANV 244
>gi|313233395|emb|CBY24510.1| unnamed protein product [Oikopleura dioica]
Length = 679
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 172/420 (40%), Positives = 243/420 (57%), Gaps = 16/420 (3%)
Query: 185 HCECT-LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
H EC+ + E LP ++++I +HNEA S LLRTV SV+ RSP +++KEI+LVDD S
Sbjct: 228 HKECSKQKYQVEQLPDTTVIITYHNEAHSTLLRTVISVLHRSPPNLIKEIILVDDFSKNP 287
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+ L + + K+ + IR+P R GLI++R+ GA A G++L FLD+H E GWLE
Sbjct: 288 NIGPPLTK-IKKV----KAIRNPKREGLIRSRVRGAAIATGKVLTFLDSHVEANEGWLEP 342
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ R+ E RT VV P+ID+I F YV + G FNW+L F+W + +R+
Sbjct: 343 LLGRIHESRTAVVSPIIDVIGMDDFHYVGASADLKGGFNWDLVFKWDYMSEQERRERRRA 402
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGGLF+ID+ +F +G YD +M VWGGENLE+SFRVWQC G++EI PCS
Sbjct: 403 PTSPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEISFRVWQCHGTLEIIPCSR 462
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FPGG V N R A VWMDE+ EFYF P A K + R
Sbjct: 463 VGHVFRKKHPYTFPGGSGNVFAKNTRRAAEVWMDEYKEFYFAAVPSA-KMVKFGDISKRT 521
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
E+R++L+C SF W+L +V+P +P D +G + QT+K +E+ + G
Sbjct: 522 EVRERLQCKSFSWFLENVYPELRIPNKDAIGWGAVS--QTNKGLEECIGN---THGGGTL 576
Query: 543 SLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ C H Q F + ++ +C+ E +P V+ C + QRW Y
Sbjct: 577 GMYRC-HGDGGNQEFTLTKEGKEFRHND-LCIGYNAKEPVGNP-VKFNTCHQMSHQRWEY 633
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 89/138 (64%), Gaps = 5/138 (3%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N+FN SDRI +R +PD R +C + + E LP ++++I +HNEA S LLRTV
Sbjct: 204 YEANKFNQAASDRIKSDRAVPDTRHKECSKQKYQVEQLPDTTVIITYHNEAHSTLLRTVI 263
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV+ RSP +++KEI+LVDD S + L + + K+ + IR+P R GLI++R+ GA
Sbjct: 264 SVLHRSPPNLIKEIILVDDFSKNPNIGPPLTK-IKKV----KAIRNPKREGLIRSRVRGA 318
Query: 172 RQAEGEILVFLDAHCECT 189
A G++L FLD+H E
Sbjct: 319 AIATGKVLTFLDSHVEAN 336
>gi|354481325|ref|XP_003502852.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cricetulus griseus]
Length = 715
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 244 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 303
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 304 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 362
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 363 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 419
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 420 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 479
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK+ PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 480 VGHIYRKSVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 536
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 537 RLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTD 585
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 210 MTDAEREDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 268
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 269 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 327
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 328 GLIRTRMLGASAAIGDVITFLDSHCEANV 356
>gi|313246954|emb|CBY35800.1| unnamed protein product [Oikopleura dioica]
Length = 696
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 172/420 (40%), Positives = 243/420 (57%), Gaps = 16/420 (3%)
Query: 185 HCECT-LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
H EC+ + E LP ++++I +HNEA S LLRTV SV+ RSP +++KEI+LVDD S
Sbjct: 245 HKECSKQKYQVEQLPDTTVIITYHNEAHSTLLRTVISVLHRSPPNLIKEIILVDDFSKNP 304
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+ L + + K+ + IR+P R GLI++R+ GA A G++L FLD+H E GWLE
Sbjct: 305 NIGPPLTK-IKKV----KAIRNPKREGLIRSRVRGAAIATGKVLTFLDSHVEANEGWLEP 359
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ R+ E RT VV P+ID+I F YV + G FNW+L F+W + +R+
Sbjct: 360 LLGRIHESRTAVVSPIIDVIGMDDFHYVGASADLKGGFNWDLVFKWDYMSEQERRERRRA 419
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGGLF+ID+ +F +G YD +M VWGGENLE+SFRVWQC G++EI PCS
Sbjct: 420 PTSPIRTPMIAGGLFSIDKNWFHELGEYDMDMDVWGGENLEISFRVWQCHGTLEIIPCSR 479
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FPGG V N R A VWMDE+ EFYF P A K + R
Sbjct: 480 VGHVFRKKHPYTFPGGSGNVFAKNTRRAAEVWMDEYKEFYFAAVPSA-KMVKFGDISKRT 538
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
E+R++L+C SF W+L +V+P +P D +G + QT+K +E+ + G
Sbjct: 539 EVRERLQCKSFSWFLENVYPELRIPNKDAIGWGAVS--QTNKGLEECIGN---THGGGTL 593
Query: 543 SLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+ C H Q F + ++ +C+ E +P V+ C + QRW Y
Sbjct: 594 GMYRC-HGDGGNQEFTLTKEGKEFRHND-LCIGYNAKEPVGNP-VKFNTCHQMSHQRWEY 650
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/138 (43%), Positives = 89/138 (64%), Gaps = 5/138 (3%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N+FN SDRI +R +PD R +C + + E LP ++++I +HNEA S LLRTV
Sbjct: 221 YEANKFNQAASDRIKSDRAVPDTRHKECSKQKYQVEQLPDTTVIITYHNEAHSTLLRTVI 280
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV+ RSP +++KEI+LVDD S + L + + K+ + IR+P R GLI++R+ GA
Sbjct: 281 SVLHRSPPNLIKEIILVDDFSKNPNIGPPLTK-IKKV----KAIRNPKREGLIRSRVRGA 335
Query: 172 RQAEGEILVFLDAHCECT 189
A G++L FLD+H E
Sbjct: 336 AIATGKVLTFLDSHVEAN 353
>gi|194855488|ref|XP_001968556.1| GG24441 [Drosophila erecta]
gi|190660423|gb|EDV57615.1| GG24441 [Drosophila erecta]
Length = 631
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 239/406 (58%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 197 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 252
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 253 KID-KVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTR 311
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 312 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 371
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 372 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 431
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 432 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 490
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 491 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 542
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + + +V + AC QRW
Sbjct: 543 QEWAFTKRGEIKHDD--LCLTLVTFAR--GSQVVLKACDDSENQRW 584
Score = 125 bits (313), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 101/155 (65%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 156 LRNGEDPYIRNRFNQEASDALPSNRDIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 214
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ RVIR+ R GL+
Sbjct: 215 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRVIRNDKREGLV 269
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC ++ E L +
Sbjct: 270 RSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLER 304
>gi|449474909|ref|XP_002194974.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Taeniopygia guttata]
Length = 555
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ E++LVDD S RE
Sbjct: 81 HPNCKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELVAEVVLVDDFSDREH 140
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 141 LKKRLEDYMAQFPS-VRILRTKRREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 199
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 200 LDRIARNRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKPD 256
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 257 PSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 316
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AEF ++ PE + V ++
Sbjct: 317 VGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEFIYQRRPEY-RHLSAGDVAAQK 373
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
ELR L C SFKW++ V W P + P++ +G IR+V T CV+
Sbjct: 374 ELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEIRNVGTGLCVD 422
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 74/149 (49%), Positives = 110/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P CK K++ E+ LP +S++I FHNE
Sbjct: 47 MTDAERVDQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEK-LPNTSVIIPFHNE 105
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ E++LVDD S RE LK L++Y+A+ R++R+ R
Sbjct: 106 GWSSLLRTVHSVLNRSPPELVAEVVLVDDFSDREHLKKRLEDYMAQFPS-VRILRTKRRE 164
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 165 GLIRTRMLGASVAIGDVITFLDSHCEANV 193
>gi|326928540|ref|XP_003210435.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Meleagris gallopavo]
Length = 562
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 87 HPNCKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 146
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 147 LKKRLEDYMAQFPN-VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 205
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 206 LDRIARNRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKLD 262
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 263 PSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 322
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 323 VGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVTAQK 379
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
ELR L C SFKW+++ V W P + P++ +G IR+V T CV+
Sbjct: 380 ELRNNLNCKSFKWFMSEVAWDLPKFYPPVEPPAAAWGEIRNVGTGLCVD 428
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 110/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P CK K++ E+ LP +S++I FHNE
Sbjct: 53 MTDAERVDQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEK-LPNTSVIIPFHNE 111
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A+ R++R+ R
Sbjct: 112 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQFPN-VRILRTKKRE 170
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 171 GLIRTRMLGASVAIGDVITFLDSHCEANV 199
>gi|158286608|ref|XP_308833.4| AGAP006925-PA [Anopheles gambiae str. PEST]
gi|157020549|gb|EAA04096.4| AGAP006925-PA [Anopheles gambiae str. PEST]
Length = 622
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 216/342 (63%), Gaps = 13/342 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP +S++I+F+NE +S LLRTVHSV++ + +L+EI+LVDD ST LK LD YV +
Sbjct: 166 LPSTSVIIIFYNEPYSVLLRTVHSVLNTADGRLLREIILVDDGSTNVELKGKLDYYVRTR 225
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L +V+R RVGLI+ARL GAR A+ ++LVFLDAHCEC + WLE L+ R+ E T V
Sbjct: 226 LPAKVKVLRQRQRVGLIRARLAGARLAKADVLVFLDAHCECMVQWLEPLLERIKESPTSV 285
Query: 316 VCPVIDIISDVTFAYVRS--FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE------P 367
+ P+ID+I F Y + + G F W+ HF W+ + ++++ E P
Sbjct: 286 LVPIIDVIEAKNFYYSTNDYNDFQIGGFTWDGHFDWHDVTKRERERQKRECAEKDLEICP 345
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLFAI R YF+ IG+YDE+M WGGENLEMSFRVWQCGG++E PCS + H+
Sbjct: 346 TYSPTMAGGLFAIARDYFWDIGSYDEQMDGWGGENLEMSFRVWQCGGTLETIPCSRIGHI 405
Query: 428 FRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELR 486
FR PYSFP +G N R+A+VWMD++ E + P+ + + V R LR
Sbjct: 406 FRDFHPYSFPN--DRDTHGINTVRMAIVWMDDYVELLYLNRPDLKDHPELGDVTHRKVLR 463
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVE 527
++L C SF WY+ +V+P F+P + + FGR+ + C++
Sbjct: 464 EKLHCKSFDWYMKNVYPEKFIPTRNVRAFGRLASQADNLCLD 505
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 70/144 (48%), Positives = 97/144 (67%), Gaps = 1/144 (0%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N +S+ + NRT PD R P CK K ++ LP +S++I+F+NE +S LLRTVHSV++
Sbjct: 134 LNEELSEHLSYNRTPPDGRHPACKRKHYDLTGLPSTSVIIIFYNEPYSVLLRTVHSVLNT 193
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
+ +L+EI+LVDD ST LK LD YV +L +V+R RVGLI+ARL GAR A+
Sbjct: 194 ADGRLLREIILVDDGSTNVELKGKLDYYVRTRLPAKVKVLRQRQRVGLIRARLAGARLAK 253
Query: 176 GEILVFLDAHCECTLVFNEEFLPK 199
++LVFLDAHCEC + + E L +
Sbjct: 254 ADVLVFLDAHCECMVQWLEPLLER 277
>gi|195471053|ref|XP_002087820.1| GE14879 [Drosophila yakuba]
gi|194173921|gb|EDW87532.1| GE14879 [Drosophila yakuba]
Length = 634
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 238/406 (58%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + L+ +A
Sbjct: 200 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSDNP--EDGLE--LA 255
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 256 KID-KVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTR 314
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 315 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 374
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 375 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 434
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 435 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 493
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 494 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 545
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + + +V + AC QRW
Sbjct: 546 QEWAFTKRGEIKHDD--LCLTLVTFAR--GSQVVLKACDDSENQRW 587
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 100/155 (64%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 159 LRNGEDPYIRNRFNQEASDALPSNRDIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 217
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + L+ +AK+ RVIR+ R GL+
Sbjct: 218 TLLRTIVSVLNRSPEHLIREIVLVDDYSDNP--EDGLE--LAKID-KVRVIRNDKREGLV 272
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC ++ E L +
Sbjct: 273 RSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLER 307
>gi|431904511|gb|ELK09894.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Pteropus alecto]
Length = 557
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 167/418 (39%), Positives = 239/418 (57%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S VI FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSVDLPATSFVITFHNEARSTLLRTVKSVLNRTPPNLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A IL FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVASAAILTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKISRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 402 EQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGVNCLESQGQDTAGNFLLGVGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
P+ +Q + +D + + CL V + AC+ RQRW
Sbjct: 460 CRGSAKNPLASQAWTF---SDHLIQQQEKCLTATSTSISPGSPVILQACNPREGRQRW 514
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S VI FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSVSYSVD-LPATSFVITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPPNLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A IL FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVASAAILTFLDSHCE----VNTEWL 221
>gi|194225134|ref|XP_001495036.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Equus caballus]
Length = 619
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 158/380 (41%), Positives = 229/380 (60%), Gaps = 24/380 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 172 HYSCPSVSYSVDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 227
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 228 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQP 285
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + + G F+W LHF+W + I R D
Sbjct: 286 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASAILRGGFDWSLHFKWEQI-PLEQKIARTD 344
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 345 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 404
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 405 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVATRI 463
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKP-------------L 530
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E +
Sbjct: 464 EQRKKMSCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGVNCLESQGQDTAGNFLLGMGI 521
Query: 531 AKGSMNQASGPASLLPCTHL 550
+GS+ P + L HL
Sbjct: 522 CRGSVKNPPAPQAWLFSDHL 541
Score = 98.6 bits (244), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 140 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSVD-LPATSVIITFHNEAR 198
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 199 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 252
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 253 LIRSRVRGADVATAAVLTFLDSHCE----VNTEWL 283
>gi|46877107|ref|NP_598950.2| polypeptide N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|51315866|sp|Q6P9S7.1|GLT10_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|38148689|gb|AAH60617.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Mus musculus]
gi|74196924|dbj|BAE35020.1| unnamed protein product [Mus musculus]
Length = 603
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 226/353 (64%), Gaps = 13/353 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVEKPLA 531
+LR L C SFKW++T + W P + P++ +G IR+V T C + L
Sbjct: 425 KLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDTKLG 477
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASAATGDVVTFLDSHCEANV 244
>gi|26329191|dbj|BAC28334.1| unnamed protein product [Mus musculus]
Length = 528
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 226/353 (64%), Gaps = 13/353 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 57 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 116
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 117 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVVTFLDSHCEANVNWLPPL 175
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 176 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 232
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 233 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 292
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 293 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 349
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVEKPLA 531
+LR L C SFKW++T + W P + P++ +G IR+V T C + L
Sbjct: 350 KLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDTKLG 402
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 23 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 81
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 82 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 140
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 141 GLIRTRMLGASAATGDVVTFLDSHCEANV 169
>gi|68392893|ref|XP_688194.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Danio
rerio]
Length = 578
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 155/325 (47%), Positives = 212/325 (65%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI F+NEAWS LLRTVHSV+ SP +L E++LVDD S RE LK L+ Y+A L
Sbjct: 130 LPTTSVVIAFYNEAWSTLLRTVHSVLETSPDLLLNEVILVDDYSDREHLKEPLENYIADL 189
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A GE+L FLD HCEC GWLE L+ R+ E+ + VV
Sbjct: 190 K-KVRLIRARKREGLVRARLLGASIATGEVLTFLDCHCECHEGWLEPLLQRIKEEPSAVV 248
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ E G F+W L F W++ + +R T+ ++P MAG
Sbjct: 249 CPVIDVIDWNTFQYLGNPGEPQIGGFDWRLVFTWHSIPEHEQK-RRSAATDVVRSPTMAG 307
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+++ YF ++G YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K +PYS
Sbjct: 308 GLFAVNKKYFLYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPKKAPYS 367
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMD++ E Y+ +P A + V R +LR +L+C F+
Sbjct: 368 RNKALA-----NSVRAAEVWMDDFKEVYYHRSPHA-RLEAYGDVTDRRKLRMRLRCKDFR 421
Query: 496 WYLTHVWPHHFLPMDDK-FFGRIRH 519
W+L +++P +P D FG +++
Sbjct: 422 WFLDNIYPDIQVPEDKPGMFGMLKN 446
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 69/134 (51%), Positives = 91/134 (67%), Gaps = 1/134 (0%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
++ N VSDRI ++R L + P C+ ++ LP +S+VI F+NEAWS LLRTVHSV+
Sbjct: 96 HQINTYVSDRISLHRRLAERWNPLCRNLKYDYLSLPTTSVVIAFYNEAWSTLLRTVHSVL 155
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +L E++LVDD S RE LK L+ Y+A L R+IR+ R GL++ARLLGA A
Sbjct: 156 ETSPDLLLNEVILVDDYSDREHLKEPLENYIADLK-KVRLIRARKREGLVRARLLGASIA 214
Query: 175 EGEILVFLDAHCEC 188
GE+L FLD HCEC
Sbjct: 215 TGEVLTFLDCHCEC 228
>gi|260841393|ref|XP_002613900.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
gi|229299290|gb|EEN69909.1| hypothetical protein BRAFLDRAFT_208719 [Branchiostoma floridae]
Length = 442
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 161/357 (45%), Positives = 227/357 (63%), Gaps = 12/357 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I+F+NEAWS LLRTVHSV+ SP +L+E++LVDD ST + LK+ L+ Y++ L
Sbjct: 83 LPQTSVIIIFYNEAWSTLLRTVHSVLEASPAELLREVILVDDCSTFDHLKAPLETYLSTL 142
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R++RSP R GLI+ARLLGA A GE+L FLD+HCEC GWLE + +A + T V
Sbjct: 143 PQ-VRLVRSPKRQGLIRARLLGALHARGEVLTFLDSHCECMHGWLEPQLETIARNYTTVP 201
Query: 317 CPVIDIISDVTFAY----VRSFELHWGAFNW-ELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
V+D I TF Y ++S ++ G N+ EL F W + ++K +P ++P
Sbjct: 202 ISVLDNILHDTFQYTFMDLQSTQM--GGINFKELTFIWEPIPEHERR-RQKSPVDPIRSP 258
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGG+F+I++ YF ++GAYD M+VWGGEN+EMSFR+WQCGG+I + PCSHV H+FR
Sbjct: 259 TMAGGIFSINKKYFEYLGAYDTGMEVWGGENIEMSFRIWQCGGTIVVLPCSHVGHVFRPT 318
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKC 491
SPYS G + L N R+A VWMD++ E Y++ +PE K D V R LRK L C
Sbjct: 319 SPYST-GDAWKKLVHNNRRMAEVWMDDYKEIYYRKHPEYRKY-DMGDVTQRKLLRKGLHC 376
Query: 492 HSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPC 547
F WYL+HV+P ++P + G++ HV + + L + PA + PC
Sbjct: 377 RDFSWYLSHVFPTLYVPDIRPIAHGQVSHVTSISSEQTGLCLDVIKAGKEPAGVFPC 433
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/189 (43%), Positives = 119/189 (62%), Gaps = 20/189 (10%)
Query: 38 DGGLHSNLS----DAQQLYQ----INRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFL 89
+GG NL + ++LY+ N FN S +I ++R+LPD+R CK K F L
Sbjct: 25 EGGHGVNLQPSTPEEKRLYKEGLKNNSFNAWASSKISLHRSLPDLRHRLCKQKQFFRP-L 83
Query: 90 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 149
P++S++I+F+NEAWS LLRTVHSV+ SP +L+E++LVDD ST + LK+ L+ Y++ L
Sbjct: 84 PQTSVIIIFYNEAWSTLLRTVHSVLEASPAELLREVILVDDCSTFDHLKAPLETYLSTLP 143
Query: 150 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC----------TLVFNEEFLPK 199
R++RSP R GLI+ARLLGA A GE+L FLD+HCEC T+ N +P
Sbjct: 144 Q-VRLVRSPKRQGLIRARLLGALHARGEVLTFLDSHCECMHGWLEPQLETIARNYTTVPI 202
Query: 200 SSIVIVFHN 208
S + + H+
Sbjct: 203 SVLDNILHD 211
>gi|195342262|ref|XP_002037720.1| GM18147 [Drosophila sechellia]
gi|194132570|gb|EDW54138.1| GM18147 [Drosophila sechellia]
Length = 606
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 239/406 (58%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 172 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 227
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 228 KID-KVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTR 286
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 287 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 346
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 347 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 406
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 407 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 465
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 466 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 517
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + + +V + AC QRW
Sbjct: 518 QEWAFTKRGEIKHDD--LCLTLVTFAR--GSQVVLKACDDSENQRW 559
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 62/139 (44%), Positives = 93/139 (66%), Gaps = 6/139 (4%)
Query: 61 VSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRS 120
SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S LLRT+ SV++RSP
Sbjct: 147 ASDALPSNRDIPDTRNPMCRTKKYRED-LPETSVIITFHNEARSTLLRTIVSVLNRSPEH 205
Query: 121 MLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
+++EI+LVDD S + + L+ +AK+ RVIR+ R GL+++R+ GA A +L
Sbjct: 206 LIREIVLVDDYS--DHPEDGLE--LAKID-KVRVIRNDKREGLVRSRVKGADAAVSSVLT 260
Query: 181 FLDAHCECTLVFNEEFLPK 199
FLD+H EC ++ E L +
Sbjct: 261 FLDSHVECNEMWLEPLLER 279
>gi|74186700|dbj|BAE34806.1| unnamed protein product [Mus musculus]
Length = 603
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 155/353 (43%), Positives = 226/353 (64%), Gaps = 13/353 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVEKPLA 531
+LR L C SFKW++T + W P + P++ +G IR+V T C + L
Sbjct: 425 KLRVSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDTKLG 477
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNMYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASAATGDVITFLDSHCEANV 244
>gi|18543347|ref|NP_570098.1| polypeptide N-acetylgalactosaminyltransferase 10 [Rattus
norvegicus]
gi|51315730|sp|Q925R7.1|GLT10_RAT RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|14150450|gb|AAK54498.1|AF241241_1 UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase T9 [Rattus
norvegicus]
gi|149052685|gb|EDM04502.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Rattus norvegicus]
Length = 603
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTD 473
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 109/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKLYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASAATGDVITFLDSHCEANV 244
>gi|403285674|ref|XP_003934138.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Saimiri boliviensis boliviensis]
Length = 682
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 211 HPNCNSKHYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 270
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 271 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 329
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 330 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 386
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 387 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 446
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 447 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVTAQK 503
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 504 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 552
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 177 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKHYLET-LPNTSIIIPFHNE 235
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 236 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 294
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 295 GLIRTRMLGASVATGDVITFLDSHCEANV 323
>gi|34042922|gb|AAQ56700.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Drosophila melanogaster]
Length = 615
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 239/406 (58%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 181 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 236
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 237 KID-KVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTR 295
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 296 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 355
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 356 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 415
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 416 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 474
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 475 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 526
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + + +V + AC QRW
Sbjct: 527 QEWAFTKRGEIKHDD--LCLTLVTFAR--GSQVVLKACDDSENQRW 568
Score = 125 bits (313), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 101/155 (65%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 140 LRNGEDPYIRNRFNQEASDALPSNRDIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 198
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ RVIR+ R GL+
Sbjct: 199 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRVIRNDKREGLV 253
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC ++ E L +
Sbjct: 254 RSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLER 288
>gi|62484229|ref|NP_608773.2| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|320594323|ref|NP_995625.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
gi|195576320|ref|XP_002078024.1| GD22759 [Drosophila simulans]
gi|51315875|sp|Q6WV19.2|GALT2_DROME RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 2;
Short=pp-GaNTase 2; AltName: Full=Protein-UDP
acetylgalactosaminyltransferase 2; AltName:
Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 2
gi|61678274|gb|AAF51113.3| polypeptide GalNAc transferase 2, isoform A [Drosophila
melanogaster]
gi|194190033|gb|EDX03609.1| GD22759 [Drosophila simulans]
gi|318068299|gb|AAS64620.2| polypeptide GalNAc transferase 2, isoform B [Drosophila
melanogaster]
Length = 633
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 239/406 (58%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 199 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 254
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 255 KID-KVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTR 313
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 314 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 373
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 374 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 433
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 434 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 492
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 493 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 544
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + + +V + AC QRW
Sbjct: 545 QEWAFTKRGEIKHDD--LCLTLVTFAR--GSQVVLKACDDSENQRW 586
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 101/155 (65%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 158 LRNGEDPYIRNRFNQEASDALPSNRDIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 216
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ RVIR+ R GL+
Sbjct: 217 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRVIRNDKREGLV 271
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC ++ E L +
Sbjct: 272 RSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLER 306
>gi|33589464|gb|AAQ22499.1| RE02655p [Drosophila melanogaster]
Length = 633
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 239/406 (58%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 199 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 254
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 255 KID-KVRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVREDPTR 313
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 314 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 373
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 374 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 433
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 434 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 492
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 493 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 544
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + + +V + AC QRW
Sbjct: 545 QEWAFTKRGEIKHDD--LCLTLVTFAR--GSQVVLKACDDSENQRW 586
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/155 (43%), Positives = 101/155 (65%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 158 LRNGEDPYIRNRFNQEASDALPSNRDIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 216
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ RVIR+ R GL+
Sbjct: 217 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRVIRNDKREGLV 271
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC ++ E L +
Sbjct: 272 RSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLER 306
>gi|350594474|ref|XP_003134177.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Sus
scrofa]
Length = 624
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 153 HPNCNSKRYLEMLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 212
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 213 LKKPLEDYMA-LFPNVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 271
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 272 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 328
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 329 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 388
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 389 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 445
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 446 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 494
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 119 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRY-LEMLPNTSIIIPFHNE 177
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 178 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPNVRILRTKKRE 236
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 237 GLIRTRMLGASAATGDVITFLDSHCEANV 265
>gi|194761420|ref|XP_001962927.1| GF15680 [Drosophila ananassae]
gi|190616624|gb|EDV32148.1| GF15680 [Drosophila ananassae]
Length = 630
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 173/406 (42%), Positives = 240/406 (59%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 196 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 251
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ RVIR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 252 KID-KVRVIRNDKREGLVRSRVRGADAAVSGVLTFLDSHVECNERWLEPLLERVREDPTR 310
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W S+ ++ D T +TP +A
Sbjct: 311 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTPMIA 370
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 371 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 430
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 431 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 489
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 490 KWYLENVYPDLQAP-DPQEIGQFRQDGT-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 541
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + ++ +V + AC QRW
Sbjct: 542 QEWAFSKRGEIKHDD--LCLTLVQFAR--GSQVVLKACDESENQRW 583
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 99/155 (63%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+ K + E+ LP++S++I FHNEA S
Sbjct: 155 LRNGEDPYIRNRFNQEASDSLPSNREIPDTRNPMCRNKKYRED-LPETSVIITFHNEARS 213
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ RVIR+ R GL+
Sbjct: 214 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRVIRNDKREGLV 268
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC + E L +
Sbjct: 269 RSRVRGADAAVSGVLTFLDSHVECNERWLEPLLER 303
>gi|332839183|ref|XP_001147578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 6 isoform
5 [Pan troglodytes]
Length = 638
Score = 305 bits (780), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 171/380 (45%), Positives = 234/380 (61%), Gaps = 35/380 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L
Sbjct: 176 LATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQL 235
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V RV+R R GLI ARLLGA A+ E+L FLDAHCEC GWLE L+AR+AED+T VV
Sbjct: 236 QV-VRVVRQEERKGLITARLLGASVAQAEVLTFLDAHCECFHGWLEPLLARIAEDKTVVV 294
Query: 317 CPVIDIISDVTFAYVRSFE---LHW-GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + +H G F+W L F W T + +RKD T P K+P
Sbjct: 295 SPDIVTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQ-RRKDETYPIKSPT 353
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 354 FAGGLFSISKSYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 413
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR---SRLELRKQ- 488
P++FP G S V+ N R+A VWMD + + +++ N +A K Q VR SR L ++
Sbjct: 414 PHTFPKGTS-VIARNQVRLAEVWMDSYKKIFYRRNLQAAKM--AQEVRGNGSRRGLDEEK 470
Query: 489 -------LKCHSFKWYLTHVW----------PHHFLP-MDDKFFGRIRHVQTHKCVEKPL 530
+ + ++ +W P F+P + F+G I+++ T++C++
Sbjct: 471 YGPQTLFMGLMAGTHLISTIWRLPSPSGTFYPERFVPDLTPTFYGAIKNLGTNQCLD--- 527
Query: 531 AKGSMNQASGPASLLPCTHL 550
G N+ P + C L
Sbjct: 528 -VGENNRGGKPLIMYSCHGL 546
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 77/144 (53%), Positives = 100/144 (69%), Gaps = 3/144 (2%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R+L PD R P+C + F L +S++IVFHNEAWS
Sbjct: 132 EKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECVDQKFRRCPPLATTSVIIVFHNEAWS 191
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E LK L++YV +L V RV+R R GLI
Sbjct: 192 TLLRTVYSVLHTTPAILLKEIILVDDASTEEHLKEKLEQYVKQLQV-VRVVRQEERKGLI 250
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
ARLLGA A+ E+L FLDAHCEC
Sbjct: 251 TARLLGASVAQAEVLTFLDAHCEC 274
>gi|118097436|ref|XP_414578.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Gallus
gallus]
Length = 611
Score = 305 bits (780), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 137 HPNCKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 196
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 197 LKKRLEDYMAQFPN-VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 255
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 256 LDRIARNRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKLD 312
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 313 PSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 372
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 373 VGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVTAQK 429
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
ELR L C SFKW++ V W P + P++ +G IR+V T CV+
Sbjct: 430 ELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEIRNVGTGLCVD 478
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 110/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P CK K++ E+ LP +S++I FHNE
Sbjct: 103 MTDAERVDQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEK-LPNTSVIIPFHNE 161
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A+ R++R+ R
Sbjct: 162 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQFPN-VRILRTKKRE 220
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 221 GLIRTRMLGASVAIGDVITFLDSHCEANV 249
>gi|260794623|ref|XP_002592308.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
gi|229277524|gb|EEN48319.1| hypothetical protein BRAFLDRAFT_206872 [Branchiostoma floridae]
Length = 374
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 207/313 (66%), Gaps = 9/313 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S++I FHNEAWS L+RTVHSV+ +P +L E+++VDD S + LK+ L +YVA L
Sbjct: 62 LPAVSVIICFHNEAWSTLMRTVHSVLRTAPSELLTEVIMVDDDSQYDHLKAQLTDYVAGL 121
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR+ R GLI+ARLLGA A ++LVFLD+HCEC +GWLE L+ R+ ++R+ VV
Sbjct: 122 P-KVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPLLDRIVQNRSHVV 180
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT-EPFKTPAMAG 375
PVID+I TF Y + F+W L FRW +S KR+ + +P +P MAG
Sbjct: 181 TPVIDVIDFKTFEYRHLAIIQVRGFDWRLIFRWEKIPASYE--KRRGLSVDPILSPTMAG 238
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAID+ YF H+G YD M++WGGENLE+SFR+WQCGG++EI PCS V H+FR+ PY
Sbjct: 239 GLFAIDKEYFHHLGLYDTGMEIWGGENLELSFRIWQCGGTLEIMPCSRVGHVFRQRFPYQ 298
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
+EV NL RVA VWMD++ E++++ K++ V R ELR++L+C F
Sbjct: 299 TS---TEVTTRNLMRVAEVWMDQYKEYFYQIR--HIKKKSFGDVTERQELRRRLQCRDFH 353
Query: 496 WYLTHVWPHHFLP 508
WYL ++P P
Sbjct: 354 WYLNTIYPELKFP 366
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/171 (45%), Positives = 110/171 (64%), Gaps = 6/171 (3%)
Query: 34 GDSVDGGLHSNLS--DAQQL---YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEF 88
G+ G + NLS D +QL Y+ FN S +IP+ RT+PD R CK+K ++
Sbjct: 2 GELGQGVVLRNLSPQDRKQLEEGYKKYAFNEFASTKIPLTRTIPDGRHWLCKSKEYDVSR 61
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP S++I FHNEAWS L+RTVHSV+ +P +L E+++VDD S + LK+ L +YVA L
Sbjct: 62 LPAVSVIICFHNEAWSTLMRTVHSVLRTAPSELLTEVIMVDDDSQYDHLKAQLTDYVAGL 121
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++IR+ R GLI+ARLLGA A ++LVFLD+HCEC + + E L +
Sbjct: 122 P-KVKLIRTHQREGLIRARLLGASHARADVLVFLDSHCECNIGWLEPLLDR 171
>gi|402873191|ref|XP_003900469.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Papio
anubis]
Length = 637
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 166 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 225
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 226 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 284
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 285 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 341
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 342 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 401
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 402 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 458
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 459 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 507
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 132 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 190
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 191 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 249
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 250 GLIRTRMLGASVATGDVITFLDSHCEANV 278
>gi|291190646|ref|NP_001167159.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
gi|223648406|gb|ACN10961.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Salmo salar]
Length = 560
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 158/332 (47%), Positives = 213/332 (64%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SS++I FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + L + + K+
Sbjct: 124 LPASSVIITFHNEARSALLRTVVSVLKKSPPHLVKEIILVDDYSDNPEDGALLGK-IEKI 182
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 183 ----RVLRNDRREGLMRSRVRGADTATASVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 238
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W +++ + T P KTP +AGG
Sbjct: 239 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTVEQRRVRQGNPTAPIKTPMIAGG 298
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 299 LFVMDKDYFELLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 358
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SR+E++K+L C FKW
Sbjct: 359 PGGSGTVFARNTRRAAEVWMDEFKNFYYAAVPSA-RNVPYGNIQSRMEMKKRLGCQPFKW 417
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 418 YLENVYPELRVPDHQDIAFGALQ--QGGNCLD 447
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/152 (42%), Positives = 95/152 (62%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R C+ K + E LP SS++I FHNEA SALL
Sbjct: 84 GQDPYARNKFNQVESDKLRMDRGVPDTRHDHCQHKQWKTE-LPASSVIITFHNEARSALL 142
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + L + + K+ RV+R+ R GL+++R
Sbjct: 143 RTVVSVLKKSPPHLVKEIILVDDYSDNPEDGALLGK-IEKI----RVLRNDRREGLMRSR 197
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A +L FLD+HCEC + E L +
Sbjct: 198 VRGADTATASVLTFLDSHCECNEHWLEPLLER 229
>gi|405951291|gb|EKC19216.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Crassostrea
gigas]
Length = 613
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 157/330 (47%), Positives = 209/330 (63%), Gaps = 6/330 (1%)
Query: 200 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP 259
+SI++ F NE SALLR VHS+ ++P+ ++KEI+LVDD+ST + L ++ YV +
Sbjct: 148 TSIIVCFFNEQPSALLRLVHSINDQTPQELVKEIILVDDSSTLDDLSCQIENYVNQHFNN 207
Query: 260 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPV 319
R++R+P R GLI+AR+ GA A G++LVFLD+HCE WLE L+ R++ D T VV PV
Sbjct: 208 VRLVRTPEREGLIRARVFGANLASGQVLVFLDSHCEVNTDWLEPLLLRISHDPTTVVVPV 267
Query: 320 IDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF-TEPFKTPAMAGGLF 378
IDII+ T Y +S L G FNW LHF W ++ D ++P +P MAGGLF
Sbjct: 268 IDIINHDTMEYQQS-PLVRGGFNWGLHFSWDRLPDNEK--NDPDLGSKPILSPTMAGGLF 324
Query: 379 AIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPG 438
A+ R YF H+G YD M +WGGENLE+SFR+W CGG +EI PCS V H+FRK PY P
Sbjct: 325 AMKRDYFHHLGEYDLGMDIWGGENLEISFRIWMCGGKLEIIPCSRVGHIFRKRRPYGNPK 384
Query: 439 GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYL 498
G L +L RVA VWMD++ E++ K P+A+ D + R+ LRK L C SFKWYL
Sbjct: 385 GRDTFLKNSL-RVANVWMDKYKEYFLKQRPQAQVV-DYGDISDRISLRKHLSCKSFKWYL 442
Query: 499 THVWPHHFLPMDDKFFGRIRHVQTHKCVEK 528
HV+P LP D K + H Q K +K
Sbjct: 443 DHVYPELSLPGDVKPSNKSSHHQPMKSNDK 472
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/146 (45%), Positives = 95/146 (65%), Gaps = 1/146 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ FN LVSD+I +R +PD R PKC+ F L +SI++ F NE SALLR VH
Sbjct: 109 YQNFAFNALVSDKIGFHRAIPDTRYPKCQDVTFPAINL-DTSIIVCFFNEQPSALLRLVH 167
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+ ++P+ ++KEI+LVDD+ST + L ++ YV + R++R+P R GLI+AR+ GA
Sbjct: 168 SINDQTPQELVKEIILVDDSSTLDDLSCQIENYVNQHFNNVRLVRTPEREGLIRARVFGA 227
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFL 197
A G++LVFLD+HCE + E L
Sbjct: 228 NLASGQVLVFLDSHCEVNTDWLEPLL 253
>gi|432110716|gb|ELK34193.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Myotis davidii]
Length = 466
Score = 304 bits (779), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 215/330 (65%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP +SI+I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L+
Sbjct: 15 YDYDQLPTTSIIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLEN 74
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++KL R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 75 ELSKLP-KVRLIRANKREGLVRARLLGASAAKGQVLTFLDCHCECHEGWLEPLLQRIQEE 133
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + + R + ++
Sbjct: 134 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPERERMRMRSP-VDVIRS 192
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 193 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 252
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 253 QAPYSRKKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 306
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 307 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 336
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 115/216 (53%), Gaps = 42/216 (19%)
Query: 74 VRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAST 133
+ + +CK K ++ + LP +SI+I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S
Sbjct: 5 LSQTRCKEKKYDYDQLPTTSIIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSD 64
Query: 134 REFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFN 193
RE LK L+ ++KL R+IR+ R GL++ARLLGA A+G++L FLD HCEC
Sbjct: 65 REHLKERLENELSKLP-KVRLIRANKREGLVRARLLGASAAKGQVLTFLDCHCEC----- 118
Query: 194 EEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV 253
+E W L + I +++ ++ V D +T E+L +S + +
Sbjct: 119 --------------HEGW---LEPLLQRIQEEESAVVCPVIDVIDWNTFEYLGNSGEPQI 161
Query: 254 A-------------------KLSVPTRVIRSPGRVG 270
++ P VIRSP G
Sbjct: 162 GGFDWRLVFTWHVVPERERMRMRSPVDVIRSPTMAG 197
>gi|426233584|ref|XP_004010796.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Ovis
aries]
Length = 557
Score = 304 bits (779), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 165/418 (39%), Positives = 238/418 (56%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSMSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAFFTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 402 EQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGTNCLESQGQDTAGNFQLGMGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRW 600
P Q ++ TD + + CL V + AC+ RQRW
Sbjct: 460 CRGSAKNPPAAQAWLF---TDHLIQQQGKCLAATSTSVSPGSLVVLQACNPREGRQRW 514
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 91/155 (58%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAAAAFFTFLDSHCE----VNTEWL 221
>gi|390347275|ref|XP_003726736.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Strongylocentrotus purpuratus]
Length = 507
Score = 304 bits (779), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 171/431 (39%), Positives = 244/431 (56%), Gaps = 23/431 (5%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE---F 244
C + LP +S++IVFHNEA S LLRT+ SVI RSPR +L+EI+LVDDAS+ E +
Sbjct: 34 CRMFRYRHDLPSASVIIVFHNEAKSVLLRTIWSVIVRSPRHLLEEIVLVDDASSLEGEDW 93
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA--EGEILVFLDAHCECTLGWLE 302
+ L++YV+ L VP R++R P R G++ +RL G A +LVFLD+HCECT GWLE
Sbjct: 94 YRGWLEDYVSHLPVPARLLRMPHRSGVVGSRLAGVSAASRNATVLVFLDSHCECTEGWLE 153
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIK 360
L+ + + TRVV P ID ISD F+Y +R G F+W F W +
Sbjct: 154 PLLDCINRNSTRVVSPAIDSISDTDFSYTFIRGIART-GGFSWFPEFMWTHAPQREMKRV 212
Query: 361 RKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
+D P +TP +AGGLFAIDR +F +G YD E+ VWG ENLE+SF+VWQCGGS+E+ P
Sbjct: 213 WQDAATPLRTPTIAGGLFAIDRKFFKSLGYYDPELHVWGSENLELSFKVWQCGGSLEVVP 272
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
CS V H+FR PY FPG VL N RV VW + ++ PE + D +
Sbjct: 273 CSRVGHVFRSKPPYDFPGNPETVLLRNNKRVLEVWGGQIKHLFYGLTPEYQAV-DAGDIS 331
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASG 540
SR+ +R +LKC +F+WYL +V+P + LP++ + GR + + C++ + + + G
Sbjct: 332 SRIRIRDELKCKNFEWYLENVYPENILPLNFQALGRFMNEGVNLCID--VLHATDGRRMG 389
Query: 541 PASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG--FNR- 597
+ L Q F + L + C+ E +N V +L C +NR
Sbjct: 390 AHLAVNACREGALAQTFSWNDLSQL--RHDRFCITAVEGDN----HVMLLECQDVHYNRL 443
Query: 598 ---QRWTYDKE 605
Q+W Y+++
Sbjct: 444 LTNQQWQYNEQ 454
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 107/155 (69%), Gaps = 6/155 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
A L++ ++FN+ SD+I +NRTLPD R P+C+ + + LP +S++IVFHNEA S LL
Sbjct: 3 ADVLFKKHQFNVFASDQIALNRTLPDHRPPRCRMFRYRHD-LPSASVIIVFHNEAKSVLL 61
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTRE---FLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
RT+ SVI RSPR +L+EI+LVDDAS+ E + + L++YV+ L VP R++R P R G++
Sbjct: 62 RTIWSVIVRSPRHLLEEIVLVDDASSLEGEDWYRGWLEDYVSHLPVPARLLRMPHRSGVV 121
Query: 165 KARLLGARQA--EGEILVFLDAHCECTLVFNEEFL 197
+RL G A +LVFLD+HCECT + E L
Sbjct: 122 GSRLAGVSAASRNATVLVFLDSHCECTEGWLEPLL 156
>gi|357625888|gb|EHJ76177.1| hypothetical protein KGM_07902 [Danaus plexippus]
Length = 535
Score = 304 bits (779), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 173/430 (40%), Positives = 242/430 (56%), Gaps = 24/430 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP S+V+ FHNE WS LLRT +SV++RSP ++KE+ LVDDAST++F
Sbjct: 60 HPGCQNRLYIESLPTVSVVVPFHNEHWSTLLRTAYSVLNRSPTFLIKEVFLVDDASTKDF 119
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK LD+YV+K ++IR R GLI ARL GA +A ++LVFLD+H E + WL L
Sbjct: 120 LKEQLDDYVSKHMPKVKIIRLKSRSGLIAARLAGAEKATADVLVFLDSHTEANVNWLPPL 179
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ +A + VVCP ID+++ TFAY E GAF+WEL ++ +D +
Sbjct: 180 LEPIALNYKTVVCPFIDVVAYDTFAYRAQDEGARGAFDWELFYKRLPVLPAD----EANM 235
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
EPF +P MAGGLFAI R +F+ +G YD + +WGGE E+SF++WQCGG + APCS V
Sbjct: 236 PEPFPSPVMAGGLFAISRVFFWELGGYDPGLDIWGGEQYELSFKLWQCGGKMLDAPCSRV 295
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H++RK +P+ PG + + N RVA VWMDE+A++ +K P K D + +
Sbjct: 296 GHIYRKFAPFPNPGH-GDFVGKNYRRVAEVWMDEYAQYLYKRRPHYLKI-DTGDISKQKA 353
Query: 485 LRKQLKCHSFKWYLTHV---WPHHFLPMDDKFF--GRIRHVQ-THKCVEKPLAKGSMNQA 538
LR++L+C FKW++T + + P++ K F GRIR H CV+
Sbjct: 354 LREKLQCKPFKWFMTQIAFDLTAKYPPVEPKPFAEGRIRPATYPHLCVD-----AHHGNQ 408
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS---GF 595
L CT Q F++ D+ + ++C D+P D SPR IL S G
Sbjct: 409 MDKLHLKSCTASTSAEQNFMLSWHKDIKSKTRNMCWDLP----DSSPRSPILLYSCHLGG 464
Query: 596 NRQRWTYDKE 605
Q W Y E
Sbjct: 465 GNQLWRYHPE 474
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 73/142 (51%), Positives = 103/142 (72%), Gaps = 1/142 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+ LY +N FN +SD+IP+NR+LPD+R P C+ +++ E LP S+V+ FHNE WS LLR
Sbjct: 33 KDLYAVNGFNGALSDKIPLNRSLPDIRHPGCQNRLYIES-LPTVSVVVPFHNEHWSTLLR 91
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T +SV++RSP ++KE+ LVDDAST++FLK LD+YV+K ++IR R GLI ARL
Sbjct: 92 TAYSVLNRSPTFLIKEVFLVDDASTKDFLKEQLDDYVSKHMPKVKIIRLKSRSGLIAARL 151
Query: 169 LGARQAEGEILVFLDAHCECTL 190
GA +A ++LVFLD+H E +
Sbjct: 152 AGAEKATADVLVFLDSHTEANV 173
>gi|327278031|ref|XP_003223766.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like [Anolis carolinensis]
Length = 602
Score = 304 bits (779), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HPNCKHKVYLEKLPNTSIIIPFHNEGWTSLLRTIHSIINRTPNSLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+A+ V R++R+ R GLI+ RLLGA A+GE+L FLD+HCE + WL L
Sbjct: 187 LKEKLEEYMARF-VKVRIVRTKKREGLIRTRLLGASIAKGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRTD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVNRKWFWDLGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYVYQRRPEY-RHLSTGDLSAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK LKC FKW++ V W P +F P++ +G IR+V + CV+
Sbjct: 420 ELRKHLKCKDFKWFMAAVAWDVPKYFPPVEPLPASWGEIRNVAANLCVD 468
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 75/139 (53%), Positives = 105/139 (75%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R P CK KV+ E+ LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+A+ V R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPNSLIAEIILVDDFSDREHLKEKLEEYMARF-VKVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A+GE+L FLD+HCE +
Sbjct: 221 SIAKGEVLTFLDSHCEVNV 239
>gi|261244898|ref|NP_778197.2| polypeptide N-acetylgalactosaminyltransferase-like 6 [Mus musculus]
gi|311103009|gb|ADP69005.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 20 [Mus musculus]
Length = 601
Score = 304 bits (779), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK+LKC FKW++ V W P ++ P++ +G IR V + CV+
Sbjct: 420 ELRKRLKCKDFKWFMAAVAWDVPKYYPPVEPLPAAWGEIRSVAANLCVD 468
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L++Y+A+ S R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPESLIAEIILVDDFSDREHLKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SVARGEVLTFLDSHCEVNV 239
>gi|260812139|ref|XP_002600778.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
gi|229286068|gb|EEN56790.1| hypothetical protein BRAFLDRAFT_127524 [Branchiostoma floridae]
Length = 561
Score = 304 bits (779), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 161/390 (41%), Positives = 232/390 (59%), Gaps = 22/390 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S+VI FHNE W+ LLRTVHSV++RSP ++ EI+LVDD S R L L++YVAKL
Sbjct: 98 LPDVSLVIPFHNEGWTTLLRTVHSVLNRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKL 157
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
S RV+R+ R GLI+ RLLGA+ A+G++L+FLD+HCE + WL L+ +A ++ +V
Sbjct: 158 SPKVRVVRTKQREGLIRTRLLGAQVAKGQVLIFLDSHCEANVNWLPPLLEPIALNKKTIV 217
Query: 317 CPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CP ID+I F Y ++ + GAF+WE++++ IK D ++PF++P MAG
Sbjct: 218 CPNIDVIDKDDFHYETQAGDAMRGAFDWEMYYKRIPIPDE---IKNPDPSDPFESPVMAG 274
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+DR YF +G YD + +WGGE E+SF+VWQCGG + APCS V H++RK PY
Sbjct: 275 GLFAVDREYFEELGGYDPGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRKFVPYK 334
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P GV+ L NL RVA VWMDE+ E +K P K D + +L+LR++LKC FK
Sbjct: 335 VPAGVN--LGKNLKRVAEVWMDEYKEHLYKRRPHLRKT-DMGDISGQLQLRERLKCKPFK 391
Query: 496 WYLTHVWPH---HFLPMDDK--FFGRIRHVQTHKCVEKPLAKGS----MNQASGPASLLP 546
W++ V P H+ P++ + G I + ++ C++ G ++Q ++
Sbjct: 392 WFMKVVAPDIILHYPPVEPEPAASGEIWNKASNLCIDSKHGGGQAEVRLDQCVKGGGIMN 451
Query: 547 CTHLPVLTQMFVMKLPTDLIATDESVCLDV 576
Q F M D+ + C D
Sbjct: 452 G------EQNFHMSWHNDIRPKGRTFCFDA 475
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 70/139 (50%), Positives = 100/139 (71%), Gaps = 1/139 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ FN+ S++I ++R +PD+R P C +K + + LP S+VI FHNE W+ LLRTVH
Sbjct: 62 YEECGFNIKASNKISLDRAIPDIRHPNCASKKYVRD-LPDVSLVIPFHNEGWTTLLRTVH 120
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++RSP ++ EI+LVDD S R L L++YVAKLS RV+R+ R GLI+ RLLGA
Sbjct: 121 SVLNRSPEQLIHEIILVDDFSDRSHLGKDLEDYVAKLSPKVRVVRTKQREGLIRTRLLGA 180
Query: 172 RQAEGEILVFLDAHCECTL 190
+ A+G++L+FLD+HCE +
Sbjct: 181 QVAKGQVLIFLDSHCEANV 199
>gi|417402722|gb|JAA48197.1| Putative polypeptide n-acetylgalactosaminyltransferase [Desmodus
rotundus]
Length = 557
Score = 304 bits (778), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 164/418 (39%), Positives = 240/418 (57%), Gaps = 15/418 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSLSYSADLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVASAAVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + N G
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIK--QGVNCLESQGQDTAGNFLLGVGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS-GFNRQRW 600
P Q ++ +D + + CL V + AC+ RQRW
Sbjct: 460 CRGSAKNPPAPQAWLF---SDHLIQQQGKCLTATSTSVSPGSPVTLQACNLREGRQRW 514
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSSDRPTRDTRHYSCPSLSYSAD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVASAAVLTFLDSHCE----VNTEWL 221
>gi|345803601|ref|XP_537492.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Canis lupus
familiaris]
Length = 557
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 158/380 (41%), Positives = 227/380 (59%), Gaps = 24/380 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSVSYSADLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKP-------------L 530
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E +
Sbjct: 402 EQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGVNCLESQGQDSAGNFLLGMGI 459
Query: 531 AKGSMNQASGPASLLPCTHL 550
+GS P + L HL
Sbjct: 460 CRGSAKNPPAPQAWLFSDHL 479
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRAIRDTRHYSCPSVSYSAD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVATAAVLTFLDSHCE----VNTEWL 221
>gi|109476381|ref|XP_001066416.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 576
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 174/457 (38%), Positives = 249/457 (54%), Gaps = 52/457 (11%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LPK+S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 125 YDYDNLPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKEPLAN 184
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++L R+IR+ R GL++ARLLGA A GE+L FLD HCEC GWLE L+ R+ E
Sbjct: 185 ELSQLP-KVRLIRASKREGLVRARLLGASAARGEVLTFLDCHCECHEGWLEPLLQRIHEK 243
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + + R + ++
Sbjct: 244 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERKLMRSPI-DVIRS 302
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 303 PTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 362
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMD++ E Y+ NP+A + V R +LR +L+
Sbjct: 363 QAPYSRSKALA-----NSVRAAEVWMDDFKELYYHRNPQA-RLEPFGDVTERKKLRAKLQ 416
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIR-----------------HVQTHKCV------ 526
C FKW+L V+P +P D FFG + +V+ H+ +
Sbjct: 417 CKDFKWFLDTVYPELHVPEDRPGFFGMLENRGLRGYCLDYNPPSENNVEGHQVLLYLCHG 476
Query: 527 -------------------EKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIA 567
+P A ++ + + C Q FV++ L+
Sbjct: 477 MGQNQFFEYTSRQEIRYNTRQPEACIAVEEGKDVLIMDLCKDTVPENQEFVLQEDGTLVH 536
Query: 568 TDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDK 604
C++ E E D S + CS + QRW + +
Sbjct: 537 KQSGKCVEATEKEFDTSFAPHLRDCSSSDNQRWLFKE 573
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 89/238 (37%), Positives = 130/238 (54%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
Q ++ N+ +SDRI ++R LP+ P C+ ++ + LPK+S+VI F+NEAWS LLRTV+S
Sbjct: 94 QRHQINIYLSDRISLHRRLPERWNPLCREVKYDYDNLPKTSVVIAFYNEAWSTLLRTVYS 153
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L +++L R+IR+ R GL++ARLLGA
Sbjct: 154 VLETSPDILLEEVILVDDYSDREHLKEPLANELSQLP-KVRLIRASKREGLVRARLLGAS 212
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A GE+L FLD HCEC +E W LL+ +H S +++
Sbjct: 213 AARGEVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES----AVVC 249
Query: 232 EILLVDDASTREFLKSSLDEYVAKLS---------VPTR----------VIRSPGRVG 270
++ V D +T E+L +S + + VP R VIRSP G
Sbjct: 250 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERKLMRSPIDVIRSPTMAG 307
>gi|427778457|gb|JAA54680.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 568
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 201/313 (64%), Gaps = 23/313 (7%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F E+ LP +S+V+ F+NEAWSAL+RTVHS++ R+P ++L L
Sbjct: 118 FQEQSLPTASVVVCFYNEAWSALVRTVHSILERTPAALLH----------------XLSR 161
Query: 252 YVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE 310
YVA +L R+IR+P R GLI+AR+ GA A G++LVFLD+HCE +GWLE ++AR+
Sbjct: 162 YVASELPSHVRLIRTPAREGLIRARMYGAHNASGQVLVFLDSHCEVNVGWLEPMLARIGA 221
Query: 311 DRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+RT V CPVIDII+ TF Y S + G FNW LHF+W S + + +P +
Sbjct: 222 NRTTVTCPVIDIINADTFEYSASPIVR-GGFNWGLHFKWE---SPPRLRGPQQAIDPIPS 277
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+DR YF +G YD+ M +WGGENLE+SFR+W CGG +EI PCS V H+FR+
Sbjct: 278 PTMAGGLFAMDRQYFHELGEYDDGMDIWGGENLEISFRIWMCGGRLEILPCSRVGHVFRR 337
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY P G + L N RVA VWMDE+ +Y + +A Q V +R ELRK+LK
Sbjct: 338 RRPYGSPSG-EDTLTKNSLRVAHVWMDEYKTYYLQTRRDARNQWYGD-VSARKELRKRLK 395
Query: 491 CHSFKWYLTHVWP 503
CHSF WY+ HV+P
Sbjct: 396 CHSFDWYMHHVYP 408
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/149 (42%), Positives = 94/149 (63%), Gaps = 17/149 (11%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y + FN+L+S+R+ R+LPD R P C+ + F E+ LP +S+V+ F+NEAWSAL+RTVH
Sbjct: 86 YHQHAFNVLISNRLGKVRSLPDTRNPLCRQQEFQEQSLPTASVVVCFYNEAWSALVRTVH 145
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA-KLSVPTRVIRSPGRVGLIKARLLG 170
S++ R+P ++L L YVA +L R+IR+P R GLI+AR+ G
Sbjct: 146 SILERTPAALLH----------------XLSRYVASELPSHVRLIRTPAREGLIRARMYG 189
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPK 199
A A G++LVFLD+HCE + + E L +
Sbjct: 190 AHNASGQVLVFLDSHCEVNVGWLEPMLAR 218
>gi|391348383|ref|XP_003748427.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 13-like
[Metaseiulus occidentalis]
Length = 648
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 181/471 (38%), Positives = 258/471 (54%), Gaps = 60/471 (12%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
HC+ E LP +S+VI+F +E +S LLRT+ S I+RSP +L+EI+LVDD S E
Sbjct: 180 HCKAVTYPMAE-LPTASVVIIFTDEIFSTLLRTIVSTINRSPNHLLREIILVDDFSQSED 238
Query: 245 LKSSLDEYVAK--LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
LK L Y+ + R+IR P R GLI+ARL GAR A+G++L+FLD+HCE T GWLE
Sbjct: 239 LKDRLQRYITHHFRADVVRLIRLPERSGLIRARLAGARAAKGDVLIFLDSHCETTPGWLE 298
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYVRS----FELHWGAFNWELHFRWYTYGSSDAI 358
L+ + DR VVCPVIDII D T YV + F++ G FNW+ F W+ ++
Sbjct: 299 PLLEPIRRDRRAVVCPVIDIIDDKTLQYVAAEGDRFQI--GGFNWKGEFSWHNIPAA-WR 355
Query: 359 IKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEI 418
R EP ++P MAGGLFAI+R YF+ G+YDEEM WGGENLEMSFR+WQCGG I I
Sbjct: 356 KNRTSIAEPMRSPTMAGGLFAINREYFWESGSYDEEMDGWGGENLEMSFRIWQCGGHIVI 415
Query: 419 APCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA 478
APCSHV H+FR PY FP G + N R VWMDE+ +++++ PE K +
Sbjct: 416 APCSHVGHIFRDYHPYKFPKG-KDTNAINTKRAVEVWMDEFKKYFYQTRPELTKMKVGD- 473
Query: 479 VRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHKCVEK--------- 528
+ +R R++ +C SFKWYL +V+PH +L + + FG IR+ T+ C++
Sbjct: 474 ISARKAFREKNRCKSFKWYLDNVYPHKYLMEEHSQGFGIIRNPHTNMCLDTYGKSEDEIS 533
Query: 529 --------PLAKGSMNQ---------------------------ASGPASLLPCTHLPVL 553
P+ + + NQ G + C P
Sbjct: 534 DLGVFECHPIPEEATNQLLSLSRKGELRRDDVCAKVSWVDPFRRTKGKIVMEKCDEYPRA 593
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDK 604
+++ K ++ +C+ + E+ R+ ++ C+G + Q W + K
Sbjct: 594 DKLWRHKRGGQIVHAQSGLCI---QSESVAVERIYVVECTGDDLQNWNFHK 641
Score = 116 bits (290), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 71/153 (46%), Positives = 99/153 (64%), Gaps = 2/153 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A + + FN+ VSDR+P+NR+L D R CK + LP +S+VI+F +E +S L
Sbjct: 149 EADKEFSKAAFNVYVSDRLPLNRSLRDTRHRHCKAVTYPMAELPTASVVIIFTDEIFSTL 208
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK--LSVPTRVIRSPGRVGLI 164
LRT+ S I+RSP +L+EI+LVDD S E LK L Y+ + R+IR P R GLI
Sbjct: 209 LRTIVSTINRSPNHLLREIILVDDFSQSEDLKDRLQRYITHHFRADVVRLIRLPERSGLI 268
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
+ARL GAR A+G++L+FLD+HCE T + E L
Sbjct: 269 RARLAGARAAKGDVLIFLDSHCETTPGWLEPLL 301
>gi|410910894|ref|XP_003968925.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Takifugu rubripes]
Length = 577
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 218/341 (63%), Gaps = 11/341 (3%)
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
L ++ LP +S++I F+NE WS LLRTVHSV+ SP +LKE++LVDD S R LK L
Sbjct: 122 LKYDYRSLPTTSVIIAFYNEGWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPL 181
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
+ Y++ L R+IR+ R GL++ARLLGA G++L FLD HCEC GWLE L+ R+
Sbjct: 182 ENYISGLK-KVRLIRATKREGLVRARLLGASITTGDVLTFLDCHCECHEGWLEPLLHRIK 240
Query: 310 EDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
E+ + VVCPVID+I F Y+ + E G F+W L F W++ + +RK T+
Sbjct: 241 EEPSAVVCPVIDVIDWNNFQYLGNAGEPQIGGFDWRLVFTWHSIPEYEQK-RRKSPTDVI 299
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + YF ++G YD M+VWGGENLE SFR+WQCGGS+E+ PCSHV H+F
Sbjct: 300 RSPTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSHVGHVF 359
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR++
Sbjct: 360 PKKAPYSRNKALA-----NSVRAAEVWMDEYKEIYYHRNPHA-RLEAYGDVTERRKLREK 413
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRH-VQTHKCVE 527
L C F+W+L +V+P +P D FG +++ +T+ C +
Sbjct: 414 LGCKGFRWFLDNVYPDLHIPEDKPGRFGMLKNRGKTNHCFD 454
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/241 (34%), Positives = 126/241 (52%), Gaps = 42/241 (17%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
Q ++ N+ +SD++ ++R LP+ P C+ ++ LP +S++I F+NE WS LLRTVHS
Sbjct: 93 QKHQINIYISDKVSLHRRLPERWNPLCRQLKYDYRSLPTTSVIIAFYNEGWSTLLRTVHS 152
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +LKE++LVDD S R LK L+ Y++ L R+IR+ R GL++ARLLGA
Sbjct: 153 VLETSPDILLKEVVLVDDYSDRAHLKEPLENYISGLK-KVRLIRATKREGLVRARLLGAS 211
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
G++L FLD HCEC +E W L + I P +++
Sbjct: 212 ITTGDVLTFLDCHCEC-------------------HEGW---LEPLLHRIKEEPSAVVCP 249
Query: 233 ILLVDDASTREFLKS------------------SLDEYVAK-LSVPTRVIRSPGRVGLIK 273
++ V D + ++L + S+ EY K PT VIRSP G +
Sbjct: 250 VIDVIDWNNFQYLGNAGEPQIGGFDWRLVFTWHSIPEYEQKRRKSPTDVIRSPTMAGGLF 309
Query: 274 A 274
A
Sbjct: 310 A 310
>gi|432852860|ref|XP_004067421.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Oryzias latipes]
Length = 556
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 209/332 (62%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SS+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S +S D +
Sbjct: 120 LPASSVVITFHNEARSALLRTVVSVLKKSPPQLVKEIILVDDYS-----DNSEDGALLGK 174
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 175 IEKVRVLRNDRREGLMRSRVRGADAATAPVLTFLDSHCECNDHWLEPLLERVAEDKTRVV 234
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 235 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTLEQRRARQGNPIAPIKTPMIAGG 294
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 295 LFVMDKEYFELLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 354
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLE++K+L C FKW
Sbjct: 355 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLEMKKRLGCKPFKW 413
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 414 YLDNVYPELRVPDHQDIAFGALQ--QGGNCLD 443
Score = 119 bits (297), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 90/142 (63%), Gaps = 6/142 (4%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R CK K +N + LP SS+VI FHNEA SALL
Sbjct: 80 GQDPYARNKFNQVESDKLRMDRAVPDTRHDHCKHKQWNSD-LPASSVVITFHNEARSALL 138
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S +S D + RV+R+ R GL+++R
Sbjct: 139 RTVVSVLKKSPPQLVKEIILVDDYS-----DNSEDGALLGKIEKVRVLRNDRREGLMRSR 193
Query: 168 LLGARQAEGEILVFLDAHCECT 189
+ GA A +L FLD+HCEC
Sbjct: 194 VRGADAATAPVLTFLDSHCECN 215
>gi|113931290|ref|NP_001039091.1| polypeptide N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|89268082|emb|CAJ83416.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|111305589|gb|AAI21348.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
gi|134026192|gb|AAI35810.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Xenopus
(Silurana) tropicalis]
Length = 562
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 157/383 (40%), Positives = 225/383 (58%), Gaps = 17/383 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V ++ LP +S++I FHNEA S LLRT+ SV+ RSP ++++EI+LVDD ST
Sbjct: 117 HYRCTSVHHDNDLPSTSVIITFHNEARSTLLRTIKSVLIRSPGNLIQEIILVDDFST--- 173
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
D+ +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 174 ---DPDDCQLLTKIPKVKCLRNNRREGLIRSRVRGAELAAAPVLTFLDSHCEVNNEWLQP 230
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV +D TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 231 LLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 289
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T +TP +AGG+F ID+++F +G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 290 PTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 349
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R VWMDE+ ++Y++ P A + +V R+
Sbjct: 350 VGHVFRKRHPYEFPDGNALTYIKNTKRTVEVWMDEYKQYYYQARPSA-IGKSYGSVADRV 408
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMN------Q 537
ELRK+L C SF+WYL +V+P +P + G I+ Q C+E + N Q
Sbjct: 409 ELRKKLSCKSFQWYLQNVYPELKIPEKEVISGLIK--QGGNCMESQTRDTTGNIPVMLTQ 466
Query: 538 ASGPASLLPCTHLPVLTQMFVMK 560
G A+ P L++ + +
Sbjct: 467 CKGSANSAPAAQEWALSENVIKQ 489
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 89/148 (60%), Gaps = 11/148 (7%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN L SD++ R + D R +C T V ++ LP +S++I FHNEA S LLRT+
Sbjct: 93 YRQHAFNQLESDKLSSERPIRDTRHYRC-TSVHHDNDLPSTSVIITFHNEARSTLLRTIK 151
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV+ RSP ++++EI+LVDD ST D+ +P + +R+ R GLI++R+ G
Sbjct: 152 SVLIRSPGNLIQEIILVDDFST------DPDDCQLLTKIPKVKCLRNNRREGLIRSRVRG 205
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLP 198
A A +L FLD+HCE V NE P
Sbjct: 206 AELAAAPVLTFLDSHCE---VNNEWLQP 230
>gi|348573294|ref|XP_003472426.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Cavia
porcellus]
Length = 556
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 163/417 (39%), Positives = 241/417 (57%), Gaps = 14/417 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A IL FLD+HCE + WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAILTFLDSHCEVNVEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+A+F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E + + G
Sbjct: 402 EQRKKMDCKSFRWYLENVYPELTVPVKEVLPGIIK--QGLNCLETQGQDTAGDFLLGMGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
P Q ++ TD + + CL SP + + S ++Q+W
Sbjct: 460 CRGSAKNPPPAQAWLF---TDHLIQQQGRCLAATSVSPPGSPVILQVCNSKESKQKW 513
Score = 99.4 bits (246), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 93/155 (60%), Gaps = 8/155 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A IL FLD+HCE + + + L
Sbjct: 191 LIRSRVRGADVAAAAILTFLDSHCEVNVEWLQPML 225
>gi|348513276|ref|XP_003444168.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oreochromis niloticus]
Length = 575
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/341 (45%), Positives = 220/341 (64%), Gaps = 11/341 (3%)
Query: 190 LVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSL 249
L ++ LP +S+VI F+NEAWS LLRTVHSV+ SP +LKE++LVDD S + LK L
Sbjct: 120 LKYDYRSLPTTSVVIAFYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDKAHLKEPL 179
Query: 250 DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVA 309
D+Y++ L+ R+IR+ R GL++ARLLGA GE+L FLD HCEC GWLE ++ R+
Sbjct: 180 DKYISGLN-KVRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPVLHRIK 238
Query: 310 EDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
E+ VVCPVID+I TF Y+ + E G F+W L F W++ + +R+ +
Sbjct: 239 EEPKAVVCPVIDVIDWNTFQYLGHAGEPQIGGFDWRLVFTWHSIPDYEQK-RRRSPVDVI 297
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLFA+ + +F ++G YD M+VWGGENLE SFR+WQCGGS+E+ PCSHV H+F
Sbjct: 298 RSPTMAGGLFAVRKDFFHYLGTYDTGMEVWGGENLEFSFRIWQCGGSLEVHPCSHVGHVF 357
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
K +PYS ++ N R A VW+DE+ E Y+ NP A + R+ LR++
Sbjct: 358 PKKAPYSRSKALA-----NSVRAAEVWLDEFKEIYYHRNPHARLEAFGDVTERRM-LREK 411
Query: 489 LKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRH-VQTHKCVE 527
L C SFKWYL +++P +P D FG +++ +T+ C +
Sbjct: 412 LGCKSFKWYLDNIYPDIHVPHDRPGMFGMLKNRGKTNYCFD 452
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 88/237 (37%), Positives = 126/237 (53%), Gaps = 42/237 (17%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
+ ++ N VSD+I ++R LP+ P CK ++ LP +S+VI F+NEAWS LLRTVHS
Sbjct: 91 KAHQINTYVSDKISLHRRLPERWNPLCKELKYDYRSLPTTSVVIAFYNEAWSTLLRTVHS 150
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +LKE++LVDD S + LK LD+Y++ L+ R+IR+ R GL++ARLLGA
Sbjct: 151 VLETSPDILLKEVVLVDDYSDKAHLKEPLDKYISGLN-KVRLIRATKREGLVRARLLGAS 209
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKE 232
GE+L FLD HCEC +E W L V I P++++
Sbjct: 210 ITTGEVLTFLDCHCEC-------------------HEGW---LEPVLHRIKEEPKAVVCP 247
Query: 233 ILLVDDASTREFLK------------------SSLDEYVAK-LSVPTRVIRSPGRVG 270
++ V D +T ++L S+ +Y K P VIRSP G
Sbjct: 248 VIDVIDWNTFQYLGHAGEPQIGGFDWRLVFTWHSIPDYEQKRRRSPVDVIRSPTMAG 304
>gi|154152027|ref|NP_001093795.1| polypeptide N-acetylgalactosaminyltransferase 12 [Bos taurus]
gi|151553796|gb|AAI48122.1| GALNT12 protein [Bos taurus]
Length = 579
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 157/330 (47%), Positives = 215/330 (65%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
+N + LP +S+VI F+NEAWS LLRTV+SV+ SP ++L+E++LVDD S RE LK L
Sbjct: 128 YNYDELPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILVDDYSDREHLKERLAT 187
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+A L R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E
Sbjct: 188 ELAGLP-KVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQRIHEK 246
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ + E G F+W L F W+T + I R + ++
Sbjct: 247 ESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTVPEKERIRMRSPI-DVIRS 305
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++EI PCSHV H+F +
Sbjct: 306 PTMAGGLFAVSKKYFEYLGSYDIGMEVWGGENLEFSFRIWQCGGTLEIHPCSHVGHVFPR 365
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP+A + V R +LR +L
Sbjct: 366 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPQA-RLEPFGDVTERRQLRARLG 419
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 420 CKDFKWFLNTVYPELHVPEDRPGFFGMLQN 449
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 87/238 (36%), Positives = 132/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K +N + LP +S+VI F+NEAWS LLRTV+S
Sbjct: 97 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYNYDELPTTSVVIAFYNEAWSTLLRTVYS 156
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP ++L+E++LVDD S RE LK L +A L R+IR+ R GL++ARLLGA
Sbjct: 157 VLETSPDTLLEEVILVDDYSDREHLKERLATELAGLP-KVRLIRANKREGLVRARLLGAS 215
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A+G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 216 VAKGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES----AVVC 252
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L ++ + + ++ P VIRSP G
Sbjct: 253 PVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTVPEKERIRMRSPIDVIRSPTMAG 310
>gi|109079467|ref|XP_001111603.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 5 [Macaca mulatta]
Length = 603
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASVATGDVITFLDSHCEANV 244
>gi|345321967|ref|XP_001514624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Ornithorhynchus anatinus]
Length = 484
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 151/357 (42%), Positives = 231/357 (64%), Gaps = 14/357 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+I FHNEAWS LLRTVHS++ PR LKEI+LVDD S ++ LKS+L +YV+KL
Sbjct: 18 LPSASIIICFHNEAWSTLLRTVHSILDTVPRPHLKEIILVDDLSQQDHLKSALSDYVSKL 77
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++RS R+G+I+ R+LGA +A G++LVF+D+HCEC GWLE L++R+A +R RVV
Sbjct: 78 D-GVKLLRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECHRGWLEPLLSRIASNRNRVV 136
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P++D+I TF Y S +L G F+W+L F W +++ + P ++P + GG
Sbjct: 137 TPILDVIDWKTFQYFHSEDLQQGVFDWKLDFHWELLPEQKRKVRQSPIS-PIRSPVVPGG 195
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
+ A+DR YF + GAYD M +WGGENLE+S RVW CGGS+E+ PCS V H++R + +
Sbjct: 196 VMAMDRHYFQNTGAYDSLMTLWGGENLELSIRVWLCGGSVEVLPCSRVGHVYRNQASDTL 255
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR--DKQAVRSRLELRKQLKCHSF 494
P E + N R+A W+ + E +++ +PEA R +K RL+L+++L C +F
Sbjct: 256 PN--QEAILRNKIRIAETWLGSFKEIFYQHSPEAFSLRKVEKPDCSERLQLQRRLGCRTF 313
Query: 495 KWYLTHVWPHHFLPMDDK--FFGRIRHVQTHKCVEKPLAKGSMNQASGPA-SLLPCT 548
W+L++++P + P + + F G++ + CV+ GS + G + +LLPC+
Sbjct: 314 HWFLSNIYPELY-PSERRPGFSGKLFSTRVGFCVD----GGSKGKIPGSSITLLPCS 365
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/122 (49%), Positives = 89/122 (72%), Gaps = 2/122 (1%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C+ + + + LP +SI+I FHNEAWS LLRTVHS++ PR LKEI+LVDD S ++ L
Sbjct: 8 RCRQQHYGDN-LPSASIIICFHNEAWSTLLRTVHSILDTVPRPHLKEIILVDDLSQQDHL 66
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
KS+L +YV+KL +++RS R+G+I+ R+LGA +A G++LVF+D+HCEC + E L
Sbjct: 67 KSALSDYVSKLD-GVKLLRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECHRGWLEPLL 125
Query: 198 PK 199
+
Sbjct: 126 SR 127
>gi|296193322|ref|XP_002744461.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Callithrix jacchus]
Length = 667
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 196 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 255
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 256 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 314
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 315 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 371
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 372 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 431
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 432 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVTAQK 488
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++ + W P + P++ +G IR+V T C +
Sbjct: 489 KLRSSLNCKSFKWFMMKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 537
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 162 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 220
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 221 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 279
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 280 GLIRTRMLGASVATGDVITFLDSHCEANV 308
>gi|410214072|gb|JAA04255.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410214074|gb|JAA04256.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295440|gb|JAA26320.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410295442|gb|JAA26321.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|410336845|gb|JAA37369.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 217/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S L++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDLEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 443
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 95/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S L++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDLEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|357606408|gb|EHJ65055.1| putative UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase
[Danaus plexippus]
Length = 389
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 216/332 (65%), Gaps = 13/332 (3%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++ + LP++S+VI F NEAWS LLRT+HSV++RSP +L+E+LL+DD S + +K
Sbjct: 44 CLERYSSKLLPQASVVICFFNEAWSTLLRTLHSVLNRSPPHLLREVLLIDDFSDMDHIKV 103
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L+ Y K +IR+ R GLI+AR++GA++A +LVFLD+HCECT GWLE L+ R
Sbjct: 104 RLENYTRKFP-NVILIRTSQREGLIRARIVGAKKASAPVLVFLDSHCECTEGWLEPLLER 162
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSF--ELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
+ E+ V PVID I TF Y+ +++ G FNW L F W + KR++F
Sbjct: 163 LVENPKIVASPVIDHIDPNTFEYISQNPKDIYIGGFNWNLKFIWRSIE-----YKRENFL 217
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
P KTP +AGGLFAID+ +F+ IG YDE VWGGENLE+SF+VW CGGS+EI PCSHV
Sbjct: 218 LPIKTPTIAGGLFAIDKEFFYSIGYYDEGFDVWGGENLELSFKVWMCGGSLEIVPCSHVG 277
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLEL 485
H+FR+ PY G E N AR+A VW+D++A+ +++ A+ V ++ EL
Sbjct: 278 HIFRENFPYYTSG---ETFKRNAARLAEVWLDDYAKIFYERIGNADVSLGD--VTAQKEL 332
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRI 517
RK+LKC SF WYL +V+P +P + G++
Sbjct: 333 RKKLKCKSFNWYLRNVYPEKKIPKSNVASGQV 364
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/157 (43%), Positives = 104/157 (66%), Gaps = 5/157 (3%)
Query: 33 LGDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKS 92
+ +S+ G L + ++ N FN VSD IP++R L D+R C + ++ + LP++
Sbjct: 1 MANSIKGTLRLIIKKG---WEENAFNQFVSDLIPIDRPLLDLRDKWCLER-YSSKLLPQA 56
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
S+VI F NEAWS LLRT+HSV++RSP +L+E+LL+DD S + +K L+ Y K
Sbjct: 57 SVVICFFNEAWSTLLRTLHSVLNRSPPHLLREVLLIDDFSDMDHIKVRLENYTRKFP-NV 115
Query: 153 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
+IR+ R GLI+AR++GA++A +LVFLD+HCECT
Sbjct: 116 ILIRTSQREGLIRARIVGAKKASAPVLVFLDSHCECT 152
>gi|443726011|gb|ELU13353.1| hypothetical protein CAPTEDRAFT_91056 [Capitella teleta]
Length = 426
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 199/312 (63%), Gaps = 2/312 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK S++I F E+WS LLR+VHSV++R+P +L+EILLVDD S RE L + LD+Y+ +L
Sbjct: 102 LPKMSVIICFTEESWSTLLRSVHSVLNRTPPDLLEEILLVDDFSQREHLHAKLDDYLTRL 161
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
T +IR P R GLI+ARL A G +L FLD+H EC +GW E L+ R++ +R +V
Sbjct: 162 PKVT-LIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVIV 220
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVID IS F+Y+ G FNW + F+W + D T P +TP +AGG
Sbjct: 221 APVIDAISSRDFSYIPISANQRGGFNWAMLFKWMPVPDYEKSRTGGDPTAPVRTPTIAGG 280
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI + +F +G YD + +WG ENLE+SF+ W CGGS+E+ PC+ V H++R PYSF
Sbjct: 281 LFAIHQGFFRSLGFYDPGLHIWGSENLELSFKAWMCGGSMEMIPCARVGHVYRSTQPYSF 340
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V N RVA VWMD++ + ++ PE + + SR+ELR +LKCH FKW
Sbjct: 341 PGGNVKVFMRNNLRVANVWMDDYVDLFYLMKPELRNEPFGD-ISSRVELRHRLKCHDFKW 399
Query: 497 YLTHVWPHHFLP 508
YL +V P +P
Sbjct: 400 YLENVIPELNIP 411
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 69/153 (45%), Positives = 100/153 (65%), Gaps = 1/153 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ +++ FN SD + RT+PD R P+C+ K ++ LPK S++I F E+WS LLR
Sbjct: 62 KEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYSSLPKMSVIICFTEESWSTLLR 121
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
+VHSV++R+P +L+EILLVDD S RE L + LD+Y+ +L T +IR P R GLI+ARL
Sbjct: 122 SVHSVLNRTPPDLLEEILLVDDFSQREHLHAKLDDYLTRLPKVT-LIRLPSRQGLIRARL 180
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
A G +L FLD+H EC + + E L + S
Sbjct: 181 RAIEIARGPVLTFLDSHVECNVGWAEPLLQRIS 213
>gi|297477445|ref|XP_002689374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
gi|296485129|tpg|DAA27244.1| TPA: polypeptide N-acetylgalactosaminyltransferase 10-like [Bos
taurus]
Length = 620
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 149 HPNCKSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 208
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 209 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 267
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 268 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 324
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 325 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 384
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 385 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVTAQK 441
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 442 KLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPPAAAWGEIRNVGTGLCAD 490
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 75/141 (53%), Positives = 104/141 (73%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VSD+I +NR+LPD+R P CK+K + E LP +SI+I FHNE WS+LLRT
Sbjct: 123 QAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLET-LPNTSIIIPFHNEGWSSLLRT 181
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R GLI+ R+L
Sbjct: 182 VHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKREGLIRTRML 240
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A G+++ FLD+HCE +
Sbjct: 241 GASAATGDVITFLDSHCEANV 261
>gi|440897124|gb|ELR48890.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Bos
grunniens mutus]
Length = 499
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 157/330 (47%), Positives = 215/330 (65%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
+N + LP +S+VI F+NEAWS LLRTV+SV+ SP ++L+E++LVDD S RE LK L
Sbjct: 48 YNYDELPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILVDDYSDREHLKERLAT 107
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+A L R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E
Sbjct: 108 ELAGLP-KVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQRIHEK 166
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ + E G F+W L F W+T + I R + ++
Sbjct: 167 ESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTVPEKERIRMRSPI-DVIRS 225
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++EI PCSHV H+F +
Sbjct: 226 PTMAGGLFAVSKKYFEYLGSYDIGMEVWGGENLEFSFRIWQCGGTLEIHPCSHVGHVFPR 285
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP+A + V R +LR +L
Sbjct: 286 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPQA-RLEPFGDVTERRQLRARLG 339
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 340 CKDFKWFLNTVYPELHVPEDRPGFFGMLQN 369
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 87/238 (36%), Positives = 132/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K +N + LP +S+VI F+NEAWS LLRTV+S
Sbjct: 17 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYNYDELPTTSVVIAFYNEAWSTLLRTVYS 76
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP ++L+E++LVDD S RE LK L +A L R+IR+ R GL++ARLLGA
Sbjct: 77 VLETSPDTLLEEVILVDDYSDREHLKERLATELAGLP-KVRLIRANKREGLVRARLLGAS 135
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A+G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 136 VAKGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES----AVVC 172
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L ++ + + ++ P VIRSP G
Sbjct: 173 PVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHTVPEKERIRMRSPIDVIRSPTMAG 230
>gi|148670721|gb|EDL02668.1| mCG7620, isoform CRA_b [Mus musculus]
Length = 667
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 219/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 219 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 274
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE + WL+
Sbjct: 275 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQP 332
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 333 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 391
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 392 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 451
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 452 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 510
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 511 EQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIK--QGVNCLE 552
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 93/155 (60%), Gaps = 8/155 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 187 QLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 245
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 246 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREG 299
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE + + + L
Sbjct: 300 LIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPML 334
>gi|355691777|gb|EHH26962.1| hypothetical protein EGK_17053, partial [Macaca mulatta]
gi|355750353|gb|EHH54691.1| hypothetical protein EGM_15579, partial [Macaca fascicularis]
Length = 551
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 80 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 139
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 140 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 198
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 199 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 255
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 256 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 315
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 316 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 372
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 373 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 421
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 46 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 104
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 105 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 163
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 164 GLIRTRMLGASVATGDVITFLDSHCEANV 192
>gi|348534088|ref|XP_003454535.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2
[Oreochromis niloticus]
Length = 559
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 158/332 (47%), Positives = 208/332 (62%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SS+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + D +
Sbjct: 123 LPASSVVITFHNEARSALLRTVVSVLKKSPPHLVKEIILVDDYS-----DNPEDGALLGK 177
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 178 IEKVRVLRNDRREGLMRSRVRGADAATAPVLTFLDSHCECNDHWLEPLLERVAEDKTRVV 237
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 238 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTQEQRRARQGNPIAPIKTPMIAGG 297
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 298 LFVMDKEYFEQLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 357
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLE++K+L C FKW
Sbjct: 358 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLEMKKRLNCKPFKW 416
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 417 YLENVYPELRVPDHQDIAFGALQ--QGGNCLD 446
Score = 113 bits (283), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 92/152 (60%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R C+ K + + LP SS+VI FHNEA SALL
Sbjct: 83 GQDPYARNKFNQVESDKLRMDRAVPDTRHDHCRHKQWKSD-LPASSVVITFHNEARSALL 141
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + D + RV+R+ R GL+++R
Sbjct: 142 RTVVSVLKKSPPHLVKEIILVDDYS-----DNPEDGALLGKIEKVRVLRNDRREGLMRSR 196
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A +L FLD+HCEC + E L +
Sbjct: 197 VRGADAATAPVLTFLDSHCECNDHWLEPLLER 228
>gi|410949405|ref|XP_003981412.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Felis
catus]
Length = 603
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASAATGDVITFLDSHCEANV 244
>gi|291410883|ref|XP_002721722.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like 1,
partial [Oryctolagus cuniculus]
Length = 499
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 217/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 51 HYSCPSMSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 106
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A IL FLD+HCE WL+
Sbjct: 107 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAILTFLDSHCEVNTEWLQP 164
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 165 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKITRTD 223
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+A+F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 224 PTRPIRTPVIAGGIFVIDKAWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 283
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 284 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 342
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 343 EQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIK--QGVNCLE 384
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 19 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSLD-LPATSVIITFHNEAR 77
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 78 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 131
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A IL FLD+HCE N E+L
Sbjct: 132 LIRSRVRGADVAAAAILTFLDSHCE----VNTEWL 162
>gi|426256000|ref|XP_004021634.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2 [Ovis
aries]
Length = 674
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/334 (48%), Positives = 213/334 (63%), Gaps = 11/334 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 236 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 294
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE--DRTR 314
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAE DRTR
Sbjct: 295 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNERWLEPLLERVAEGSDRTR 350
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VV P+ID+I+ F YV + G F+W L F+W ++ + P KTP +A
Sbjct: 351 VVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIA 410
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 411 GGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIVPCSRVGHVFRKQHPY 470
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C F
Sbjct: 471 TFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLNCKPF 529
Query: 495 KWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
KWYL +V+P +P D FG ++ Q C++
Sbjct: 530 KWYLENVYPELRVPDHQDIAFGALQ--QGTNCLD 561
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 196 GQDPYARNKFNQVESDKLRMDRAVPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 254
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 255 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 309
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 310 VRGADAAQAKVLTFLDSHCECNERWLEPLLER 341
>gi|354484373|ref|XP_003504363.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
6-like, partial [Cricetulus griseus]
Length = 555
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 81 HANCKHKMYLERLPNTSIIIPFHNEGWTSLLRTIHSIINRTPESLIVEIILVDDFSDREH 140
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ S R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 141 LKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGASMARGEVLTFLDSHCEVNVNWLPPL 199
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 200 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKKIPIPPE---LQRAD 256
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 257 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGEMFDVPCSR 316
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G +L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 317 VGHIYRKYVPYKVPSGT--ILARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 373
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK+LKC FKW++ V W P ++ P++ +G IR++ + CV+
Sbjct: 374 ELRKRLKCKDFKWFMAAVAWDVPKYYPPVEPLPAAWGEIRNLAANLCVD 422
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 72/139 (51%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R CK K++ E LP +SI+I FHNE W++LLRT+H
Sbjct: 57 YRENGFNIFVSNNIALERSLPDIRHANCKHKMYLER-LPNTSIIIPFHNEGWTSLLRTIH 115
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L++Y+A+ S R++R+ R GLI+ RLLGA
Sbjct: 116 SIINRTPESLIVEIILVDDFSDREHLKDKLEDYMARFS-KVRIVRTKKREGLIRTRLLGA 174
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 175 SMARGEVLTFLDSHCEVNV 193
>gi|50510795|dbj|BAD32383.1| mKIAA1130 protein [Mus musculus]
Length = 655
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 219/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 207 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 262
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE + WL+
Sbjct: 263 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQP 320
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 321 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 379
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 380 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 439
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 440 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVATRI 498
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 499 EQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIK--QGVNCLE 540
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/155 (36%), Positives = 93/155 (60%), Gaps = 8/155 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 175 QLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 233
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 234 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREG 287
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE + + + L
Sbjct: 288 LIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPML 322
>gi|26338209|dbj|BAC32790.1| unnamed protein product [Mus musculus]
Length = 570
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/332 (48%), Positives = 212/332 (63%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ RSP ++KEI+LVDD S + +L + K+
Sbjct: 134 LPATSVVITFHNEARSALLRTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 192
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+H EC WLE L+ RVAEDRTRVV
Sbjct: 193 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHYECNERWLEPLLERVAEDRTRVV 248
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 249 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 308
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 309 LFVMDKLYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 368
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 369 PGGSGTVFARNTRRAAEVWMDEYKHFYYAAVPSA-RNVPYGNIQSRLELRKKLGCKPFKW 427
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 428 YLDNVYPELRVPDHQDIAFGALQ--QGTNCLD 457
Score = 112 bits (279), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 97/152 (63%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 94 GQDPYARNKFNQVESDKLHMDRGIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 152
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ RSP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 153 RTVVSVLKRSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 207
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+H EC + E L +
Sbjct: 208 VRGADAAQAKVLTFLDSHYECNERWLEPLLER 239
>gi|417515619|gb|JAA53628.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Sus
scrofa]
Length = 506
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 35 HPNCNSKRYLEMLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 94
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 95 LKKPLEDYMA-LFPNVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 153
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 154 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 210
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 211 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 270
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 271 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 327
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 328 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 376
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 1 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRY-LEMLPNTSIIIPFHNE 59
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 60 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPNVRILRTKKRE 118
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 119 GLIRTRMLGASAATGDVITFLDSHCEANV 147
>gi|194669011|ref|XP_001788574.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Bos
taurus]
Length = 652
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 181 HPNCKSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 240
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 241 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 299
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 300 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 356
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 357 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 416
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 417 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVTAQK 473
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 474 KLRSSLNCKSFKWFMTKIAWDLPQFYPPVEPPAAAWGEIRNVGTGLCAD 522
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 75/141 (53%), Positives = 104/141 (73%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VSD+I +NR+LPD+R P CK+K + E LP +SI+I FHNE WS+LLRT
Sbjct: 155 QAYRENGFNIYVSDKISLNRSLPDIRHPNCKSKRYLET-LPNTSIIIPFHNEGWSSLLRT 213
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R GLI+ R+L
Sbjct: 214 VHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKREGLIRTRML 272
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A G+++ FLD+HCE +
Sbjct: 273 GASAATGDVITFLDSHCEANV 293
>gi|427784527|gb|JAA57715.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 612
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 171/418 (40%), Positives = 247/418 (59%), Gaps = 26/418 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S+++ FHNE W+ LLRT SV++RSP ++KEI+L DD S +E LK L++Y+AK
Sbjct: 159 LPTVSVIVPFHNEHWTTLLRTATSVLNRSPPELIKEIILADDYSNKEQLKKPLEDYIAKH 218
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GLI+ARLLGARQA G++L+FLD+H E + WL L+ +A+D VV
Sbjct: 219 WNKVRVVRATRREGLIRARLLGARQATGDVLIFLDSHTEANVNWLPPLLEPIAKDYRTVV 278
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP ID+I TFAY E G+F+WEL+++ D + TEPFK+P MAGG
Sbjct: 279 CPFIDVIDYETFAYRAQDEGARGSFDWELYYKRLPLLPEDLA----NPTEPFKSPVMAGG 334
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI R YF+ +G YDE + VWGGE E+SF++WQCGG++ APCS V H++RK +P+
Sbjct: 335 LFAISRRYFWELGGYDEGLDVWGGEQYELSFKIWQCGGTMVDAPCSRVGHIYRKFAPFPN 394
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
P G+ + + N RVA VWMDE+ E+ + P + + + ++ ELRK+L C SFKW
Sbjct: 395 P-GIGDFVGRNYRRVAEVWMDEYKEYLYMRRPHY-RNLEPGDLTAQKELRKRLNCKSFKW 452
Query: 497 YLTHV---WPHHFLPMD--DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT--- 548
++ +V P + ++ D +G IRH ++ C++ KG + SL C
Sbjct: 453 FMENVAFDQPSKYPAIEPPDYAWGEIRHEKSSLCIDTQF-KGQNER----FSLEKCIRDH 507
Query: 549 HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRIL--ACSGFN-RQRWTYD 603
Q V+ D+ +VC DV E PR ++ +C G + Q + YD
Sbjct: 508 RDQSGEQHLVLTWHKDIRPQKRTVCFDVSSSE----PRAPVVLWSCHGMHGNQLFKYD 561
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 72/141 (51%), Positives = 102/141 (72%), Gaps = 1/141 (0%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+LY++N FN L SD I +NR+LPD+R P C+ K + + LP S+++ FHNE W+ LLRT
Sbjct: 121 ELYKVNGFNALASDFIALNRSLPDIRNPGCQKKRYVSK-LPTVSVIVPFHNEHWTTLLRT 179
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
SV++RSP ++KEI+L DD S +E LK L++Y+AK RV+R+ R GLI+ARLL
Sbjct: 180 ATSVLNRSPPELIKEIILADDYSNKEQLKKPLEDYIAKHWNKVRVVRATRREGLIRARLL 239
Query: 170 GARQAEGEILVFLDAHCECTL 190
GARQA G++L+FLD+H E +
Sbjct: 240 GARQATGDVLIFLDSHTEANV 260
>gi|195386226|ref|XP_002051805.1| GJ10330 [Drosophila virilis]
gi|194148262|gb|EDW63960.1| GJ10330 [Drosophila virilis]
Length = 631
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 170/407 (41%), Positives = 239/407 (58%), Gaps = 20/407 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 197 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LA 252
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ R+IR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TR
Sbjct: 253 KID-KVRIIRNDKREGLVRSRVRGADAAVSSVLTFLDSHVECNEQWLEPLLERVREDPTR 311
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W ++ + D T +TP +A
Sbjct: 312 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPTERAARHNDPTTAIRTPMIA 371
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 372 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 431
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 432 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 490
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQA-SGPASLLPCTHLPVL 553
KWYL +V+P P + + G+ R T +C++ +M G L PC H
Sbjct: 491 KWYLENVYPDLQAP-EPQEVGQFRQ-DTTECLD------TMGHVIDGTVGLFPC-HNTGG 541
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + ++ +V + +C QRW
Sbjct: 542 NQEWAYSKRGEIKHDD--LCLTLVQFAR--GSQVVLKSCDDTENQRW 584
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 71/159 (44%), Positives = 103/159 (64%), Gaps = 14/159 (8%)
Query: 39 GGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVF 98
GGL S + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I F
Sbjct: 154 GGLRS----GEDPYIRNRFNQEASDALPSNREIPDTRNPMCRTKKYRED-LPETSVIITF 208
Query: 99 HNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSP 158
HNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ R+IR+
Sbjct: 209 HNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRIIRND 263
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R GL+++R+ GA A +L FLD+H EC NE++L
Sbjct: 264 KREGLVRSRVRGADAAVSSVLTFLDSHVEC----NEQWL 298
>gi|109079473|ref|XP_001111560.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
isoform 4 [Macaca mulatta]
Length = 602
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 131 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 190
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 191 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 249
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 250 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 306
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 307 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 366
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 367 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 423
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 424 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 472
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 97 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 155
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 156 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 214
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 215 GLIRTRMLGASVATGDVITFLDSHCEANV 243
>gi|198474621|ref|XP_001356764.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
gi|198138471|gb|EAL33829.2| GA16973 [Drosophila pseudoobscura pseudoobscura]
Length = 639
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 170/406 (41%), Positives = 240/406 (59%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 205 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDFS--DHPEDGLE--LA 260
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ R+IR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED +R
Sbjct: 261 KID-KVRIIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEKWLEPLLERVREDPSR 319
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W ++ ++ D T +TP +A
Sbjct: 320 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERSVRHNDPTTAIRTPMIA 379
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 380 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 439
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 440 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 498
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 499 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 550
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + ++ +V + AC QRW
Sbjct: 551 QEWAYSKRGEIKHDD--LCLTLVQFAR--GSQVVLKACDESENQRW 592
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 100/155 (64%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 164 LRNGEDPYIRNRFNQEASDALPSNREIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 222
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ R+IR+ R GL+
Sbjct: 223 TLLRTIVSVLNRSPEHLIREIVLVDDFS--DHPEDGLE--LAKID-KVRIIRNDKREGLV 277
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC + E L +
Sbjct: 278 RSRVKGADAAVSSVLTFLDSHVECNEKWLEPLLER 312
>gi|195148230|ref|XP_002015077.1| GL19517 [Drosophila persimilis]
gi|194107030|gb|EDW29073.1| GL19517 [Drosophila persimilis]
Length = 638
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 170/406 (41%), Positives = 240/406 (59%), Gaps = 18/406 (4%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +A
Sbjct: 204 EDLPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDFS--DHPEDGLE--LA 259
Query: 255 KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
K+ R+IR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED +R
Sbjct: 260 KID-KVRIIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEKWLEPLLERVREDPSR 318
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VVCPVID+IS F Y+ + G F+W L F+W ++ ++ D T +TP +A
Sbjct: 319 VVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERSVRHNDPTTAIRTPMIA 378
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY
Sbjct: 379 GGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPY 438
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+FPGG V N R A VWMD++ + Y+ P A K + RL L+++L C F
Sbjct: 439 TFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPF 497
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
KWYL +V+P P D + G+ R T +C++ G + G + PC H
Sbjct: 498 KWYLENVYPDLQAP-DPQEVGQFRQDST-ECLD---TMGHL--IDGTVGIFPC-HNTGGN 549
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
Q + ++ D +CL + ++ +V + AC QRW
Sbjct: 550 QEWAYSKRGEIKHDD--LCLTLVQFAR--GSQVVLKACDESENQRW 591
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 67/155 (43%), Positives = 100/155 (64%), Gaps = 6/155 (3%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK + E+ LP++S++I FHNEA S
Sbjct: 163 LRNGEDPYIRNRFNQEASDALPSNREIPDTRNPMCRTKKYRED-LPETSVIITFHNEARS 221
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ R+IR+ R GL+
Sbjct: 222 TLLRTIVSVLNRSPEHLIREIVLVDDFS--DHPEDGLE--LAKID-KVRIIRNDKREGLV 276
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++R+ GA A +L FLD+H EC + E L +
Sbjct: 277 RSRVKGADAAVSSVLTFLDSHVECNEKWLEPLLER 311
>gi|28268676|dbj|BAC56890.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [Homo sapiens]
Length = 603
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASVATGDVITFLDSHCEANV 244
>gi|296190391|ref|XP_002743190.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Callithrix jacchus]
Length = 571
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 120 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 179
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 180 ELSGLP-KVRLIRASKREGLVRARLLGASVARGDVLTFLDCHCECHEGWLEPLLQRIHEE 238
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 239 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIQMRSP-VDVIRS 297
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 298 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 357
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 358 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRTKLQ 411
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 412 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 441
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 89 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 148
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 149 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRASKREGLVRARLLGAS 207
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 208 VARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 244
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 245 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIQMRSPVDVIRSPTMAG 302
>gi|410255362|gb|JAA15648.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410303020|gb|JAA30110.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
gi|410355291|gb|JAA44249.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10) [Pan
troglodytes]
Length = 603
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASVATGDVITFLDSHCEANV 244
>gi|38195091|ref|NP_938080.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|51315962|sp|Q86SR1.2|GLT10_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 10;
AltName: Full=Polypeptide GalNAc transferase 10;
Short=GalNAc-T10; Short=pp-GaNTase 10; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 10;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 10
gi|25809274|emb|CAD44532.1| polypeptide N-acetylgalactosaminyltransferase 10 [Homo sapiens]
gi|151556534|gb|AAI48616.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|157169754|gb|AAI53182.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10)
[synthetic construct]
gi|193785288|dbj|BAG54441.1| unnamed protein product [Homo sapiens]
gi|261858046|dbj|BAI45545.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 [synthetic
construct]
Length = 603
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASVATGDVITFLDSHCEANV 244
>gi|380800197|gb|AFE71974.1| polypeptide N-acetylgalactosaminyltransferase 10, partial [Macaca
mulatta]
Length = 565
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 94 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 153
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 154 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 212
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 213 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 269
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 270 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 329
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 330 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 386
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 387 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 435
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 60 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 118
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 119 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 177
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 178 GLIRTRMLGASVATGDVITFLDSHCEANV 206
>gi|348575151|ref|XP_003473353.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Cavia porcellus]
Length = 602
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 131 HPNCNSKRYLEVLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 190
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 191 LKKPLEDYMA-LFPSVRILRTKRREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 249
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 250 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 306
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 307 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 366
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMD++AE+ ++ PE + V ++
Sbjct: 367 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDDYAEYIYQRRPEY-RHLSAGDVVAQK 423
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 424 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 472
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/141 (52%), Positives = 103/141 (73%), Gaps = 2/141 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE WS+LLRT
Sbjct: 105 QAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRY-LEVLPNTSIIIPFHNEGWSSLLRT 163
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
VHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R GLI+ R+L
Sbjct: 164 VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKRREGLIRTRML 222
Query: 170 GARQAEGEILVFLDAHCECTL 190
GA A G+++ FLD+HCE +
Sbjct: 223 GASAATGDVITFLDSHCEANV 243
>gi|345799489|ref|XP_546283.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Canis
lupus familiaris]
Length = 603
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPSELIAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G I +V T CV+
Sbjct: 425 KLRSALNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIHNVGTGLCVD 473
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPSELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASAATGDVITFLDSHCEANV 244
>gi|431918071|gb|ELK17299.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Pteropus alecto]
Length = 582
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 157/358 (43%), Positives = 228/358 (63%), Gaps = 14/358 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 111 HPNCNNKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPQLIAEIVLVDDFSDREH 170
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 171 LKKPLEDYMAHFPS-VRILRTKKREGLIRTRMLGASAASGDVITFLDSHCEANVNWLPPL 229
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 230 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 286
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 287 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 346
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 347 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEFAEHIYQRRPEY-RHLSAGDVAAQK 403
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE-KPLAKGSM 535
+LR L C SFKW++T + W P + P++ +G IR+V T C + K A GS+
Sbjct: 404 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTDTKHGAVGSL 461
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 106/149 (71%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C K + E LP +SI+I FHNE
Sbjct: 77 MTDAERVDQAYRENGFNIYVSDKIALNRSLPDIRHPNCNNKRYLET-LPNTSIIIPFHNE 135
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A R++R+ R
Sbjct: 136 GWSSLLRTVHSVLNRSPPQLIAEIVLVDDFSDREHLKKPLEDYMAHFPS-VRILRTKKRE 194
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 195 GLIRTRMLGASAASGDVITFLDSHCEANV 223
>gi|410962531|ref|XP_003987822.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1,
partial [Felis catus]
Length = 553
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 157/380 (41%), Positives = 226/380 (59%), Gaps = 24/380 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P +++EI+LVDD S
Sbjct: 106 HYSCPSVAYSADLPATSVIITFHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFS---- 161
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 162 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVATAAVLTFLDSHCEVNTEWLQP 219
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 220 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIARTD 278
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 279 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 338
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 339 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVATRI 397
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKP-------------L 530
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E +
Sbjct: 398 EQRKKMNCKSFRWYLDNVYPELTVPVKEVLPGIIK--QGVNCLESQGQDSAGNFLLGMGI 455
Query: 531 AKGSMNQASGPASLLPCTHL 550
+GS P + L HL
Sbjct: 456 CRGSAKNPPAPQAWLFSDHL 475
Score = 98.6 bits (244), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 74 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVAYSAD-LPATSVIITFHNEAR 132
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P +++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 133 STLLRTVKSVLNRTPAGLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 186
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 187 LIRSRVRGADVATAAVLTFLDSHCE----VNTEWL 217
>gi|124487253|ref|NP_001074890.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Mus musculus]
gi|341940755|sp|Q9JJ61.2|GLTL1_MOUSE RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|52851357|dbj|BAD52071.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74218446|dbj|BAE23810.1| unnamed protein product [Mus musculus]
gi|115527273|gb|AAI10635.1| Galntl1 protein [Mus musculus]
gi|115528977|gb|AAI25016.1| Galntl1 protein [Mus musculus]
Length = 558
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 219/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE + WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGATVLTFLDSHCEVNVEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIK--QGVNCLE 443
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/157 (36%), Positives = 94/157 (59%), Gaps = 8/157 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LI++R+ GA A +L FLD+HCE + + + L +
Sbjct: 191 LIRSRVRGADVAGATVLTFLDSHCEVNVEWLQPMLQR 227
>gi|119389148|pdb|2D7I|A Chain A, Crsytal Structure Of Pp-Galnac-T10 With Udp, Galnac And
Mn2+
gi|119389151|pdb|2D7R|A Chain A, Crystal Structure Of Pp-galnac-t10 Complexed With
Galnac-ser On Lectin Domain
Length = 570
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 99 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 158
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 159 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 217
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 218 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 274
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 275 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 334
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 335 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 391
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 392 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 440
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 65 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 123
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 124 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 182
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 183 GLIRTRMLGASVATGDVITFLDSHCEANV 211
>gi|441596034|ref|XP_003276624.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Nomascus leucogenys]
gi|119582046|gb|EAW61642.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
gi|119582047|gb|EAW61643.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 10 (GalNAc-T10),
isoform CRA_d [Homo sapiens]
Length = 506
Score = 302 bits (773), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 35 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 94
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 95 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 153
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 154 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 210
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 211 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 270
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 271 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 327
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 328 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 376
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 1 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 59
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 60 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 118
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 119 GLIRTRMLGASVATGDVITFLDSHCEANV 147
>gi|392347955|ref|XP_232988.5| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Rattus norvegicus]
Length = 579
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LPK+S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 125 YDYDNLPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKEPLAN 184
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++L R+IR+ R GL++ARLLGA A GE+L FLD HCEC GWLE L+ R+ E
Sbjct: 185 ELSQLP-KVRLIRASKREGLVRARLLGASAARGEVLTFLDCHCECHEGWLEPLLQRIHEK 243
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + + R + ++
Sbjct: 244 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERKLMRSPI-DVIRS 302
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 303 PTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 362
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMD++ E Y+ NP+A + V R +LR +L+
Sbjct: 363 QAPYSRSKALA-----NSVRAAEVWMDDFKELYYHRNPQA-RLEPFGDVTERKKLRAKLQ 416
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG + +
Sbjct: 417 CKDFKWFLDTVYPELHVPEDRPGFFGMLEN 446
Score = 136 bits (343), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 89/238 (37%), Positives = 130/238 (54%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
Q ++ N+ +SDRI ++R LP+ P C+ ++ + LPK+S+VI F+NEAWS LLRTV+S
Sbjct: 94 QRHQINIYLSDRISLHRRLPERWNPLCREVKYDYDNLPKTSVVIAFYNEAWSTLLRTVYS 153
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L +++L R+IR+ R GL++ARLLGA
Sbjct: 154 VLETSPDILLEEVILVDDYSDREHLKEPLANELSQLP-KVRLIRASKREGLVRARLLGAS 212
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A GE+L FLD HCEC +E W LL+ +H S +++
Sbjct: 213 AARGEVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES----AVVC 249
Query: 232 EILLVDDASTREFLKSSLDEYVAKLS---------VPTR----------VIRSPGRVG 270
++ V D +T E+L +S + + VP R VIRSP G
Sbjct: 250 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERKLMRSPIDVIRSPTMAG 307
>gi|410912128|ref|XP_003969542.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Takifugu rubripes]
Length = 558
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 211/332 (63%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SS+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + L + + KL
Sbjct: 122 LPASSVVITFHNEARSALLRTVVSVLKKSPPHLVKEIILVDDYSDNPEDGALLGK-IDKL 180
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 181 ----RVLRNDRREGLMRSRVRGADAATAPVLTFLDSHCECNDHWLEPLLERVAEDKTRVV 236
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 237 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTLDQRRARQGNPIAPIKTPMIAGG 296
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 297 LFVMDKEYFEQLGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 356
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLEL+K++ C FKW
Sbjct: 357 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELKKRVGCKPFKW 415
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 416 YLENVYPELRVPDHQDIAFGALQ--QGGNCLD 445
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/152 (43%), Positives = 95/152 (62%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R C+ K + + LP SS+VI FHNEA SALL
Sbjct: 82 GQDPYARNKFNQVESDKLRMDRAVPDTRHDHCRHKQWKSD-LPASSVVITFHNEARSALL 140
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + L + + KL RV+R+ R GL+++R
Sbjct: 141 RTVVSVLKKSPPHLVKEIILVDDYSDNPEDGALLGK-IDKL----RVLRNDRREGLMRSR 195
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A +L FLD+HCEC + E L +
Sbjct: 196 VRGADAATAPVLTFLDSHCECNDHWLEPLLER 227
>gi|410039926|ref|XP_518048.4| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Pan
troglodytes]
Length = 551
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 80 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 139
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 140 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 198
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 199 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 255
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 256 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 315
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 316 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAVQK 372
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 373 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 421
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 46 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 104
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 105 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 163
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 164 GLIRTRMLGASVATGDVITFLDSHCEANV 192
>gi|395817210|ref|XP_003782067.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Otolemur garnettii]
Length = 603
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 223/349 (63%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+ Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEAYMA-LFPSVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 RLRTSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 77/149 (51%), Positives = 107/149 (71%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
+SDA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MSDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L+ Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEAYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASVATGDVITFLDSHCEANV 244
>gi|405973911|gb|EKC38600.1| Polypeptide N-acetylgalactosaminyltransferase 2 [Crassostrea gigas]
Length = 581
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 158/376 (42%), Positives = 222/376 (59%), Gaps = 15/376 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ +C ++ L +S++I FHNEA S LLRT+ SV SRSP+ ++ EI+LVDD F
Sbjct: 132 NAQCREEQHDSNLDPTSVIITFHNEARSTLLRTIVSVFSRSPKHLITEIILVDD-----F 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
D + +++R+ R GL+++R+ GA A IL FLD+HCEC +GWLE L
Sbjct: 187 SDDPSDGQELAVIKRVKILRNDKREGLMRSRVKGADAARAPILTFLDSHCECNVGWLEPL 246
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ R+ DRTRVV P+ID+I+ F Y+ + G F+W L F+W + + +
Sbjct: 247 LDRIKGDRTRVVSPIIDVINMDNFEYIGASADLKGGFDWNLVFKWDYMTPEERNKRAGNP 306
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P +TP +AGGLF+I++ +F +G YD M VWGGENLE+SFRVWQC GS+EI PCS V
Sbjct: 307 IQPIRTPMIAGGLFSIEKKWFEELGKYDRNMDVWGGENLEISFRVWQCHGSLEIIPCSRV 366
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY+FPGG V N R A VWMD + EFY+ P A K + + R++
Sbjct: 367 GHVFRKQHPYTFPGGSGNVFARNTRRAAEVWMDNYKEFYYAAVPSA-KMVNFGDISERMD 425
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
LRK+L C FKW+L HV+P +P D+ FG I+ Q + C++ N A G
Sbjct: 426 LRKRLSCKPFKWFLEHVYPELKVPGHQDQAFGSIQ--QDNNCMDT-----LGNFADGILG 478
Query: 544 LLPCTHLPVLTQMFVM 559
+ PC H Q F +
Sbjct: 479 IFPC-HFAGGNQEFSL 493
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 88/154 (57%), Gaps = 6/154 (3%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
S Q Y N+FN L SD NR +PD R +C+ + + P +S++I FHNEA S
Sbjct: 102 SAGQDCYNRNKFNQLASDNTKSNRHVPDTRNAQCREEQHDSNLDP-TSVIITFHNEARST 160
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLRT+ SV SRSP+ ++ EI+LVDD F D + +++R+ R GL++
Sbjct: 161 LLRTIVSVFSRSPKHLITEIILVDD-----FSDDPSDGQELAVIKRVKILRNDKREGLMR 215
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+R+ GA A IL FLD+HCEC + + E L +
Sbjct: 216 SRVKGADAARAPILTFLDSHCECNVGWLEPLLDR 249
>gi|355693388|gb|EHH27991.1| hypothetical protein EGK_18322, partial [Macaca mulatta]
Length = 499
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 51 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 106
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 107 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 164
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 165 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 223
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 224 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 283
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 284 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 342
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 343 EQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIK--QGPNCLE 384
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 95/159 (59%), Gaps = 12/159 (7%)
Query: 41 LHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHN 100
L L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHN
Sbjct: 16 LAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHN 74
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPG 159
EA S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+
Sbjct: 75 EARSTLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDR 128
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
R GLI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 129 REGLIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 163
>gi|344273523|ref|XP_003408571.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Loxodonta africana]
Length = 555
Score = 301 bits (772), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 152/344 (44%), Positives = 217/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A IL FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDQREGLIRSRVRGADVAVAAILTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKISRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPEKEVLPGTIK--QGVNCLE 443
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSLD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDQREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A IL FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAVAAILTFLDSHCE----VNTEWL 221
>gi|403264517|ref|XP_003924524.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Saimiri boliviensis boliviensis]
Length = 558
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 217/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSMSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V SR+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVASRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIK--QGMNCLE 443
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSLD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWL 221
>gi|391332245|ref|XP_003740546.1| PREDICTED: LOW QUALITY PROTEIN: putative polypeptide
N-acetylgalactosaminyltransferase 10-like [Metaseiulus
occidentalis]
Length = 590
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 175/422 (41%), Positives = 245/422 (58%), Gaps = 23/422 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDD-ASTRE 243
H EC + LP +SIVI FHNE S LLRT+ SV+ RSP+S++KEI+LVDD +S +
Sbjct: 123 HSECQNIRYAARLPTASIVIPFHNEHLSVLLRTITSVLRRSPKSLIKEIILVDDFSSKKS 182
Query: 244 FLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
+ + L+ Y++ +++R+ R GLI+ARLLGAR AEG++L+FLD+H E + WL
Sbjct: 183 XVSTELENYLSSHFGSQVKLLRATKREGLIRARLLGARAAEGDVLIFLDSHTEANVNWLP 242
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
L+ +A +R VVCP ID+I TFAY E GAF+WEL+++ S D +KR
Sbjct: 243 PLLDPIARNRRTVVCPFIDVIHYETFAYRSQDEGARGAFDWELYYKRLPLLSED--LKRP 300
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
TEPF++P MAGGLFAIDR+YF+ +G YDE + VWGGE E+SF++WQCGG + APCS
Sbjct: 301 --TEPFRSPVMAGGLFAIDRSYFWELGGYDEGLDVWGGEQYELSFKIWQCGGQMFDAPCS 358
Query: 423 HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSR 482
V H++RK +P+ P G+ + + N RVA VWMDE+ EF + P + V +
Sbjct: 359 RVGHIYRKFAPFPNP-GIGDFVGRNYRRVAEVWMDEYKEFLYNRRPHY-RTLGYGDVSKQ 416
Query: 483 LELRKQLKCHSFKWYLTHV---WPHHFLPMD--DKFFGRIRHVQTHKCVEKPLAKGSMNQ 537
LRK+LKC FKW++ V P + P++ D +G IR+V KC++ +
Sbjct: 417 KALRKKLKCKPFKWFMETVAFDQPLRYPPVEPPDFAWGAIRNVGADKCLDT-----KFKE 471
Query: 538 ASGPASLLPCTHLP---VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG 594
SL C Q FV+ DL +VC DV E V + C G
Sbjct: 472 QGKRFSLETCISSNGDVSGEQNFVLTWHKDLRPAKRNVCFDVSSGEKKAP--VVLWTCHG 529
Query: 595 FN 596
+
Sbjct: 530 MH 531
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/144 (47%), Positives = 102/144 (70%), Gaps = 3/144 (2%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++LY++N FN VSD I +NR+LPD+R +C+ + LP +SIVI FHNE S LLR
Sbjct: 96 EKLYKVNGFNAAVSDLIALNRSLPDIRHSECQNIRYAAR-LPTASIVIPFHNEHLSVLLR 154
Query: 109 TVHSVISRSPRSMLKEILLVDD-ASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKA 166
T+ SV+ RSP+S++KEI+LVDD +S + + + L+ Y++ +++R+ R GLI+A
Sbjct: 155 TITSVLRRSPKSLIKEIILVDDFSSKKSXVSTELENYLSSHFGSQVKLLRATKREGLIRA 214
Query: 167 RLLGARQAEGEILVFLDAHCECTL 190
RLLGAR AEG++L+FLD+H E +
Sbjct: 215 RLLGARAAEGDVLIFLDSHTEANV 238
>gi|195435185|ref|XP_002065582.1| GK14594 [Drosophila willistoni]
gi|194161667|gb|EDW76568.1| GK14594 [Drosophila willistoni]
Length = 635
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 168/405 (41%), Positives = 239/405 (59%), Gaps = 18/405 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I FHNEA S LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+
Sbjct: 203 LPETSVIITFHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKI 258
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL+++R+ GA A +L FLD+H EC WLE L+ RV ED TRVV
Sbjct: 259 D-KVRIIRNDKREGLVRSRVRGADAAVSSVLTFLDSHVECNEQWLEPLLERVREDPTRVV 317
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS F Y+ + G F+W L F+W ++ ++ D T +TP +AGG
Sbjct: 318 CPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPAERSVRHNDPTTAIRTPMIAGG 377
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF ID+AYF +G YD +M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 378 LFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKRHPYTF 437
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ + Y+ P A K + RL L+++L C FKW
Sbjct: 438 PGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLA-KNIPFGNIDDRLALKEKLHCKPFKW 496
Query: 497 YLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQM 556
YL +V+P P + + G+ R T +C++ G + G + PC H Q
Sbjct: 497 YLENVYPDLQAP-EPQEIGQFRQDGT-ECLD---TMGHL--IDGTVGIFPC-HNTGGNQE 548
Query: 557 FVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
+ ++ D +CL + ++ +V + +C QRW
Sbjct: 549 WAYSKRGEIKHDD--LCLTLVQFSR--GSQVVLKSCDDSENQRWN 589
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 100/153 (65%), Gaps = 10/153 (6%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + + Y NRFN SD +P NR +PD R P C+TK F + LP++S++I FHNEA S
Sbjct: 160 LRNGEDPYIRNRFNQEASDALPSNREIPDTRNPMCRTKKFRND-LPETSVIITFHNEARS 218
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRT+ SV++RSP +++EI+LVDD S + + L+ +AK+ R+IR+ R GL+
Sbjct: 219 TLLRTIVSVLNRSPEHLIREIVLVDDYS--DHPEDGLE--LAKID-KVRIIRNDKREGLV 273
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
++R+ GA A +L FLD+H EC NE++L
Sbjct: 274 RSRVRGADAAVSSVLTFLDSHVEC----NEQWL 302
>gi|68534728|gb|AAH98578.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|158260513|dbj|BAF82434.1| unnamed protein product [Homo sapiens]
Length = 558
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADMAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 443
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADMAAATVLTFLDSHCE----VNTEWLP 222
>gi|402876549|ref|XP_003902024.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Papio
anubis]
Length = 558
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIK--QGPNCLE 443
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|380786811|gb|AFE65281.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Macaca mulatta]
Length = 558
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIK--QGPNCLE 443
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 95/159 (59%), Gaps = 12/159 (7%)
Query: 41 LHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHN 100
L L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHN
Sbjct: 75 LAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHN 133
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPG 159
EA S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+
Sbjct: 134 EARSTLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDR 187
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
R GLI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 188 REGLIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|114625882|ref|XP_001157326.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
troglodytes]
Length = 483
Score = 301 bits (772), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 217/344 (63%), Gaps = 13/344 (3%)
Query: 181 FLDAHCECTLV---FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 237
FL C C ++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVD
Sbjct: 18 FLKFRCICRCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVD 77
Query: 238 DASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 297
D S RE LK L ++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC
Sbjct: 78 DYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECH 136
Query: 298 LGWLENLVARVAEDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSD 356
GWLE L+ R+ E+ + VVCPVID+I TF Y+ S E G F+W L F W+T +
Sbjct: 137 EGWLEPLLQRIHEEESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERE 196
Query: 357 AIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSI 416
I R + ++P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +
Sbjct: 197 RIRMRSP-VDVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVL 255
Query: 417 EIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDK 476
E PCSHV H+F K +PYS ++ N R A VWMDE+ E Y+ NP A +
Sbjct: 256 ETHPCSHVGHVFPKQAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPF 309
Query: 477 QAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
V R +LR +L+C FKW+L V+P +P D FFG +++
Sbjct: 310 GDVTERKQLRDKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQN 353
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 116/217 (53%), Gaps = 44/217 (20%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+CK K ++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE L
Sbjct: 26 RCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHL 85
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
K L ++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC
Sbjct: 86 KERLANELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCEC--------- 135
Query: 198 PKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA-- 254
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 136 ----------HEGWLEPLLQRIHEEES----AVVCPVIDVIDWNTFEYLGNSGEPQIGGF 181
Query: 255 -----------------KLSVPTRVIRSPGRVGLIKA 274
++ P VIRSP G + A
Sbjct: 182 DWRLVFTWHTVPERERIRMRSPVDVIRSPTMAGGLFA 218
>gi|324505926|gb|ADY42538.1| N-acetylgalactosaminyltransferase 7 [Ascaris suum]
Length = 640
Score = 301 bits (772), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 163/373 (43%), Positives = 233/373 (62%), Gaps = 22/373 (5%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC E LP +S+VIVFHNE WS LLRTVHSVI RSP ++LKEI+LVDD S +E LK
Sbjct: 182 ECKYWHYPEDLPTASVVIVFHNEGWSPLLRTVHSVILRSPPNLLKEIVLVDDFSDKEHLK 241
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD Y+ + + R++R+ R GLI+ R +GA+ A G++++FLDAHCE + WL L+A
Sbjct: 242 DRLDRYIEQFNGKVRLVRNNEREGLIRTRSIGAQHAVGDVVIFLDAHCEVNINWLPPLLA 301
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVR---SFELHW-GAFNWELHFRWYTYGSSDAIIKRK 362
+ +R + PVID I T++Y R S + H+ G F W L ++ T + + +RK
Sbjct: 302 PIRRNRKVMTVPVIDGIDMHTWSYRRVYGSADRHFRGIFEWGLLYK-ETEITKEEARRRK 360
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+EPF++P AGGLFAID+ +F +G YD +Q+WGGE E+SF++WQCGG I PCS
Sbjct: 361 YNSEPFRSPTHAGGLFAIDKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFVPCS 420
Query: 423 HVAHLFRKASPYSFPGGVS--EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
HV H++R PY F G +S V+ N+ RV WMDE+ ++Y+ P A K R +
Sbjct: 421 HVGHVYRSHMPYGF-GKLSGKPVISTNMVRVIKTWMDEYEKYYYIREPSA-KHRSPGDIS 478
Query: 481 SRLELRKQLKCHSFKWYLTH-----VWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM 535
++LELRK+L C SFKWY+ V+ + FLP ++ +G +++QT KC++ +M
Sbjct: 479 AQLELRKRLHCKSFKWYMEKVAYDVVYSYPFLP-ENHVWGEAKNLQTSKCID------TM 531
Query: 536 NQA-SGPASLLPC 547
+A G PC
Sbjct: 532 GRAIPGIVGATPC 544
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 108/158 (68%), Gaps = 3/158 (1%)
Query: 33 LGDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKS 92
+G+ V+ L N + Q+ FN VSD I +NR++PDVR +CK + E+ LP +
Sbjct: 139 MGEPVNVDL--NQPEVQRAMNEFGFNTFVSDMISLNRSVPDVRMDECKYWHYPED-LPTA 195
Query: 93 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 152
S+VIVFHNE WS LLRTVHSVI RSP ++LKEI+LVDD S +E LK LD Y+ + +
Sbjct: 196 SVVIVFHNEGWSPLLRTVHSVILRSPPNLLKEIVLVDDFSDKEHLKDRLDRYIEQFNGKV 255
Query: 153 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 190
R++R+ R GLI+ R +GA+ A G++++FLDAHCE +
Sbjct: 256 RLVRNNEREGLIRTRSIGAQHAVGDVVIFLDAHCEVNI 293
>gi|291167742|ref|NP_001094333.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Rattus norvegicus]
Length = 558
Score = 301 bits (772), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 218/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P +++EI+LVDD S
Sbjct: 110 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPAGLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE + WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGASVLTFLDSHCEVNVEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIK--QGVNCLE 443
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 92/155 (59%), Gaps = 8/155 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P +++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPAGLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE + + + L
Sbjct: 191 LIRSRVRGADVAGASVLTFLDSHCEVNVEWLQPML 225
>gi|395740752|ref|XP_003780818.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12, partial [Pongo
abelii]
Length = 538
Score = 301 bits (772), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 87 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 146
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 147 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 205
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 206 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 264
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 265 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 324
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 325 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRAKLQ 378
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 379 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 408
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 56 RLHQINIYLSDRISMHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 115
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 116 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 174
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 175 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 211
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 212 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 269
>gi|149726707|ref|XP_001501206.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10 [Equus
caballus]
Length = 561
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 90 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 149
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 150 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 208
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 209 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 265
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 266 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 325
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 326 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 382
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C +F+W++T + W P + P++ +G IR+V T C +
Sbjct: 383 QLRSSLNCKTFRWFMTQIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 431
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 56 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 114
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 115 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 173
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 174 GLIRTRMLGASAATGDVITFLDSHCEANV 202
>gi|344251833|gb|EGW07937.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Cricetulus
griseus]
Length = 457
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 227/369 (61%), Gaps = 13/369 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP++S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 6 YDYENLPRTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 65
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++L R+IR+ R GL++ARLLGA A+GE+L FLD HCEC GWLE L+ R+ E
Sbjct: 66 ELSQLPR-VRLIRASKREGLVRARLLGASVAKGEVLTFLDCHCECHEGWLEPLLQRIHEK 124
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + + R + ++
Sbjct: 125 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERELMRSPI-DVIRS 183
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 184 PTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 243
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 244 QAPYSRSKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKKLRDKLQ 297
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCT 548
C F+W+L V+P +P D FFG +++ + C++ + NQ G LL
Sbjct: 298 CKDFRWFLKTVYPELHVPEDRPGFFGMLQNKGLREYCLD--YNPPNENQVDGHQVLLYLC 355
Query: 549 HLPVLTQMF 557
H Q F
Sbjct: 356 HGMGQNQFF 364
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 117/216 (54%), Gaps = 44/216 (20%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 138
C+ + ++ E LP++S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK
Sbjct: 1 CREEKYDYENLPRTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 60
Query: 139 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
L +++L R+IR+ R GL++ARLLGA A+GE+L FLD HCEC
Sbjct: 61 ERLANELSQLPR-VRLIRASKREGLVRARLLGASVAKGEVLTFLDCHCEC---------- 109
Query: 199 KSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 257
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 110 ---------HEGWLEPLLQRIHEKES----AVVCPVIDVIDWNTFEYLGNSGEPQIGGFD 156
Query: 258 ---------VPTR----------VIRSPGRVGLIKA 274
VP R VIRSP G + A
Sbjct: 157 WRLVFTWHVVPQRERELMRSPIDVIRSPTMAGGLFA 192
>gi|198415534|ref|XP_002121475.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
2, partial [Ciona intestinalis]
Length = 582
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 211/332 (63%), Gaps = 8/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+++ FHNEA S LLRTV SV++RSP S+++EI+LVDD F ++ D +
Sbjct: 145 LPATSVIVTFHNEARSTLLRTVVSVLNRSPPSLVREIILVDD-----FSDNAEDGQLLAQ 199
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+H EC WLE L+ R+A+DRT VV
Sbjct: 200 IEKVRVLRNNQREGLMRSRIRGADAAAAPVLTFLDSHVECNKNWLEPLLQRIADDRTAVV 259
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP+ID+I+ F Y+ + G F+W L F+W S + + + T P TP +AGG
Sbjct: 260 CPIIDVINMDNFEYIGASADLRGGFDWNLVFKWDYMSSEERRSRAGNPTAPISTPMIAGG 319
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF++D++YF +G YD M VWGGENLE+SFRVWQCGG +EI PCS V H+FRK PY+F
Sbjct: 320 LFSMDKSYFNQLGKYDTAMDVWGGENLEISFRVWQCGGRLEIIPCSRVGHVFRKQHPYTF 379
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ E+Y+ P A K + +RL++R + +C FKW
Sbjct: 380 PGGSGNVFTRNTRRAAEVWMDDYKEYYYAAVPSA-KLIPFGNIENRLQIRVRNQCKPFKW 438
Query: 497 YLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVE 527
YL +V+P +P + FG I+ +KC++
Sbjct: 439 YLENVYPELRVPSKESVAFGSIKQ-GVNKCID 469
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 91/152 (59%), Gaps = 5/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
+ Y+ N+FN SD++ +R +PD R C + ++ LP +S+++ FHNEA S LL
Sbjct: 104 GEDKYKRNKFNQQASDKLKCDRPVPDTRNGLCSSNSWDLSKLPATSVIVTFHNEARSTLL 163
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV++RSP S+++EI+LVDD F ++ D + RV+R+ R GL+++R
Sbjct: 164 RTVVSVLNRSPPSLVREIILVDD-----FSDNAEDGQLLAQIEKVRVLRNNQREGLMRSR 218
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A +L FLD+H EC + E L +
Sbjct: 219 IRGADAAAAPVLTFLDSHVECNKNWLEPLLQR 250
>gi|297298138|ref|XP_001104403.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Macaca
mulatta]
Length = 558
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTIPVKEALPGIIK--QGPNCLE 443
Score = 101 bits (252), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 95/159 (59%), Gaps = 12/159 (7%)
Query: 41 LHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHN 100
L L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHN
Sbjct: 75 LAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHN 133
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPG 159
EA S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+
Sbjct: 134 EARSTLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDR 187
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
R GLI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 188 REGLIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|417411867|gb|JAA52354.1| Putative polypeptide n-acetylgalactosaminyltransferase, partial
[Desmodus rotundus]
Length = 599
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 223/349 (63%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 128 HPNCNRKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 187
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 188 LKKPLEDYMAHFPS-VRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 246
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 247 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 303
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 304 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 363
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 364 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEFAEHIYQRRPEY-RHLSAGDVAAQK 420
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 421 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 469
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 106/149 (71%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C K + E LP +SI+I FHNE
Sbjct: 94 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNRKRYLET-LPNTSIIIPFHNE 152
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A R++R+ R
Sbjct: 153 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMAHFPS-VRILRTKKRE 211
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 212 GLIRTRMLGASAAIGDVITFLDSHCEANV 240
>gi|345328051|ref|XP_003431229.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
2 [Ornithorhynchus anatinus]
Length = 863
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 206/318 (64%), Gaps = 4/318 (1%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C LV N+ LP ++I++ F +E WS LLR++HSV++RSP +++EI+LVDD ST+E L
Sbjct: 495 CAEQLVHND--LPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKEHL 552
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K +LD+Y+A+ RV+ R GLI+ARL GA A G++L FLD+H EC +GWLE L+
Sbjct: 553 KDNLDKYMAQFP-KVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLL 611
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
RV R +V CPVI++ISD +Y G F W ++F W + + T
Sbjct: 612 ERVRLHRKKVACPVIEVISDKDLSYQTVDNFQRGIFTWPMNFGWKSIPPEVIEKNKMKET 671
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+ + P MAGGLF+ID+ YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V
Sbjct: 672 DIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVPCSRVG 731
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE-AEKQRDKQAVRSRLE 484
H+FR +PYSFP + + NL RVA VW+DE+ + ++ E++ D + + E
Sbjct: 732 HIFRNDNPYSFPKDRVKTVERNLVRVAEVWLDEYKDLFYGHGLHLLERRSDIGNLTQQKE 791
Query: 485 LRKQLKCHSFKWYLTHVW 502
LRK+L+C SFKWYL +V+
Sbjct: 792 LRKKLQCKSFKWYLENVF 809
Score = 128 bits (321), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 102/150 (68%), Gaps = 2/150 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
A++ ++ FN+ +SD IPV+R + D R C ++ + + LP ++I++ F +E WS LL
Sbjct: 464 AKKRWKEGNFNVYLSDLIPVDRAIEDTRPDGCAEQLVHND-LPTTTIIMCFVDEVWSTLL 522
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
R++HSV++RSP +++EI+LVDD ST+E LK +LD+Y+A+ RV+ R GLI+AR
Sbjct: 523 RSIHSVLNRSPPHLIQEIILVDDFSTKEHLKDNLDKYMAQFP-KVRVLHLKERHGLIRAR 581
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L GA A G++L FLD+H EC + + E L
Sbjct: 582 LAGAEIATGDVLTFLDSHVECNVGWLEPLL 611
>gi|348569970|ref|XP_003470770.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Cavia porcellus]
Length = 579
Score = 301 bits (771), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP +S++I F+NEAWS LLRTV+SV+ SP +++E++LVDD S +E LK L
Sbjct: 128 YDYENLPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILVEEVILVDDYSDKEHLKERLAN 187
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G +L FLD HCEC GWLE L+ R+ E+
Sbjct: 188 ELSGLP-KVRLIRASKREGLVRARLLGASVARGNVLTFLDCHCECHEGWLEPLLQRIHEE 246
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ + E G F+W L F W+ D + + K + ++
Sbjct: 247 ESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPERDRL-RMKSPIDVIRS 305
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++EI PCSHV H+F K
Sbjct: 306 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLEIHPCSHVGHVFPK 365
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R ELR +L+
Sbjct: 366 QAPYSRSKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPYGDVTERRELRAKLR 419
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C F+W+L +V+P +P D FFG +++
Sbjct: 420 CRDFRWFLENVYPELHVPEDRPGFFGMLQN 449
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 129/238 (54%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ E LP +S++I F+NEAWS LLRTV+S
Sbjct: 97 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYENLPTTSVIIAFYNEAWSTLLRTVYS 156
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +++E++LVDD S +E LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 157 VLETSPDILVEEVILVDDYSDKEHLKERLANELSGLP-KVRLIRASKREGLVRARLLGAS 215
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G +L FLD HCEC +E W LL+ +H S +++
Sbjct: 216 VARGNVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 252
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L ++ + + ++ P VIRSP G
Sbjct: 253 PVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHVVPERDRLRMKSPIDVIRSPTMAG 310
>gi|345326650|ref|XP_003431069.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 4-like
[Ornithorhynchus anatinus]
Length = 580
Score = 301 bits (771), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 157/326 (48%), Positives = 209/326 (64%), Gaps = 10/326 (3%)
Query: 187 ECTLV-FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
EC V ++ LP +S+VI F+NEAWS LLRTVHSV+ SP +LKE++LVDD S R +L
Sbjct: 125 ECRAVTYDYRRLPTTSVVIAFYNEAWSTLLRTVHSVLETSPAVLLKEVILVDDLSDRPYL 184
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ L++YV+ L R++R+ R GL++ARL+GA A GE+L FLD HCEC GWLE L+
Sbjct: 185 KAELEKYVSALQR-VRLVRTNRREGLVRARLIGATFATGEVLTFLDCHCECGPGWLEPLL 243
Query: 306 ARVAEDRTRVVCPVIDIISDVTFA-YVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
R+ + T VVCPVID I TF Y+++ E G F+W L F+W T + +R
Sbjct: 244 ERIGRNETAVVCPVIDTIDWNTFEFYMQTGEPMIGGFDWRLTFQWQTVPERERRRRRSRI 303
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+P +P MAGGLFA+ + YF ++G YD M+VWGGENLE+SFRVWQCGG++EI PCSHV
Sbjct: 304 -DPIPSPTMAGGLFAVGKKYFEYLGTYDMGMEVWGGENLELSFRVWQCGGTLEILPCSHV 362
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+F K +PY+ P N AR A VWMD + E ++ NP A K+ + R
Sbjct: 363 GHVFPKRAPYARPS-----FLRNTARAAEVWMDGYKEHFYNRNPPARKE-SYWDLSERTS 416
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMD 510
LR+ L C SF W +V P +P D
Sbjct: 417 LREXLNCRSFDWLPENVLPRIHVPED 442
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/151 (46%), Positives = 103/151 (68%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++L + N+ +SDRI ++R + D R P+C+ ++ LP +S+VI F+NEAWS LLR
Sbjct: 96 EELVEKYAINIHLSDRISLHRRIRDRRMPECRAVTYDYRRLPTTSVVIAFYNEAWSTLLR 155
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSV+ SP +LKE++LVDD S R +LK+ L++YV+ L R++R+ R GL++ARL
Sbjct: 156 TVHSVLETSPAVLLKEVILVDDLSDRPYLKAELEKYVSALQR-VRLVRTNRREGLVRARL 214
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+GA A GE+L FLD HCEC + E L +
Sbjct: 215 IGATFATGEVLTFLDCHCECGPGWLEPLLER 245
>gi|27370010|ref|NP_766281.1| polypeptide N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|51315979|sp|Q8BGT9.1|GLT12_MOUSE RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|26329325|dbj|BAC28401.1| unnamed protein product [Mus musculus]
gi|26334957|dbj|BAC31179.1| unnamed protein product [Mus musculus]
gi|33991661|gb|AAH56425.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Mus musculus]
gi|52851351|dbj|BAD52068.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase [Mus musculus]
gi|74140287|dbj|BAE33836.1| unnamed protein product [Mus musculus]
Length = 576
Score = 301 bits (771), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 157/333 (47%), Positives = 214/333 (64%), Gaps = 16/333 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LPK+S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 125 YDYDNLPKTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 184
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++L R+IR+ R GL++ARLLGA A GE+L FLD HCEC GWLE L+ R+ E
Sbjct: 185 ELSQLP-KVRLIRASRREGLVRARLLGASAARGEVLTFLDCHCECHEGWLEPLLQRIHEK 243
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF-- 368
+ VVCPVID+I TF Y+ S E G F+W L F W+ + R+ P
Sbjct: 244 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRE----RQSMRSPIDV 299
Query: 369 -KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
++P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+
Sbjct: 300 IRSPTMAGGLFAVSKRYFDYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHV 359
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
F K +PYS ++ N R A VWMDE+ E Y+ NP+A + V R +LR
Sbjct: 360 FPKQAPYSRSKALA-----NSVRAAEVWMDEFKELYYHRNPQA-RLEPFGDVTERKKLRA 413
Query: 488 QLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
+L+C FKW+L V+P +P D FFG +++
Sbjct: 414 KLQCKDFKWFLDTVYPELHVPEDRPGFFGMLQN 446
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 88/236 (37%), Positives = 129/236 (54%), Gaps = 44/236 (18%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
++ N+ +SDRI ++R LP+ P C+ ++ + LPK+S+VI F+NEAWS LLRTV+SV+
Sbjct: 96 HQINIYLSDRISLHRRLPERWNPLCREVKYDYDNLPKTSVVIAFYNEAWSTLLRTVYSVL 155
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +L+E++LVDD S RE LK L +++L R+IR+ R GL++ARLLGA A
Sbjct: 156 ETSPDILLEEVILVDDYSDREHLKERLANELSQLP-KVRLIRASRREGLVRARLLGASAA 214
Query: 175 EGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEI 233
GE+L FLD HCEC +E W LL+ +H S +++ +
Sbjct: 215 RGEVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES----AVVCPV 251
Query: 234 LLVDDASTREFLKSSLDEYVAKLS---------VPTR----------VIRSPGRVG 270
+ V D +T E+L +S + + VP R VIRSP G
Sbjct: 252 IDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERQSMRSPIDVIRSPTMAG 307
>gi|291387688|ref|XP_002710374.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10
[Oryctolagus cuniculus]
Length = 603
Score = 301 bits (771), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 225/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 132 HPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREH 191
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 192 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 250
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 251 LDRIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKVD 307
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 308 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 367
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 368 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 424
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 425 KLRSSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCAD 473
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 108/149 (72%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K + E LP +SI+I FHNE
Sbjct: 98 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLET-LPNTSIIIPFHNE 156
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 157 GWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 215
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 216 GLIRTRMLGASVAIGDVITFLDSHCEANV 244
>gi|410897066|ref|XP_003962020.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Takifugu rubripes]
Length = 600
Score = 301 bits (771), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 154/329 (46%), Positives = 218/329 (66%), Gaps = 10/329 (3%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C TLV ++ LP +S++ F +E WS LLR+VHSV++RSP +L+EI+LVDD ST+E+L
Sbjct: 148 CAQTLVHDD--LPSTSVIFCFVDEVWSTLLRSVHSVLNRSPPHLLEEIILVDDFSTKEYL 205
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K+ LD+Y+++ R+IR R GLI+ARL GA A+GE+L FLD+H EC +GWLE L+
Sbjct: 206 KAPLDKYMSQFP-KVRIIRLRERQGLIRARLAGAAAAKGEVLTFLDSHVECNVGWLEPLL 264
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
R+ DR +V CPVI++I+D +Y+ G F W L F W +A IK+ + T
Sbjct: 265 ERIYMDRRKVPCPVIEVINDKDMSYMLVDNFQRGIFRWPLVFGWSPL--PEAYIKKHNLT 322
Query: 366 --EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+P + P MAGGLF+ID+ YF+ +GAYD + VWGGEN+E+SF++W CGG IEI PCS
Sbjct: 323 ISDPIRCPVMAGGLFSIDKKYFYELGAYDSGLDVWGGENMEISFKIWMCGGEIEIIPCSR 382
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR---DKQAVR 480
V H+FR +PY FP + + NLARVA VW+DE+ + ++ + D +
Sbjct: 383 VGHIFRGQNPYKFPKDRQKTVERNLARVAEVWLDEYKDLFYGHGYHHLLDKSVIDIGDLS 442
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPM 509
++ELR+QLKC SFKWYL +V+P P+
Sbjct: 443 EQIELRRQLKCKSFKWYLENVYPDMNAPL 471
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 66/143 (46%), Positives = 102/143 (71%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN+ +S++IP++R +PD R C + +++ LP +S++ F +E WS LLR+VHSV++R
Sbjct: 126 FNVYLSNQIPLDRAIPDTRPESCAQTLVHDD-LPSTSVIFCFVDEVWSTLLRSVHSVLNR 184
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
SP +L+EI+LVDD ST+E+LK+ LD+Y+++ R+IR R GLI+ARL GA A+G
Sbjct: 185 SPPHLLEEIILVDDFSTKEYLKAPLDKYMSQFP-KVRIIRLRERQGLIRARLAGAAAAKG 243
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
E+L FLD+H EC + + E L +
Sbjct: 244 EVLTFLDSHVECNVGWLEPLLER 266
>gi|62122367|dbj|BAD93178.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|119601393|gb|EAW80987.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_b
[Homo sapiens]
gi|168269696|dbj|BAG09975.1| polypeptide N-acetylgalactosaminyltransferase-like protein 1
[synthetic construct]
Length = 542
Score = 301 bits (771), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 443
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|149639580|ref|XP_001512277.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 isoform
1 [Ornithorhynchus anatinus]
Length = 949
Score = 301 bits (770), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 206/318 (64%), Gaps = 4/318 (1%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C LV N+ LP ++I++ F +E WS LLR++HSV++RSP +++EI+LVDD ST+E L
Sbjct: 495 CAEQLVHND--LPTTTIIMCFVDEVWSTLLRSIHSVLNRSPPHLIQEIILVDDFSTKEHL 552
Query: 246 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLV 305
K +LD+Y+A+ RV+ R GLI+ARL GA A G++L FLD+H EC +GWLE L+
Sbjct: 553 KDNLDKYMAQFP-KVRVLHLKERHGLIRARLAGAEIATGDVLTFLDSHVECNVGWLEPLL 611
Query: 306 ARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
RV R +V CPVI++ISD +Y G F W ++F W + + T
Sbjct: 612 ERVRLHRKKVACPVIEVISDKDLSYQTVDNFQRGIFTWPMNFGWKSIPPEVIEKNKMKET 671
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+ + P MAGGLF+ID+ YF+ +G YD + VWGGEN+E+SF+VW CGG IEI PCS V
Sbjct: 672 DIIRCPVMAGGLFSIDKKYFYELGTYDPGLDVWGGENMEISFKVWMCGGEIEIVPCSRVG 731
Query: 426 HLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE-AEKQRDKQAVRSRLE 484
H+FR +PYSFP + + NL RVA VW+DE+ + ++ E++ D + + E
Sbjct: 732 HIFRNDNPYSFPKDRVKTVERNLVRVAEVWLDEYKDLFYGHGLHLLERRSDIGNLTQQKE 791
Query: 485 LRKQLKCHSFKWYLTHVW 502
LRK+L+C SFKWYL +V+
Sbjct: 792 LRKKLQCKSFKWYLENVF 809
Score = 128 bits (322), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 102/150 (68%), Gaps = 2/150 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
A++ ++ FN+ +SD IPV+R + D R C ++ + + LP ++I++ F +E WS LL
Sbjct: 464 AKKRWKEGNFNVYLSDLIPVDRAIEDTRPDGCAEQLVHND-LPTTTIIMCFVDEVWSTLL 522
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
R++HSV++RSP +++EI+LVDD ST+E LK +LD+Y+A+ RV+ R GLI+AR
Sbjct: 523 RSIHSVLNRSPPHLIQEIILVDDFSTKEHLKDNLDKYMAQFP-KVRVLHLKERHGLIRAR 581
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFL 197
L GA A G++L FLD+H EC + + E L
Sbjct: 582 LAGAEIATGDVLTFLDSHVECNVGWLEPLL 611
>gi|390361781|ref|XP_790897.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Strongylocentrotus purpuratus]
Length = 521
Score = 301 bits (770), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 169/422 (40%), Positives = 252/422 (59%), Gaps = 27/422 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S++I FHNEA S L RTVHS+ +RSP ++ EI+LVDD S R +LK LD+Y++
Sbjct: 58 LPNVSVIIPFHNEALSTLKRTVHSIFNRSPPELIHEIILVDDFSDRAYLKGPLDDYMSAF 117
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR R GLI+ RLLGA A G++++FLD+HCE WL L+ R+A +R R+V
Sbjct: 118 P-KVKIIRLEKREGLIRTRLLGAGPATGDVVLFLDSHCEANYNWLPPLLERIALNRRRIV 176
Query: 317 CPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF-TEPFKTPAMA 374
CP+ID+IS+ F Y ++ ++ GAF+WEL+++ S+A KR+ ++PF+TP MA
Sbjct: 177 CPMIDVISNEDFHYESQAGDVMRGAFDWELYYKRIPI--SEAENKRRSHESDPFRTPIMA 234
Query: 375 GGLFAIDRAYFF-HIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
GGLFA+DR YF +G YDE +++WGGE ++SF+VW CGG +E PCS V H++RK
Sbjct: 235 GGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPCSRVGHIYRKFMS 294
Query: 434 YSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHS 493
Y+ PGG V+ NL RV VWMDEW +++++ P + Q D + +L LR++L+C +
Sbjct: 295 YTVPGGAG-VINKNLLRVVEVWMDEWGKYFYERRPYLKGQ-DYGDISKQLALRERLQCKN 352
Query: 494 FKWYLTHVWP---HHFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMN----QASGPASL 544
F W+LT V P ++ P++ + G I H T KC+ L++G + Q P S+
Sbjct: 353 FTWFLTEVAPDILQYYPPVEPEGGAKGHITHTSTGKCLT--LSQGGKDELRVQECNPRSM 410
Query: 545 LPCTHLPVLTQMFVMKLPTDLIATDES--VCLDVPEYENDISPRVRILACSGFNRQRWTY 602
+Q + + D + +S C+D P P + G N Q W Y
Sbjct: 411 KQGG-----SQFWELTWHDDFRPSSKSRKQCVDFPYGREGAEPILYPCHHGGGN-QLWVY 464
Query: 603 DK 604
D+
Sbjct: 465 DE 466
Score = 131 bits (330), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 69/144 (47%), Positives = 95/144 (65%), Gaps = 6/144 (4%)
Query: 55 NRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
N FN VSD I ++R LPD+R P+CK + + LP S++I FHNEA S L RTVHS+
Sbjct: 25 NGFNERVSDMISMDRALPDIRNPRCKEITYLAK-LPNVSVIIPFHNEALSTLKRTVHSIF 83
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
+RSP ++ EI+LVDD S R +LK LD+Y++ ++IR R GLI+ RLLGA A
Sbjct: 84 NRSPPELIHEIILVDDFSDRAYLKGPLDDYMSAFP-KVKIIRLEKREGLIRTRLLGAGPA 142
Query: 175 EGEILVFLDAHCECTLVFNEEFLP 198
G++++FLD+HCE N +LP
Sbjct: 143 TGDVVLFLDSHCEA----NYNWLP 162
>gi|354475881|ref|XP_003500155.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Cricetulus griseus]
Length = 559
Score = 301 bits (770), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 227/369 (61%), Gaps = 13/369 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP++S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 108 YDYENLPRTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 167
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+++L R+IR+ R GL++ARLLGA A+GE+L FLD HCEC GWLE L+ R+ E
Sbjct: 168 ELSQLPR-VRLIRASKREGLVRARLLGASVAKGEVLTFLDCHCECHEGWLEPLLQRIHEK 226
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + + R + ++
Sbjct: 227 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPQRERELMRSPI-DVIRS 285
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 286 PTMAGGLFAVSKRYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 345
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 346 QAPYSRSKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKKLRDKLQ 399
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCT 548
C F+W+L V+P +P D FFG +++ + C++ + NQ G LL
Sbjct: 400 CKDFRWFLKTVYPELHVPEDRPGFFGMLQNKGLREYCLD--YNPPNENQVDGHQVLLYLC 457
Query: 549 HLPVLTQMF 557
H Q F
Sbjct: 458 HGMGQNQFF 466
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 80/217 (36%), Positives = 118/217 (54%), Gaps = 44/217 (20%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C+ + ++ E LP++S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE L
Sbjct: 102 RCREEKYDYENLPRTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHL 161
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
K L +++L R+IR+ R GL++ARLLGA A+GE+L FLD HCEC
Sbjct: 162 KERLANELSQLPR-VRLIRASKREGLVRARLLGASVAKGEVLTFLDCHCEC--------- 211
Query: 198 PKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 212 ----------HEGWLEPLLQRIHEKES----AVVCPVIDVIDWNTFEYLGNSGEPQIGGF 257
Query: 257 S---------VPTR----------VIRSPGRVGLIKA 274
VP R VIRSP G + A
Sbjct: 258 DWRLVFTWHVVPQRERELMRSPIDVIRSPTMAGGLFA 294
>gi|397499979|ref|XP_003820707.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Pan
paniscus]
Length = 568
Score = 301 bits (770), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 117 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 176
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 177 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 235
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 236 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 294
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 295 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 354
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 355 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRDKLQ 408
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 409 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 438
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 86 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 145
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 146 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 204
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 205 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 241
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 242 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 299
>gi|344235750|gb|EGV91853.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Cricetulus griseus]
Length = 797
Score = 301 bits (770), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 218/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 94 HYSCPSLSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 149
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE + WL+
Sbjct: 150 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQP 207
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 208 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 266
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 267 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 326
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 327 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 385
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P + G I+ Q C+E
Sbjct: 386 EQRKKMDCKSFRWYLENVYPELTVPAKEVLPGVIK--QGVNCLE 427
Score = 99.4 bits (246), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 106/192 (55%), Gaps = 21/192 (10%)
Query: 6 VVQLLKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQL------YQINRFNL 59
++QL KL N Q+ ++ V G G A+QL Y+ + FN
Sbjct: 25 LLQLWKLVTNLQNGVEAEEV-------TGTPSKGFDEKAYLSAKQLKPGEDPYRQHAFNQ 77
Query: 60 LVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 119
L SD++ +R + D R C + ++ + LP +S++I FHNEA S LLRTV SV++R+P
Sbjct: 78 LESDKLSPDRPIRDTRHYSCPSLSYSLD-LPATSVIITFHNEARSTLLRTVKSVLNRTPA 136
Query: 120 SMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEI 178
++++EI+LVDD S S ++ + +P + +R+ R GLI++R+ GA A +
Sbjct: 137 NLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGATV 190
Query: 179 LVFLDAHCECTL 190
L FLD+HCE +
Sbjct: 191 LTFLDSHCEVNI 202
>gi|332228990|ref|XP_003263671.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Nomascus leucogenys]
Length = 558
Score = 301 bits (770), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 443
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|6329812|dbj|BAA86444.1| KIAA1130 protein [Homo sapiens]
Length = 575
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 143 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 198
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 199 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 256
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 257 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 315
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 316 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 375
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 376 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 434
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 435 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 476
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 111 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 169
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 170 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 223
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 224 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 255
>gi|270265820|ref|NP_065743.2| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|270265827|ref|NP_001161840.1| putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Homo sapiens]
gi|332842578|ref|XP_522885.3| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
gi|51316024|sp|Q8N428.2|GLTL1_HUMAN RecName: Full=Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1;
AltName: Full=Polypeptide GalNAc transferase-like
protein 1; Short=GalNAc-T-like protein 1;
Short=pp-GaNTase-like protein 1; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase-like
protein 1; AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase-like protein 1
gi|51490858|emb|CAD44534.1| polypeptide N-acetylgalactosaminyltransferase 16 [Homo sapiens]
gi|112180422|gb|AAH36812.2| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|112818460|gb|AAI22546.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Homo sapiens]
gi|119601392|gb|EAW80986.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|119601394|gb|EAW80988.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1, isoform CRA_a
[Homo sapiens]
gi|164691113|dbj|BAF98739.1| unnamed protein product [Homo sapiens]
gi|410265456|gb|JAA20694.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pan
troglodytes]
Length = 558
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 443
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|158300689|ref|XP_320549.4| AGAP011984-PA [Anopheles gambiae str. PEST]
gi|157013282|gb|EAA00339.4| AGAP011984-PA [Anopheles gambiae str. PEST]
Length = 585
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 171/421 (40%), Positives = 248/421 (58%), Gaps = 28/421 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP S++I+F+NE WSALLRTV+SV++RSP ++LKEI+LV+D ST+ FL + L E+V ++
Sbjct: 122 LPTVSVIIIFYNEHWSALLRTVYSVLNRSPPALLKEIILVNDHSTKPFLWTPLREFVESE 181
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L+ R++ P R GLI AR+ GAR+A G++L+ LD+H E WL L+ +AED
Sbjct: 182 LAPKVRLVDLPERSGLIVARMAGAREARGDVLIVLDSHTEVNTNWLPPLLEPIAEDYRTC 241
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCP ID+I+ TF Y E GAF+W+ +++ D D T+PF +P MAG
Sbjct: 242 VCPFIDVIAHDTFQYRSQDEGKRGAFDWKFYYKRLPLLPGDL----DDPTKPFNSPVMAG 297
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAI +F+ +G YDE + +WGGE E+SF++WQCGG + APCS V H++R +P+
Sbjct: 298 GLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSRVGHVYRGYAPFG 357
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P GV+ V+ N RVA VWMDE+++F ++ NP+ K D + ++ ELR++L+C FK
Sbjct: 358 NPRGVNFVVR-NFKRVAEVWMDEYSQFLYERNPQFAKT-DPGDLSAQRELRERLQCKPFK 415
Query: 496 WYLTHVWPH---HFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMN-QASGPASLLPC-- 547
W+L V P + P D + F GR++ V + L S+N QA P L C
Sbjct: 416 WFLEVVAPDLLVRYPPRDPQPFASGRVQSVANPR-----LCLDSLNHQAKEPIGLYACAF 470
Query: 548 --THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNRQRWTYDK 604
TH P Q F + D+ CLD + ++I + +C Q W YD
Sbjct: 471 NKTH-PQNNQFFTLSYHRDIRVRSNDKCLDAAKLNDEIV----LFSCHESQGNQMWRYDY 525
Query: 605 E 605
E
Sbjct: 526 E 526
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/150 (43%), Positives = 100/150 (66%), Gaps = 18/150 (12%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+L++ N ++ +VSD I +NR+ + E LP S++I+F+NE WSALLRT
Sbjct: 96 KLFKENGYSAVVSDMIALNRS-----------EYLKE--LPTVSVIIIFYNEHWSALLRT 142
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARL 168
V+SV++RSP ++LKEI+LV+D ST+ FL + L E+V ++L+ R++ P R GLI AR+
Sbjct: 143 VYSVLNRSPPALLKEIILVNDHSTKPFLWTPLREFVESELAPKVRLVDLPERSGLIVARM 202
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLP 198
GAR+A G++L+ LD+H E N +LP
Sbjct: 203 AGAREARGDVLIVLDSHTEV----NTNWLP 228
>gi|302563901|ref|NP_001181506.1| polypeptide N-acetylgalactosaminyltransferase 12 [Macaca mulatta]
Length = 581
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 130 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 189
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 190 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 248
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 249 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 307
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 368 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 421
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 422 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 451
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 99 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 158
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 159 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 217
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 218 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 254
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 255 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 312
>gi|426222263|ref|XP_004005316.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12,
partial [Ovis aries]
Length = 467
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 156/330 (47%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
+N + LP +S+VI F+NEAWS LLRTV+SV+ SP ++L+E++LVDD S RE LK L
Sbjct: 16 YNYDQLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILVDDYSDREHLKERLAT 75
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
+A L R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E
Sbjct: 76 ELAGLP-KVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCECHEGWLEPLLQRIHEK 134
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ + E G F+W L F W+ + I R + ++
Sbjct: 135 ESAVVCPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHMVPEKERIRMRSPI-DVIRS 193
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++EI PCSHV H+F +
Sbjct: 194 PTMAGGLFAVSKKYFEYLGSYDIGMEVWGGENLEFSFRIWQCGGTLEIHPCSHVGHVFPR 253
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP+A + V R +LR +L
Sbjct: 254 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPQA-RLEPFGDVTERRQLRARLG 307
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 308 CKDFKWFLNTVYPELHVPEDRPGFFGMLQN 337
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 83/226 (36%), Positives = 121/226 (53%), Gaps = 44/226 (19%)
Query: 69 RTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLV 128
R LP+ P CK K +N + LP +S+VI F+NEAWS LLRTV+SV+ SP ++L+E++LV
Sbjct: 1 RRLPERWNPLCKEKKYNYDQLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDTLLEEVILV 60
Query: 129 DDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
DD S RE LK L +A L R+IR+ R GL++ARLLGA A+G++L FLD HCEC
Sbjct: 61 DDYSDREHLKERLATELAGLP-KVRLIRANKREGLVRARLLGASVAKGDVLTFLDCHCEC 119
Query: 189 TLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
+E W LL+ +H S +++ ++ V D +T E+L +
Sbjct: 120 -------------------HEGWLEPLLQRIHEKES----AVVCPVIDVIDWNTFEYLGN 156
Query: 248 SLDEYVA-------------------KLSVPTRVIRSPGRVGLIKA 274
+ + + ++ P VIRSP G + A
Sbjct: 157 AGEPQIGGFDWRLVFTWHMVPEKERIRMRSPIDVIRSPTMAGGLFA 202
>gi|297695402|ref|XP_002824932.1| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1 [Pongo abelii]
Length = 558
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 443
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 222
>gi|112807221|ref|NP_078918.3| polypeptide N-acetylgalactosaminyltransferase 12 [Homo sapiens]
gi|84028209|sp|Q8IXK2.3|GLT12_HUMAN RecName: Full=Polypeptide N-acetylgalactosaminyltransferase 12;
AltName: Full=Polypeptide GalNAc transferase 12;
Short=GalNAc-T12; Short=pp-GaNTase 12; AltName:
Full=Protein-UDP acetylgalactosaminyltransferase 12;
AltName: Full=UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 12
gi|25815116|emb|CAC80100.2| UDP-GalNAc-transferase 12 [Homo sapiens]
gi|151555601|gb|AAI48740.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
[synthetic construct]
gi|151556464|gb|AAI48410.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12)
[synthetic construct]
gi|261858036|dbj|BAI45540.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [synthetic
construct]
Length = 581
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 130 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 189
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 190 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 248
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I + + + ++
Sbjct: 249 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERI-RMQSPVDVIRS 307
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 368 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRDKLQ 421
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 422 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 451
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 99 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 158
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 159 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 217
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 218 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 254
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 255 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMQSPVDVIRSPTMAG 312
>gi|190358441|ref|NP_001121823.1| polypeptide N-acetylgalactosaminyltransferase 2 [Danio rerio]
Length = 559
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 208/332 (62%), Gaps = 9/332 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SS+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + D +
Sbjct: 123 LPASSVVITFHNEARSALLRTVISVLKKSPPHLVKEIILVDDYS-----DNPEDGALLGK 177
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A +L FLD+HCEC WLE L+ RVAED+TRVV
Sbjct: 178 IEKVRVLRNDRREGLMRSRVRGADAATASVLTFLDSHCECNEHWLEPLLERVAEDKTRVV 237
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 238 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTLEQRRARQGNPIAPIKTPMIAGG 297
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF +G YD M VWGGENLE+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 298 LFVMDKDYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 357
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMD++ FY+ P A + ++SRLE++K+L C FKW
Sbjct: 358 PGGSGTVFARNTRRAAEVWMDDFKNFYYAAVPSA-RNVPYGNIQSRLEMKKRLGCKPFKW 416
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 417 YLENVYPELRVPDHQDIAFGALQ--QGQNCLD 446
Score = 114 bits (284), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 92/152 (60%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R C+ K + + LP SS+VI FHNEA SALL
Sbjct: 83 GQDPYARNKFNQVESDKLRMDRNIPDTRHDHCRHKQWRTD-LPASSVVITFHNEARSALL 141
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + D + RV+R+ R GL+++R
Sbjct: 142 RTVISVLKKSPPHLVKEIILVDDYS-----DNPEDGALLGKIEKVRVLRNDRREGLMRSR 196
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A +L FLD+HCEC + E L +
Sbjct: 197 VRGADAATASVLTFLDSHCECNEHWLEPLLER 228
>gi|153792142|ref|NP_001093363.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 16 [Xenopus laevis]
gi|148744516|gb|AAI42582.1| LOC100101309 protein [Xenopus laevis]
Length = 563
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 156/372 (41%), Positives = 218/372 (58%), Gaps = 17/372 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT V + LP +S++I FHNEA S LLRT+ SV+ RSP ++++EI+LVDD ST
Sbjct: 118 HYRCTSVHYDNDLPSTSVIITFHNEARSTLLRTIKSVLIRSPGNLIQEIILVDDFST--- 174
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
D+ +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 175 ---DPDDCQLLTKIPKVKCLRNNRREGLIRSRVRGAELAAAPVLTFLDSHCEVNNEWLQP 231
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV +D TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 232 LLQRVKDDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTD 290
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T +TP +AGG+F ID+++F +G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 291 PTSSIRTPVIAGGIFVIDKSWFNQLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSR 350
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R VWMDE+ ++Y++ P A + +V R
Sbjct: 351 VGHVFRKRHPYEFPDGNALTYIKNTKRTVEVWMDEYKQYYYQARPSA-IGKSYGSVADRA 409
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMN------Q 537
ELRK+L C SF+WYL +V+P +P + G I+ Q C+E + N Q
Sbjct: 410 ELRKKLSCKSFQWYLQNVYPELKVPEKEVISGLIK--QGGNCLESQTRDTTGNNPIMLTQ 467
Query: 538 ASGPASLLPCTH 549
G A+ P
Sbjct: 468 CKGSANSAPAAQ 479
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/148 (41%), Positives = 88/148 (59%), Gaps = 11/148 (7%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN L SD++ R + D R +C T V + LP +S++I FHNEA S LLRT+
Sbjct: 94 YRQHAFNQLESDKLSSERPIRDTRHYRC-TSVHYDNDLPSTSVIITFHNEARSTLLRTIK 152
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV+ RSP ++++EI+LVDD ST D+ +P + +R+ R GLI++R+ G
Sbjct: 153 SVLIRSPGNLIQEIILVDDFST------DPDDCQLLTKIPKVKCLRNNRREGLIRSRVRG 206
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLP 198
A A +L FLD+HCE V NE P
Sbjct: 207 AELAAAPVLTFLDSHCE---VNNEWLQP 231
>gi|22122074|dbj|BAC07181.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 [Homo sapiens]
Length = 581
Score = 301 bits (770), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 130 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 189
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 190 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 248
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I + + + ++
Sbjct: 249 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERI-RMQSPVDVIRS 307
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 368 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRDKLQ 421
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 422 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 451
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 130/238 (54%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 99 RLHQINIYLSDRISLHRRLPVRWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 158
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 159 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 217
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 218 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 254
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 255 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMQSPVDVIRSPTMAG 312
>gi|397507535|ref|XP_003824250.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 [Pan
paniscus]
Length = 529
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 81 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 136
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 137 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 194
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 195 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 253
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 254 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 313
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 314 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 372
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 373 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 414
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 49 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 107
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 108 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 161
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 162 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 193
>gi|402896867|ref|XP_003911504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Papio
anubis]
Length = 581
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 130 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 189
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 190 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 248
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 249 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 307
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 368 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 421
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 422 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 451
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 99 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 158
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 159 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 217
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 218 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 254
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 255 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 312
>gi|345304811|ref|XP_001505904.2| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Ornithorhynchus anatinus]
Length = 555
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 166/420 (39%), Positives = 240/420 (57%), Gaps = 21/420 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H CT LP +SIVI FHNEA S LLRTV SV++R+P ++++EI+LVDD F
Sbjct: 110 HYRCTSAHYPSDLPVTSIVITFHNEARSTLLRTVKSVLNRTPANLVREIILVDD-----F 164
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
D + + + + R GLI++R+ GA A +IL FLD+HCE WL+ L
Sbjct: 165 SADPEDCQLLTRIPKVKCLHNNQREGLIRSRVRGAEVATADILTFLDSHCEVNSEWLQPL 224
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 225 LQRVKEDYTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMSRTDP 283
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+ +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS V
Sbjct: 284 TQSIRTPVIAGGIFVIDKSWFNHLGKYDTQMDIWGGENFELSFRVWMCGGSLEIVPCSRV 343
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMD++ ++Y++ P A + +V R+E
Sbjct: 344 GHVFRKRHPYDFPEGNALTYIKNTKRAAEVWMDDYKQYYYEARPSA-IGKAFGSVAERVE 402
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
R+++ C SF+WYL +V+P +P + G IR Q C+E ++G PA +
Sbjct: 403 QRQKMNCKSFQWYLENVYPELKVPEKEPAPGIIR--QGASCLE---SRGRDAAGDSPAGV 457
Query: 545 LPCTHL---PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN-RQRW 600
C P TQ +V +D + + CL + + +V + C+ + RQ+W
Sbjct: 458 GGCRGTAGGPAGTQEWVF---SDPLIRQQDQCLSITSFS--AGSQVTLERCNQKDGRQKW 512
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 89/153 (58%), Gaps = 10/153 (6%)
Query: 45 LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWS 104
L + Y+ + FN L SD++ +R + D R +C + + + LP +SIVI FHNEA S
Sbjct: 79 LKPGEDPYRQHAFNQLESDKLSSDRAIRDTRHYRCTSAHYPSD-LPVTSIVITFHNEARS 137
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV SV++R+P ++++EI+LVDD F D + + + + R GLI
Sbjct: 138 TLLRTVKSVLNRTPANLVREIILVDD-----FSADPEDCQLLTRIPKVKCLHNNQREGLI 192
Query: 165 KARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
++R+ GA A +IL FLD+HCE N E+L
Sbjct: 193 RSRVRGAEVATADILTFLDSHCE----VNSEWL 221
>gi|395849607|ref|XP_003797413.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Otolemur garnettii]
Length = 558
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 217/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A IL FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVATAAILTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIIK--QGVNCLE 443
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSLD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A IL FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVATAAILTFLDSHCE----VNTEWL 221
>gi|351699369|gb|EHB02288.1| Polypeptide N-acetylgalactosaminyltransferase 12 [Heterocephalus
glaber]
Length = 570
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/330 (46%), Positives = 212/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP +S++I F+NEAWS LLRTV+SV+ SP +++E++LVDD S +E LK L E
Sbjct: 119 YDYENLPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILVEEVILVDDYSDKEHLKERLAE 178
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E
Sbjct: 179 ELSALP-KVRLIRATKREGLVRARLLGASVARGDVLTFLDCHCECHEGWLEPLLQRIHEK 237
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G FNW+L F W+ D ++ + + ++
Sbjct: 238 ESAVVCPVIDVIDWNTFEYLGNSREPQVGGFNWQLVFTWHVVPERDRLLMKSPI-DVIRS 296
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 297 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 356
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ P A + V R +LR +L
Sbjct: 357 QAPYSRSKALA-----NSVRAAEVWMDEFKELYYHRTPRA-RLEPYGDVTERRQLRAKLH 410
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C F+W+L V+P +P D FFG +++
Sbjct: 411 CKDFRWFLETVYPELHVPEDRPGFFGMLQN 440
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 91/244 (37%), Positives = 135/244 (55%), Gaps = 47/244 (19%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
++ +L+QIN + +SDRI ++R LP+ P CK K ++ E LP +S++I F+NEAWS L
Sbjct: 85 ESVRLHQINTY---LSDRISLHRRLPERWNPLCKEKKYDYENLPTTSVIIAFYNEAWSTL 141
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LRTV+SV+ SP +++E++LVDD S +E LK L E ++ L R+IR+ R GL++A
Sbjct: 142 LRTVYSVLETSPDILVEEVILVDDYSDKEHLKERLAEELSALP-KVRLIRATKREGLVRA 200
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRS 225
RLLGA A G++L FLD HCEC +E W LL+ +H S
Sbjct: 201 RLLGASVARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEKES-- 239
Query: 226 PRSMLKEILLVDDASTREFLKSSLDEYVAKLS---------VPTR----------VIRSP 266
+++ ++ V D +T E+L +S + V + VP R VIRSP
Sbjct: 240 --AVVCPVIDVIDWNTFEYLGNSREPQVGGFNWQLVFTWHVVPERDRLLMKSPIDVIRSP 297
Query: 267 GRVG 270
G
Sbjct: 298 TMAG 301
>gi|444509912|gb|ELV09433.1| Putative polypeptide N-acetylgalactosaminyltransferase-like protein
1 [Tupaia chinensis]
Length = 566
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 217/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 118 HYSCPSMSYSVDLPATSVIITFHNEARSTLLRTVRSVLNRTPANLIQEIILVDDFS---- 173
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 174 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAAVLTFLDSHCEVNTEWLQP 231
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W D + R D
Sbjct: 232 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLDQKMTRTD 290
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 291 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 350
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 351 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVATRI 409
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G ++ Q C+E
Sbjct: 410 EQRKKMNCKSFRWYLENVYPELTVPVKEVLPGIMK--QGVNCLE 451
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 86 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSMSYSVD-LPATSVIITFHNEAR 144
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 145 STLLRTVRSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 198
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 199 LIRSRVRGADVAAAAVLTFLDSHCE----VNTEWL 229
>gi|118090108|ref|XP_420520.2| PREDICTED: UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 6 [Gallus gallus]
Length = 601
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HPNCKHKVYLEKLPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ + + R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKEKLEEYMVRFA-KVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMFDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK LKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKHLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 103/139 (74%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R P CK KV+ E+ LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEK-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+ + + R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMVRFA-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SLARGEVLTFLDSHCEVNV 239
>gi|119579301|gb|EAW58897.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 12 (GalNAc-T12) [Homo
sapiens]
Length = 517
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 66 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 125
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 126 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 184
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I + + + ++
Sbjct: 185 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERI-RMQSPVDVIRS 243
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 244 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 303
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 304 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRDKLQ 357
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 358 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 387
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 35 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 94
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 95 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 153
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 154 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 190
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 191 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMQSPVDVIRSPTMAG 248
>gi|344276552|ref|XP_003410072.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Loxodonta africana]
Length = 527
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 162/308 (52%), Positives = 205/308 (66%), Gaps = 5/308 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALLRTVHSV R+P +L EI+LVDD S + LK LDEYV K
Sbjct: 150 LPAASVVICFYNEAFSALLRTVHSVTDRTPAHLLHEIILVDDDSDLDDLKGELDEYVQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L T+VIR+ R GLI+ R++GA QA GE+LVFLD+HCE WL+ L+A V ED V
Sbjct: 210 LPGKTKVIRNKKREGLIRGRMIGAAQATGEVLVFLDSHCEVNEMWLQPLLAAVREDPHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCPVIDIIS T Y S + G FNW LHF+W D + + T P K+P MAG
Sbjct: 270 VCPVIDIISADTLLYSSS-PIVRGGFNWGLHFKW-DLVPFDELGGPEGATAPIKSPTMAG 327
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA++R YF +G YD M +WGGENLE+SFR+W CGG + I PCS V H+FRK PY
Sbjct: 328 GLFAMNRHYFSELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYG 387
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + +L R+A VW+DE+ E YF P+ K R + R+ELRK+L C SFK
Sbjct: 388 SPEGQDTMTHNSL-RLAHVWLDEYKEQYFSLRPDL-KTRSYGNISERVELRKKLGCKSFK 445
Query: 496 WYLTHVWP 503
WYL +++P
Sbjct: 446 WYLDNIYP 453
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 76/147 (51%), Positives = 103/147 (70%), Gaps = 2/147 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ +R +PD R CK K + + LP +S+VI F+NEA+SALLRTVH
Sbjct: 114 YQKHAFNMLISNRLGYHRDVPDTRNAACKEKSYPLD-LPAASVVICFYNEAFSALLRTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV R+P +L EI+LVDD S + LK LDEYV K L T+VIR+ R GLI+ R++G
Sbjct: 173 SVTDRTPAHLLHEIILVDDDSDLDDLKGELDEYVQKYLPGKTKVIRNKKREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFL 197
A QA GE+LVFLD+HCE ++ + L
Sbjct: 233 AAQATGEVLVFLDSHCEVNEMWLQPLL 259
>gi|449268007|gb|EMC78887.1| Polypeptide N-acetylgalactosaminyltransferase 14, partial [Columba
livia]
Length = 514
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 183/425 (43%), Positives = 242/425 (56%), Gaps = 23/425 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRS-----MLKEILLVDDA 239
H CT + + LP +S++I FHNEA S LLRT+ S + S ++ EI+LVDD
Sbjct: 54 HHRCTTLHYRQDLPPTSVIITFHNEARSTLLRTIRSTVMHFLSSFFTVHLVHEIILVDDF 113
Query: 240 STREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLG 299
S + KL + +R+ R GLI++R+ GA A+ +L FLD+HCE
Sbjct: 114 SD----DPDDCRLLGKLP-KVKCLRNGRREGLIRSRIRGADVAQAGVLTFLDSHCEVNKD 168
Query: 300 WLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAII 359
WL L+ R+ ED TRVV PVIDII+ TFAYV + G F+W LHF+W S +
Sbjct: 169 WLLPLLQRIKEDPTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQL-SPEQKA 227
Query: 360 KRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
KR D TEP KTP +AGGLF ID+A+F H+G YD M +WGGEN E+SFRVW CGGS+EI
Sbjct: 228 KRLDPTEPIKTPIIAGGLFMIDKAWFNHLGKYDSAMDIWGGENFEISFRVWMCGGSLEII 287
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
PCS V H+FRK PY FP G + N R A VWMDE+ +Y+ P A+ R V
Sbjct: 288 PCSRVGHVFRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKRYYYAARPAAQG-RPYGNV 346
Query: 480 RSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQA 538
+SR+ELRK+LKCHSFKWYL +V+P +P + + G IR Q+ C+E + S +Q
Sbjct: 347 QSRVELRKRLKCHSFKWYLENVYPELRIPEESLYQTGMIRQRQS--CLE---SHKSEDQE 401
Query: 539 SGPASLLPCTHLPVLTQMFVMK--LPTDLIATDESVCLDVPEYENDISPRVRILACS-GF 595
SL PCT T + + + +CL V Y +V + C G
Sbjct: 402 FPVLSLNPCTGSKGTTAATAQEWTYTYNHQVRQQQLCLSV--YTLFPGSQVLLSPCKEGD 459
Query: 596 NRQRW 600
N+QRW
Sbjct: 460 NKQRW 464
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 81/141 (57%), Gaps = 11/141 (7%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+++ FN S+RIP +R + D R +C T + ++ LP +S++I FHNEA S LLRT+
Sbjct: 30 YRLHAFNQRESERIPSDRAVRDTRHHRCTTLHYRQD-LPPTSVIITFHNEARSTLLRTIR 88
Query: 112 SVISRSPRS-----MLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
S + S ++ EI+LVDD S + KL + +R+ R GLI++
Sbjct: 89 STVMHFLSSFFTVHLVHEIILVDDFSD----DPDDCRLLGKLP-KVKCLRNGRREGLIRS 143
Query: 167 RLLGARQAEGEILVFLDAHCE 187
R+ GA A+ +L FLD+HCE
Sbjct: 144 RIRGADVAQAGVLTFLDSHCE 164
>gi|426362479|ref|XP_004048391.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Gorilla gorilla gorilla]
Length = 633
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 182 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 241
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 242 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 300
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 301 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 359
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 360 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 419
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 420 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRAKLQ 473
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 474 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 503
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 151 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 210
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 211 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 269
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 270 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 306
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 307 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 364
>gi|224049734|ref|XP_002187605.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
1 [Taeniopygia guttata]
gi|449500484|ref|XP_004176221.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 17 isoform
2 [Taeniopygia guttata]
Length = 601
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE W++LLRT+HS+I+R+P S++ EI+LVDD S RE
Sbjct: 127 HPNCKHKVYLEALPNTSIIIPFHNEGWTSLLRTIHSIINRTPDSLIAEIILVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L+EY+ + + R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 187 LKEKLEEYMLRFA-KVRIVRTKKREGLIRTRLLGASLARGEVLTFLDSHCEVNVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A + +VCP+ID+I F Y ++ + GAF+WE++++ ++R D
Sbjct: 246 LNQIALNHKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQRAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGG + PCS
Sbjct: 303 PSDPFESPVMAGGLFAVNRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGGMYDVPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+AE+ ++ PE + + ++
Sbjct: 363 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEFAEYIYQRRPEY-RHLSTGDISAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK LKC FKW++ V W P ++ P++ +G IR+V + CV+
Sbjct: 420 ELRKHLKCKDFKWFMAAVAWDVPKYYPPVEPPPAAWGEIRNVAANLCVD 468
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 73/139 (52%), Positives = 102/139 (73%), Gaps = 2/139 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ N FN+ VS+ I + R+LPD+R P CK KV+ E LP +SI+I FHNE W++LLRT+H
Sbjct: 103 YRENGFNIFVSNNIALERSLPDIRHPNCKHKVYLEA-LPNTSIIIPFHNEGWTSLLRTIH 161
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S+I+R+P S++ EI+LVDD S RE LK L+EY+ + + R++R+ R GLI+ RLLGA
Sbjct: 162 SIINRTPDSLIAEIILVDDFSDREHLKEKLEEYMLRFA-KVRIVRTKKREGLIRTRLLGA 220
Query: 172 RQAEGEILVFLDAHCECTL 190
A GE+L FLD+HCE +
Sbjct: 221 SLARGEVLTFLDSHCEVNV 239
>gi|301763571|ref|XP_002917213.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like,
partial [Ailuropoda melanoleuca]
Length = 598
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 127 HPNCNGKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 186
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 187 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 245
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 246 LDRIAQNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 302
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 303 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 362
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 363 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 419
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+LR L C +FKW++T + W P + P++ +G I +V T C +
Sbjct: 420 KLRSSLNCKTFKWFMTKIAWDLPKFYPPVEPPAAAWGEIHNVGTGLCAD 468
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 107/149 (71%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C K + E LP +SI+I FHNE
Sbjct: 93 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLET-LPNTSIIIPFHNE 151
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 152 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 210
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 211 GLIRTRMLGASAATGDVITFLDSHCEANV 239
>gi|62148924|dbj|BAD93346.1| UDP-GalNAc: polypeptide N-acetylgalactosaminyltransferase [Homo
sapiens]
Length = 581
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 214/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 130 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 189
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 190 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 248
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I + + + ++
Sbjct: 249 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERI-RMQSPVDVIRS 307
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 368 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRDKLQ 421
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 422 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 451
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 99 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 158
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 159 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 217
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 218 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 254
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 255 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMQSPVDVIRSPTMAG 312
>gi|355567593|gb|EHH23934.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
mulatta]
Length = 457
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 6 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 65
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 66 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 124
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 125 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 183
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 184 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 243
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 244 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 297
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 298 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 327
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/212 (35%), Positives = 113/212 (53%), Gaps = 44/212 (20%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 138
CK K ++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK
Sbjct: 1 CKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 60
Query: 139 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
L ++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC
Sbjct: 61 ERLANELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCEC---------- 109
Query: 199 KSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA--- 254
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 110 ---------HEGWLEPLLQRIHEEES----AVVCPVIDVIDWNTFEYLGNSGEPQIGGFD 156
Query: 255 ----------------KLSVPTRVIRSPGRVG 270
++ P VIRSP G
Sbjct: 157 WRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 188
>gi|321469963|gb|EFX80941.1| hypothetical protein DAPPUDRAFT_224457 [Daphnia pulex]
Length = 498
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 147/314 (46%), Positives = 206/314 (65%), Gaps = 8/314 (2%)
Query: 199 KSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL---KSSLDEYVAK 255
+S++I F+NE S L RTVHSV+ ++P +L E+LLVDD+S E L + ++E+V K
Sbjct: 23 NTSVIICFYNEDPSTLFRTVHSVLDQTPAELLHEVLLVDDSSDGELLGNIHNEVEEFVGK 82
Query: 256 LSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTR 314
P +++++ R GLI+AR+ GA++A G++L+FLD+HCE W++ L+AR+ E+RT
Sbjct: 83 NFPPKVQLLKTMKREGLIRARIFGAKKATGQVLIFLDSHCEVNREWVQPLIARIQENRTF 142
Query: 315 VVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
VV P+IDII+ TF Y S L G FNW LHF+W + D + +DF +P +P MA
Sbjct: 143 VVTPIIDIINSDTFQYTSS-PLVRGGFNWGLHFKWDSL-PDDTLKTNEDFVKPILSPTMA 200
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAI+R YFF IG YD M VWGGENLE+SFR+W CGG +EI PCS V H+FR+ PY
Sbjct: 201 GGLFAIEREYFFDIGEYDAGMNVWGGENLEISFRIWMCGGRLEIIPCSRVGHVFRRRRPY 260
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
P G + Y +L R A VW+D++ E +F P+A + V R LR+ +KC SF
Sbjct: 261 GSPNGEDTMTYNSL-RAAHVWLDDYIEHFFHVRPDA-RHVSYGDVGPRQRLRRLMKCQSF 318
Query: 495 KWYLTHVWPHHFLP 508
WYL +V+P +P
Sbjct: 319 DWYLKNVYPELTVP 332
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 48/113 (42%), Positives = 80/113 (70%), Gaps = 5/113 (4%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL- 137
C+T+ + + +L +S++I F+NE S L RTVHSV+ ++P +L E+LLVDD+S E L
Sbjct: 12 CQTQEYPKLYL-NTSVIICFYNEDPSTLFRTVHSVLDQTPAELLHEVLLVDDSSDGELLG 70
Query: 138 --KSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ ++E+V K P +++++ R GLI+AR+ GA++A G++L+FLD+HCE
Sbjct: 71 NIHNEVEEFVGKNFPPKVQLLKTMKREGLIRARIFGAKKATGQVLIFLDSHCE 123
>gi|432936506|ref|XP_004082149.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oryzias latipes]
Length = 533
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 147/334 (44%), Positives = 207/334 (61%), Gaps = 9/334 (2%)
Query: 176 GEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILL 235
GE + H C + + LP ++++I FHNEA S LLRTV SV+ RSP S+++E+LL
Sbjct: 82 GERAIRDTRHYRCAALSYDADLPSTTVIITFHNEARSTLLRTVKSVLMRSPPSLIQEVLL 141
Query: 236 VDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 294
+DD S S L++ +P R +R+ R GLI++R+ GA A IL FLD+HC
Sbjct: 142 IDDFS------SDLEDCQLLAQIPKVRCLRNSRREGLIRSRVKGANSASAPILTFLDSHC 195
Query: 295 ECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGS 354
E WL+ ++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W
Sbjct: 196 EVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-P 254
Query: 355 SDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGG 414
+ + R D T P +TP +AGG+F +D+++F H+G YD M +WGGEN E+SFRVW CGG
Sbjct: 255 IEQKMARSDPTLPIRTPVIAGGIFVMDKSWFNHLGQYDTHMDIWGGENFELSFRVWMCGG 314
Query: 415 SIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR 474
S+EI PCS V H+FRK PY FP G + N R A VWMDE+ +FY+ P A+ +
Sbjct: 315 SLEILPCSRVGHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDEYKQFYYSARPSAQG-K 373
Query: 475 DKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLP 508
++ RL LR++L C F+WY+ +V+P +P
Sbjct: 374 AFGSITERLSLRRKLNCKPFRWYMENVYPELRVP 407
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 85/137 (62%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FNL SDR+ R + D R +C ++ + LP ++++I FHNEA S LLRTV
Sbjct: 67 YREHAFNLQESDRLGGERAIRDTRHYRCAALSYDAD-LPSTTVIITFHNEARSTLLRTVK 125
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV+ RSP S+++E+LL+DD S S L++ +P R +R+ R GLI++R+ G
Sbjct: 126 SVLMRSPPSLIQEVLLIDDFS------SDLEDCQLLAQIPKVRCLRNSRREGLIRSRVKG 179
Query: 171 ARQAEGEILVFLDAHCE 187
A A IL FLD+HCE
Sbjct: 180 ANSASAPILTFLDSHCE 196
>gi|8918932|dbj|BAA97985.1| unnamed protein product [Mus musculus]
Length = 558
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 218/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSLSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ A A +L FLD+HCE + WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRRADVAGATVLTFLDSHCEVNVEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 LTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 402 EQRKKMDCKSFRWYLENVYPELTVPVKEVLPGVIK--QGVNCLE 443
Score = 97.1 bits (240), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 93/157 (59%), Gaps = 8/157 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSSDRPIRDTRHYSCPSLSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LI++R+ A A +L FLD+HCE + + + L +
Sbjct: 191 LIRSRVRRADVAGATVLTFLDSHCEVNVEWLQPMLQR 227
>gi|194225536|ref|XP_001494993.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Equus
caballus]
Length = 460
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 151/321 (47%), Positives = 210/321 (65%), Gaps = 9/321 (2%)
Query: 191 VFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD 250
++ E LP +S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 8 TYDYERLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLA 67
Query: 251 EYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAE 310
+++L R+IR+ R GL++ARLLGA A+G++L FLD HCEC GWLE L+ R+ E
Sbjct: 68 SELSRLP-KVRLIRANKREGLVRARLLGASAAKGDVLTFLDCHCECHEGWLEPLLQRIHE 126
Query: 311 DRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFK 369
+ + VVCPVID+I TF Y+ S E G F+W L F W+ + + R T+ +
Sbjct: 127 EESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPERERLRMRSP-TDVIR 185
Query: 370 TPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFR 429
+P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F
Sbjct: 186 SPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFP 245
Query: 430 KASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQL 489
K +PYS ++ N R A VWMD + E Y+ NP A + V R +LR++L
Sbjct: 246 KQAPYSRSKALA-----NSVRAAEVWMDGYKELYYHRNPHA-RLEPFGDVTERKQLREKL 299
Query: 490 KCHSFKWYLTHVWPHHFLPMD 510
+C F+W+L +V+P +P D
Sbjct: 300 RCKDFRWFLENVYPELHVPED 320
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 80/216 (37%), Positives = 117/216 (54%), Gaps = 44/216 (20%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 138
CK K ++ E LP +S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK
Sbjct: 4 CKKKTYDYERLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 63
Query: 139 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
L +++L R+IR+ R GL++ARLLGA A+G++L FLD HCEC
Sbjct: 64 ERLASELSRLP-KVRLIRANKREGLVRARLLGASAAKGDVLTFLDCHCEC---------- 112
Query: 199 KSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA--- 254
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 113 ---------HEGWLEPLLQRIHEEES----AVVCPVIDVIDWNTFEYLGNSGEPQIGGFD 159
Query: 255 ----------------KLSVPTRVIRSPGRVGLIKA 274
++ PT VIRSP G + A
Sbjct: 160 WRLVFTWHVVPERERLRMRSPTDVIRSPTMAGGLFA 195
>gi|358332241|dbj|GAA27774.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 584
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 170/414 (41%), Positives = 242/414 (58%), Gaps = 23/414 (5%)
Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLS 257
P +++VI FHNE WS LLR+VHSV+ P ++LKEI+LVDD ST E+LKS LD Y+ +L
Sbjct: 135 PATAVVICFHNECWSTLLRSVHSVLDTVPENLLKEIVLVDDFSTYEYLKSPLDLYMKQLK 194
Query: 258 VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVC 317
+VI + R GLI+AR++G + EIL FLD+H EC GWLE L+ + ++++ VV
Sbjct: 195 -KVKVIHTDKREGLIRARMIGMNASTAEILTFLDSHIECNKGWLEPLLDCIQKNQSTVVS 253
Query: 318 PVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVID I+D TFAY + ++ G F+W++ + W+ D FT P + P +AG
Sbjct: 254 PVIDRINDDTFAYEPLLLSQIQVGGFDWDMTYNWHVPPKRDLERPGAPFT-PIRAPTIAG 312
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++ R +F ++G YD +M VWGGENLE+SF+ W CGG++++ PCSHV H+FR SPYS
Sbjct: 313 GLFSVHRDFFAYLGYYDPQMDVWGGENLELSFKTWMCGGTLQVHPCSHVGHVFRTKSPYS 372
Query: 436 FPGGVSEVLYGNLARVALVWMDEW-AEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ L NL R+A VWMDE+ FY +F + K D V R LR++LKC SF
Sbjct: 373 AKNNTGDTLRHNLVRLAEVWMDEYKGYFYERF---SFKLGDYGDVSERKALRERLKCRSF 429
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL---PCTHLP 551
KWYL +V+P F+P + G I + C++ S + LL PC L
Sbjct: 430 KWYLNNVFPELFVPSNSLANGDIESFKMAICLD-----ASADDHQPELHLLRGYPCHRLG 484
Query: 552 VLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
Q++ P I D C V E S + + C G ++Q++ Y +E
Sbjct: 485 G-NQLWYWT-PDKEIRRDNR-CWSVDE----ASGFIGMAKCGGTDKQKFNYTEE 531
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/141 (49%), Positives = 93/141 (65%), Gaps = 2/141 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN +SDRI V R L DVR+ C+T+ F P +++VI FHNE WS LLR+VHSV+
Sbjct: 103 FNQYISDRISVRRYLTDVREGLCRTQKFPAN-QPATAVVICFHNECWSTLLRSVHSVLDT 161
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
P ++LKEI+LVDD ST E+LKS LD Y+ +L +VI + R GLI+AR++G +
Sbjct: 162 VPENLLKEIVLVDDFSTYEYLKSPLDLYMKQLK-KVKVIHTDKREGLIRARMIGMNASTA 220
Query: 177 EILVFLDAHCECTLVFNEEFL 197
EIL FLD+H EC + E L
Sbjct: 221 EILTFLDSHIECNKGWLEPLL 241
>gi|281345023|gb|EFB20607.1| hypothetical protein PANDA_005411 [Ailuropoda melanoleuca]
Length = 551
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 224/349 (64%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 80 HPNCNGKRYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 139
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 140 LKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAATGDVITFLDSHCEANVNWLPPL 198
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A++R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 199 LDRIAQNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKAD 255
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 256 PSDPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 315
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE ++ PE + V ++
Sbjct: 316 VGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEHIYQRRPEY-RHLSAGDVAAQK 372
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMDD--KFFGRIRHVQTHKCVE 527
+LR L C +FKW++T + W P + P++ +G I +V T C +
Sbjct: 373 KLRSSLNCKTFKWFMTKIAWDLPKFYPPVEPPAAAWGEIHNVGTGLCAD 421
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 76/149 (51%), Positives = 107/149 (71%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C K + E LP +SI+I FHNE
Sbjct: 46 MTDAERVDQAYRENGFNIYVSDKISLNRSLPDIRHPNCNGKRYLET-LPNTSIIIPFHNE 104
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A L R++R+ R
Sbjct: 105 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKRE 163
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 164 GLIRTRMLGASAATGDVITFLDSHCEANV 192
>gi|354472196|ref|XP_003498326.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Cricetulus griseus]
Length = 513
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 218/344 (63%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 65 HYSCPSLSYSLDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 120
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE + WL+
Sbjct: 121 --SDPEDCLLLTRIPKVKCLRNDKREGLIRSRVRGADVAGATVLTFLDSHCEVNIEWLQP 178
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 179 MLQRVMEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 237
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 238 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 297
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 298 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 356
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P + G I+ Q C+E
Sbjct: 357 EQRKKMDCKSFRWYLENVYPELTVPAKEVLPGVIK--QGVNCLE 398
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 93/155 (60%), Gaps = 8/155 (5%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 33 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSLSYSLD-LPATSVIITFHNEAR 91
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 92 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDKREG 145
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE + + + L
Sbjct: 146 LIRSRVRGADVAGATVLTFLDSHCEVNIEWLQPML 180
>gi|426377334|ref|XP_004055422.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Gorilla gorilla gorilla]
Length = 598
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 216/344 (62%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 150 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 205
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL
Sbjct: 206 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPP 263
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 264 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 322
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 323 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 382
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 383 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 441
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
E RK++ C SF+WYL +V+P +P+ + G I+ Q C+E
Sbjct: 442 EQRKKMNCKSFRWYLENVYPELTVPVKEALPGIIK--QGVNCLE 483
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/156 (37%), Positives = 94/156 (60%), Gaps = 12/156 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 118 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 176
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 177 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 230
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI++R+ GA A +L FLD+HCE N E+LP
Sbjct: 231 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWLP 262
>gi|390345015|ref|XP_787987.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 2
[Strongylocentrotus purpuratus]
gi|390345017|ref|XP_003726244.1| PREDICTED: N-acetylgalactosaminyltransferase 7-like isoform 1
[Strongylocentrotus purpuratus]
Length = 670
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 163/430 (37%), Positives = 238/430 (55%), Gaps = 16/430 (3%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC LP +S+VIVFH E WS L+RT+HSV + SP+ +L E+LLVDD S + LK
Sbjct: 202 ECKHWHYPTNLPNTSVVIVFHQEGWSTLIRTIHSVFNTSPKELLAEVLLVDDYSDKVHLK 261
Query: 247 SSLDEYV--AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LD+Y+ + S R++R+ R GLI++R +GAR+A G++L FLDAHCEC WL L
Sbjct: 262 KKLDDYIRDPRFSGKIRIVRNKKREGLIRSRTIGARKAIGQVLTFLDAHCECGPNWLPPL 321
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFE-LHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+A +A DR+ +VCP +D IS TFAY + L GAF+W+ ++ + K
Sbjct: 322 LAEIAVDRSTIVCPTVDAISSDTFAYTSQGDGLCRGAFDWDFWYKRIPVKPYWHRLGLKQ 381
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++P+ +P MAGGL A+DR+YFF +G YD +Q+WGGEN E+SF+VW CGGS++ PCS
Sbjct: 382 RSQPYPSPVMAGGLLALDRSYFFELGGYDPGLQIWGGENFEISFKVWMCGGSLKFVPCSR 441
Query: 424 VAHLFRKASPYSFPG----GVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
V H++RK PYS+P GVS V+ N RVA VW+DE+ + ++ P E + +
Sbjct: 442 VGHVYRKQVPYSYPSSGVEGVS-VVDLNYMRVAEVWLDEYKDSFYATKPLLEG-KPCGNI 499
Query: 480 RSRLELRKQLKCHSFKWYLTHV----WPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM 535
++E RKQ SFKW++ + + P +K +G IR V + C++ G+
Sbjct: 500 SEQVEYRKQNCPKSFKWFMEEIAFDNLQEYPPPAKNKIWGEIRSVDSELCLD---TNGAQ 556
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
N + + H Q+F + L+ D P E I +
Sbjct: 557 NDPTHATMGMEGCHGMGGNQLFRINTRGQLVVEDGFCLYARPNTEKLIFAKCTAEKSFTS 616
Query: 596 NRQRWTYDKE 605
W YD +
Sbjct: 617 ETTMWDYDSD 626
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 66/134 (49%), Positives = 95/134 (70%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN++VSDRI ++R + D+R +CK + LP +S+VIVFH E WS L+RT+HSV +
Sbjct: 181 FNMVVSDRISLDRIVADLRDKECKHWHYPTN-LPNTSVVIVFHQEGWSTLIRTIHSVFNT 239
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYV--AKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP+ +L E+LLVDD S + LK LD+Y+ + S R++R+ R GLI++R +GAR+A
Sbjct: 240 SPKELLAEVLLVDDYSDKVHLKKKLDDYIRDPRFSGKIRIVRNKKREGLIRSRTIGARKA 299
Query: 175 EGEILVFLDAHCEC 188
G++L FLDAHCEC
Sbjct: 300 IGQVLTFLDAHCEC 313
>gi|322792015|gb|EFZ16120.1| hypothetical protein SINV_06269 [Solenopsis invicta]
Length = 433
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 178/433 (41%), Positives = 251/433 (57%), Gaps = 41/433 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 11 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLKRQLEDYM--M 68
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+ P R+IR+ R GLI+ARLLGA A+ +L +LD+HCECT GWLE L+ R+A D T V
Sbjct: 69 NYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECTEGWLEPLLDRIARDPTTV 128
Query: 316 VCPVIDIISDVTFAYVRSFELHW--------GAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
VCPVID+I D T Y HW G F+W L F W+ + + K+ EP
Sbjct: 129 VCPVIDVIDDTTLEY------HWRDSGGVNVGGFDWNLQFNWHAVPERERK-RHKNPAEP 181
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+P MAGGLF+IDRA+F IG YD +WGGENLE+SF+ W CGG++EI PCSHV H+
Sbjct: 182 VWSPTMAGGLFSIDRAFFERIGTYDSGFDIWGGENLELSFKTWMCGGTLEIVPCSHVGHI 241
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK SPY + GV+ VL N R++ VW+DE+A++Y++ + + V R LRK
Sbjct: 242 FRKRSPYKWRSGVN-VLKRNSIRLSEVWLDEYAKYYYQRI--GHDKGNYGDVSERKALRK 298
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKP-------LAK-------- 532
+L C SFKWYL +V+P F+P + G + ++ + + L+K
Sbjct: 299 KLGCKSFKWYLDNVYPELFIPGEAVASGEASPCRIYRGINRDRLSQYWMLSKTGEIRRDE 358
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDES-VCLDVPEYENDISPRVRILA 591
++ + L PC H Q ++ T+ I S CL + E + + V
Sbjct: 359 SCLDYSGSDVILYPC-HGSKGNQQWIYNPQTNHIRHGSSDKCLAITESKQQL---VMEEC 414
Query: 592 CSGFNRQRWTYDK 604
S RQRW+++
Sbjct: 415 SSSAPRQRWSFEN 427
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/157 (41%), Positives = 95/157 (60%), Gaps = 25/157 (15%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP ++++I FHNEAWS LLRTVHSV+ RSP +++EI+LVDD S LK L++Y+ +
Sbjct: 11 LPPTAVIICFHNEAWSVLLRTVHSVLDRSPEHLIQEIILVDDFSDMPHLKRQLEDYM--M 68
Query: 149 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFH 207
+ P R+IR+ R GLI+ARLLGA A+ +L +LD+HCECT
Sbjct: 69 NYPKVRIIRANKREGLIRARLLGAAAAKAPVLTYLDSHCECT------------------ 110
Query: 208 NEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
E W L + I+R P +++ ++ V D +T E+
Sbjct: 111 -EGW---LEPLLDRIARDPTTVVCPVIDVIDDTTLEY 143
>gi|296215364|ref|XP_002754093.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1
[Callithrix jacchus]
Length = 558
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 146/328 (44%), Positives = 210/328 (64%), Gaps = 9/328 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P ++++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPANLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKMTRTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTRPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V SR+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIG-KAFGSVASRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDD 511
E RK++ C SF+WYL +V+P +P+ +
Sbjct: 402 EQRKKMNCKSFRWYLENVYPELTVPVKE 429
Score = 98.6 bits (244), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P ++++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPANLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAAATVLTFLDSHCE----VNTEWL 221
>gi|334348942|ref|XP_001380115.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Monodelphis domestica]
Length = 642
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 159/418 (38%), Positives = 241/418 (57%), Gaps = 16/418 (3%)
Query: 190 LVFNEEF---LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
L E++ LP +S++I FH+EAWS LLRTVHS++ +PR LKEI+LVDD S + LK
Sbjct: 182 LCLQEDYGDHLPTASVIICFHDEAWSTLLRTVHSILDTAPRPFLKEIILVDDLSQQGHLK 241
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
S+L ++VA+L R++RS R+G I+ R+LGA +A G++LVF+D+HCEC GWLE L++
Sbjct: 242 SALSDHVAQLE-GVRLLRSNKRLGAIRGRMLGAARAAGDVLVFMDSHCECHKGWLEPLLS 300
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R+A DR+R+V P+ID+I F Y S +L G F+WEL+F W + +++ +
Sbjct: 301 RIAGDRSRLVSPIIDVIDWKNFQYYHSMDLQRGVFDWELNFHWRPLPEQERKMRQSPIS- 359
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P + GG+ AIDR YF + GAYD M +WG ENLE+S RVW CGGS+EI PCS V H
Sbjct: 360 PIRSPVLPGGVLAIDRHYFQNTGAYDSLMSIWGSENLELSIRVWLCGGSVEIIPCSRVGH 419
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA--EKQRDKQAVRSRLE 484
++R P + P E N R+ W+ + + +++ +P+A Q +KQ RL+
Sbjct: 420 VYRHQPPNASPD--PEAALKNKIRIVETWLGSFKDTFYQHSPKAFSLSQAEKQDCSERLQ 477
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDK--FFGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
L+++L C +F W+L ++ P + P + K F G++ C E +G G
Sbjct: 478 LQRRLGCRTFHWFLANLSPELY-PSEHKPGFSGKLYSSGVGSCAECVSGQG---LPGGWV 533
Query: 543 SLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRW 600
L PC+ + + +C DV + E ++ + A RQ W
Sbjct: 534 MLSPCSDSRQPQHLEYTSKKQISCGKNPKLCFDVNQ-EGRVTLQNCTEAALSLQRQHW 590
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 60/130 (46%), Positives = 89/130 (68%), Gaps = 2/130 (1%)
Query: 70 TLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVD 129
LP + C + + + LP +S++I FH+EAWS LLRTVHS++ +PR LKEI+LVD
Sbjct: 174 ALPPSKMKLCLQEDYGDH-LPTASVIICFHDEAWSTLLRTVHSILDTAPRPFLKEIILVD 232
Query: 130 DASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
D S + LKS+L ++VA+L R++RS R+G I+ R+LGA +A G++LVF+D+HCEC
Sbjct: 233 DLSQQGHLKSALSDHVAQLE-GVRLLRSNKRLGAIRGRMLGAARAAGDVLVFMDSHCECH 291
Query: 190 LVFNEEFLPK 199
+ E L +
Sbjct: 292 KGWLEPLLSR 301
>gi|443715013|gb|ELU07165.1| hypothetical protein CAPTEDRAFT_143879 [Capitella teleta]
Length = 390
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 199/317 (62%), Gaps = 2/317 (0%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
F+ LPK S++I F E+WS LLR+VHSV++R+P +L+EI+LVDD S R L + LD
Sbjct: 61 FDYSSLPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDN 120
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
Y+ +L T +IR P R GLI+ARL A G +L FLD+H EC +GW E L+ R++ +
Sbjct: 121 YLTRLPKVT-LIRLPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHN 179
Query: 312 RTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
R +V PVID IS F+Y+ G FNW + F+W + + D T P +TP
Sbjct: 180 RRVIVAPVIDAISSRDFSYIPISANQRGGFNWAMLFKWMPVPNYEKSRTGGDPTAPVRTP 239
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
+AGGLFAI + +F +G YD + +WG ENLE+SF+ W CGGS+E+ PCS V H++R
Sbjct: 240 TIAGGLFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPCSRVGHVYRST 299
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKC 491
PYSFPGG +V N RVA VWMD + ++ PE + + SR+ELR++LKC
Sbjct: 300 QPYSFPGGNVKVFMRNNLRVANVWMDGYVNLFYLMKPELRNEPFGD-ISSRVELRRRLKC 358
Query: 492 HSFKWYLTHVWPHHFLP 508
H FKWYL +V P +P
Sbjct: 359 HDFKWYLENVIPELNIP 375
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 68/153 (44%), Positives = 98/153 (64%), Gaps = 1/153 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ +++ FN SD + RT+PD R P+C+ K F+ LPK S++I F E+WS LLR
Sbjct: 26 KEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSFDYSSLPKMSVIICFTEESWSTLLR 85
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
+VHSV++R+P +L+EI+LVDD S R L + LD Y+ +L T +IR P R GLI+ARL
Sbjct: 86 SVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTRLPKVT-LIRLPSRQGLIRARL 144
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
A G +L FLD+H EC + + E L + S
Sbjct: 145 RAIEIARGPVLTFLDSHVECNVGWAEPLLQRIS 177
>gi|441593636|ref|XP_003260599.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 isoform
2 [Nomascus leucogenys]
Length = 495
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 212/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 44 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 103
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 104 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 162
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+ + I R + ++
Sbjct: 163 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHVVPERERIRMRSP-VDVIRS 221
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 222 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 281
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 282 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPRA-RLEPFGDVTERKQLRAKLQ 335
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C FKW+L V+P +P D FFG +++
Sbjct: 336 CKDFKWFLETVYPELHVPEDRPGFFGMLQN 365
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 77/215 (35%), Positives = 115/215 (53%), Gaps = 44/215 (20%)
Query: 76 KPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 135
K +CK K ++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE
Sbjct: 36 KARCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDRE 95
Query: 136 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEE 195
LK L ++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC
Sbjct: 96 HLKERLANELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCEC------- 147
Query: 196 FLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 148 ------------HEGWLEPLLQRIHEEES----AVVCPVIDVIDWNTFEYLGNSGEPQIG 191
Query: 255 -------------------KLSVPTRVIRSPGRVG 270
++ P VIRSP G
Sbjct: 192 GFDWRLVFTWHVVPERERIRMRSPVDVIRSPTMAG 226
>gi|395824312|ref|XP_003785413.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Otolemur garnettii]
Length = 508
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 163/364 (44%), Positives = 223/364 (61%), Gaps = 13/364 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
+PK+S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L ++ L
Sbjct: 62 MPKTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSGL 121
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A G +L FLD HCEC GWLE L+ R+ E+ + VV
Sbjct: 122 P-KVRLIRANKREGLVRARLLGASAARGNVLTFLDCHCECHEGWLEPLLQRIHEEESAVV 180
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ S E G F+W L F W+T + + K + ++P MAG
Sbjct: 181 CPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERQ-RMKSPIDVIRSPTMAG 239
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGGS+E PCSHV H+F K +PYS
Sbjct: 240 GLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLETHPCSHVGHVFPKQAPYS 299
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMD++ E Y+ NP A + V R +LR++L+C FK
Sbjct: 300 RNKALA-----NSVRAAEVWMDDYKELYYHRNPRA-RLEPFGDVTERRQLREKLQCKDFK 353
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCTHLPVL 553
W+L V+P +P D FFG +++ K C + NQ +G +L H
Sbjct: 354 WFLETVFPELHVPEDRPGFFGMLQNKGLKKYCFD--YNPPDENQVAGRQVILYLCHGLGQ 411
Query: 554 TQMF 557
Q F
Sbjct: 412 NQFF 415
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 113/216 (52%), Gaps = 44/216 (20%)
Query: 79 CKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 138
C+ K ++ +PK+S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK
Sbjct: 52 CREKKYDYANMPKTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLK 111
Query: 139 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
L ++ L R+IR+ R GL++ARLLGA A G +L FLD HCEC
Sbjct: 112 ERLANELSGLP-KVRLIRANKREGLVRARLLGASAARGNVLTFLDCHCEC---------- 160
Query: 199 KSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA--- 254
+E W LL+ +H S +++ ++ V D +T E+L +S + +
Sbjct: 161 ---------HEGWLEPLLQRIHEEES----AVVCPVIDVIDWNTFEYLGNSGEPQIGGFD 207
Query: 255 ----------------KLSVPTRVIRSPGRVGLIKA 274
++ P VIRSP G + A
Sbjct: 208 WRLVFTWHTVPERERQRMKSPIDVIRSPTMAGGLFA 243
>gi|345317797|ref|XP_001520970.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Ornithorhynchus anatinus]
Length = 467
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/325 (47%), Positives = 212/325 (65%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S+VI F+NEA S LLRTVHSV+ SP +L E++LVDD S + LK L+ ++A L
Sbjct: 22 LPRTSVVIAFYNEARSTLLRTVHSVLETSPDVLLNEVILVDDYSDKGHLKEPLENHLAGL 81
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+ + VV
Sbjct: 82 P-KVRLIRASKREGLVRARLLGASIATGQVLTFLDCHCECHEGWLEPLLERIREEESAVV 140
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ + E G F+W L F W+ + +R+ + ++P MAG
Sbjct: 141 CPVIDVIDWNTFEYLGNAGEPQIGGFDWRLVFTWHPIPEREQK-RRRSKVDVIRSPTMAG 199
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGGS+EI PCSHV H+F K +PYS
Sbjct: 200 GLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGSLEIHPCSHVGHVFPKQAPYS 259
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMD + E Y+ NP A + V +R +LR +LKC FK
Sbjct: 260 RSKALA-----NSVRAAEVWMDGYKELYYHRNPHA-RLEPYGDVTARRDLRSKLKCRDFK 313
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRH 519
W+L +V+P +P D +FG +++
Sbjct: 314 WFLENVYPELHVPEDRPGYFGMLKN 338
Score = 113 bits (282), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 56/111 (50%), Positives = 77/111 (69%), Gaps = 1/111 (0%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+C K ++ LP++S+VI F+NEA S LLRTVHSV+ SP +L E++LVDD S + L
Sbjct: 11 RCNEKKYDYRRLPRTSVVIAFYNEARSTLLRTVHSVLETSPDVLLNEVILVDDYSDKGHL 70
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCEC 188
K L+ ++A L R+IR+ R GL++ARLLGA A G++L FLD HCEC
Sbjct: 71 KEPLENHLAGLP-KVRLIRASKREGLVRARLLGASIATGQVLTFLDCHCEC 120
>gi|332027983|gb|EGI68034.1| Polypeptide N-acetylgalactosaminyltransferase 1 [Acromyrmex
echinatior]
Length = 597
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 163/353 (46%), Positives = 225/353 (63%), Gaps = 20/353 (5%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C V ++ LP +SI+I+F+NE WS LLRTVHSV+ SP ++LKEI+LVDD S E L+
Sbjct: 132 CANVTYDKLLPSASIIIIFYNEPWSVLLRTVHSVLKGSPPNLLKEIILVDDHSEEEELQG 191
Query: 248 SLDEYVA-KLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD Y++ +L +++R P R GLI+ARL GA+ A G++LVFLDAHCE WL+ L+
Sbjct: 192 QLDYYLSTRLPAKVKLLRLPYRQGLIRARLHGAKNAVGDVLVFLDAHCEVIKDWLQPLLQ 251
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYV---RSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
R+ +++ V+ P+ID IS+ T Y +F G F W HF W T + + R
Sbjct: 252 RIKDNKNAVLMPIIDNISEETLEYFHDNEAFFFQVGGFTWSGHFTWITIQKHE-VESRFS 310
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
P ++P MAGGLFAI+R YF+ IG+YD++M WGGENLE+SFR+WQCGG++EI PCS
Sbjct: 311 PISPTRSPTMAGGLFAINRKYFWEIGSYDDKMDGWGGENLEISFRIWQCGGTLEIIPCSR 370
Query: 424 VAHLFRKASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFY------FKFNPEAEKQRDK 476
V H+FR PY FP + +G N AR+A VWMDE+ + FK NPE
Sbjct: 371 VGHIFRNFHPYKFPN--DKDTHGINTARLAFVWMDEYKRLFLLHRSEFKDNPELIGD--- 425
Query: 477 QAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEK 528
+ RL+LRK+LKC SFKWYL +V+P F+P ++ +GR+R C++
Sbjct: 426 --ISERLKLRKKLKCKSFKWYLNNVYPEKFIPDENAIAYGRVRLRNRRLCLDN 476
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 68/132 (51%), Positives = 96/132 (72%), Gaps = 2/132 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N+++S++I + R LPDVR P C + ++ LP +SI+I+F+NE WS LLRTVHSV+
Sbjct: 110 LNVILSNKISLTRKLPDVRNPLCANVTY-DKLLPSASIIIIFYNEPWSVLLRTVHSVLKG 168
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVA-KLSVPTRVIRSPGRVGLIKARLLGARQAE 175
SP ++LKEI+LVDD S E L+ LD Y++ +L +++R P R GLI+ARL GA+ A
Sbjct: 169 SPPNLLKEIILVDDHSEEEELQGQLDYYLSTRLPAKVKLLRLPYRQGLIRARLHGAKNAV 228
Query: 176 GEILVFLDAHCE 187
G++LVFLDAHCE
Sbjct: 229 GDVLVFLDAHCE 240
>gi|355753170|gb|EHH57216.1| Polypeptide N-acetylgalactosaminyltransferase 12, partial [Macaca
fascicularis]
Length = 542
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/326 (47%), Positives = 210/326 (64%), Gaps = 10/326 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ + LP++S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 99 YDYDNLPRTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 158
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A G++L FLD HCEC GWLE L+ R+ E+
Sbjct: 159 ELSGLP-KVRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEE 217
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ S E G F+W L F W+T + I R + ++
Sbjct: 218 ESAVVCPVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSP-VDVIRS 276
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG +E PCSHV H+F K
Sbjct: 277 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 336
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR +L+
Sbjct: 337 QAPYSRNKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 390
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFG 515
C FKW+L V+P +P D FFG
Sbjct: 391 CKDFKWFLETVYPELHVPEDRPGFFG 416
Score = 138 bits (347), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 44/238 (18%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ + LP++S++I F+NEAWS LLRTV+S
Sbjct: 68 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEAWSTLLRTVYS 127
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 128 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 186
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW-SALLRTVHSVISRSPRSMLK 231
A G++L FLD HCEC +E W LL+ +H S +++
Sbjct: 187 AARGDVLTFLDCHCEC-------------------HEGWLEPLLQRIHEEES----AVVC 223
Query: 232 EILLVDDASTREFLKSSLDEYVA-------------------KLSVPTRVIRSPGRVG 270
++ V D +T E+L +S + + ++ P VIRSP G
Sbjct: 224 PVIDVIDWNTFEYLGNSGEPQIGGFDWRLVFTWHTVPERERIRMRSPVDVIRSPTMAG 281
>gi|291382916|ref|XP_002708201.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12
[Oryctolagus cuniculus]
Length = 476
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 162/358 (45%), Positives = 223/358 (62%), Gaps = 19/358 (5%)
Query: 165 KARLLGARQAEGEILVFLDAHCECTLV-FNEEFLPKSSIVIVFHNEAWSALLRTVHSVIS 223
+ RLL A + E E L C V ++ + LPK+S++I F+NEAWS LLRTV+SV+
Sbjct: 6 RHRLLVAGRWEAEPL--------CKEVKYDYDHLPKTSVIIAFYNEAWSTLLRTVYSVLE 57
Query: 224 RSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAE 283
SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA A+
Sbjct: 58 TSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRATKREGLVRARLLGASAAK 116
Query: 284 GEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFN 342
G++L FLD HCEC GWLE L+ R+ E + VVCPVID+I TF Y+ E G F+
Sbjct: 117 GDVLTFLDCHCECHEGWLEPLLHRIHEKESAVVCPVIDVIDWNTFEYLGNPGEPQIGGFD 176
Query: 343 WELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGEN 402
W L F W+ + + R + ++P MAGGLFA+ + YF ++G+YD M+VWGGEN
Sbjct: 177 WRLVFTWHVVPERERLRMRSPI-DVIRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGEN 235
Query: 403 LEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEF 462
LE SFR+WQCGG++E PCSHV H+F K +PYS ++ N R A VWMDE+ E
Sbjct: 236 LEFSFRIWQCGGTLETHPCSHVGHVFPKQAPYSRNKALA-----NSVRAAEVWMDEFKEL 290
Query: 463 YFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
Y+ NP A + V R +LR +L+C FKW+L V+P +P D FFG +++
Sbjct: 291 YYHRNPRA-RLEPFGDVTERRQLRAKLQCKDFKWFLETVYPELHVPEDRPGFFGMLQN 347
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 64/135 (47%), Positives = 88/135 (65%), Gaps = 9/135 (6%)
Query: 54 INRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSV 113
I R LLV+ R +P CK ++ + LPK+S++I F+NEAWS LLRTV+SV
Sbjct: 4 IGRHRLLVAGRWEA--------EPLCKEVKYDYDHLPKTSVIIAFYNEAWSTLLRTVYSV 55
Query: 114 ISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQ 173
+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 56 LETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRATKREGLVRARLLGASA 114
Query: 174 AEGEILVFLDAHCEC 188
A+G++L FLD HCEC
Sbjct: 115 AKGDVLTFLDCHCEC 129
>gi|56756104|gb|AAW26230.1| SJCHGC09400 protein [Schistosoma japonicum]
Length = 737
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 173/412 (41%), Positives = 242/412 (58%), Gaps = 15/412 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+I FHNEAWS LLR+VHSVI RSP +L EI+LVDD S R LK +L+EY+ L
Sbjct: 244 LPSASIIICFHNEAWSVLLRSVHSVIDRSPSYLLHEIILVDDFSDRPHLKEALEEYMKML 303
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+V +++R+ R GLI+AR+LGA Q+ G++LVFLD+H ECT GWLE L+ R+A + + VV
Sbjct: 304 NV-VKIVRTKRREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDRIAYNSSIVV 362
Query: 317 CPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
PVI +I+D T Y + G F+W L F W+ ++ P ++P MA
Sbjct: 363 VPVITVINDKTLKYDLPSPSRVQIGGFDWSLSFIWHEQTERHKNRPGAPYS-PVQSPTMA 421
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLFAI R YF H+G YD M+VWGGENLE+SF++W CGGS+EI CS V H+FR SPY
Sbjct: 422 GGLFAISREYFNHLGMYDPGMEVWGGENLELSFKIWMCGGSLEIVICSQVGHIFRDRSPY 481
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
+ V + L NL R+A VW+D++ FY + D V R LR++LKCHSF
Sbjct: 482 IWDVDVKDPLKRNLLRLADVWLDDYKRFYHAR--IGFEMVDIGNVSERKALREKLKCHSF 539
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLT 554
WYLT+++P F+P G I C++ PL + + +S PC H
Sbjct: 540 DWYLTNIYPELFVPSKALASGDIESAAGPHCLDAPLPSEN-DSSSVIIKTRPC-HKQGGN 597
Query: 555 QMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSG-FNRQRWTYDKE 605
Q +++ ++ D + +Y + + C G Q +TY+K+
Sbjct: 598 QFWLLSSENEIRRDDYCFDSGIQKYS------IGLYHCHGSHGNQEFTYEKD 643
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 81/161 (50%), Positives = 111/161 (68%), Gaps = 7/161 (4%)
Query: 44 NLSDAQQL-----YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVF 98
++S A+Q+ ++ N FN L SDRI V R LPD R+ CK ++ LP +SI+I F
Sbjct: 195 DISPAEQVIFDKGWKDNAFNQLASDRISVRRYLPDYREGTCKDNKYSRN-LPSASIIICF 253
Query: 99 HNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSP 158
HNEAWS LLR+VHSVI RSP +L EI+LVDD S R LK +L+EY+ L+V +++R+
Sbjct: 254 HNEAWSVLLRSVHSVIDRSPSYLLHEIILVDDFSDRPHLKEALEEYMKMLNV-VKIVRTK 312
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R GLI+AR+LGA Q+ G++LVFLD+H ECT + E L +
Sbjct: 313 RREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDR 353
>gi|327277504|ref|XP_003223504.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 10-like
[Anolis carolinensis]
Length = 612
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 166/439 (37%), Positives = 251/439 (57%), Gaps = 36/439 (8%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 138 HPNCNSKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 197
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L+ L++Y+A+ + R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 198 LRKRLEDYMAQFT-KVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPL 256
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A + +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 257 LDRIARNHKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKPD 313
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 314 PSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 373
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 374 VGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVATQK 430
Query: 484 ELRKQLKCHSFKWYLTHV-WP-HHFLPMDD---KFFGRIRHVQTHKCVE-------KPLA 531
ELR L C SF+W++ V W F P + +G I +V T CV+ PL
Sbjct: 431 ELRSNLNCKSFRWFMNEVAWDLRKFYPPVEPPAAAWGEIHNVGTSLCVDTKHGALGSPLK 490
Query: 532 KGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATD----ESVCLDVPEYENDISPRV 587
+ ++ G A+ Q+F D+ D + C D + + ++
Sbjct: 491 IETCVKSRGEAAW-------NNVQVFTFSWREDIRPGDPQHTKKFCFDAVSHNSPVT--- 540
Query: 588 RILACSGF-NRQRWTYDKE 605
+ C G Q W Y K+
Sbjct: 541 -LYDCHGMKGNQLWKYRKD 558
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 73/149 (48%), Positives = 111/149 (74%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P C +K++ E+ LP +S++I FHNE
Sbjct: 104 MTDAERVDQAYRENGFNIFVSDKISLNRSLPDIRHPNCNSKLYLEK-LPNTSVIIPFHNE 162
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE L+ L++Y+A+ + R++R+ R
Sbjct: 163 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLRKRLEDYMAQFT-KVRILRTKKRE 221
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 222 GLIRTRMLGASAAIGDVITFLDSHCEANV 250
>gi|170065987|ref|XP_001868085.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
gi|167862691|gb|EDS26074.1| N-acetylgalactosaminyltransferase [Culex quinquefasciatus]
Length = 639
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 150/325 (46%), Positives = 209/325 (64%), Gaps = 5/325 (1%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C ++ LP +SI++ F+NE LLR+V+SV+ R+P +L EI+LVDD S +
Sbjct: 161 HKLCPEQSYDKVLPSASIIMCFYNEHLQTLLRSVNSVLGRTPAYLLHEIILVDDCSDFDD 220
Query: 245 LKSSLDEYVAKLS-VPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L L+ + K + R+IR+ R GL+++R+ GAR A G++LVFLD+H E + W+E
Sbjct: 221 LGDDLEVGLKKFNNSKIRLIRNRDREGLMRSRVYGARNATGDVLVFLDSHIEVNVDWIEP 280
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ R+ +RT + PVIDII+ TFAY S L G FNW LHF+W ++ K D
Sbjct: 281 LLQRIKVNRTILAMPVIDIINSDTFAYTSS-PLVRGGFNWGLHFKWDNL-PKGSLAKETD 338
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F PF++P MAGGLFA+DR YF +G YD M VWGGENLE+SFR WQCGGSIE+ PCS
Sbjct: 339 FVGPFQSPTMAGGLFAMDRKYFKELGEYDMGMDVWGGENLEISFRAWQCGGSIELLPCSR 398
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
+ H+FRK PY P G ++ +L R+A VWMD++ +++F+ P A K D + R
Sbjct: 399 IGHVFRKRRPYGSPDGTDTMIRNSL-RLARVWMDDYIKYFFENQPHANKL-DAGDLSERQ 456
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLP 508
ELR +L C SF+WYL +++P LP
Sbjct: 457 ELRNRLNCKSFEWYLKNIYPQLRLP 481
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 94/149 (63%), Gaps = 2/149 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN+LVS +I R +PD R C + + ++ LP +SI++ F+NE LLR+V+
Sbjct: 137 YRKHAFNVLVSSKIGPFREVPDTRHKLCPEQSY-DKVLPSASIIMCFYNEHLQTLLRSVN 195
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL-SVPTRVIRSPGRVGLIKARLLG 170
SV+ R+P +L EI+LVDD S + L L+ + K + R+IR+ R GL+++R+ G
Sbjct: 196 SVLGRTPAYLLHEIILVDDCSDFDDLGDDLEVGLKKFNNSKIRLIRNRDREGLMRSRVYG 255
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPK 199
AR A G++LVFLD+H E + + E L +
Sbjct: 256 ARNATGDVLVFLDSHIEVNVDWIEPLLQR 284
>gi|359320847|ref|XP_532008.3| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12 [Canis
lupus familiaris]
Length = 578
Score = 298 bits (764), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 213/330 (64%), Gaps = 10/330 (3%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP +S+VI F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L
Sbjct: 128 YDYENLPTTSVVIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLAN 187
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A+GE+L FLD HCEC GWLE L+ R+ E+
Sbjct: 188 ELSGLP-KVRLIRANKREGLVRARLLGASAAKGEVLTFLDCHCECHEGWLEPLLQRIHEE 246
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKT 370
+ VVCPVID+I TF Y+ E G F+W L F W+ + + R + ++
Sbjct: 247 ESAVVCPVIDVIDWNTFEYLGNPREPQIGGFDWRLVFTWHVVPERERMRMRSPI-DVIRS 305
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K
Sbjct: 306 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPK 365
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
+PYS ++ N R A VWMD++ E Y+ NP A + V R +LR +L+
Sbjct: 366 QAPYSRNKALA-----NSVRAAEVWMDDFKELYYHRNPHA-RLEPFGDVTERKQLRAKLQ 419
Query: 491 CHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
C F+W+L +V+P +P D FFG +++
Sbjct: 420 CKDFRWFLENVYPELHVPEDRPGFFGMLQN 449
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 69/136 (50%), Positives = 97/136 (71%), Gaps = 1/136 (0%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ E LP +S+VI F+NEAWS LLRTV+S
Sbjct: 97 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYENLPTTSVVIAFYNEAWSTLLRTVYS 156
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 157 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANKREGLVRARLLGAS 215
Query: 173 QAEGEILVFLDAHCEC 188
A+GE+L FLD HCEC
Sbjct: 216 AAKGEVLTFLDCHCEC 231
>gi|427779849|gb|JAA55376.1| Putative polypeptide n-acetylgalactosaminyltransferase
[Rhipicephalus pulchellus]
Length = 683
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 174/447 (38%), Positives = 244/447 (54%), Gaps = 57/447 (12%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+++ FHNEAWS LLRTVHS+I RSP +L EI+LVDD S LK L++YVA
Sbjct: 199 LPSTSVIVCFHNEAWSVLLRTVHSIIDRSPPKLLHEIILVDDYSDMPHLKQKLEDYVAHF 258
Query: 257 SVPTRVIRSPGRVGLIK-----------------------------------ARLLGARQ 281
+++R+ R GLI+ AR
Sbjct: 259 P-KVKIVRAQKREGLIRARLLGAAAATAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVX 317
Query: 282 AEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFA--YVRSFELHWG 339
A +L +LD+HCECT GWLE L+ R+A + T VVCPVID+ISD TF Y S ++ G
Sbjct: 318 ATAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVVCPVIDVISDSTFEYHYRDSGGVNVG 377
Query: 340 AFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWG 399
F+W L F W+ + +RK +P +P MAGGLF+ID+A+F +G YD +WG
Sbjct: 378 GFDWNLQFSWHAVPERER-QRRKHSWDPVWSPTMAGGLFSIDKAFFEKLGTYDSGFDIWG 436
Query: 400 GENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEW 459
GENLE+SF+ W CGG++EI PCSHV H+FRK SPY + GV+ VL N R+A VW+DE+
Sbjct: 437 GENLELSFKTWMCGGTLEIVPCSHVGHIFRKRSPYKWRSGVN-VLRRNSVRLAEVWLDEY 495
Query: 460 AEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH 519
++Y++ + D V +R LR LKC SF WY+ ++P F+P D G +R+
Sbjct: 496 KQYYYQRI--GDDLGDFGDVSARKRLRDNLKCRSFDWYVRTIYPELFVPGDAVASGEVRN 553
Query: 520 VQT--HKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVP 577
C++ P + +M++ P + PC H Q +++ + I DE+ CLD
Sbjct: 554 KGQGGSSCLDSPSGRDNMHK---PVGMYPC-HGQGGNQYWMLSKEGE-IRRDEA-CLDYA 607
Query: 578 EYENDISPRVRILACSGF-NRQRWTYD 603
+ V + C G Q W YD
Sbjct: 608 GSD------VILYPCHGSKGNQLWIYD 628
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 82/209 (39%), Positives = 125/209 (59%), Gaps = 15/209 (7%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+Q + N FN +SD I ++R+LPDVR +CK + + ++ LP +S+++ FHNEAWS LLR
Sbjct: 160 KQGWDRNAFNQYISDMISLHRSLPDVRDSECKDERYLKD-LPSTSVIVCFHNEAWSVLLR 218
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHS+I RSP +L EI+LVDD S LK L++YVA +++R+ R GLI+ARL
Sbjct: 219 TVHSIIDRSPPKLLHEIILVDDYSDMPHLKQKLEDYVAHFP-KVKIVRAQKREGLIRARL 277
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK----------SSIVIVF---HNEAWSALL 215
LGA A +L +LD+HCECT + E L + ++ V+ + H E L
Sbjct: 278 LGAAAATAPVLTYLDSHCECTEGWLEPLLDRIARNSTTVXATAPVLTYLDSHCECTEGWL 337
Query: 216 RTVHSVISRSPRSMLKEILLVDDASTREF 244
+ I+R+ +++ ++ V ST E+
Sbjct: 338 EPLLDRIARNSTTVVCPVIDVISDSTFEY 366
>gi|156364641|ref|XP_001626455.1| predicted protein [Nematostella vectensis]
gi|156213331|gb|EDO34355.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 298 bits (763), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 204/307 (66%), Gaps = 5/307 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SIVI F+NEAW+ LLRT+HSV+ R+P L EI+LVDD S LKS LD Y++ +
Sbjct: 54 LPVASIVICFYNEAWTILLRTIHSVLDRTPHQFLHEIILVDDFSNMLELKSKLDRYLSTM 113
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R++R+ R GLI+ R++GA A G++LVFLD+HCE + WL+ L+ + +D+ V
Sbjct: 114 P-KIRIVRNNKREGLIRGRIIGAEAATGQVLVFLDSHCEVNINWLQPLLQHIHDDQKAVA 172
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVID+IS TF Y S + G FNW LHF W S ++K +D+ +P ++P MAGG
Sbjct: 173 CPVIDVISSDTFEYSSSPMVR-GGFNWGLHFTWEPIPPS-LLVKPEDYVKPIRSPTMAGG 230
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFA+DR YF +G YD M +WG ENLE+SFR+W CGGS++I PCS V HLFR+ PY
Sbjct: 231 LFAVDREYFTQLGKYDSGMDIWGAENLEISFRIWMCGGSLDILPCSRVGHLFRRFRPYG- 289
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
+ + N R+A VW+D + +++++ + E ++ + R++LRK L+C SF+W
Sbjct: 290 SDSKGDTMSRNSMRLAEVWLDGYKKYFYQIRHDLEGKKFGD-ISQRIKLRKSLQCKSFEW 348
Query: 497 YLTHVWP 503
YL +++P
Sbjct: 349 YLKNIYP 355
Score = 135 bits (339), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 101/149 (67%), Gaps = 2/149 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ Y + FNLL+S+R+ ++RT+ D R C+ K + + LP +SIVI F+NEAW+ LLR
Sbjct: 15 EEGYSAHGFNLLISNRLSLHRTIKDTRHELCRGKTYPKN-LPVASIVICFYNEAWTILLR 73
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T+HSV+ R+P L EI+LVDD S LKS LD Y++ + R++R+ R GLI+ R+
Sbjct: 74 TIHSVLDRTPHQFLHEIILVDDFSNMLELKSKLDRYLSTMP-KIRIVRNNKREGLIRGRI 132
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
+GA A G++LVFLD+HCE + + + L
Sbjct: 133 IGAEAATGQVLVFLDSHCEVNINWLQPLL 161
>gi|156375693|ref|XP_001630214.1| predicted protein [Nematostella vectensis]
gi|156217230|gb|EDO38151.1| predicted protein [Nematostella vectensis]
Length = 575
Score = 298 bits (762), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 168/412 (40%), Positives = 236/412 (57%), Gaps = 23/412 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++I+I FHNE SALLRTV S ++RSP +LKEI+LVDD S+ L L
Sbjct: 139 LPTTTIIICFHNEGRSALLRTVISALNRSPPHLLKEIILVDDFSSDPKDGRRL------L 192
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
+P ++IR+ R GLI++R+ GA A GE+L FLD+HCEC WLE L+ R+ E +
Sbjct: 193 KLPKVKLIRNTKREGLIRSRVKGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTI 252
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
V P+ID+I+ TF Y+ S G F W L+F+W ++ T P K+P +AG
Sbjct: 253 VSPIIDVINLDTFDYLGSSADLRGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSPVIAG 312
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++ + +F +G YD +M VWGGENLE+SFR WQCGG++EI PCS V H+FR PY
Sbjct: 313 GLFSVAKKWFETLGKYDMQMDVWGGENLEISFRTWQCGGAMEIIPCSRVGHVFRNRHPYQ 372
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FPGG V N R VWMD++ +Y+ P A K + R+ELR++L+C FK
Sbjct: 373 FPGGSMNVFQKNTRRAVEVWMDDYKRYYYAAVPYA-KNTPYGDIEERVELRRKLRCRPFK 431
Query: 496 WYLTHVWPHHFLPMDD--KFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVL 553
WY+ +V+P LP D+ K FG I+ Q ++CV+ ++ G L H
Sbjct: 432 WYVQNVYPELKLPSDESTKSFGEIK--QGNQCVD------TLGHMRGQTIGLFECHGAGG 483
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNR-QRWTYDK 604
QM+ L + E++CL V + + V++L C N Q W Y+K
Sbjct: 484 NQMW--SLTKSSLLKHETMCLGV--NDGKATEPVQLLDCDENNSMQHWEYEK 531
Score = 130 bits (326), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 84/217 (38%), Positives = 123/217 (56%), Gaps = 38/217 (17%)
Query: 37 VDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
V+ G H + Y N +N+ SD++PV+R +PDVR +CK++V+ + LP ++I+I
Sbjct: 92 VENGFH----EGDDAYAKNAYNIKKSDQLPVDREVPDVRDQQCKSQVWPHD-LPTTTIII 146
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVI 155
FHNE SALLRTV S ++RSP +LKEI+LVDD S+ L L +P ++I
Sbjct: 147 CFHNEGRSALLRTVISALNRSPPHLLKEIILVDDFSSDPKDGRRL------LKLPKVKLI 200
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAW--SA 213
R+ R GLI++R+ GA A GE+L FLD+HCEC N+ W
Sbjct: 201 RNTKREGLIRSRVKGANLARGEVLTFLDSHCEC-------------------NKNWLEPL 241
Query: 214 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLD 250
LLR I SP++++ I+ V + T ++L SS D
Sbjct: 242 LLR-----IKESPKTIVSPIIDVINLDTFDYLGSSAD 273
>gi|443704264|gb|ELU01402.1| hypothetical protein CAPTEDRAFT_127533 [Capitella teleta]
Length = 390
Score = 298 bits (762), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 197/312 (63%), Gaps = 2/312 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK S++I F E+WS LLR+VHSV++R+P +L+EI+LVDD S R L + LD Y+ +L
Sbjct: 66 LPKMSVIICFTEESWSTLLRSVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTRL 125
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
T +IR P R GLI+ARL A G +L FLD+H EC +GW E L+ R++ +R +V
Sbjct: 126 PKVT-LIRFPSRQGLIRARLRAIEIARGPVLTFLDSHVECNVGWAEPLLQRISHNRRVIV 184
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVID IS F+Y+ G FNW + F+W + + D T P +TP +AGG
Sbjct: 185 APVIDAISSRDFSYIPISANQRGGFNWAMLFKWMPVPNYEKSRTGGDPTAPVRTPTIAGG 244
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI + +F +G YD + +WG ENLE+SF+ W CGGS+E+ PCS V H++R PYSF
Sbjct: 245 LFAIHQRFFRSLGFYDPGLDIWGSENLELSFKAWMCGGSMEMIPCSRVGHVYRSTQPYSF 304
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG +V N RVA VWMD + ++ PE + + SR+ELR++LKCH FKW
Sbjct: 305 PGGNVKVFMRNNLRVANVWMDGYVNLFYLMKPELRNEPFGD-ISSRVELRRRLKCHDFKW 363
Query: 497 YLTHVWPHHFLP 508
YL +V P +P
Sbjct: 364 YLENVIPELNIP 375
Score = 132 bits (332), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 67/153 (43%), Positives = 98/153 (64%), Gaps = 1/153 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++ +++ FN SD + RT+PD R P+C+ K ++ LPK S++I F E+WS LLR
Sbjct: 26 KEGFRLASFNQHASDLVSFERTIPDSRPPRCRDKSYDYSSLPKMSVIICFTEESWSTLLR 85
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
+VHSV++R+P +L+EI+LVDD S R L + LD Y+ +L T +IR P R GLI+ARL
Sbjct: 86 SVHSVLNRTPPELLEEIILVDDFSQRGHLHAKLDNYLTRLPKVT-LIRFPSRQGLIRARL 144
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
A G +L FLD+H EC + + E L + S
Sbjct: 145 RAIEIARGPVLTFLDSHVECNVGWAEPLLQRIS 177
>gi|410968689|ref|XP_003990834.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 5 [Felis catus]
Length = 939
Score = 298 bits (762), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 169/418 (40%), Positives = 244/418 (58%), Gaps = 25/418 (5%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
L++ R ++PG+ G K + R EG V+L A C
Sbjct: 427 LTLSPRDPKAPGQFGRPVVVPRGKEKEAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 486
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+++LK
Sbjct: 487 EQLVHNN--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKD 544
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 545 NLDKYMSQFP-KVRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLER 603
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
V R +V CPVI++I+D +Y+ G F W ++F W T R T+
Sbjct: 604 VYLSRKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVVAKNRIKETDI 663
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+ P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H+
Sbjct: 664 IRCPVMAGGLFSIDKNYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHI 723
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLELR 486
FR +PY+FP + + NL RVA VW+DE+ E FY + ++ D + + +LR
Sbjct: 724 FRNDNPYTFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTEQRKLR 783
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
K+LKC SFKWYL +V+P P+ + G + +V KC+ ++ G + L
Sbjct: 784 KKLKCKSFKWYLENVFPDLKAPI-VRANGVLINVALGKCISIENTTATLEDCDGSSKL 840
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 66/151 (43%), Positives = 105/151 (69%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A++ ++ FN+ +SD IPV+R + D R C ++ + LP +S+++ F +E WS L
Sbjct: 453 EAERRWKEGNFNVYLSDLIPVDRAIEDTRPAGCAEQLVHNN-LPTTSVIMCFVDEVWSTL 511
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+A
Sbjct: 512 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFP-KVRILRLKERHGLIRA 570
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 571 RLAGAQNATGDVLTFLDSHVECNVGWLEPLL 601
>gi|349732170|ref|NP_001231847.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 1-like [Sus
scrofa]
Length = 557
Score = 298 bits (762), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 165/421 (39%), Positives = 240/421 (57%), Gaps = 21/421 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C V LP +S++I FHNEA S LLRTV SV++R+P S+++EI+LVDD S
Sbjct: 110 HYSCPSVSYSSDLPATSVIITFHNEARSTLLRTVKSVLNRTPASLIQEIILVDDFS---- 165
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P + +R+ R GLI++R+ GA A +L FLD+HCE WL+
Sbjct: 166 --SDPEDCLLLTRIPKVKCLRNDRREGLIRSRVRGADVAAAGVLTFLDSHCEVNTEWLQP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + I D
Sbjct: 224 MLQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PLEQKIAWTD 282
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F ID+++F H+G YD +M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 283 PTKPIRTPVIAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSR 342
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ ++Y++ P A + +V +R+
Sbjct: 343 VGHVFRKRHPYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSA-IGKAFGSVATRI 401
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
E RK++ C +F+WYL +V+P +P+ + I+ Q C+E + N G
Sbjct: 402 EQRKKMNCKTFRWYLENVYPELTVPVKEVLPSIIK--QGANCLETQGQDTAGNFLLGMGI 459
Query: 544 LLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFN----RQR 599
P Q ++ +D + + CL ISP ++ G N RQR
Sbjct: 460 CRGSAKNPPAAQAWLF---SDHLIQQQGKCLAA--TSTSISPG-SLVVLQGCNPREGRQR 513
Query: 600 W 600
W
Sbjct: 514 W 514
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 12/155 (7%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FN L SD++ +R + D R C + ++ + LP +S++I FHNEA
Sbjct: 78 QLKPGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSD-LPATSVIITFHNEAR 136
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 162
S LLRTV SV++R+P S+++EI+LVDD S S ++ + +P + +R+ R G
Sbjct: 137 STLLRTVKSVLNRTPASLIQEIILVDDFS------SDPEDCLLLTRIPKVKCLRNDRREG 190
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
LI++R+ GA A +L FLD+HCE N E+L
Sbjct: 191 LIRSRVRGADVAAAGVLTFLDSHCE----VNTEWL 221
>gi|449267121|gb|EMC78087.1| Polypeptide N-acetylgalactosaminyltransferase 10, partial [Columba
livia]
Length = 560
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 153/360 (42%), Positives = 224/360 (62%), Gaps = 24/360 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S++I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE
Sbjct: 75 HPNCKNKLYLEKLPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREH 134
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L++Y+A+ R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L
Sbjct: 135 LKKRLEDYMAQFPN-VRILRTKKREGLIRTRMLGASVAIGDVITFLDSHCEANVNWLPPL 193
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D
Sbjct: 194 LDRIARNRKTIVCPMIDVIDHDHFGYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKLD 250
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS
Sbjct: 251 PSDPFESPVMAGGLFAVDRKWFWELGGYDAGLEIWGGEQYEISFKVWMCGGRMEDIPCSR 310
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ PE + V ++
Sbjct: 311 VGHIYRKYVPYKVPTGVS--LARNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVAAQK 367
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD-------------DKFFGRIRHVQTHKCVE 527
ELR L C SFKW++ V W P + P++ +IR+V T CV+
Sbjct: 368 ELRNNLNCKSFKWFMNEVAWDLPKFYPPVEPPAAAWGEARDSATSLLFQIRNVGTGLCVD 427
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 75/149 (50%), Positives = 110/149 (73%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA+ Q Y+ N FN+ VSD+I +NR+LPD+R P CK K++ E+ LP +S++I FHNE
Sbjct: 41 MTDAERVDQAYRENGFNIFVSDKISLNRSLPDIRHPNCKNKLYLEK-LPNTSVIIPFHNE 99
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK L++Y+A+ R++R+ R
Sbjct: 100 GWSSLLRTVHSVLNRSPPELIAEIVLVDDFSDREHLKKRLEDYMAQFPN-VRILRTKKRE 158
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A G+++ FLD+HCE +
Sbjct: 159 GLIRTRMLGASVAIGDVITFLDSHCEANV 187
>gi|301758254|ref|XP_002914993.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 12-like
[Ailuropoda melanoleuca]
Length = 540
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 211/325 (64%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L ++ L
Sbjct: 95 LPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSGL 154
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A+GE+L FLD HCEC GWLE L+ R+ E+ + VV
Sbjct: 155 P-KVRLIRANRREGLVRARLLGASAAKGEVLTFLDCHCECHEGWLEPLLQRIHEEESAVV 213
Query: 317 CPVIDIISDVTFAYVRS-FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ + E G F+W L F W+ + + R + ++P MAG
Sbjct: 214 CPVIDVIDWNTFEYLGNPGEPQIGGFDWRLVFTWHVVPERERMRMRSP-VDVIRSPTMAG 272
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K +PYS
Sbjct: 273 GLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPKQAPYS 332
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMDE+ E Y+ NP A + V R +LR +L+C F+
Sbjct: 333 RNKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRARLQCKDFR 386
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRH 519
W+L +V+P +P D FFG +++
Sbjct: 387 WFLENVYPELHVPEDRPGFFGMLQN 411
Score = 136 bits (342), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 96/136 (70%), Gaps = 1/136 (0%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ LP +S++I F+NEAWS LLRTV+S
Sbjct: 59 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYNNLPTTSVIIAFYNEAWSTLLRTVYS 118
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 119 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANRREGLVRARLLGAS 177
Query: 173 QAEGEILVFLDAHCEC 188
A+GE+L FLD HCEC
Sbjct: 178 AAKGEVLTFLDCHCEC 193
>gi|345323153|ref|XP_001510349.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Ornithorhynchus anatinus]
Length = 479
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 211/347 (60%), Gaps = 10/347 (2%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
CT LP +SI+I FHNEA S LLRT+ SV++R+P ++ EI+LVDD S
Sbjct: 28 CTASRYRSDLPSTSIIITFHNEARSTLLRTIRSVLNRTPMHLVHEIILVDDFSDDPDDCQ 87
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L + +R+ R GLI++R+ GA A+ +L FLD+HCE WL L+ R
Sbjct: 88 LLGPLPK-----VKCLRNGQREGLIRSRIRGADLAKAGVLTFLDSHCEVNKDWLLPLLQR 142
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ ED TRVV PVIDII+ TFAYV + G F+W LHF+W S + KR D T+P
Sbjct: 143 IKEDPTRVVSPVIDIINLDTFAYVAASSDLRGGFDWSLHFKWEQL-SPEQKAKRTDPTQP 201
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
KTP +AGGLF ID+++F H+G YD M +WGGEN E+SFRVW CGG++EI PCS V H+
Sbjct: 202 IKTPIIAGGLFVIDKSWFNHLGKYDTAMDIWGGENFEISFRVWMCGGTLEIVPCSRVGHV 261
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK PY FP G + N R A VWMDE+ ++Y+ P A+ R ++SR+EL+K
Sbjct: 262 FRKKHPYVFPEGNANTYIKNTKRTAEVWMDEFKQYYYAARPAAQG-RPYGDIQSRVELKK 320
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKG 533
LKC FKWYL V+P +P + G IR Q KC+E +G
Sbjct: 321 SLKCRPFKWYLETVYPELRIPEESLAQTGIIR--QRQKCLESQRLEG 365
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/99 (43%), Positives = 61/99 (61%), Gaps = 5/99 (5%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP +SI+I FHNEA S LLRT+ SV++R+P ++ EI+LVDD S L
Sbjct: 37 LPSTSIIITFHNEARSTLLRTIRSVLNRTPMHLVHEIILVDDFSDDPDDCQLLGPLPK-- 94
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ +R+ R GLI++R+ GA A+ +L FLD+HCE
Sbjct: 95 ---VKCLRNGQREGLIRSRIRGADLAKAGVLTFLDSHCE 130
>gi|348510947|ref|XP_003443006.1| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Oreochromis niloticus]
Length = 567
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 206/334 (61%), Gaps = 9/334 (2%)
Query: 176 GEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILL 235
GE + H C + + LP +SIVI FHNEA S LLRT+ SV+ RSP S+++EI+L
Sbjct: 116 GERAIRDTRHYRCAALTYDTDLPSTSIVITFHNEARSTLLRTIKSVLMRSPPSLIQEIIL 175
Query: 236 VDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHC 294
+DD S S ++ +P R +R+ R GLI++R+ GA A IL FLD+HC
Sbjct: 176 IDDFS------SDPEDCQLLAQIPKVRCLRNGRREGLIRSRVRGANMASASILTFLDSHC 229
Query: 295 ECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGS 354
E WL+ ++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W
Sbjct: 230 EVNTDWLQPMIQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-P 288
Query: 355 SDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGG 414
+ + R D T+ +TP +AGG+F +DR++F H+G YD M +WGGEN E+SFRVW CGG
Sbjct: 289 IEQKMARSDPTQAIRTPVIAGGIFVMDRSWFNHLGQYDTHMDIWGGENFELSFRVWLCGG 348
Query: 415 SIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR 474
S+EI PCS V H+FRK PY FP G + N R A VWMDE+ ++Y+ P A+ +
Sbjct: 349 SLEILPCSRVGHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDEYKQYYYSARPSAQG-K 407
Query: 475 DKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLP 508
+V RL LR++L C F+WY+ +V+P +P
Sbjct: 408 AFGSVTDRLALRRKLNCKPFRWYMENVYPELRVP 441
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/137 (43%), Positives = 84/137 (61%), Gaps = 8/137 (5%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FNL SDR+ R + D R +C ++ + LP +SIVI FHNEA S LLRT+
Sbjct: 101 YKDHAFNLQESDRLGGERAIRDTRHYRCAALTYDTD-LPSTSIVITFHNEARSTLLRTIK 159
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV+ RSP S+++EI+L+DD S S ++ +P R +R+ R GLI++R+ G
Sbjct: 160 SVLMRSPPSLIQEIILIDDFS------SDPEDCQLLAQIPKVRCLRNGRREGLIRSRVRG 213
Query: 171 ARQAEGEILVFLDAHCE 187
A A IL FLD+HCE
Sbjct: 214 ANMASASILTFLDSHCE 230
>gi|47228720|emb|CAG07452.1| unnamed protein product [Tetraodon nigroviridis]
Length = 611
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 168/365 (46%), Positives = 226/365 (61%), Gaps = 25/365 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ SP +LKEI+LVDDAS + LK L+ +V +L
Sbjct: 170 LPTTSVIIVFHNEAWSTLLRTVYSVLHTSPAILLKEIILVDDASAADHLKEQLEVFVHQL 229
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+ RV+R P R GLI ARLLGA A+GE+L FLDAHCEC GWLE L+AR+ E+ T VV
Sbjct: 230 KI-VRVVRQPERKGLITARLLGASVAQGEVLTFLDAHCECFHGWLEPLLARIVEEPTAVV 288
Query: 317 CPVIDIISDVTFAYVR----SFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + S + G F+W L F W + A RKD T P KTP
Sbjct: 289 SPEITTIDLETFQFNKPVASSHAYNRGNFDWGLTFGWEQIPEA-ARKLRKDETYPVKTPT 347
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I ++YF HIG YD++M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 348 FAGGLFSILKSYFEHIGTYDDKMEIWGGENIEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 407
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQ-------AVRSRLEL 485
P++FP G ++V+ N R+A VWMD++ + +++ N AE + AVR L
Sbjct: 408 PHTFPKG-TDVITRNQVRLAEVWMDDYKKIFYRRNRNAENMAKEDLTPEKYGAVRHTF-L 465
Query: 486 RKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
L+ SF L +V P L + D + +I++ + C L G N+ P +
Sbjct: 466 SITLERSSF---LKNVTP---LFIFDPYVAQIQNQGSKTC----LDVGENNKGGKPVIMY 515
Query: 546 PCTHL 550
C ++
Sbjct: 516 QCHNM 520
Score = 142 bits (357), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 75/134 (55%), Positives = 95/134 (70%), Gaps = 3/134 (2%)
Query: 57 FNLLVSDRIPVNRTL-PDVRKPKC-KTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVI 114
FN SDRI ++R+L D R P+C + K LP +S++IVFHNEAWS LLRTV+SV+
Sbjct: 136 FNQFASDRISLSRSLGDDTRPPECVERKFLRCPALPTTSVIIVFHNEAWSTLLRTVYSVL 195
Query: 115 SRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQA 174
SP +LKEI+LVDDAS + LK L+ +V +L + RV+R P R GLI ARLLGA A
Sbjct: 196 HTSPAILLKEIILVDDASAADHLKEQLEVFVHQLKI-VRVVRQPERKGLITARLLGASVA 254
Query: 175 EGEILVFLDAHCEC 188
+GE+L FLDAHCEC
Sbjct: 255 QGEVLTFLDAHCEC 268
>gi|260836667|ref|XP_002613327.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
gi|229298712|gb|EEN69336.1| hypothetical protein BRAFLDRAFT_118726 [Branchiostoma floridae]
Length = 545
Score = 297 bits (761), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 159/419 (37%), Positives = 249/419 (59%), Gaps = 25/419 (5%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
E+LP +S+++ F +EA+SA++R+VHS+I+R+P +L E++LVDD STR LK LD+YV
Sbjct: 95 EYLPPTSVIMCFTDEAFSAVMRSVHSIINRTPPHLLAEVILVDDNSTRAELKGHLDDYVR 154
Query: 255 KLSV--PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR 312
+ +V+ R GLI+ RL GA +A G +L FLDAH EC +GW+E L+ R+ E+R
Sbjct: 155 RQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFLDAHIECNVGWVEPLLHRIWENR 214
Query: 313 TRVVCPVIDIISDVTFAYVRSFE----LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF 368
+ VV P+I+ I D TF Y + G F+WELHF W + + D T P
Sbjct: 215 SNVVMPIIEAIDDKTFEYHGGVQSSRYAQRGGFSWELHFDWRVIPEYEIKRWKGDETTPI 274
Query: 369 KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLF 428
++P MAGGLF+ID++YF+ +G YD++M WGGENLE+SF++W CGG++E PCS V H+F
Sbjct: 275 RSPTMAGGLFSIDKSYFYELGTYDDKMDTWGGENLELSFKIWMCGGTLEQPPCSKVGHVF 334
Query: 429 RKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQ 488
R ++PYS P G + N RV VW+D + + ++ NP + + V R +R++
Sbjct: 335 RSSAPYSNPSG-PKTFIRNTLRVVEVWLDSYKDLFYALNPHMQGE-PYGDVSERKRIRER 392
Query: 489 LKCHSFKWYLTHVWPHHFLPMDDKFF---GRIRHVQTHKCVEKPLAKGSMNQASGPASLL 545
L+C SF W+L +++P LP+ DK G ++++ +KC++ +M + + L
Sbjct: 393 LQCKSFDWFLENIFPE--LPIPDKNVQGRGELKNLGGNKCMD------TMGEHAPYTGLY 444
Query: 546 PCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDK 604
C H Q+F ++I+ E CL V + D R+ + C + +W ++K
Sbjct: 445 SC-HGMGGNQVFSYTW-KNVISYQER-CLAVSRNKPD---RISLYPCGDRDILKWKHEK 497
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 57/125 (45%), Positives = 85/125 (68%), Gaps = 3/125 (2%)
Query: 75 RKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 134
R+ +CKTK + E+LP +S+++ F +EA+SA++R+VHS+I+R+P +L E++LVDD STR
Sbjct: 84 RQVRCKTKKY-PEYLPPTSVIMCFTDEAFSAVMRSVHSIINRTPPHLLAEVILVDDNSTR 142
Query: 135 EFLKSSLDEYVAKLSV--PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVF 192
LK LD+YV + +V+ R GLI+ RL GA +A G +L FLDAH EC + +
Sbjct: 143 AELKGHLDDYVRRQVGWDKVKVVHLEKREGLIRCRLRGAEKAVGPVLTFLDAHIECNVGW 202
Query: 193 NEEFL 197
E L
Sbjct: 203 VEPLL 207
>gi|344249957|gb|EGW06061.1| Polypeptide N-acetylgalactosaminyltransferase 10 [Cricetulus
griseus]
Length = 494
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 150/351 (42%), Positives = 224/351 (63%), Gaps = 15/351 (4%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
C E LP +SI+I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S RE LK
Sbjct: 19 NCNSKLYLETLPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELVAEIVLVDDFSDREHLK 78
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L++Y+A L R++R+ R GLI+ R+LGA A G+++ FLD+HCE + WL L+
Sbjct: 79 KPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAAIGDVITFLDSHCEANVNWLPPLLD 137
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFT 365
R+A +R +VCP+ID+I F Y ++ + GAF+WE++++ +++ D +
Sbjct: 138 RIARNRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYKRIPIPPE---LQKADPS 194
Query: 366 EPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVA 425
+PF++P MAGGLFA+DR +F+ +G YD +++WGGE E+SF+VW CGG +E PCS V
Sbjct: 195 DPFESPVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVG 254
Query: 426 HLFRKASPYSFPGGVSE----VLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
H++RK+ PY P G ++ + NL RVA VWMDE+AE+ ++ PE + V +
Sbjct: 255 HIYRKSVPYKVPAGPADPCNCLSLQNLKRVAEVWMDEYAEYIYQRRPEY-RHLSAGDVVA 313
Query: 482 RLELRKQLKCHSFKWYLTHV-W--PHHFLPMDDK--FFGRIRHVQTHKCVE 527
+ LR L C SFKW++T + W P + P++ +G IR+V T C +
Sbjct: 314 QKRLRGSLNCKSFKWFMTKIAWDLPKFYPPVEPPAAAWGEIRNVGTGLCTD 364
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/128 (48%), Positives = 89/128 (69%), Gaps = 2/128 (1%)
Query: 63 DRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSML 122
D +P N T ++ C +K++ E LP +SI+I FHNE WS+LLRTVHSV++RSP ++
Sbjct: 4 DIVPPNWTHQLLQNLNCNSKLYLET-LPNTSIIIPFHNEGWSSLLRTVHSVLNRSPPELV 62
Query: 123 KEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFL 182
EI+LVDD S RE LK L++Y+A L R++R+ R GLI+ R+LGA A G+++ FL
Sbjct: 63 AEIVLVDDFSDREHLKKPLEDYMA-LFPSVRILRTKKREGLIRTRMLGASAAIGDVITFL 121
Query: 183 DAHCECTL 190
D+HCE +
Sbjct: 122 DSHCEANV 129
>gi|281341254|gb|EFB16838.1| hypothetical protein PANDA_002911 [Ailuropoda melanoleuca]
Length = 496
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 210/325 (64%), Gaps = 10/325 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I F+NEAWS LLRTV+SV+ SP +L+E++LVDD S RE LK L ++ L
Sbjct: 51 LPTTSVIIAFYNEAWSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSGL 110
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR+ R GL++ARLLGA A+GE+L FLD HCEC GWLE L+ R+ E+ + VV
Sbjct: 111 P-KVRLIRANRREGLVRARLLGASAAKGEVLTFLDCHCECHEGWLEPLLQRIHEEESAVV 169
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CPVID+I TF Y+ E G F+W L F W+ + + R + ++P MAG
Sbjct: 170 CPVIDVIDWNTFEYLGNPGEPQIGGFDWRLVFTWHVVPERERMRMRSP-VDVIRSPTMAG 228
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+F K +PYS
Sbjct: 229 GLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHVFPKQAPYS 288
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
++ N R A VWMDE+ E Y+ NP A + V R +LR +L+C F+
Sbjct: 289 RNKALA-----NSVRAAEVWMDEFKELYYHRNPHA-RLEPFGDVTERKQLRARLQCKDFR 342
Query: 496 WYLTHVWPHHFLPMD-DKFFGRIRH 519
W+L +V+P +P D FFG +++
Sbjct: 343 WFLENVYPELHVPEDRPGFFGMLQN 367
Score = 135 bits (341), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 96/136 (70%), Gaps = 1/136 (0%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
++++ N+ +SDRI ++R LP+ P CK K ++ LP +S++I F+NEAWS LLRTV+S
Sbjct: 15 RLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYNNLPTTSVIIAFYNEAWSTLLRTVYS 74
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ SP +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 75 VLETSPDILLEEVILVDDYSDREHLKERLANELSGLP-KVRLIRANRREGLVRARLLGAS 133
Query: 173 QAEGEILVFLDAHCEC 188
A+GE+L FLD HCEC
Sbjct: 134 AAKGEVLTFLDCHCEC 149
>gi|312371733|gb|EFR19844.1| hypothetical protein AND_21714 [Anopheles darlingi]
Length = 637
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 167/426 (39%), Positives = 242/426 (56%), Gaps = 18/426 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C LP S++++F+NE WSALLRTV+SV++RSP S+LKE++LV+D ST+ F
Sbjct: 141 HISCRTKAYLRELPTVSVIVIFYNEHWSALLRTVYSVLNRSPASLLKEVILVNDHSTKPF 200
Query: 245 LKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L + L E+V ++L+ R+I P R GLI AR+ GAR+A G++L+ LD+H E WL
Sbjct: 201 LWAPLREFVESELAPKVRLIDLPERSGLILARMAGAREARGDVLIVLDSHTEVNNNWLPP 260
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ +AED VCP ID+I+ TF Y E GAF+W+ +++ D D
Sbjct: 261 LLEPIAEDYRTCVCPFIDVIAHDTFQYRAQDEGKRGAFDWKFYYKRLPLLPGDL----DD 316
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+PF +P MAGGLFAI +F+ +G YDE + +WGGE E+SF++WQCGG + APCS
Sbjct: 317 PTKPFNSPVMAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRLVDAPCSR 376
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++R +P+ P GV+ V+ N RVA VWMDE+A+F ++ NP EK D + ++
Sbjct: 377 VGHVYRGYAPFGNPRGVNFVVR-NFKRVAEVWMDEYAKFLYERNPLFEKT-DPGDLTAQR 434
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMN-QASGPA 542
ELR++L+C FKW+L + P + + + ++ L S+N QA P
Sbjct: 435 ELRERLQCRPFKWFLEEIAPDLLIRYPVREPQPFASGRVQSVADRRLCLDSLNHQAKQPI 494
Query: 543 SLLPC----THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNR 597
L C TH P Q F + D+ CLD ++ V + +C
Sbjct: 495 GLYTCASNQTH-PQNNQFFTLSFHRDIRVRSNDKCLDASRLNDE----VILFSCHESQGN 549
Query: 598 QRWTYD 603
Q W YD
Sbjct: 550 QMWRYD 555
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/150 (47%), Positives = 109/150 (72%), Gaps = 6/150 (4%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+L++ N ++ +VSD I +NR++PD+R C+TK + E LP S++++F+NE WSALLRT
Sbjct: 115 KLFKENGYSAVVSDMIALNRSVPDIRHISCRTKAYLRE-LPTVSVIVIFYNEHWSALLRT 173
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARL 168
V+SV++RSP S+LKE++LV+D ST+ FL + L E+V ++L+ R+I P R GLI AR+
Sbjct: 174 VYSVLNRSPASLLKEVILVNDHSTKPFLWAPLREFVESELAPKVRLIDLPERSGLILARM 233
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLP 198
GAR+A G++L+ LD+H E N +LP
Sbjct: 234 AGAREARGDVLIVLDSHTEV----NNNWLP 259
>gi|221130543|ref|XP_002162500.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Hydra magnipapillata]
Length = 578
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 166/414 (40%), Positives = 238/414 (57%), Gaps = 24/414 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++I+I FHNE +ALLRTV S +++SP +LKEI+LVDD S L L
Sbjct: 141 LPTTTIIICFHNEGRAALLRTVVSALNKSPEHLLKEIILVDDFSDNPLDGEEL------L 194
Query: 257 SVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
++P ++IR+ R GLI++R+ GA A GE+L FLD+HCEC WLE L+ + ++R V
Sbjct: 195 ALPRVKLIRNNQREGLIRSRVKGADMAVGEVLTFLDSHCECNEMWLEPLLQAIKDNRKIV 254
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
P+ID+I F Y+ S G F W L+F+W + I ++D T +P +AG
Sbjct: 255 ASPIIDVIGHEDFKYLSSSSDLRGGFGWNLNFKWDFLPPNHLIKHQQDGTAFILSPVIAG 314
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF+I +++F +G YD +M VWGGENLE+SFR WQCGG + I PCS V H+FR PY
Sbjct: 315 GLFSIHKSWFEELGKYDPQMDVWGGENLEISFRTWQCGGEMYIIPCSRVGHVFRDRHPYK 374
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
FPGG V N R A VWMD++ ++YF P A +R RL+LRK L C SFK
Sbjct: 375 FPGGSMNVFQKNTRRAAEVWMDDYKKYYFAAVPSARYSLFGD-IRDRLQLRKDLNCKSFK 433
Query: 496 WYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPA-SLLPCTHLPVL 553
WYL +++P +P DD +G+I++ + C++ +M G L PC H
Sbjct: 434 WYLENIYPELKVPDDDVIKYGQIKYKVSEDCLD------TMGHIKGEGIGLFPC-HGQGG 486
Query: 554 TQMFVMKLPTDLIATDESVCLD-VPEYENDISPRVRILAC-SGFNRQRWTYDKE 605
Q + + ES+CL + + +I VR++ C + N Q+W YD++
Sbjct: 487 NQDWSWTKSNQI--KHESLCLSGISKKSEEI---VRMVPCVATDNFQKWKYDEK 535
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 66/158 (41%), Positives = 97/158 (61%), Gaps = 8/158 (5%)
Query: 41 LHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHN 100
L L + Q N +N S+ +P +R +PDVR P C+ K +++ LP ++I+I FHN
Sbjct: 94 LRGALKPGEDKNQRNAYNQEASELLPWDRVVPDVRDPGCRKKSYDKN-LPTTTIIICFHN 152
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPG 159
E +ALLRTV S +++SP +LKEI+LVDD S L L++P ++IR+
Sbjct: 153 EGRAALLRTVVSALNKSPEHLLKEIILVDDFSDNPLDGEEL------LALPRVKLIRNNQ 206
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
R GLI++R+ GA A GE+L FLD+HCEC ++ E L
Sbjct: 207 REGLIRSRVKGADMAVGEVLTFLDSHCECNEMWLEPLL 244
>gi|351714773|gb|EHB17692.1| Polypeptide N-acetylgalactosaminyltransferase 11 [Heterocephalus
glaber]
Length = 608
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 180/433 (41%), Positives = 247/433 (57%), Gaps = 39/433 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +S+VI F+NEA+SALL TVHSV+ ++P +L EI+LVDD S + LK LDEY+ K
Sbjct: 150 LPVASVVIYFYNEAFSALLPTVHSVLDQTPAYLLHEIILVDDDSDFDDLKGELDEYIQKY 209
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L VIR+P R GLI+ R++GA A GE+LVFL +HCE + WL+ L+A V D V
Sbjct: 210 LPAKIEVIRNPRREGLIRGRMIGAAHATGEVLVFLVSHCEVNVMWLQLLLAAVHGDPHTV 269
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRW-----YTYGSSDAIIKRKDFTEPFKT 370
VCPVI IIS T AY S + G FNW LHF+W G +D+ T P K+
Sbjct: 270 VCPVIYIISADTLAYSSSLVVQ-GGFNWGLHFKWDLVLLSELGGADSA------TAPIKS 322
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRK 430
P MAGGLFA++R YF +G YD M V GGENLE+SFR+W CGG + I PCS V H+F+K
Sbjct: 323 PTMAGGLFAVNRQYFNELGQYDSGMDVRGGENLEISFRIWMCGGKLFIIPCSRVGHIFQK 382
Query: 431 ASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY G + + +L R+ +W+DE+ E YF P+ K + + +ELRK+L
Sbjct: 383 RWPYGSLEGQDTMTHNSL-RLVHIWLDEYKEQYFSLRPDL-KTKSYGNISEHVELRKKLG 440
Query: 491 CHSFKWYLTHVWPHHFLP-----MDDKFF-------------GRIRHVQTHKCVEKPLAK 532
C SFKWYL +++P +P F GR+ H QT+KC+ +A+
Sbjct: 441 CQSFKWYLDNIYPEMQIPGPSAKAQQPVFVNRGPKHPRILQRGRLYHFQTNKCL---VAQ 497
Query: 533 GSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC 592
G +Q G +L + TQ+++ +L+ + CLD+ E + PR+
Sbjct: 498 GRPSQKGG-LLVLKAGNYGDPTQVWIYNEKHELVLNN-LFCLDMSETCSSDPPRLMKCHG 555
Query: 593 SGFNRQRWTYDKE 605
SG Q+WT K
Sbjct: 556 SG-GSQQWTLGKN 567
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/142 (48%), Positives = 100/142 (70%), Gaps = 2/142 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
YQ + FN+L+S+R+ ++ +PD R CK K ++ + LP +S+VI F+NEA+SALL TVH
Sbjct: 114 YQKHAFNMLISNRLGYHKDVPDTRNAVCKEKSYSTD-LPVASVVIYFYNEAFSALLPTVH 172
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLG 170
SV+ ++P +L EI+LVDD S + LK LDEY+ K L VIR+P R GLI+ R++G
Sbjct: 173 SVLDQTPAYLLHEIILVDDDSDFDDLKGELDEYIQKYLPAKIEVIRNPRREGLIRGRMIG 232
Query: 171 ARQAEGEILVFLDAHCECTLVF 192
A A GE+LVFL +HCE +++
Sbjct: 233 AAHATGEVLVFLVSHCEVNVMW 254
>gi|119620893|gb|EAX00488.1| UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 14 (GalNAc-T14),
isoform CRA_c [Homo sapiens]
Length = 519
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 166/395 (42%), Positives = 235/395 (59%), Gaps = 22/395 (5%)
Query: 213 ALLRTVH-SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVG 270
A+ T H SV++R+P +++EI+LVDD S + D+ + +P + +R+ R G
Sbjct: 92 AIPDTRHLSVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQG 145
Query: 271 LIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAY 330
L+++R+ GA A+G L FLD+HCE WL+ L+ RV ED TRVVCPVIDII+ TF Y
Sbjct: 146 LVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHRVKEDYTRVVCPVIDIINLDTFTY 205
Query: 331 VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGA 390
+ S G F+W LHF+W S + +R D TEP +TP +AGGLF ID+A+F ++G
Sbjct: 206 IESASELRGGFDWSLHFQWEQL-SPEQKARRLDPTEPIRTPIIAGGLFVIDKAWFDYLGK 264
Query: 391 YDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLAR 450
YD +M +WGGEN E+SFRVW CGGS+EI PCS V H+FRK PY FP G + N R
Sbjct: 265 YDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKHPYVFPDGNANTYIKNTKR 324
Query: 451 VALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD 510
A VWMDE+ ++Y+ P A +R V SRL+LRK L+C SFKWYL +++P +P +
Sbjct: 325 TAEVWMDEYKQYYYAARPFA-LERPFGNVESRLDLRKNLRCQSFKWYLENIYPELSIPKE 383
Query: 511 DKF-FGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLP---VLTQMFVMKLPTDLI 566
G IR Q KC+E ++ NQ + L PC + +Q++ ++
Sbjct: 384 SSIQKGNIR--QRQKCLE---SQRQNNQETPNLKLSPCAKVKGEDAKSQVWAFTYTQQIL 438
Query: 567 ATDESVCLDVPEYENDISPRVRILACSGFNRQRWT 601
E +CL V +P V +L +G +RQ+WT
Sbjct: 439 Q--EELCLSVITLFPG-APVVLVLCKNGDDRQQWT 470
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/137 (29%), Positives = 64/137 (46%), Gaps = 41/137 (29%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y++ FN S+RI NR +PD R
Sbjct: 74 YKLYAFNQRESERISSNRAIPDTRH----------------------------------L 99
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLG 170
SV++R+P +++EI+LVDD S + D+ + +P + +R+ R GL+++R+ G
Sbjct: 100 SVLNRTPTHLIREIILVDDFS------NDPDDCKQLIKLPKVKCLRNNERQGLVRSRIRG 153
Query: 171 ARQAEGEILVFLDAHCE 187
A A+G L FLD+HCE
Sbjct: 154 ADIAQGTTLTFLDSHCE 170
>gi|47216191|emb|CAG01225.1| unnamed protein product [Tetraodon nigroviridis]
Length = 586
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/344 (43%), Positives = 211/344 (61%), Gaps = 11/344 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + + LP +SI+I FHNEA S LLRTV SV+ RSP S+++EI+L+DD S
Sbjct: 149 HYRCASISYDPELPSTSIIITFHNEARSTLLRTVKSVLMRSPPSLIQEIILIDDFS---- 204
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
S ++ + +P R +R+ R GLI++R+ GA A IL FLD+HCE WL+
Sbjct: 205 --SDPEDCQLLVHIPKVRCLRNVRREGLIRSRVRGANAASAPILTFLDSHCEVNTDWLQP 262
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
++ RV ED TRVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 263 MIQRVKEDHTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMARSD 321
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
T+P +TP +AGG+F +D+++F +G YD M +WGGEN E+SFRVW CGGS+EI PCS
Sbjct: 322 PTQPIRTPVIAGGIFVMDKSWFNRLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSR 381
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY FP G + N R A VWMDE+ ++Y+ P A+ + ++ R+
Sbjct: 382 VGHVFRKRHPYEFPEGNALTYIRNTRRAAEVWMDEYKQYYYSARPSAQG-KAFGSITDRV 440
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE 527
LRK+L C F+WY+ +V+P +P + +R Q C+E
Sbjct: 441 SLRKKLNCKPFRWYMENVYPELRVPEQEAVTSVLR--QGGLCLE 482
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/161 (40%), Positives = 94/161 (58%), Gaps = 10/161 (6%)
Query: 30 GHYLGDSVDGGLHSN--LSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEE 87
GH LG + + L YQ + FN+L SDR+ R + D R +C + ++ E
Sbjct: 101 GHLLGSFDEKAYLTEKLLKPGVDPYQDHAFNVLESDRVGSERAIRDTRHYRCASISYDPE 160
Query: 88 FLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK 147
LP +SI+I FHNEA S LLRTV SV+ RSP S+++EI+L+DD S S ++
Sbjct: 161 -LPSTSIIITFHNEARSTLLRTVKSVLMRSPPSLIQEIILIDDFS------SDPEDCQLL 213
Query: 148 LSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ +P R +R+ R GLI++R+ GA A IL FLD+HCE
Sbjct: 214 VHIPKVRCLRNVRREGLIRSRVRGANAASAPILTFLDSHCE 254
>gi|118404262|ref|NP_001072444.1| polypeptide N-acetylgalactosaminyltransferase 10 [Xenopus
(Silurana) tropicalis]
gi|113197915|gb|AAI21701.1| GalNAc transferase 10 [Xenopus (Silurana) tropicalis]
Length = 603
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/350 (42%), Positives = 228/350 (65%), Gaps = 15/350 (4%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
+++C+ F++ LP +S++I FHNE WS+LLRTVHSV++RSP ++ EI+LVDD S +
Sbjct: 129 NSNCKNKFYFSK--LPNTSVIIPFHNEGWSSLLRTVHSVLNRSPPELIAEIVLVDDYSDK 186
Query: 243 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
LKS L++Y+A +++R+ R GLI+ R+LGA A GE+L FLD+HCE + WL
Sbjct: 187 AHLKSRLEKYMANFP-KVKIVRTKKREGLIRTRMLGATVASGEVLTFLDSHCEANVNWLP 245
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
L+ + ++ VVCP+ID+I F YV ++ + GAF+WE+ ++ +++
Sbjct: 246 PLLDPLVQNYKTVVCPMIDVIDSDNFGYVTQAGDAMRGAFDWEMFYKRIPIPPE---LQK 302
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
D ++PF +P MAGGLFAI+R +F+ +G YD +++WGGE E+SF+VW CGG + +PC
Sbjct: 303 GDPSDPFDSPVMAGGLFAINREWFWQLGGYDPGLEIWGGEQYEISFKVWMCGGRMVDSPC 362
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
S V H++RK PY P GVS L NL RVA VWMDE+AE+ ++ P+ + V +
Sbjct: 363 SRVGHIYRKYVPYKVPAGVS--LARNLKRVAEVWMDEYAEYIYQRRPDY-RHLSVGDVAA 419
Query: 482 RLELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCV 526
+ ELR++L+C FKW++ + W P ++ P++ +G IR+V+ CV
Sbjct: 420 QKELRRKLQCRDFKWFMNEIAWDLPKYYPPVELPPAAWGEIRNVRMDMCV 469
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 73/149 (48%), Positives = 105/149 (70%), Gaps = 5/149 (3%)
Query: 45 LSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNE 101
++DA Q Y+ N FN+ VSD+I +NR+LPD+R CK K + + LP +S++I FHNE
Sbjct: 95 MTDADHVDQAYRENGFNIFVSDKISLNRSLPDIRNSNCKNKFYFSK-LPNTSVIIPFHNE 153
Query: 102 AWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRV 161
WS+LLRTVHSV++RSP ++ EI+LVDD S + LKS L++Y+A +++R+ R
Sbjct: 154 GWSSLLRTVHSVLNRSPPELIAEIVLVDDYSDKAHLKSRLEKYMANFP-KVKIVRTKKRE 212
Query: 162 GLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R+LGA A GE+L FLD+HCE +
Sbjct: 213 GLIRTRMLGATVASGEVLTFLDSHCEANV 241
>gi|449281639|gb|EMC88675.1| Polypeptide N-acetylgalactosaminyltransferase-like protein 2
[Columba livia]
Length = 640
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 153/374 (40%), Positives = 232/374 (62%), Gaps = 15/374 (4%)
Query: 190 LVFNEEF---LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
L +E+ LP +S++I FH+EAWS LLRTVHS++ +P++ LK+I+LVDD S + LK
Sbjct: 183 LCLQQEYDSSLPTASVIICFHDEAWSTLLRTVHSIMDTAPKASLKDIILVDDLSQQGPLK 242
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
S+L EY++KL ++IRS R+G+I+ R+LGA +A G++LVF+D+HCEC GWLE L+A
Sbjct: 243 SALSEYISKLD-GVKLIRSNKRLGVIRGRMLGAARATGDVLVFMDSHCECQKGWLEPLLA 301
Query: 307 RVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTE 366
R++ +R VV PVID+I TF Y S LH G F+W+L F W + +++ +
Sbjct: 302 RLSSNRNSVVSPVIDVIDWKTFQYYHSVGLHRGVFDWKLDFHWEPVPEREEKVRQSPIS- 360
Query: 367 PFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAH 426
P ++P +AG + A+DR YF + GAYD +M +WG ENLE+S R W CGGS+EI PCS V H
Sbjct: 361 PIRSPVVAGAVVAMDRHYFQNTGAYDSDMTMWGAENLELSIRTWLCGGSVEIIPCSRVGH 420
Query: 427 LFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA--EKQRDKQAVRSRLE 484
++R P +F E + N R+A W+ + + ++K + A + +K RL+
Sbjct: 421 VYRNHFPRAF--SYEEAIVRNKIRIAETWLGSFKDNFYKHDTVAFLISKAEKPDCSERLQ 478
Query: 485 LRKQLKCHSFKWYLTHVWPH-HFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPAS 543
L+K+L C SF+W++++V+P L +F G++ + C + A + A G
Sbjct: 479 LQKRLGCRSFQWFISNVYPELSQLEDTPRFSGKLYNTGVGFCADYRPASAA---AEGSIE 535
Query: 544 LLPCTHLPVLTQMF 557
L PC+ LTQ F
Sbjct: 536 LSPCSD--SLTQHF 547
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 69/145 (47%), Positives = 104/145 (71%), Gaps = 2/145 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN +S+RI + R LP+VR P C + ++ LP +S++I FH+EAWS LLRTVHS++
Sbjct: 162 FNEALSERISLRRDLPEVRHPLCLQQEYDSS-LPTASVIICFHDEAWSTLLRTVHSIMDT 220
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+P++ LK+I+LVDD S + LKS+L EY++KL ++IRS R+G+I+ R+LGA +A G
Sbjct: 221 APKASLKDIILVDDLSQQGPLKSALSEYISKLD-GVKLIRSNKRLGVIRGRMLGAARATG 279
Query: 177 EILVFLDAHCECTLVFNEEFLPKSS 201
++LVF+D+HCEC + E L + S
Sbjct: 280 DVLVFMDSHCECQKGWLEPLLARLS 304
>gi|224496010|ref|NP_001139074.1| polypeptide N-acetylgalactosaminyltransferase-like 6 [Danio rerio]
Length = 600
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 221/349 (63%), Gaps = 13/349 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +SI+I FHNE WS+LLRT+HS+ +R+P ++ EI+LVDD S RE
Sbjct: 126 HPNCKQKLYLENLPNTSIIIPFHNEGWSSLLRTLHSISNRTPDHLIAEIILVDDYSDREH 185
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK+ L EY+++ R++R+ R GLI+ RLLGA A GE+L FLD+HCE + WL L
Sbjct: 186 LKAHLAEYMSRFP-KVRIVRTKKREGLIRTRLLGASVARGEVLTFLDSHCEANINWLPPL 244
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
+ ++A++ +VCP+ID+I F Y ++ + GAF+WE++++ ++ D
Sbjct: 245 LDQIAQNPKTIVCPMIDVIDHNHFGYEAQAGDAMRGAFDWEMYYKRIPIPPE---LQGPD 301
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
++P+++P MAGGLFA++R +F+ +G YD +++WGGE E+SF+VW CGGS+ PCS
Sbjct: 302 PSDPYQSPVMAGGLFAVNRQWFWELGGYDTGLEIWGGEQFEISFKVWMCGGSMYDVPCSR 361
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++RK PY P G S L NL RVA WMDE+ E+ ++ PE + + ++
Sbjct: 362 VGHIYRKYVPYKVPSGTS--LARNLKRVAETWMDEYTEYIYQRRPEY-RHLSTGDLTAQK 418
Query: 484 ELRKQLKCHSFKWYLTHV-W--PHHFLPMD--DKFFGRIRHVQTHKCVE 527
ELRK LKC FKWY+ V W P ++ P++ +G IR+ + CV+
Sbjct: 419 ELRKHLKCKDFKWYMNTVAWDLPKYYPPVEPLPAAWGEIRNAASGLCVD 467
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 102/140 (72%), Gaps = 2/140 (1%)
Query: 51 LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
+Y+ N FN+ VS+ I ++R+LPD+R P CK K++ E LP +SI+I FHNE WS+LLRT+
Sbjct: 101 VYKENGFNIYVSNNIALDRSLPDIRHPNCKQKLYLEN-LPNTSIIIPFHNEGWSSLLRTL 159
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLG 170
HS+ +R+P ++ EI+LVDD S RE LK+ L EY+++ R++R+ R GLI+ RLLG
Sbjct: 160 HSISNRTPDHLIAEIILVDDYSDREHLKAHLAEYMSRFP-KVRIVRTKKREGLIRTRLLG 218
Query: 171 ARQAEGEILVFLDAHCECTL 190
A A GE+L FLD+HCE +
Sbjct: 219 ASVARGEVLTFLDSHCEANI 238
>gi|340371807|ref|XP_003384436.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 2-like
[Amphimedon queenslandica]
Length = 350
Score = 295 bits (755), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 198/309 (64%), Gaps = 8/309 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++I FHNEA SALLRT++SV+S P ++KEI+LVDD F D + +
Sbjct: 45 LPTTSVIICFHNEARSALLRTIYSVLSHEPAKLIKEIILVDD-----FSDDINDGEILSV 99
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IR R GLI+ARL GAR A+GE+L FLD+HCE T GWLE L+AR+ EDR VV
Sbjct: 100 IPKVKLIRLNERQGLIRARLTGARAAQGEVLTFLDSHCEVTPGWLEPLLARIKEDRRHVV 159
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+IDII Y ++ G F L F+W + +R+D T P TPA+AGG
Sbjct: 160 SPIIDIIRKDDMKYNQANANIKGGFGHNLLFKWDNLNWQELQRRRQDNTAPIPTPAIAGG 219
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF+IDR YF IG+YDEEM++WGGEN+E S RVW CGG +EI PCSHV H+FR + PYSF
Sbjct: 220 LFSIDRGYFKEIGSYDEEMEIWGGENVEFSIRVWMCGGRLEIMPCSHVGHIFRSSMPYSF 279
Query: 437 PGGVS--EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
G S + NL R+A VWMDE+ ++ N A + V SR+E+RK+L+C F
Sbjct: 280 GKGKSYHTTVTRNLRRIAEVWMDEYKYLFYNANA-AARSIPFGNVSSRVEIRKRLQCKPF 338
Query: 495 KWYLTHVWP 503
W++ V+P
Sbjct: 339 SWFVEKVYP 347
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/152 (44%), Positives = 95/152 (62%), Gaps = 7/152 (4%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEF--LPKSSIVIVFHNEAWSALL 107
++Y+I++FN L SD P NR+L DVR +C + LP +S++I FHNEA SALL
Sbjct: 4 EMYRIHQFNQLASDNTPYNRSLWDVRTHQCMRRYDGSSASELPTTSVIICFHNEARSALL 63
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RT++SV+S P ++KEI+LVDD F D + + ++IR R GLI+AR
Sbjct: 64 RTIYSVLSHEPAKLIKEIILVDD-----FSDDINDGEILSVIPKVKLIRLNERQGLIRAR 118
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
L GAR A+GE+L FLD+HCE T + E L +
Sbjct: 119 LTGARAAQGEVLTFLDSHCEVTPGWLEPLLAR 150
>gi|157117587|ref|XP_001658839.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108875983|gb|EAT40208.1| AAEL008037-PA [Aedes aegypti]
Length = 662
Score = 295 bits (755), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 210/325 (64%), Gaps = 5/325 (1%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C ++ LP +SI++ F+NE L+R+V S+I R+P +L EI+LVDD S +
Sbjct: 184 HKLCHEQSYDKVLPSASIIMCFYNEHLETLVRSVTSIIRRTPSYLLHEIILVDDCSDLDD 243
Query: 245 LKSSLD-EYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L+ +L+ E A + R+IR+ R GL+++R+ GAR A G++L+FLD+H E + W+E
Sbjct: 244 LRDNLEHELNALKNSKVRLIRNAEREGLMRSRVYGARNATGDVLIFLDSHIEVNVDWVEP 303
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ R+ ++T + PVIDII+ TF Y S L G FNW LHF+W + K D
Sbjct: 304 LLQRIKTNKTILAMPVIDIINSDTFIYSSS-PLVRGGFNWGLHFKWDNL-PKGTLAKESD 361
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F PF++P MAGGLFA+DR YF +G YD M VWGGENLE+SFR WQCGGSIE+ PCS
Sbjct: 362 FVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFRTWQCGGSIELVPCSR 421
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
+ H+FRK PY P G S+ + N R++ VWMD++ +++ + P+A K+ D + R
Sbjct: 422 IGHVFRKRRPYGSPDG-SDTMIRNSLRLSRVWMDDYIKYFLENQPQA-KKVDPGDLTDRH 479
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLP 508
+LRK+L C SF+WYL +++P LP
Sbjct: 480 DLRKRLNCKSFEWYLKNIYPQLKLP 504
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 96/149 (64%), Gaps = 2/149 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN+LVS++I R +PD R C + + ++ LP +SI++ F+NE L+R+V
Sbjct: 160 YRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQSY-DKVLPSASIIMCFYNEHLETLVRSVT 218
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLD-EYVAKLSVPTRVIRSPGRVGLIKARLLG 170
S+I R+P +L EI+LVDD S + L+ +L+ E A + R+IR+ R GL+++R+ G
Sbjct: 219 SIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHELNALKNSKVRLIRNAEREGLMRSRVYG 278
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPK 199
AR A G++L+FLD+H E + + E L +
Sbjct: 279 ARNATGDVLIFLDSHIEVNVDWVEPLLQR 307
>gi|344271584|ref|XP_003407617.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 12-like [Loxodonta
africana]
Length = 576
Score = 295 bits (754), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 153/333 (45%), Positives = 212/333 (63%), Gaps = 16/333 (4%)
Query: 192 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 251
++ E LP++S++I F+NEAWS LLRTV+SV+ S +L+E++LVDD S RE LK L
Sbjct: 125 YDYENLPRTSVIIAFYNEAWSTLLRTVYSVLETSSDMLLEEVILVDDYSDREHLKERLAT 184
Query: 252 YVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED 311
++ L R+IR+ R GL++ARLLGA A+G +L FLD HCEC GWLE L+ R+ E+
Sbjct: 185 ELSGLP-KVRLIRANKREGLVRARLLGASVAKGNVLTFLDCHCECHEGWLEPLLERIHEE 243
Query: 312 RTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPF-- 368
+ VVCPVID+I TF Y+ E G F+W L F W+T + R+ P
Sbjct: 244 ESAVVCPVIDVIDWDTFEYLGNPGEPQIGGFDWRLVFTWHTVPERE----RRRMRSPIDV 299
Query: 369 -KTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
++P MAGGLFA+ + YF ++G+YD M+VWGGENLE SFR+WQCGG++E PCSHV H+
Sbjct: 300 IRSPTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGTLETHPCSHVGHV 359
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
F K +PYS ++ N R A VWMDE+ E Y+ NP A + V R +LR
Sbjct: 360 FPKKAPYSRNKALA-----NSVRAAEVWMDEYKELYYHRNPHA-RLEPFGDVTERRQLRA 413
Query: 488 QLKCHSFKWYLTHVWPHHFLPMD-DKFFGRIRH 519
+L+C F+W+L V+P +P D FFG +++
Sbjct: 414 KLQCKDFRWFLETVYPELHVPEDRPGFFGMLQN 446
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 67/136 (49%), Positives = 96/136 (70%), Gaps = 1/136 (0%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
Q+++ N+ +SDRI ++R LP+ R P C K ++ E LP++S++I F+NEAWS LLRTV+S
Sbjct: 94 QLHQINVYLSDRISLHRRLPERRNPLCMEKKYDYENLPRTSVIIAFYNEAWSTLLRTVYS 153
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ S +L+E++LVDD S RE LK L ++ L R+IR+ R GL++ARLLGA
Sbjct: 154 VLETSSDMLLEEVILVDDYSDREHLKERLATELSGLP-KVRLIRANKREGLVRARLLGAS 212
Query: 173 QAEGEILVFLDAHCEC 188
A+G +L FLD HCEC
Sbjct: 213 VAKGNVLTFLDCHCEC 228
>gi|156351115|ref|XP_001622369.1| hypothetical protein NEMVEDRAFT_v1g141560 [Nematostella vectensis]
gi|156208888|gb|EDO30269.1| predicted protein [Nematostella vectensis]
Length = 494
Score = 295 bits (754), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 149/365 (40%), Positives = 217/365 (59%), Gaps = 16/365 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C LP +SI+I FHNEA S LLRTV S+++++P +++ EI+LVDD S
Sbjct: 45 HSHCRYEAYPSTLPATSIIITFHNEARSTLLRTVKSILNKTPPNLVNEIILVDDFS---- 100
Query: 245 LKSSLDEYVAKLSVP-TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
++ + + +P +V+R+ R GLI++R+ G+ A+ ++L FLD+HCEC WL+
Sbjct: 101 --DDAEDGLLLMGLPKVKVLRNNKRQGLIRSRVKGSDTAKSDVLTFLDSHCECNTDWLQP 158
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ RV +++ VV P+ID+I+ F+Y+ + G F+W LHF+W +R
Sbjct: 159 LLKRVVQNKKAVVSPIIDVINMDDFSYIGASADIKGGFDWSLHFKWDNLTPEQKQSRRST 218
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
P KTP +AGGLF + +++F +G YD M +WGGEN E+SFR WQCGGS+EI PCS
Sbjct: 219 PIAPIKTPMIAGGLFVVTKSWFEEMGKYDTMMDIWGGENFEISFRTWQCGGSMEIIPCSR 278
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+FP G + N R A VWMDE+ FY+ P A +++SR
Sbjct: 279 VGHVFRKRHPYTFPDGNANTYMKNTRRTAEVWMDEYKRFYYAARPMARSAL-YGSIKSRK 337
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPA 542
ELRK+L+C FKWYL +V+P +P D FG ++ Q C++ GS QA G
Sbjct: 338 ELRKRLQCKPFKWYLQNVYPELQIPDSQDVSFGELK--QGKSCLD---TLGS--QAGGSV 390
Query: 543 SLLPC 547
+ C
Sbjct: 391 GMFDC 395
>gi|327274929|ref|XP_003222227.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Anolis carolinensis]
Length = 605
Score = 294 bits (753), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 211/318 (66%), Gaps = 7/318 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FH+EAWS LLRTVHSV+ +PR LKEI+LVDD ST+E+LKSSL EY++KL
Sbjct: 196 LPTASVVICFHDEAWSTLLRTVHSVLDTAPRDFLKEIILVDDLSTQEYLKSSLSEYISKL 255
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
++IRS R+G+I+ R+LGA +A GE++VF+D+HCEC GWLE L+ R+A DR+R+V
Sbjct: 256 P-GVKLIRSNRRLGVIQGRMLGAARATGEVVVFMDSHCECHNGWLEPLLERLASDRSRIV 314
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
PVID+I TF Y + EL G F+W+L F W + + R P ++PA+ GG
Sbjct: 315 SPVIDVIDWKTFQYHHTMELQRGVFDWKLDFHWKPLTEHEKKV-RPSPVSPIRSPAVPGG 373
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
+ A+ R +F + G YD +M + GGEN+E+S + W CGGS+EI PCS V H++R PY+F
Sbjct: 374 VIAVHRHHFQNTGGYDSDMTLLGGENIELSIKAWLCGGSVEILPCSRVGHVYRTGMPYNF 433
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA--EKQRDKQAVRSRLELRKQLKCHSF 494
+ + N R+A W+D + +++ + A + +K +R++L+K+L C F
Sbjct: 434 SD--EKAIERNKIRIAETWLDSFKHLFYQHDRLACLISKAEKPNCTARIQLKKRLGCKGF 491
Query: 495 KWYLTHVWPHHFLPMDDK 512
W++++V+P P+ D+
Sbjct: 492 LWFISNVYP-ELRPVQDR 508
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 74/136 (54%), Positives = 103/136 (75%), Gaps = 2/136 (1%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
+I FN +S +IP++R LP+VR P C + + LP +S+VI FH+EAWS LLRTVHS
Sbjct: 161 EIYGFNEALSKQIPLHRELPEVRHPLCLQQEPSPN-LPTASVVICFHDEAWSTLLRTVHS 219
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ +PR LKEI+LVDD ST+E+LKSSL EY++KL ++IRS R+G+I+ R+LGA
Sbjct: 220 VLDTAPRDFLKEIILVDDLSTQEYLKSSLSEYISKLP-GVKLIRSNRRLGVIQGRMLGAA 278
Query: 173 QAEGEILVFLDAHCEC 188
+A GE++VF+D+HCEC
Sbjct: 279 RATGEVVVFMDSHCEC 294
>gi|395507115|ref|XP_003757873.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Sarcophilus harrisii]
Length = 633
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 158/347 (45%), Positives = 217/347 (62%), Gaps = 10/347 (2%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +SI+I FHNEA S LLRT+ SV +R+P ++ EI+LVDD S
Sbjct: 210 CATLHYGPNLPPTSIIITFHNEARSTLLRTIRSVSNRTPVHLVHEIILVDDFSD----DP 265
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+ ++KL + +R+ R GLI++R+ GA A+ IL FLD+HCE WL L+ R
Sbjct: 266 DDCQLLSKLPK-VKCLRNEQREGLIRSRIRGADLAQASILTFLDSHCEVNKDWLLPLLHR 324
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ ED TRVVCPVIDII+ TFAYV S G F+W LHF+W + + R D +P
Sbjct: 325 IKEDPTRVVCPVIDIINRDTFAYVSSSPDMRGGFDWTLHFKWEELSLREKAL-RVDPIQP 383
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
KTP ++GGLF +++++F H+G YD M +WGGEN E+SFRVW CGGS+EI PCS V H+
Sbjct: 384 IKTPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEILPCSRVGHV 443
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK PY+FP G N R A VWMDE+ +++ P A+ R +++R+ELRK
Sbjct: 444 FRKKHPYTFPEGNLNTYIKNTKRTAEVWMDEFKHYFYAARPVAQG-RPFGNIQARVELRK 502
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKG 533
+LKCH+FKWYL +V+P +P + F G IR Q KC++ +G
Sbjct: 503 RLKCHTFKWYLENVYPELRIPEESLFQKGIIR--QKQKCLQSRKQEG 547
Score = 98.2 bits (243), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/140 (42%), Positives = 84/140 (60%), Gaps = 6/140 (4%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y++ FN SDRIP NR + D R +C T + LP +SI+I FHNEA S LL
Sbjct: 179 GQDPYKLYSFNQRESDRIPSNRAIRDTRPDRCATLHYGPN-LPPTSIIITFHNEARSTLL 237
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RT+ SV +R+P ++ EI+LVDD S + ++KL + +R+ R GLI++R
Sbjct: 238 RTIRSVSNRTPVHLVHEIILVDDFSD----DPDDCQLLSKLPK-VKCLRNEQREGLIRSR 292
Query: 168 LLGARQAEGEILVFLDAHCE 187
+ GA A+ IL FLD+HCE
Sbjct: 293 IRGADLAQASILTFLDSHCE 312
>gi|156544564|ref|XP_001602677.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Nasonia vitripennis]
Length = 637
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 149/330 (45%), Positives = 210/330 (63%), Gaps = 12/330 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C ++ LP +SIVI F+NE ++ LLR+++S++ R+P+ +L EI+L++D S +
Sbjct: 159 HKLCKNQTYDQKLPNASIVICFYNEHYNTLLRSLYSILDRTPKHLLHEIILINDFSDSKS 218
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L + +YV + + R+ R GLI+AR+ GA++A GE+LVFLD+H E WLE
Sbjct: 219 LHEQVRDYVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEVNKMWLEP 278
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+AR++ RT V PVIDII+ TF Y S L G FNW LHF+W + ++ +D
Sbjct: 279 LLARISHSRTIVPMPVIDIINADTFQYSSS-PLVRGGFNWGLHFKWDSLPIGTLSLE-QD 336
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F +P K+P MAGGLFA+DR YFF +G YD M VWGGENLE+SFR+W CGGSIE+ PCS
Sbjct: 337 FVKPIKSPTMAGGLFAMDRKYFFELGEYDAGMDVWGGENLEISFRIWMCGGSIELIPCSR 396
Query: 424 VAHLFRKASPYSFPGG--VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
V H+FR+ PY GG + + N RVA VWMD++ +++ K K+ D +
Sbjct: 397 VGHVFRRRRPY---GGNDQQDTMLKNSLRVAYVWMDQYKKYFLK----NVKKIDYGDITE 449
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDD 511
R +LR++L C F WYL HV+P LP D+
Sbjct: 450 RQQLRQKLHCKDFAWYLEHVYPELTLPDDN 479
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 112/179 (62%), Gaps = 8/179 (4%)
Query: 27 VVVGHYLGDSVDGGLHSNLSDAQQL---YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKV 83
V VG L + GL N+ + ++ Y+ FN+LVSD + ++R +PD R CK +
Sbjct: 110 VTVGQGLDEL---GLVKNMDEQKKREEGYKSFAFNVLVSDNLSLHRDIPDTRHKLCKNQT 166
Query: 84 FNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDE 143
++++ LP +SIVI F+NE ++ LLR+++S++ R+P+ +L EI+L++D S + L + +
Sbjct: 167 YDQK-LPNASIVICFYNEHYNTLLRSLYSILDRTPKHLLHEIILINDFSDSKSLHEQVRD 225
Query: 144 YVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
YV + + R+ R GLI+AR+ GA++A GE+LVFLD+H E ++ E L + S
Sbjct: 226 YVKQNFDNKVKYYRTERREGLIRARMFGAKKATGEVLVFLDSHIEVNKMWLEPLLARIS 284
>gi|242005043|ref|XP_002423384.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
gi|212506428|gb|EEB10646.1| UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, putative
[Pediculus humanus corporis]
Length = 573
Score = 294 bits (753), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 163/415 (39%), Positives = 237/415 (57%), Gaps = 25/415 (6%)
Query: 201 SIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPT 260
S+V+ FHNE WS LLRTV+SV++RSP +LKEI+LVDD S++ FLK LD YV +
Sbjct: 126 SVVVPFHNEHWSTLLRTVYSVLNRSPSHLLKEIILVDDYSSKPFLKKKLDIYVDRHLPKV 185
Query: 261 RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVI 320
++IR P R+GLI+ARL GA++A+ ++L+FLD+H E + WL L+ +AE+ VCP I
Sbjct: 186 KIIRLPERMGLIRARLAGAKKAKAQVLLFLDSHTEANVNWLPPLLEPIAENYKTCVCPFI 245
Query: 321 DIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAI 380
D+I+ TF Y E GAF+WE ++ D K TEPF++P MAGGLFAI
Sbjct: 246 DVIAHDTFEYRAQDEGRRGAFDWEFFYKRLPLLPEDL----KHPTEPFQSPVMAGGLFAI 301
Query: 381 DRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGV 440
+F+ +G YDE + +WGGE E+SF++WQCGG + APCS V H++RK +P+ P G+
Sbjct: 302 SAKFFWELGGYDEGLAIWGGEQYELSFKIWQCGGKMVDAPCSRVGHIYRKFAPFPNP-GI 360
Query: 441 SEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTH 500
+ + N RVA VWMDE+AE+ +K P + D + + +R++L C FKW++ +
Sbjct: 361 GDFVGKNYRRVAEVWMDEYAEYLYKRRPHY-RNIDPGDLTVQKAVRERLNCKPFKWFIEN 419
Query: 501 V---WPHHFLPMD--DKFFGRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPC-----TH 549
+ P + P++ D G IR + CV+ + L PC +
Sbjct: 420 IAFDLPLKYPPIEPPDLAEGEIRSIADPGLCVDT-----ERKEPEDTFGLKPCEKNFKSK 474
Query: 550 LPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYD 603
Q F++ D+ +VC DV +N S V + C G Q W YD
Sbjct: 475 NTRTEQYFILTWHEDIRPKGRNVCWDVSSIDNKAS--VNLYKCHGMKGNQYWHYD 527
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/142 (50%), Positives = 103/142 (72%), Gaps = 2/142 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+ LY +N FN L+SD+I +N +LPD+R P CK K + + L S+V+ FHNE WS LLR
Sbjct: 84 EALYAVNGFNALLSDKIYLN-SLPDIRHPGCKEKKYRKN-LNTVSVVVPFHNEHWSTLLR 141
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV+SV++RSP +LKEI+LVDD S++ FLK LD YV + ++IR P R+GLI+ARL
Sbjct: 142 TVYSVLNRSPSHLLKEIILVDDYSSKPFLKKKLDIYVDRHLPKVKIIRLPERMGLIRARL 201
Query: 169 LGARQAEGEILVFLDAHCECTL 190
GA++A+ ++L+FLD+H E +
Sbjct: 202 AGAKKAKAQVLLFLDSHTEANV 223
>gi|326923175|ref|XP_003207815.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like
[Meleagris gallopavo]
Length = 709
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 148/332 (44%), Positives = 214/332 (64%), Gaps = 5/332 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP ++I++ F +E WS LLR+VHSV+SRSP +L+E++LVDD ST+++LK LD Y+++
Sbjct: 267 LPTTTIIMCFVDEVWSTLLRSVHSVLSRSPPHLLQELILVDDFSTKDYLKEKLDAYMSQF 326
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+V+ R GLI+ARL GA+ A G +L FLD+H EC +GWLE L+ RV R RV
Sbjct: 327 P-KVKVLHLRERHGLIRARLAGAQMATGTVLTFLDSHVECNVGWLEPLLERVRLHRARVA 385
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK-DFTEPFKTPAMAG 375
CPVI++ISD +Y+ G F W ++F W + I K K T+ + P MAG
Sbjct: 386 CPVIEVISDKDMSYMTVDNFQRGIFTWPMNFGWKQI-PQEVIEKNKLKETDIIRCPVMAG 444
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLF++++ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H+FR +PYS
Sbjct: 445 GLFSVEKKYFFELGTYDSGLDVWGGENMELSFKVWMCGGEIEIVPCSRVGHIFRNDNPYS 504
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE-LRKQLKCHSF 494
FP + NLARVA VW+D + E ++ ++R + S+ + LR++L+C SF
Sbjct: 505 FPKDRVRTVERNLARVAEVWLDGYKELFYGHAYHLLQRRAELGDLSQQKALRERLRCRSF 564
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
+WYL +V+P P+ K G + +V T +C+
Sbjct: 565 RWYLENVYPDLAAPL-VKAGGLLVNVATARCI 595
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 64/153 (41%), Positives = 102/153 (66%), Gaps = 2/153 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+A+ ++ FN+ +SD IPV+R + D R C + +++ LP ++I++ F +E WS L
Sbjct: 226 EAKSRWKEGNFNVFLSDLIPVDRAIADTRPAGCLEQQVHDD-LPTTTIIMCFVDEVWSTL 284
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV+SRSP +L+E++LVDD ST+++LK LD Y+++ +V+ R GLI+A
Sbjct: 285 LRSVHSVLSRSPPHLLQELILVDDFSTKDYLKEKLDAYMSQFP-KVKVLHLRERHGLIRA 343
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
RL GA+ A G +L FLD+H EC + + E L +
Sbjct: 344 RLAGAQMATGTVLTFLDSHVECNVGWLEPLLER 376
>gi|260817709|ref|XP_002603728.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
gi|229289050|gb|EEN59739.1| hypothetical protein BRAFLDRAFT_126865 [Branchiostoma floridae]
Length = 501
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 169/444 (38%), Positives = 250/444 (56%), Gaps = 34/444 (7%)
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLR 216
SPG G L Q +LV + C + LP+ S+V++FHNEA S +LR
Sbjct: 19 SPGHFGPKSRTLRPQVQDTSALLVPV-----CLHEMYPDDLPQVSVVMIFHNEALSVVLR 73
Query: 217 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 276
++HSVIS+SP ++ EI+LVDD+S + + ++ KL +++R+P R GLI++RL
Sbjct: 74 SIHSVISQSPPHLIGEIILVDDSSDKPHISTN-----TKLRDKVKILRNPKREGLIRSRL 128
Query: 277 LGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFEL 336
G +QA+G +VFLD+H ECT GWLE L+AR+ ++ + V CPVID I TFAY +
Sbjct: 129 KGVQQAQGPAIVFLDSHIECTPGWLEPLLARIRKNNSTVACPVIDHIDTKTFAY-EQLKF 187
Query: 337 HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQ 396
G F W+L+F W Y + + + +RK +P + P MAGGLFAI + YF HIGAYD+ M+
Sbjct: 188 LAGGFTWDLNFMW-IYVNKEEMARRKSAIDPVRCPVMAGGLFAIYKDYFQHIGAYDQAME 246
Query: 397 VWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF--PGGVSEV-LYGNLARVAL 453
++GGEN+EMSFRVWQCGG IE PCS V H+ R PY + ++ + N ARVA
Sbjct: 247 IYGGENVEMSFRVWQCGGRIETVPCSRVGHIERTDKPYLYVRSNDTKDINIEVNKARVAE 306
Query: 454 VWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHF-LPMDDK 512
VWMDE+ + + P+ K + R LRK+L C SF+WY+ +V+P +++
Sbjct: 307 VWMDEYKRYLYAREPQL-KNISYGDISERQALRKRLGCQSFQWYMENVYPDRLEQTVENG 365
Query: 513 FF---GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPT-----D 564
++ G +R++Q C++ +G L C H Q F ++ P
Sbjct: 366 YYRAWGELRNLQAGLCLDLMDGRG--------VGLWDC-HGQGGQQFFALRRPEKRKALQ 416
Query: 565 LIATDESVCLDVPEYENDISPRVR 588
I T + C+ E SP ++
Sbjct: 417 TIGTGDMQCMGTEGTERFESPIIK 440
Score = 105 bits (262), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 93/141 (65%), Gaps = 13/141 (9%)
Query: 66 PVNRTL-PDVRK------PKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSP 118
P +RTL P V+ P C +++ ++ LP+ S+V++FHNEA S +LR++HSVIS+SP
Sbjct: 25 PKSRTLRPQVQDTSALLVPVCLHEMYPDD-LPQVSVVMIFHNEALSVVLRSIHSVISQSP 83
Query: 119 RSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEI 178
++ EI+LVDD+S + + ++ KL +++R+P R GLI++RL G +QA+G
Sbjct: 84 PHLIGEIILVDDSSDKPHISTN-----TKLRDKVKILRNPKREGLIRSRLKGVQQAQGPA 138
Query: 179 LVFLDAHCECTLVFNEEFLPK 199
+VFLD+H ECT + E L +
Sbjct: 139 IVFLDSHIECTPGWLEPLLAR 159
>gi|195115752|ref|XP_002002420.1| GI12891 [Drosophila mojavensis]
gi|193912995|gb|EDW11862.1| GI12891 [Drosophila mojavensis]
Length = 622
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 209/328 (63%), Gaps = 6/328 (1%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
CE E LP++SI++ F+NE L+R++ +V+ R+P +LKEI+LVDD S L
Sbjct: 131 CERGETLESEQLPQASIIMCFYNEHKMTLMRSIKTVLERTPAHLLKEIVLVDDNSDLPEL 190
Query: 246 K-SSLDEYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+ L + A+L R +R+ R GLI++R++GAR A G++LVFLD+H E WLE
Sbjct: 191 EFHLLADLHARLKYDNLRYVRNEQREGLIRSRVIGARDASGDVLVFLDSHIEVNRQWLEP 250
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ V + + + PVID+I+ TF Y S L G FNW LHFRW + +D
Sbjct: 251 LLRLVKAENSTLAVPVIDLINADTFEYTPS-PLVRGGFNWGLHFRWENLPEGTLKVP-ED 308
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F PF++P MAGGLFA++R YF HIG YD M +WGGEN+E+SFRVWQCGGSI+I PCS
Sbjct: 309 FKGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFRVWQCGGSIKIVPCSR 368
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+ P G + +L +L R+A VWMD++ EFY K A K D + +RL
Sbjct: 369 VGHIFRKRRPYTAPDGANTMLKNSL-RLAHVWMDKYKEFYLKHEKVA-KDYDYGDISARL 426
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDD 511
+LR++L C F WYL HV+P LP D+
Sbjct: 427 QLRERLHCKDFGWYLKHVYPELRLPGDE 454
Score = 102 bits (253), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 59/149 (39%), Positives = 92/149 (61%), Gaps = 3/149 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKC-KTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
Y+ + FN LVS+ I + R +PD R C + + E LP++SI++ F+NE L+R++
Sbjct: 104 YKHHAFNALVSNNIGLFRDIPDTRHKVCERGETLESEQLPQASIIMCFYNEHKMTLMRSI 163
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLK-SSLDEYVAKLSVPT-RVIRSPGRVGLIKARL 168
+V+ R+P +LKEI+LVDD S L+ L + A+L R +R+ R GLI++R+
Sbjct: 164 KTVLERTPAHLLKEIVLVDDNSDLPELEFHLLADLHARLKYDNLRYVRNEQREGLIRSRV 223
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
+GAR A G++LVFLD+H E + E L
Sbjct: 224 IGARDASGDVLVFLDSHIEVNRQWLEPLL 252
>gi|292623437|ref|XP_001339749.3| PREDICTED: putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1-like
[Danio rerio]
Length = 567
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 159/419 (37%), Positives = 240/419 (57%), Gaps = 25/419 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + + LP +SI+I FHNEA S LLRT+ SV+ RSP ++ EI+LVDD S+ +
Sbjct: 126 HYRCASMTFDPDLPPTSIIITFHNEARSTLLRTIKSVLMRSPPHLILEIILVDDFSS-DP 184
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L + K+ R +R+ R GLI++R+ GA A IL FLD+HCE WL+ +
Sbjct: 185 EDCRLLSQIPKV----RCLRNERREGLIRSRVRGASAASASILTFLDSHCEVNTDWLQPM 240
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RV ED +RVV P+ID+IS FAY+ + G F+W LHF+W + + R D
Sbjct: 241 IQRVKEDHSRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQI-PIEQKMARNDP 299
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T+P +TP +AGG+F I++ +F H+G YD M +WGGEN E+SFRVW CGGS+EI PCS V
Sbjct: 300 TQPIRTPVIAGGIFVIEKGWFNHLGQYDTHMDIWGGENFELSFRVWMCGGSLEILPCSRV 359
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H+FRK PY FP G + N R A VWMD++ ++Y+ P A+ + ++ RL
Sbjct: 360 GHVFRKRHPYDFPEGNALTYIKNTRRAAEVWMDDYKQYYYAARPSAQG-KAFGSIADRLA 418
Query: 485 LRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASL 544
L+++L C+SF+WYL +V+P +P ++ + ++ Q C+E + + L
Sbjct: 419 LKRKLNCNSFRWYLENVYPELKIPEQEEAYSLLK--QGGLCLE--------SHGTDSLGL 468
Query: 545 LPCTHLPVL--TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS-GFNRQRW 600
C P + +Q + + P +CL + + +VR+ C+ +RQ+W
Sbjct: 469 AECRSTPSIPASQKWTLIEPQ---IRQHDLCLAITAF--TAGSKVRLEPCNIKESRQKW 522
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 82/144 (56%), Gaps = 6/144 (4%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
L + Y+ + FNL SDR+ R + D R +C + F+ + LP +SI+I FHNEA
Sbjct: 94 QLRPGEDPYREHAFNLQESDRLGSERAIRDTRHYRCASMTFDPD-LPPTSIIITFHNEAR 152
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
S LLRT+ SV+ RSP ++ EI+LVDD F D + R +R+ R GL
Sbjct: 153 STLLRTIKSVLMRSPPHLILEIILVDD-----FSSDPEDCRLLSQIPKVRCLRNERREGL 207
Query: 164 IKARLLGARQAEGEILVFLDAHCE 187
I++R+ GA A IL FLD+HCE
Sbjct: 208 IRSRVRGASAASASILTFLDSHCE 231
>gi|170039457|ref|XP_001847550.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167863027|gb|EDS26410.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 619
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 171/433 (39%), Positives = 246/433 (56%), Gaps = 28/433 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + LP S++I+F+NE WSALLRTV+SV++RSP +LKEI+LV+D ST+ F
Sbjct: 145 HPQCRKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPPHLLKEIILVNDHSTKPF 204
Query: 245 LKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L L E+V ++LS ++I P R GLI ARL GA+ A G++L+ LD+H E + WL
Sbjct: 205 LWKPLQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPP 264
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ +A+D VCP+ID+I TF Y E GAF+W+ +++ D D
Sbjct: 265 LLEPIAQDYRTCVCPLIDVIVHDTFEYRSQDEGKRGAFDWKFYYKRLPLRPGDL----DD 320
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
TEPF++P MAGGLFAI +F+ +G YDE + +WGGE E+SF++WQCGG + APCS
Sbjct: 321 PTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSR 380
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++R SP+ P GV+ V N RVA VWMDE+ +F ++ NP+ +K + +
Sbjct: 381 VGHVYRGYSPFPNPRGVNFVTR-NFKRVAEVWMDEYKQFLYERNPQFDKTNPGDLTKQK- 438
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFL--PMDDKF---FGRIRHVQTHKCVEKPLAKGSMN-Q 537
LR++LKC FKW+L V P + P+ + GR++ V K L ++N +
Sbjct: 439 ALREKLKCKPFKWFLEEVAPDLLVRYPLREPLPFASGRVQSVANPK-----LCLDTLNHK 493
Query: 538 ASGPASLLPC----THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC- 592
A P + C TH P Q F + D+ A CLD + + V + C
Sbjct: 494 AKEPIGVFGCAPNKTH-PQNNQFFTLTYYRDIRAASVEKCLDA----SSDNAEVILFNCH 548
Query: 593 SGFNRQRWTYDKE 605
Q W YD+E
Sbjct: 549 ESQGNQLWRYDQE 561
Score = 139 bits (349), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 105/142 (73%), Gaps = 2/142 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+LY+ N ++ +VSD I +NR++PD+R P+C+ K + +E LP S++I+F+NE WSALLRT
Sbjct: 119 KLYKENGYSAVVSDLIALNRSIPDIRHPQCRKKRYLQE-LPTVSVIIIFYNEHWSALLRT 177
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARL 168
V+SV++RSP +LKEI+LV+D ST+ FL L E+V ++LS ++I P R GLI ARL
Sbjct: 178 VYSVLNRSPPHLLKEIILVNDHSTKPFLWKPLQEFVESELSPKVKLIHLPERSGLIIARL 237
Query: 169 LGARQAEGEILVFLDAHCECTL 190
GA+ A G++L+ LD+H E +
Sbjct: 238 AGAKAASGDVLIVLDSHTEVNV 259
>gi|270006170|gb|EFA02618.1| hypothetical protein TcasGA2_TC008338 [Tribolium castaneum]
Length = 613
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/416 (40%), Positives = 238/416 (57%), Gaps = 18/416 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S+V+ FHNE W+ LLRT SV++RSP +LKE++LVDD ST+EF K LD+Y+A
Sbjct: 153 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 212
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R I P R GLI+ARL GAR A ++L+FLD+H E + WL L+ +A+D V
Sbjct: 213 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 272
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP ID+I TF Y E GAF+WE ++ D + TEPFK+P MAGG
Sbjct: 273 CPFIDVIQYETFEYRAQDEGARGAFDWEFFYKRLPLLPEDL----EHPTEPFKSPVMAGG 328
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI R +F+ +G YDE + +WGGE E+SF++WQCGG + APCS V H++RK +P+
Sbjct: 329 LFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHIYRKYAPFPN 388
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PG + + N RVA VWMDE+AE+ +K P + D + + LR++L C FKW
Sbjct: 389 PGK-GDFVGRNYRRVAEVWMDEYAEYLYKRRPHY-RDIDPGDLTKQKALREKLHCKPFKW 446
Query: 497 YLTHV---WPHHFLPMDDKFF--GRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCTHL 550
++ V P + P++ F G IR++ + CV+ +Q G A + T+
Sbjct: 447 FMEKVAFDLPLKYPPIEPGDFGVGEIRNLAAPELCVDS--GHKDRDQVIGLAECVKGTN- 503
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q F + DL +++CLDV + ND + V + C G Q W YD E
Sbjct: 504 KNGEQNFALTWHKDLRVKGKTLCLDVSD-PNDKADIV-LYPCHGSQGNQYWRYDVE 557
Score = 148 bits (374), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 101/142 (71%), Gaps = 1/142 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++LY++N FN +SD+I ++R +PD+R P CK+K + ++ LP S+V+ FHNE W+ LLR
Sbjct: 114 EKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD-LPTVSVVVPFHNEHWTTLLR 172
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T SV++RSP +LKE++LVDD ST+EF K LD+Y+A R I P R GLI+ARL
Sbjct: 173 TAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAANLTKVRAIHLPERSGLIRARL 232
Query: 169 LGARQAEGEILVFLDAHCECTL 190
GAR A ++L+FLD+H E +
Sbjct: 233 AGARVATADVLIFLDSHTEANV 254
>gi|268370155|ref|NP_001161257.1| polypeptide GalNAc transferase 6-like [Tribolium castaneum]
Length = 591
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/416 (40%), Positives = 238/416 (57%), Gaps = 18/416 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP S+V+ FHNE W+ LLRT SV++RSP +LKE++LVDD ST+EF K LD+Y+A
Sbjct: 131 LPTVSVVVPFHNEHWTTLLRTAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAAN 190
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R I P R GLI+ARL GAR A ++L+FLD+H E + WL L+ +A+D V
Sbjct: 191 LTKVRAIHLPERSGLIRARLAGARVATADVLIFLDSHTEANVNWLPPLLEPIAQDYKTCV 250
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CP ID+I TF Y E GAF+WE ++ D + TEPFK+P MAGG
Sbjct: 251 CPFIDVIQYETFEYRAQDEGARGAFDWEFFYKRLPLLPEDL----EHPTEPFKSPVMAGG 306
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI R +F+ +G YDE + +WGGE E+SF++WQCGG + APCS V H++RK +P+
Sbjct: 307 LFAISRKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGLMVDAPCSRVGHIYRKYAPFPN 366
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PG + + N RVA VWMDE+AE+ +K P + D + + LR++L C FKW
Sbjct: 367 PGK-GDFVGRNYRRVAEVWMDEYAEYLYKRRPHY-RDIDPGDLTKQKALREKLHCKPFKW 424
Query: 497 YLTHV---WPHHFLPMDDKFF--GRIRHVQTHK-CVEKPLAKGSMNQASGPASLLPCTHL 550
++ V P + P++ F G IR++ + CV+ +Q G A + T+
Sbjct: 425 FMEKVAFDLPLKYPPIEPGDFGVGEIRNLAAPELCVDS--GHKDRDQVIGLAECVKGTN- 481
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-NRQRWTYDKE 605
Q F + DL +++CLDV + ND + V + C G Q W YD E
Sbjct: 482 KNGEQNFALTWHKDLRVKGKTLCLDVSD-PNDKADIV-LYPCHGSQGNQYWRYDVE 535
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 101/142 (71%), Gaps = 1/142 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
++LY++N FN +SD+I ++R +PD+R P CK+K + ++ LP S+V+ FHNE W+ LLR
Sbjct: 92 EKLYKVNGFNAALSDQIAIDRAVPDIRHPGCKSKKYLKD-LPTVSVVVPFHNEHWTTLLR 150
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
T SV++RSP +LKE++LVDD ST+EF K LD+Y+A R I P R GLI+ARL
Sbjct: 151 TAASVVNRSPPHLLKEVILVDDCSTKEFSKKPLDDYLAANLTKVRAIHLPERSGLIRARL 210
Query: 169 LGARQAEGEILVFLDAHCECTL 190
GAR A ++L+FLD+H E +
Sbjct: 211 AGARVATADVLIFLDSHTEANV 232
>gi|170056949|ref|XP_001864263.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
gi|167876550|gb|EDS39933.1| N-acetyl galactosaminyl transferase 6 [Culex quinquefasciatus]
Length = 608
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/432 (39%), Positives = 245/432 (56%), Gaps = 26/432 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + LP S++I+F+NE WSALLRTV+SV++RSP +LKEI+LV+D ST+ F
Sbjct: 134 HPQCRKKRYLQELPTVSVIIIFYNEHWSALLRTVYSVLNRSPSHLLKEIILVNDHSTKPF 193
Query: 245 LKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L L E+V ++LS ++I P R GLI ARL GA+ A G++L+ LD+H E + WL
Sbjct: 194 LWKPLQEFVESELSPKVKLIHLPERSGLIIARLAGAKAASGDVLIVLDSHTEVNVNWLPP 253
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ +A+D VCP+ID+I TF Y E GAF+W+ +++ D D
Sbjct: 254 LLEPIAQDYRTCVCPLIDVIVHDTFEYRSQDEGKRGAFDWKFYYKRLPLRPGDL----DD 309
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
TEPF++P MAGGLFAI +F+ +G YDE + +WGGE E+SF++WQCGG + APCS
Sbjct: 310 PTEPFESPIMAGGLFAISSKFFWELGGYDEGLDIWGGEQYELSFKIWQCGGRMVDAPCSR 369
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H++R SP+ P GV+ V N RVA VWMDE+ +F ++ NP+ +K + +
Sbjct: 370 VGHVYRGYSPFPNPRGVNFVTR-NFKRVAEVWMDEYKQFLYERNPQFDKTNPGDLTKQK- 427
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFL--PMDDKF---FGRIRHVQTHKCVEKPLAKGSMN-Q 537
LR++LKC FKW+L V P + P+ + GR++ V K L ++N +
Sbjct: 428 ALRERLKCKPFKWFLEEVAPDLLVRYPLREPLPFASGRVQSVANPK-----LCLDTLNHK 482
Query: 538 ASGPASLLPC----THLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACS 593
A P + C TH P Q F + D+ A CLD +D + +
Sbjct: 483 AKEPIGVFGCAPNKTH-PQNNQFFTLTYYRDIRAASVEKCLDA---SSDDAEVILFNCHE 538
Query: 594 GFNRQRWTYDKE 605
Q W YD+E
Sbjct: 539 SQGNQLWRYDQE 550
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/142 (49%), Positives = 105/142 (73%), Gaps = 2/142 (1%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+LY+ N ++ +VSD I +NR++PD+R P+C+ K + +E LP S++I+F+NE WSALLRT
Sbjct: 108 KLYKENGYSAVVSDLIALNRSIPDIRHPQCRKKRYLQE-LPTVSVIIIFYNEHWSALLRT 166
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVIRSPGRVGLIKARL 168
V+SV++RSP +LKEI+LV+D ST+ FL L E+V ++LS ++I P R GLI ARL
Sbjct: 167 VYSVLNRSPSHLLKEIILVNDHSTKPFLWKPLQEFVESELSPKVKLIHLPERSGLIIARL 226
Query: 169 LGARQAEGEILVFLDAHCECTL 190
GA+ A G++L+ LD+H E +
Sbjct: 227 AGAKAASGDVLIVLDSHTEVNV 248
>gi|198434303|ref|XP_002132126.1| PREDICTED: similar to polypeptide N-acetylgalactosaminyltransferase
17 [Ciona intestinalis]
Length = 870
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 162/418 (38%), Positives = 243/418 (58%), Gaps = 28/418 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +SI+I FHNE + LLRT+HS+I+R+P+ +L+EI+LVDD ST + LKSSLD+ ++K
Sbjct: 411 LPDASIIIPFHNEGRTTLLRTIHSIINRTPKILLREIILVDDCSTVDHLKSSLDQELSKY 470
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R R GLI+ARL G QA+G +V LD+H E T WL L+ +A DR +
Sbjct: 471 RQ-VKLVRLAKREGLIRARLAGVHQAKGNTIVILDSHVEVTNNWLPPLLEPIALDRKVIT 529
Query: 317 CPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
CP+IDII+ F Y+ + + GAF+WEL+++ + KD ++PF+ P MAG
Sbjct: 530 CPMIDIINKDDFHYLTQPGDAMRGAFDWELYYKRIPIPPEKQL---KDPSDPFEDPVMAG 586
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF IG YD+ +++WGGE E+SF+ W CGG I APCS V H++R+ PYS
Sbjct: 587 GLFAIDRLYFKEIGEYDDGLEIWGGEQYELSFKAWMCGGKILDAPCSRVGHIYREFMPYS 646
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + + N RVA VWMDE+AE+++K P + + LR+ L+C SF
Sbjct: 647 LPPGTN--INKNFKRVAEVWMDEYAEYFYKKRPHVRGIHPGDLSKQK-ALRELLECRSFD 703
Query: 496 WYLTHVWP---HHFLPM--DDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
W++ V P H+ P+ + +G + + + +C++ G + P SL+PC
Sbjct: 704 WFMKEVAPDIIKHYPPVMPEPAAWGMLSNEGSKRCLD-----GLYKKEGAPLSLMPCREE 758
Query: 551 PVLTQMFVMKLPTDL---IATDES--VCLDVPEYENDISPRVRILACSG-FNRQRWTY 602
Q F++ D+ + D++ CLD ++ V + C G + Q W Y
Sbjct: 759 GTADQSFILTWKEDIRPGTSMDKARKFCLD----GQGLNSPVVLWQCHGQYGNQLWKY 812
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 68/142 (47%), Positives = 101/142 (71%), Gaps = 2/142 (1%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
++ Y N FN+LVS+RI +NR+LPD+R C ++ + + LP +SI+I FHNE + LL
Sbjct: 371 SRSAYSENGFNILVSNRISLNRSLPDIRHKNCASRKYLAQ-LPDASIIIPFHNEGRTTLL 429
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RT+HS+I+R+P+ +L+EI+LVDD ST + LKSSLD+ ++K +++R R GLI+AR
Sbjct: 430 RTIHSIINRTPKILLREIILVDDCSTVDHLKSSLDQELSKYRQ-VKLVRLAKREGLIRAR 488
Query: 168 LLGARQAEGEILVFLDAHCECT 189
L G QA+G +V LD+H E T
Sbjct: 489 LAGVHQAKGNTIVILDSHVEVT 510
>gi|390332219|ref|XP_781199.3| PREDICTED: N-acetylgalactosaminyltransferase 7-like
[Strongylocentrotus purpuratus]
Length = 606
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 161/429 (37%), Positives = 249/429 (58%), Gaps = 28/429 (6%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
+C E LP +S++IVFHNE WS LLRTVHSV +RSP +L EI+LVDD ST+E LK
Sbjct: 148 QCKHWHYPETLPTTSVIIVFHNEGWSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLK 207
Query: 247 SSLDEYV--AKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L++YV A+ + +++R+ R GLI+ R++GAR + G++L++LDAHCE + WL L
Sbjct: 208 ERLEDYVQEARFNGKLKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPL 267
Query: 305 VARVAEDRTRVVCPVIDIISDVTF-AYVR-SFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
+ +A +RT VCP+ID+I ++ + Y + + + G F+W L+++ + +R+
Sbjct: 268 LTPIAVNRTTAVCPIIDVIDNMDYRVYPQGTGDQDRGGFDWSLYWKHLPVPQFEKS-RRQ 326
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+EP+++PAMAGGLFA+DR YFF +GAYDE +++WGGEN E+SF++W CGGS+ PCS
Sbjct: 327 HASEPYRSPAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVPCS 386
Query: 423 HVAHLFR--KASPYSFPGGVSEVLYG-NLARVALVWMDEWAEFYFKFNPEAEKQRDKQAV 479
V H++R PYS P G +L NL RV VW D++ E++++ PE+ +
Sbjct: 387 RVGHVYRILGKVPYSAPNGSMLILSERNLRRVVEVWFDDYKEYFYRSKPESLLVSTGN-I 445
Query: 480 RSRLELRKQLKCHSFKWYLTHVWP----HHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSM 535
+L R++ C SF W++ + P + LP +K++G IR + CV+ SM
Sbjct: 446 EKQLAFREKFHCKSFGWFMKEIAPDIIEKYPLPHANKYWGEIRTKKGSLCVD------SM 499
Query: 536 NQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF 595
G + H Q+F + L D+ D VR+ C G
Sbjct: 500 GSKDGGRVGMSYCHGAGGNQLFRVTENGQLRIHDQCA--------YDHYKEVRLRRCGG- 550
Query: 596 NRQRWTYDK 604
+ W++D+
Sbjct: 551 SGGGWSFDE 559
Score = 138 bits (348), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 102/147 (69%), Gaps = 3/147 (2%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
S + +L Q FN VSD+I ++R + D+R +CK + E LP +S++IVFHNE
Sbjct: 113 SEKAKVDRLIQEYGFNQYVSDQISLDRNIADLRSQQCKHWHYPET-LPTTSVIIVFHNEG 171
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV--AKLSVPTRVIRSPGR 160
WS LLRTVHSV +RSP +L EI+LVDD ST+E LK L++YV A+ + +++R+ R
Sbjct: 172 WSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLKERLEDYVQEARFNGKLKLVRNSRR 231
Query: 161 VGLIKARLLGARQAEGEILVFLDAHCE 187
GLI+ R++GAR + G++L++LDAHCE
Sbjct: 232 EGLIRTRIIGARHSTGDVLLWLDAHCE 258
>gi|126303658|ref|XP_001380711.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 14
[Monodelphis domestica]
Length = 552
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 158/347 (45%), Positives = 217/347 (62%), Gaps = 10/347 (2%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +SI+I FHNEA S LLRT+ SV +R+P ++ EI+LVDD S
Sbjct: 101 CATLHYGPDLPPTSIIITFHNEARSTLLRTIRSVSNRTPVHLVHEIILVDDFSD----DP 156
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+ ++KL + +R+ R GLI++R+ GA A+ IL FLD+HCE WL L+ R
Sbjct: 157 DDCQLLSKLP-KVKCLRNEQREGLIRSRIRGADLAQASILTFLDSHCEVNKDWLLPLLHR 215
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+ ED TRVVCPVIDII+ TFAYV S G F+W LHF+W + + R D +P
Sbjct: 216 IKEDPTRVVCPVIDIINRDTFAYVSSSPDMRGGFDWTLHFKWEELTLREKAL-RVDPIQP 274
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+TP ++GGLF +++++F H+G YD M +WGGEN E+SFRVW CGGS+EI PCS V H+
Sbjct: 275 IETPIISGGLFVMNKSWFNHLGKYDAAMDIWGGENFEISFRVWMCGGSLEILPCSRVGHV 334
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRK 487
FRK PY+FP G N R A VWMDE+ +++ P A+ R ++SR+ELRK
Sbjct: 335 FRKKHPYTFPEGNLNTYIKNTKRTAEVWMDEFKHYFYAARPVAQG-RPFGNIQSRVELRK 393
Query: 488 QLKCHSFKWYLTHVWPHHFLPMDDKF-FGRIRHVQTHKCVEKPLAKG 533
+LKCH+FKWYL +V+P +P + F G IR Q KC++ +G
Sbjct: 394 RLKCHTFKWYLENVYPELRIPEESLFQKGIIR--QKQKCLQSRKQEG 438
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/140 (42%), Positives = 85/140 (60%), Gaps = 6/140 (4%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y++ FN SDRIP NR + D R +C T + + LP +SI+I FHNEA S LL
Sbjct: 70 GQDPYKLYSFNQRESDRIPSNRAIWDTRPDRCATLHYGPD-LPPTSIIITFHNEARSTLL 128
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RT+ SV +R+P ++ EI+LVDD S + ++KL + +R+ R GLI++R
Sbjct: 129 RTIRSVSNRTPVHLVHEIILVDDFSD----DPDDCQLLSKLP-KVKCLRNEQREGLIRSR 183
Query: 168 LLGARQAEGEILVFLDAHCE 187
+ GA A+ IL FLD+HCE
Sbjct: 184 IRGADLAQASILTFLDSHCE 203
>gi|432882483|ref|XP_004074053.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Oryzias latipes]
Length = 593
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 163/434 (37%), Positives = 240/434 (55%), Gaps = 28/434 (6%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H EC E LP +S+VI FH E WS L+RTVHSV+ SPR L+E+LLVDD S
Sbjct: 167 HPECFGEKYSEILPSASVVICFHEEPWSTLMRTVHSVLDTSPRQYLQEVLLVDDLSQNRH 226
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK+ L YV+ L R+I S R+G+ R + A +A+G++LVF+D+HCEC GWLE L
Sbjct: 227 LKARLGSYVSGLDG-VRLIHSTRRLGVGGCRTMAAAKAKGQVLVFMDSHCECHEGWLEPL 285
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ RV++DR+RVV PV+D+I TF Y + G F+W L F W I ++
Sbjct: 286 LERVSQDRSRVVSPVMDVIDWKTFQYNATQWPVRGVFDWRLDFYW-ELNPPLQEIPPEEA 344
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
P ++PA+ GG+ AIDR +F +GAYD M +WG E +E+S RVW CGGS+E+ PCS V
Sbjct: 345 VLPLQSPALGGGVLAIDRNFFQSVGAYDPGMLLWGAEQIELSIRVWSCGGSMEVVPCSRV 404
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAE--KQRDKQAVRSR 482
AHL R PY FP E+L N RVA +WMD + + Y++ + A KQ + + R
Sbjct: 405 AHLSRHHLPYIFPD--QEMLRRNKIRVADIWMDAYKKIYYRRDTLAHLIKQSESPNIAER 462
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDD-KFFGRIRHVQTHKCVEKPLAKGSMNQASGP 541
L L++ L C +F W+L+ V+P ++P D G + +V T C + A+ Q +G
Sbjct: 463 LRLKRSLGCRNFHWFLSTVYPQLYVPQDKLALSGELYNVGTGNCAD--YAREEQQQGAG- 519
Query: 542 ASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLD-----------VPEYENDISPRVRIL 590
++ PC+ +L L+ +C++ E + + +
Sbjct: 520 MNIAPCSGT-------GSQLSGQLVHQQSQMCVEAVQQAGLLQSKTGEVHKPTAGGLYLK 572
Query: 591 ACSGFNRQRWTYDK 604
C+ RQ+W +++
Sbjct: 573 PCTRHPRQQWHFEQ 586
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/153 (43%), Positives = 96/153 (62%), Gaps = 2/153 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q Q + FN +S+ I ++R P+ R P+C + ++E LP +S+VI FH E WS L+R
Sbjct: 140 QASVQRHGFNEALSETISLHRRPPEARHPECFGEKYSE-ILPSASVVICFHEEPWSTLMR 198
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TVHSV+ SPR L+E+LLVDD S LK+ L YV+ L R+I S R+G+ R
Sbjct: 199 TVHSVLDTSPRQYLQEVLLVDDLSQNRHLKARLGSYVSGLDG-VRLIHSTRRLGVGGCRT 257
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPKSS 201
+ A +A+G++LVF+D+HCEC + E L + S
Sbjct: 258 MAAAKAKGQVLVFMDSHCECHEGWLEPLLERVS 290
>gi|350593501|ref|XP_003359567.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5 [Sus
scrofa]
Length = 927
Score = 293 bits (749), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 164/400 (41%), Positives = 234/400 (58%), Gaps = 51/400 (12%)
Query: 148 LSVPTRVIRSPGRVGLI------KARLLGARQAEGEILVFLD--------------AHCE 187
L++ R ++PG+ G K + + R EG V+L A C
Sbjct: 441 LTLSPRDPKAPGQFGRPVVVPQGKEKEVEKRWKEGNFNVYLSDLIPVDRAIEDTRPAGCA 500
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
LV N+ LP +S+++ F +E WS LLR+VHSV++RSP ++KEILLVDD ST+++LK
Sbjct: 501 DQLVHNK--LPTTSVIMCFVDEVWSTLLRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKD 558
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
+LD+Y+++ R++R R GLI+ARL GA+ A G++L FLD+H EC +GWLE L+ R
Sbjct: 559 NLDKYMSQFPK-VRILRLKERHGLIRARLAGAQIATGDVLTFLDSHVECNVGWLEPLLER 617
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
V R +V CPVI++I+D +R E K+ + P
Sbjct: 618 VYLSRKKVACPVIEVINDKD---MRKKER-----------------------KKGNVDTP 651
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHL 427
+ P MAGGLF+ID+ YFF +G YD + VWGGEN+E+SF+VW CGG IEI PCS V H+
Sbjct: 652 IRCPVMAGGLFSIDKTYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHI 711
Query: 428 FRKASPYSFPGGVSEVLYGNLARVALVWMDEWAE-FYFKFNPEAEKQRDKQAVRSRLELR 486
FR +PYSFP + + NL RVA VW+DE+ E FY + ++ D + + ELR
Sbjct: 712 FRNDNPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELR 771
Query: 487 KQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCV 526
K+LKC SFKWYL +V+P P+ + G + +V KC+
Sbjct: 772 KKLKCKSFKWYLENVFPDLKAPI-VRASGVLINVALGKCI 810
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 65/151 (43%), Positives = 105/151 (69%), Gaps = 2/151 (1%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSAL 106
+ ++ ++ FN+ +SD IPV+R + D R C ++ + + LP +S+++ F +E WS L
Sbjct: 467 EVEKRWKEGNFNVYLSDLIPVDRAIEDTRPAGCADQLVHNK-LPTTSVIMCFVDEVWSTL 525
Query: 107 LRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKA 166
LR+VHSV++RSP ++KEILLVDD ST+++LK +LD+Y+++ R++R R GLI+A
Sbjct: 526 LRSVHSVLNRSPPHLIKEILLVDDFSTKDYLKDNLDKYMSQFPK-VRILRLKERHGLIRA 584
Query: 167 RLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
RL GA+ A G++L FLD+H EC + + E L
Sbjct: 585 RLAGAQIATGDVLTFLDSHVECNVGWLEPLL 615
>gi|312082212|ref|XP_003143351.1| glycosyl transferase [Loa loa]
Length = 580
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 210/339 (61%), Gaps = 26/339 (7%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
+A +C LP +S++I FHNEAWS LLRTVHSV+ R+P ++L EI+LVDD S
Sbjct: 142 NAFNQCKTEKYANDLPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSDM 201
Query: 243 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
LK+SL+ Y+ + R++R R GLI+AR+ GA ++G ++ +LD+HCEC GW+E
Sbjct: 202 AHLKASLEIYMRQFP-KVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWME 260
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAY--VRSFELHWGAFNWELHFRWYTYGSSDAIIK 360
L+ R+ ++ VVCPVID+I D TF Y +++ + G F+W L F W+ D +
Sbjct: 261 PLLDRIKKNPKTVVCPVIDVIDDNTFEYHYSKAYFTNVGGFDWSLQFNWHAIPEKDRKGR 320
Query: 361 RKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAP 420
R +P K+P MAGGLF+IDR +F +G+YD + +WGGENLE+SF+ W CGG +EI P
Sbjct: 321 RD--IDPVKSPTMAGGLFSIDRTFFEKLGSYDPGLDIWGGENLELSFKTWMCGGILEIVP 378
Query: 421 CSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
CSHV H+FRK SPY + GV+ VL N R+A + D V
Sbjct: 379 CSHVGHIFRKRSPYKWLSGVN-VLKRNSVRLA--------------------EGDFGDVS 417
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRH 519
SR LR++L+C SFKWYL +V+P F+P D G IR+
Sbjct: 418 SRKALREKLQCKSFKWYLDNVYPELFVPGDAIGKGEIRN 456
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 62/167 (37%), Positives = 100/167 (59%), Gaps = 24/167 (14%)
Query: 78 KCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 137
+CKT+ + + LP +S++I FHNEAWS LLRTVHSV+ R+P ++L EI+LVDD S L
Sbjct: 146 QCKTEKYAND-LPNTSVIICFHNEAWSVLLRTVHSVLERTPENLLAEIILVDDFSDMAHL 204
Query: 138 KSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFL 197
K+SL+ Y+ + R++R R GLI+AR+ GA ++G ++ +LD+HCEC + E L
Sbjct: 205 KASLEIYMRQFP-KVRILRLEKREGLIRARIKGAAISKGSVITYLDSHCECLEGWMEPLL 263
Query: 198 PKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
+ I ++P++++ ++ V D +T E+
Sbjct: 264 DR----------------------IKKNPKTVVCPVIDVIDDNTFEY 288
>gi|347971870|ref|XP_313714.5| AGAP004429-PA [Anopheles gambiae str. PEST]
gi|333469065|gb|EAA09257.5| AGAP004429-PA [Anopheles gambiae str. PEST]
Length = 663
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/327 (44%), Positives = 209/327 (63%), Gaps = 7/327 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C ++ LP +S+V+ F+NE L+R++H+V+ R+P +LKE++LVDD S E
Sbjct: 181 HKLCQAQVYDKVLPVASVVMCFYNEHLETLVRSIHTVLKRTPAYLLKELILVDDCSDFED 240
Query: 245 LK--SSLDEYVAKLSV-PTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
L L++ +A+L R++R+ R GLI++R+ GAR A G++L+FLD+H E + W+
Sbjct: 241 LTVGGQLEKELAQLGTNKVRLLRNTDREGLIRSRVYGARNATGQVLIFLDSHIEVNVDWI 300
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
E L+AR+ DRT + PVIDII+ TF Y S L G FNW LHF+W ++ +
Sbjct: 301 EPLLARIKHDRTILAMPVIDIINSDTFVYTAS-PLVRGGFNWGLHFKWDNL-PKGSLERD 358
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
DF PF +P MAGGLFAIDRAYF +G YD M VWGGENLE+SFR WQCGGSIE+ PC
Sbjct: 359 TDFVGPFNSPTMAGGLFAIDRAYFKELGEYDMGMDVWGGENLEISFRAWQCGGSIELLPC 418
Query: 422 SHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
S + H+FRK PY P G ++ +L R+A VWMD++ ++++ P+A V
Sbjct: 419 SRIGHVFRKRRPYGSPDGQDTMIRNSL-RLAHVWMDDYIRYFYEQQPQAH-HVPYGNVSE 476
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLP 508
R LR++L C F+WYL +++P +P
Sbjct: 477 RQRLRERLGCKPFRWYLDNIYPQLRVP 503
Score = 110 bits (276), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 58/151 (38%), Positives = 100/151 (66%), Gaps = 4/151 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y+ + FN+LVS+++ R +PD R C+ +V+ ++ LP +S+V+ F+NE L+R++H
Sbjct: 157 YRKHAFNVLVSNKLGPFRPIPDTRHKLCQAQVY-DKVLPVASVVMCFYNEHLETLVRSIH 215
Query: 112 SVISRSPRSMLKEILLVDDASTREFLK--SSLDEYVAKLSV-PTRVIRSPGRVGLIKARL 168
+V+ R+P +LKE++LVDD S E L L++ +A+L R++R+ R GLI++R+
Sbjct: 216 TVLKRTPAYLLKELILVDDCSDFEDLTVGGQLEKELAQLGTNKVRLLRNTDREGLIRSRV 275
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
GAR A G++L+FLD+H E + + E L +
Sbjct: 276 YGARNATGQVLIFLDSHIEVNVDWIEPLLAR 306
>gi|354465763|ref|XP_003495346.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Cricetulus griseus]
Length = 641
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 155/396 (39%), Positives = 235/396 (59%), Gaps = 12/396 (3%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C LP +S+++ FH+EAW LLRTVHS+++ +PR++L+EI+LVDD S +E
Sbjct: 180 HPMCLQQHPASGLPTASVILCFHDEAWPMLLRTVHSILNTAPRALLQEIILVDDLSQQEQ 239
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LKS+L +YVA+L +++RS R+G I+AR+LGA +A G++LVF+DAHCEC GWLE L
Sbjct: 240 LKSALSDYVARLEA-VKLLRSNKRLGTIRARMLGATRATGDVLVFMDAHCECHPGWLEPL 298
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
++R+A+DR+RVV PVID+I T Y S EL G +W+L FRW G D
Sbjct: 299 LSRIADDRSRVVSPVIDVIDWKTLQYSASKELQRGVLDWKLDFRWEPLGEQDQKALPSPI 358
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+ P ++P + G + A+DR YF + GAYD M + G ENLEMSF+ W CGGS+EI PCS V
Sbjct: 359 S-PIRSPVVPGEVVAVDRHYFQNTGAYDPLMSLQGSENLEMSFKAWLCGGSVEILPCSRV 417
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPE--AEKQRDKQAVRSR 482
H+++ S S EV+ N R+A W+ + E +++ PE A + K R
Sbjct: 418 GHIYQ--SQDSISRTDPEVILRNKVRIAETWLGSFKETFYRHIPEAFALSKVAKPDCMER 475
Query: 483 LELRKQLKCHSFKWYLTHVWPHHFLPMDD--KFFGRIRHVQTHKCVEKPLAKGSMNQASG 540
L+L+++L C +F W+L +V+P + P + +F G++ + C + + +
Sbjct: 476 LKLQRRLGCRTFHWFLANVYPELY-PFEHRPRFSGKLHNTGLGFCAD---CQAQEDILGC 531
Query: 541 PASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDV 576
P +L PC++ + L + + +C DV
Sbjct: 532 PMTLAPCSNNRQQQNLKHTSRKEILFGSQQQLCFDV 567
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/143 (46%), Positives = 101/143 (70%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
N ++S P+ R LP+VR P C + LP +S+++ FH+EAW LLRTVHS+++
Sbjct: 161 LNKVLSAGWPLRRVLPEVRHPMC-LQQHPASGLPTASVILCFHDEAWPMLLRTVHSILNT 219
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+PR++L+EI+LVDD S +E LKS+L +YVA+L +++RS R+G I+AR+LGA +A G
Sbjct: 220 APRALLQEIILVDDLSQQEQLKSALSDYVARLEA-VKLLRSNKRLGTIRARMLGATRATG 278
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
++LVF+DAHCEC + E L +
Sbjct: 279 DVLVFMDAHCECHPGWLEPLLSR 301
>gi|167523942|ref|XP_001746307.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775069|gb|EDQ88694.1| predicted protein [Monosiga brevicollis MX1]
Length = 2376
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 198/328 (60%), Gaps = 16/328 (4%)
Query: 185 HCEC-TLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
H +C L ++ LP S++ VF+NEA S LLR++ SVI R+P S+L EI+LVDDAS
Sbjct: 1019 HKQCRALTYDLATLPDMSVIFVFYNEARSTLLRSIRSVIIRTPPSLLHEIILVDDASD-- 1076
Query: 244 FLKSSLDEYVAKLSVPTRV--IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWL 301
DE A + ++ IR P R GLI+AR GA A GE+L FLD+H E W
Sbjct: 1077 ------DELPADIKAMDKIKYIRLPSRQGLIRARTAGADAATGEVLCFLDSHIEVNRDWA 1130
Query: 302 ENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKR 361
E L+ R+ ED VV P+ID+ISD F Y S + G F+W L F+W + S +
Sbjct: 1131 EPLLQRINEDPLHVVTPIIDVISDSNFRYSASPVVR-GGFDWGLTFKWKSVPRSQ---QS 1186
Query: 362 KDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPC 421
D T P +P MAGGLFA+ R F+ +G YD M +WG ENLEMSFR+WQCG +EI PC
Sbjct: 1187 SDPTAPIASPTMAGGLFAMKRTTFYELGTYDLGMDIWGAENLEMSFRIWQCGARLEIMPC 1246
Query: 422 SHVAHLFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
S V H+FRK PYSFPGG S V N R+A VWMDE+AEF+ A ++ D +
Sbjct: 1247 SRVGHVFRKHHPYSFPGGGSGHVFLRNSLRLAEVWMDEYAEFFKSRKGSAARKIDIGDIS 1306
Query: 481 SRLELRKQLKCHSFKWYLTHVWPHHFLP 508
R +LR+ L C FKWYL +V+P +P
Sbjct: 1307 ERQKLREDLHCKPFKWYLDNVYPELRVP 1334
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/196 (36%), Positives = 105/196 (53%), Gaps = 10/196 (5%)
Query: 4 ISVVQLLKLARNQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQLYQINRFNLLVSD 63
++ VQ L R Q +++ V ++ G D+ + + N F+ VS+
Sbjct: 947 VANVQDQILRRRAQRHVENARVPMLSGRERWDTNPPRDITTYDGPADAMKRNAFDEKVSN 1006
Query: 64 RIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLK 123
+P NR +PDVR +C+ ++ LP S++ VF+NEA S LLR++ SVI R+P S+L
Sbjct: 1007 SLPSNRDVPDVRHKQCRALTYDLATLPDMSVIFVFYNEARSTLLRSIRSVIIRTPPSLLH 1066
Query: 124 EILLVDDASTREFLKSSLDEYVAKLSVPTRV--IRSPGRVGLIKARLLGARQAEGEILVF 181
EI+LVDDAS DE A + ++ IR P R GLI+AR GA A GE+L F
Sbjct: 1067 EIILVDDASD--------DELPADIKAMDKIKYIRLPSRQGLIRARTAGADAATGEVLCF 1118
Query: 182 LDAHCECTLVFNEEFL 197
LD+H E + E L
Sbjct: 1119 LDSHIEVNRDWAEPLL 1134
>gi|195425502|ref|XP_002061040.1| GK10658 [Drosophila willistoni]
gi|194157125|gb|EDW72026.1| GK10658 [Drosophila willistoni]
Length = 489
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 199/313 (63%), Gaps = 5/313 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+ ++I FHNEAWS LLRTVHSV++RSP ++ +++LVDD S LK L EY + L
Sbjct: 172 LPKTDVIICFHNEAWSTLLRTVHSVLARSPEHLIGKVILVDDYSDMPHLKIQLKEYFS-L 230
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
+++R R GL++ARL G A+ ++ FLD+HCECT GWLE L+ R+A +R V
Sbjct: 231 YPKVQLVRVAKREGLVRARLFGMEYADSPVVTFLDSHCECTEGWLEPLLDRIARNRNTVA 290
Query: 317 CPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
P ID+I TF Y G F+W L F W + + +R F EP +TP +AG
Sbjct: 291 SPTIDMIDPKTFQYNYDGANDVLGVFDWNLEFYWIPIPLRE-LKRRNHFAEPIQTPTIAG 349
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAID +F +G YD +WGG+NLE+SF+ W CGG +EI PCSHV H+FR SPY
Sbjct: 350 GLFAIDLEFFRSVGTYDPGFNIWGGDNLELSFKTWMCGGILEIIPCSHVGHIFRDDSPYE 409
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
+P + ++ NLAR+A VW+D++A++Y++ + + V R +LR++L C SFK
Sbjct: 410 WPSSRAMMVESNLARLAEVWLDDYAKYYYERS--GGNKSLATDVSDRKKLREKLGCKSFK 467
Query: 496 WYLTHVWPHHFLP 508
WYL +V+P +P
Sbjct: 468 WYLDNVYPELLVP 480
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/164 (43%), Positives = 98/164 (59%), Gaps = 6/164 (3%)
Query: 41 LHSNLSDA-----QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIV 95
L +N+SDA + + N FN SD I VNR LPD R CK LPK+ ++
Sbjct: 119 LPTNMSDAMKKAVEDGWTKNAFNQYASDLISVNRKLPDPRSAWCKDTARYLTDLPKTDVI 178
Query: 96 IVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVI 155
I FHNEAWS LLRTVHSV++RSP ++ +++LVDD S LK L EY + L +++
Sbjct: 179 ICFHNEAWSTLLRTVHSVLARSPEHLIGKVILVDDYSDMPHLKIQLKEYFS-LYPKVQLV 237
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R R GL++ARL G A+ ++ FLD+HCECT + E L +
Sbjct: 238 RVAKREGLVRARLFGMEYADSPVVTFLDSHCECTEGWLEPLLDR 281
>gi|328794283|ref|XP_001122865.2| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 5-like,
partial [Apis mellifera]
Length = 372
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 132/221 (59%), Positives = 169/221 (76%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C ++LP +SIVIVFHNEAWS LLRTV SVI+RSPR++LKEI+LVDD S ++ LK
Sbjct: 152 CKTKKYSKYLPDTSIVIVFHNEAWSTLLRTVWSVINRSPRTLLKEIILVDDKSEQDHLKQ 211
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
L++YV L VPT V R+ R GLI+ARLLGA+ +G+++ FLDAHCECT GWLE L++R
Sbjct: 212 DLEDYVKTLPVPTYVYRTEKRSGLIRARLLGAKHVKGQVITFLDAHCECTEGWLEPLLSR 271
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
+AEDRT VVCP+ID+ISD TF Y+ + ++ WG FNW+L+FRWY + + D T P
Sbjct: 272 IAEDRTTVVCPIIDVISDDTFEYIPASDMTWGGFNWKLNFRWYRVAQREMDRRLGDRTAP 331
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFR 408
+TP MAGGLF+ID+ YF+ +GAYDE M +WGGENLEMSFR
Sbjct: 332 LRTPTMAGGLFSIDKEYFYELGAYDEGMDIWGGENLEMSFR 372
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 86/151 (56%), Positives = 117/151 (77%), Gaps = 1/151 (0%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
Q+L+++N+FNL+ SD I +NR+L D+R CKTK +++ +LP +SIVIVFHNEAWS LLR
Sbjct: 122 QELFKLNQFNLMASDMISLNRSLKDIRLDGCKTKKYSK-YLPDTSIVIVFHNEAWSTLLR 180
Query: 109 TVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARL 168
TV SVI+RSPR++LKEI+LVDD S ++ LK L++YV L VPT V R+ R GLI+ARL
Sbjct: 181 TVWSVINRSPRTLLKEIILVDDKSEQDHLKQDLEDYVKTLPVPTYVYRTEKRSGLIRARL 240
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
LGA+ +G+++ FLDAHCECT + E L +
Sbjct: 241 LGAKHVKGQVITFLDAHCECTEGWLEPLLSR 271
>gi|358336356|dbj|GAA28182.2| polypeptide N-acetylgalactosaminyltransferase [Clonorchis sinensis]
Length = 592
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 170/463 (36%), Positives = 256/463 (55%), Gaps = 52/463 (11%)
Query: 181 FLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDAS 240
F + C+ T F+ + LPK++++I FHNEAWSALLR+VHSV+ SP+ +L+EI+LVDD S
Sbjct: 137 FRNGACK-TQSFSSD-LPKTAVIICFHNEAWSALLRSVHSVLDYSPKELLQEIILVDDFS 194
Query: 241 TREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGW 300
+R++LK L+ Y+ + V ++IR+ R GLI+AR++G + E+L +LD+H ECT GW
Sbjct: 195 SRDYLKEPLEIYMQQFPV-VKIIRTKRREGLIRARMVGTNVSTAEVLTYLDSHIECTPGW 253
Query: 301 LENLVARVAEDRTRVVCPVIDIISD--VTFAYVRSFELHWGAFNWELHFRWYTYGSSDAI 358
LE L+ R+ + VV PVI+II+D ++ + + G F+W L F W+ D I
Sbjct: 254 LEPLLERIKASTSNVVVPVIEIINDQDLSMKATQEASVQVGGFDWSLTFTWHLPPKRDQI 313
Query: 359 IKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEI 418
++ P ++P MAGGLFAI R +F ++G YDEEM+VWGGENLE+SF+ W CGG +E
Sbjct: 314 RLGAPYS-PIRSPTMAGGLFAIHRDFFAYLGYYDEEMEVWGGENLELSFKTWMCGGQLET 372
Query: 419 APCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYF-KFNPEAEKQRDKQ 477
CSHV H+FR SPYS+ + + NL R+A W+D++ Y+ N + D
Sbjct: 373 VVCSHVGHIFRSRSPYSWESKRTSPIKFNLVRLAETWLDDYKFLYYDSLNFDLGDYGD-- 430
Query: 478 AVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEK--------- 528
+ SR +R++ C SF+WYL ++P FLP G I ++ + C++
Sbjct: 431 -ISSRKAIRERNNCKSFQWYLDTIYPELFLPTRALASGDIENMVSPHCIDGVFNDQKTDN 489
Query: 529 -----PLAKGSMNQ---------------------ASGPASLLPCTHLPVLTQMFVMKLP 562
P + NQ G L C L TQ F +
Sbjct: 490 LVKLYPCHRQKGNQLWFYTNKNEIRRHDACFDGNVKPGHVGLFSCHGLGG-TQFF--EYT 546
Query: 563 TDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
+D + S CL E +D S V + +C+G RQ+W + ++
Sbjct: 547 SDNLIKHNSKCL---EPNSDFSDLV-LSSCNGSPRQQWKFSRK 585
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/146 (48%), Positives = 103/146 (70%), Gaps = 2/146 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
+Q N FN SDRI V R +PD R CKT+ F+ + LPK++++I FHNEAWSALLR+VH
Sbjct: 115 FQDNAFNQYASDRISVRRYIPDFRNGACKTQSFSSD-LPKTAVIICFHNEAWSALLRSVH 173
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV+ SP+ +L+EI+LVDD S+R++LK L+ Y+ + V ++IR+ R GLI+AR++G
Sbjct: 174 SVLDYSPKELLQEIILVDDFSSRDYLKEPLEIYMQQFPV-VKIIRTKRREGLIRARMVGT 232
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFL 197
+ E+L +LD+H ECT + E L
Sbjct: 233 NVSTAEVLTYLDSHIECTPGWLEPLL 258
>gi|350400046|ref|XP_003485719.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus impatiens]
Length = 643
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 205/329 (62%), Gaps = 8/329 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +SIVI F+NE + LLR++HS+I R+P S+L EI+LV+D S +
Sbjct: 161 HKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVNDWSDSKA 220
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L ++ Y+A + + ++ R GLI+AR+ GAR+A GEIL+FLD+H E W+E
Sbjct: 221 LHEKIETYIANNFNGKVKFFKTEKREGLIRARMFGARKATGEILIFLDSHIEVNKRWIEP 280
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L++++A +T + PVIDII+ TF Y S L G FNW LHF+W +D
Sbjct: 281 LLSQIAHSKTIIAMPVIDIINPDTFQYTGS-PLVRGGFNWGLHFKWDNVPVG-TFAHDED 338
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F +P K+P MAGGLFA+DR YF +G YD M +WGGENLE+SFR+W CGGSIE+ PCS
Sbjct: 339 FIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPCSR 398
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FR+ PY + + N RVA VW+DE+ +++ K ++ D + RL
Sbjct: 399 VGHVFRRRRPYG-TFDQHDTMLKNSLRVAHVWLDEYKDYFLK----NVQKVDYGDISERL 453
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDK 512
LRK+LKC +F WYL V+P LP D+K
Sbjct: 454 NLRKRLKCKNFAWYLNVVYPELALPDDNK 482
Score = 124 bits (311), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/172 (40%), Positives = 107/172 (62%), Gaps = 6/172 (3%)
Query: 33 LGDSVDG-GLHSNLSDAQQL---YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEF 88
LG +D G+ N D ++ Y+ FN+LVSD I ++R LPD R C+ + ++ +
Sbjct: 114 LGQGLDELGMVKNFEDQRKRDEGYKNYSFNILVSDNIGLHRELPDTRHKLCEIQKYSSK- 172
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA-K 147
LP +SIVI F+NE + LLR++HS+I R+P S+L EI+LV+D S + L ++ Y+A
Sbjct: 173 LPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVNDWSDSKALHEKIETYIANN 232
Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ + ++ R GLI+AR+ GAR+A GEIL+FLD+H E + E L +
Sbjct: 233 FNGKVKFFKTEKREGLIRARMFGARKATGEILIFLDSHIEVNKRWIEPLLSQ 284
>gi|47222223|emb|CAG11102.1| unnamed protein product [Tetraodon nigroviridis]
Length = 531
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 158/363 (43%), Positives = 218/363 (60%), Gaps = 40/363 (11%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S+VI FHNEAWS LLRTVHSV+S +PR L+E+LLVDD S
Sbjct: 166 HPRCLQQQYSESLPSASVVICFHNEAWSTLLRTVHSVLSTAPRRHLRELLLVDDLSQHGH 225
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK L EY++ LS R++RS R+G+ R LGA +AEGE+LVF+D+HCEC GWLE L
Sbjct: 226 LKGVLSEYLSHLSR-VRLLRSARRLGVAGCRALGASKAEGELLVFMDSHCECQKGWLEPL 284
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTY--------GSSD 356
+ RVA+DRTRVV P+ID I TF Y + G FNW L FRW ++ GS+
Sbjct: 285 LERVAQDRTRVVSPIIDAIDWRTFRYNATQWPVRGVFNWRLDFRWESHTLLPDKDPGSAV 344
Query: 357 A---IIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCG 413
+ +R T F++P + G +FAIDR +F H+G +D M +WG E +E+S RVW CG
Sbjct: 345 RALRLCRRLTETARFRSPVLGGEVFAIDRHFFQHVGGFDPGMLLWGEEQIELSIRVWSCG 404
Query: 414 GSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEA--- 470
GS+E+APCS VAHL + PY+FP ++L N R+A +WM + + +++ + A
Sbjct: 405 GSMEVAPCSRVAHLDHHSLPYTFPD--QDLLENNKIRIAEIWMGAYRKIFYRRDTLAHFI 462
Query: 471 ----------------EKQRDKQAVRS-------RLELRKQLKCHSFKWYLTHVWPHHFL 507
++ R +V+S RL+L+K L C +F+WYLT V+P ++
Sbjct: 463 RQVGGRERLLFEHLTRKRNRGNVSVQSESPQITERLQLQKSLGCKNFQWYLTTVYPQLYI 522
Query: 508 PMD 510
P D
Sbjct: 523 PED 525
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/143 (51%), Positives = 99/143 (69%), Gaps = 2/143 (1%)
Query: 57 FNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISR 116
FN VS+ I V+R LP+ R P+C + ++E LP +S+VI FHNEAWS LLRTVHSV+S
Sbjct: 147 FNEAVSEGISVHRRLPEARHPRCLQQQYSES-LPSASVVICFHNEAWSTLLRTVHSVLST 205
Query: 117 SPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEG 176
+PR L+E+LLVDD S LK L EY++ LS R++RS R+G+ R LGA +AEG
Sbjct: 206 APRRHLRELLLVDDLSQHGHLKGVLSEYLSHLSR-VRLLRSARRLGVAGCRALGASKAEG 264
Query: 177 EILVFLDAHCECTLVFNEEFLPK 199
E+LVF+D+HCEC + E L +
Sbjct: 265 ELLVFMDSHCECQKGWLEPLLER 287
>gi|395728898|ref|XP_002809364.2| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 2 [Pongo abelii]
Length = 532
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 209/332 (62%), Gaps = 10/332 (3%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S+VI FHNEA SALLRTV SV+ +SP ++KEI+LVDD S + +L + K+
Sbjct: 97 LPATSVVITFHNEARSALLRTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV 155
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RV+R+ R GL+++R+ GA A+ ++L FLD+HCEC WLE L+ RVAEDRTRVV
Sbjct: 156 ----RVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERVAEDRTRVV 211
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P+ID+I+ F YV + G F+W L F+W ++ + P KTP +AGG
Sbjct: 212 SPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSRQGNPVAPIKTPMIAGG 271
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LF +D+ YF G +E VWGGE E+SFRVWQCGGS+EI PCS V H+FRK PY+F
Sbjct: 272 LFVMDKFYFEDWGVRHDE-DVWGGETXEISFRVWQCGGSLEIIPCSRVGHVFRKQHPYTF 330
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
PGG V N R A VWMDE+ FY+ P A + ++SRLELRK+L C FKW
Sbjct: 331 PGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSA-RNVPYGNIQSRLELRKKLSCKPFKW 389
Query: 497 YLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
YL +V+P +P D FG ++ Q C++
Sbjct: 390 YLENVYPELRVPDHQDIAFGALQ--QGTNCLD 419
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 64/152 (42%), Positives = 98/152 (64%), Gaps = 6/152 (3%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
Q Y N+FN + SD++ ++R +PD R +C+ K + + LP +S+VI FHNEA SALL
Sbjct: 57 GQDPYARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVD-LPATSVVITFHNEARSALL 115
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV+ +SP ++KEI+LVDD S + +L + K+ RV+R+ R GL+++R
Sbjct: 116 RTVVSVLKKSPPHLIKEIILVDDYSN-DPEDGALLGKIEKV----RVLRNDRREGLMRSR 170
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ GA A+ ++L FLD+HCEC + E L +
Sbjct: 171 VRGADAAQAKVLTFLDSHCECNEHWLEPLLER 202
>gi|395823173|ref|XP_003804166.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 1 [Otolemur garnettii]
Length = 539
Score = 291 bits (746), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 167/452 (36%), Positives = 240/452 (53%), Gaps = 68/452 (15%)
Query: 188 CTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKS 247
C + LP +S+VIVFHNEAWS LLRTVHSV++RSPR M++EI+LVDDAS R
Sbjct: 106 CKTKVYPDNLPTTSVVIVFHNEAWSTLLRTVHSVVNRSPRHMIEEIVLVDDASER----- 160
Query: 248 SLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVAR 307
D++ + ++I S + A Q +IL + C +
Sbjct: 161 --DQHQRR-----QMIGSLSHADWDCSSQHVAHQGASDILKAFEI---CFYDF------- 203
Query: 308 VAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEP 367
R VVCP+ID+ISD TF Y+ ++ +G FNW+L+FRWY + ++ D T P
Sbjct: 204 ----RKTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRKGDRTLP 259
Query: 368 FKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS----- 422
+TP MAGGLF+IDR YF IG YD M +WGGENLE+SFR+WQCGG++EI CS
Sbjct: 260 VRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTCSXXXXX 319
Query: 423 -HVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS 481
V H+FRKA+PY+FPGG +++ N R+A VWMDE+ F++ +P K D + S
Sbjct: 320 XXVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKV-DYGDISS 378
Query: 482 RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGS------- 534
RL LR +L+C F WYL +++P +P G IR+V+T++C++ K +
Sbjct: 379 RLGLRHKLQCRPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFN 438
Query: 535 --------------------------MNQASGPASLLPCTHLPVLTQMFVMKLPTDLIAT 568
+++ +GP ++L C HL + L
Sbjct: 439 CHGMGGNQVFSYTANKEIRTDDLCLDVSKLNGPVTMLKCHHLKGNQLWEYDPVKLTLQHV 498
Query: 569 DESVCLDVPEYENDISPRVRILACSGFNRQRW 600
+ + CLD E+ P +R C+G Q+W
Sbjct: 499 NSNQCLDKATEEDSQVPSIR--DCNGSRSQQW 528
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 54/87 (62%), Positives = 74/87 (85%), Gaps = 1/87 (1%)
Query: 49 QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLR 108
+++++IN+FNL+ S+ I +NR+LPDVR CKTKV+ + LP +S+VIVFHNEAWS LLR
Sbjct: 76 KEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDN-LPTTSVVIVFHNEAWSTLLR 134
Query: 109 TVHSVISRSPRSMLKEILLVDDASTRE 135
TVHSV++RSPR M++EI+LVDDAS R+
Sbjct: 135 TVHSVVNRSPRHMIEEIVLVDDASERD 161
>gi|157107410|ref|XP_001649764.1| n-acetylgalactosaminyltransferase [Aedes aegypti]
gi|108884050|gb|EAT48275.1| AAEL000639-PA [Aedes aegypti]
Length = 613
Score = 291 bits (746), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 166/419 (39%), Positives = 237/419 (56%), Gaps = 24/419 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 255
LP S++++F+NE WS LLRTV+SV++RSP +LKEI+LV+D ST+EFL L ++V +
Sbjct: 151 LPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQDFVRTE 210
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L+ ++I P R GLI ARL GA+ A G++L+ LD+H E + WL L+ +AED
Sbjct: 211 LAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEDYRTC 270
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCP ID+I+ TF Y E GAF+W+ ++ + D + D TEPF++P MAG
Sbjct: 271 VCPFIDVIAHDTFQYRAQDEGKRGAFDWKFLYKRLPLRAQDMV----DPTEPFESPIMAG 326
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAI +F+ +G YDE + +WGGE E+SF+VWQCGG + APCS V H++R +P+
Sbjct: 327 GLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRGYAPFP 386
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G + V N RVA VWMDE+ +F ++ NP+ + Q D + + LR++L+C FK
Sbjct: 387 NPRGTNFVTR-NFKRVAEVWMDEYKQFLYERNPQFD-QTDAGDLTKQKALRERLQCKPFK 444
Query: 496 WYLTHVWPH---HFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMN-QASGPASLLPCT- 548
W+L V P + D K F GR++ K L SMN +A P + C
Sbjct: 445 WFLEEVAPDLVVRYPLRDPKPFASGRVQSAANPK-----LCLDSMNHKAKEPIGVFSCAA 499
Query: 549 --HLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYDKE 605
P Q F + D+ + CLD +D S + Q W YD E
Sbjct: 500 NRTYPQNNQFFTLTYYRDIRVSSVDKCLDA---SSDGSEVILFNCHESQGNQLWQYDTE 555
Score = 105 bits (261), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 75/103 (72%), Gaps = 1/103 (0%)
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AK 147
LP S++++F+NE WS LLRTV+SV++RSP +LKEI+LV+D ST+EFL L ++V +
Sbjct: 151 LPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQDFVRTE 210
Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 190
L+ ++I P R GLI ARL GA+ A G++L+ LD+H E +
Sbjct: 211 LAPKVKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNV 253
>gi|313230315|emb|CBY08019.1| unnamed protein product [Oikopleura dioica]
Length = 589
Score = 291 bits (745), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 156/359 (43%), Positives = 229/359 (63%), Gaps = 20/359 (5%)
Query: 184 AHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTRE 243
A C+ +++ LP S++I FHNE S LLRT+HS+ +RSP S+LKEI+LVDDAS+R
Sbjct: 142 ASCKSKKYYSD--LPDVSVIIPFHNEGLSTLLRTIHSLHNRSPESLLKEIVLVDDASSRP 199
Query: 244 FLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
K L+ +AK ++IR+P R GLI++R+ G A+G ++V LD+H E + WL
Sbjct: 200 LYKE-LESSLAKFP-KVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVSTNWLPP 257
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYV-RSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
L+ ++ DR VVCP+IDII + F YV + + GAF+WEL+++ + + K
Sbjct: 258 LLHPISLDRKTVVCPMIDIIDNENFQYVTQPGDAMRGAFDWELYYKRIPIPNEK---RPK 314
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
D +EPF++P MAGGLFAI+R YF+ IG YDE +++WGGE E+SF+VW CGG I +PCS
Sbjct: 315 DPSEPFESPVMAGGLFAIERNYFYEIGLYDEGLEIWGGEQYELSFKVWMCGGRILDSPCS 374
Query: 423 HVAHLFRKASPYSFP--GGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
+ H++RK PY+ P GG + N RVA VWMDE+AEF+++ P K D +
Sbjct: 375 RIGHIYRKFVPYTIPNNGGPNY----NYKRVAEVWMDEYAEFFYRRRPYVRKI-DAGDLS 429
Query: 481 SRLELRKQLKCHSFKWYLTHVWP---HHFLPM--DDKFFGRIRHVQTHKCVEKPLAKGS 534
LRK+LKC SF WY+ +V P ++ P+ +GR++HV ++ C++ + KGS
Sbjct: 430 KAKALRKELKCKSFDWYIKNVIPDLVQYYPPILPPSAAWGRLKHVVSNLCIDPQVKKGS 488
Score = 129 bits (323), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 76/175 (43%), Positives = 109/175 (62%), Gaps = 7/175 (4%)
Query: 15 NQQDYIDRRGVHVVVGHYLGDSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDV 74
N ++I R +G G V H L A Y+ N FN+LVSDRI ++R+L D+
Sbjct: 84 NDHEFIRREAARTGLGEQ-GKPVTLFGHEKLHSA---YKDNGFNILVSDRISLDRSLHDI 139
Query: 75 RKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 134
R CK+K + + LP S++I FHNE S LLRT+HS+ +RSP S+LKEI+LVDDAS+R
Sbjct: 140 RHASCKSKKYYSD-LPDVSVIIPFHNEGLSTLLRTIHSLHNRSPESLLKEIVLVDDASSR 198
Query: 135 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECT 189
K L+ +AK ++IR+P R GLI++R+ G A+G ++V LD+H E +
Sbjct: 199 PLYK-ELESSLAKFP-KVKLIRNPTRQGLIRSRVRGVHLAKGGVVVILDSHVEVS 251
>gi|170038569|ref|XP_001847121.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
gi|167882320|gb|EDS45703.1| N-acetyl galactosaminyl transferase [Culex quinquefasciatus]
Length = 541
Score = 291 bits (744), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 151/352 (42%), Positives = 219/352 (62%), Gaps = 16/352 (4%)
Query: 193 NEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY 252
N LP++++V+VF+NE WS L+RTVHSV+ RSP +L+E+LLVDD S K+ L+EY
Sbjct: 102 NLAHLPQTTVVVVFYNEPWSILVRTVHSVLERSPPQLLREVLLVDDYSYLPETKTQLNEY 161
Query: 253 VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDR 312
A+ S R++R+P R+GLI+ARL GA+ A EIL FLDAHCECTLGWL+ + RVA D
Sbjct: 162 FAR-SPKVRILRAPKRLGLIRARLFGAKNATTEILTFLDAHCECTLGWLQPQLDRVARDP 220
Query: 313 TRVVCPVIDIISDVTFAYVRSFEL-HWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTP 371
T + P ID I + A+V + L ++GA +W F W G D ++ + EPF TP
Sbjct: 221 TTIAVPTIDWIDEHNLAFVSNKSLGYYGATDWGFQFAWR--GRWDRKVQPANKLEPFPTP 278
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLF+I+R +F H+G YDE +++GGEN+E+S + W CGG IE PCS V H+ +
Sbjct: 279 IMAGGLFSINRTFFGHLGWYDEGFEIYGGENVELSLKAWMCGGRIETVPCSRVGHVQKAG 338
Query: 432 SPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKC 491
PY ++ + N RVA VW+D++A+ + + + + V SR +LR+ LKC
Sbjct: 339 HPY-LRVETTDWVRINTVRVAEVWLDQYAQVVYDMFGGPQFRGNFGDVSSRKKLRESLKC 397
Query: 492 HSFKWYLTHVWPHHFLPMDDKFFGRIRHVQ-------THKCVEKPLAKGSMN 536
HSF+WYL +V+P +DD + H + +C++ P A+G+
Sbjct: 398 HSFRWYLDNVFPE----LDDPEGRGVGHGEVINLAAGATRCLQYPTAEGTFG 445
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/155 (50%), Positives = 105/155 (67%), Gaps = 6/155 (3%)
Query: 41 LHSNLSDA-----QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIV 95
L ++L DA +Q ++ + FN VSD I V R LPD R CK N LP++++V
Sbjct: 53 LPADLPDAVNELIEQQFKQHGFNQYVSDLISVRRRLPDYRDDWCKQPGRNLAHLPQTTVV 112
Query: 96 IVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVI 155
+VF+NE WS L+RTVHSV+ RSP +L+E+LLVDD S K+ L+EY A+ S R++
Sbjct: 113 VVFYNEPWSILVRTVHSVLERSPPQLLREVLLVDDYSYLPETKTQLNEYFAR-SPKVRIL 171
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 190
R+P R+GLI+ARL GA+ A EIL FLDAHCECTL
Sbjct: 172 RAPKRLGLIRARLFGAKNATTEILTFLDAHCECTL 206
>gi|312377569|gb|EFR24376.1| hypothetical protein AND_11091 [Anopheles darlingi]
Length = 1150
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 157/405 (38%), Positives = 227/405 (56%), Gaps = 22/405 (5%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C L + LP +S+V+ F+ E WS LLRTVHSV++RSP +LKE+++VDD ST+EF
Sbjct: 270 HPSCKLQQYFKHLPTASVVVPFYEEHWSTLLRTVHSVLNRSPSHLLKEVIIVDDGSTKEF 329
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
L L YV + ++IR R GL+KARL GA+ A G++LVFLD+H E WL L
Sbjct: 330 LHGQLQNYVNQNLPKVKLIRQGERTGLMKARLAGAKLASGDVLVFLDSHTEAGYNWLPPL 389
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ +AE+ VCP+ID+I D TF + G F+W H++ SD R
Sbjct: 390 LEPIAENPKTCVCPLIDVIDDQTFNIHPQDDGGRGLFDWRFHYKRLALKESD----RVSP 445
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
T PF +P MAGGLFAI +F+ +G YDEE+ +WG E E+SF++WQCGG + APCS
Sbjct: 446 TAPFPSPVMAGGLFAIGTNFFWELGGYDEELDIWGAEQYELSFKIWQCGGRMLDAPCSRF 505
Query: 425 AHLFRKASPYSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
+H++R SP FP + + N RVA +WMDE+ ++ + +PE + D +
Sbjct: 506 SHIYRSYSP--FPNSRKYDFITRNHKRVAEIWMDEYKQYIYDRDPERYARSDAGDLTKMK 563
Query: 484 ELRKQLKCHSFKWYLTHVWP---HHFLPMDDKFF--GRIRHVQTHKCVEKPLAKGSMNQA 538
LR++L+C F+W+L V P + P++ + F G I+ + E L +M +
Sbjct: 564 ALREKLQCKPFEWFLKEVAPEILQLYPPVEPEPFASGAIQSI-----AEPTLCIDTMQRP 618
Query: 539 SG-PASLLPC----THLPVLTQMFVMKLPTDLIATDESVCLDVPE 578
G P + PC H + Q FV+ D+ + C DVPE
Sbjct: 619 RGNPIGMHPCDSDLIHPKNMNQYFVLSWHRDIQQKSDEQCFDVPE 663
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 196/311 (63%), Gaps = 9/311 (2%)
Query: 195 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 254
+ LP++S++I F++E WS LLRTV+SV+ RSP S+L EI+LVDD S + FLK LD YVA
Sbjct: 682 QHLPRTSVIIPFYDEHWSTLLRTVYSVMRRSPSSLLLEIILVDDGSMKNFLKEQLDHYVA 741
Query: 255 K-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRT 313
L ++I P R GLI ARL GA+ A+G++LVFLD+H E + WL L+ +A +
Sbjct: 742 THLKHLVKIIHLPTRSGLITARLAGAKIAKGDVLVFLDSHVEAGINWLPPLLEPIAHNPR 801
Query: 314 RVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAM 373
VCP ID+I D TF + GAF+W + ++ D +KD T+PF++P M
Sbjct: 802 TCVCPFIDVIMDDTFELTPQDQGARGAFDWNMLYKRLPLRPED----QKDPTQPFESPVM 857
Query: 374 AGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASP 433
AGGLFAI +F+ +G YDE +++WG E E+SF++WQCGG + APCS V H++R SP
Sbjct: 858 AGGLFAISSMFFWELGGYDEMLEIWGAEQYELSFKIWQCGGRMIDAPCSRVGHIYRSYSP 917
Query: 434 YSFPGGVS-EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCH 492
FP S + + N RVA VWMDE+ ++ ++ +P D + ELR++L C
Sbjct: 918 --FPNVKSYDYVAKNHKRVAEVWMDEYKKYVYRKDP-MRFSIDAGDLTKMKELRRRLNCK 974
Query: 493 SFKWYLTHVWP 503
F+W++ +V P
Sbjct: 975 PFRWFIENVAP 985
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 72/146 (49%), Positives = 102/146 (69%), Gaps = 4/146 (2%)
Query: 46 SDAQQ---LYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
SD++Q LY N FN L+SD+I +NR++ D+R P CK + + + LP +S+V+ F+ E
Sbjct: 237 SDSEQRKKLYLQNGFNALLSDKISINRSIADLRHPSCKLQQYFKH-LPTASVVVPFYEEH 295
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVG 162
WS LLRTVHSV++RSP +LKE+++VDD ST+EFL L YV + ++IR R G
Sbjct: 296 WSTLLRTVHSVLNRSPSHLLKEVIIVDDGSTKEFLHGQLQNYVNQNLPKVKLIRQGERTG 355
Query: 163 LIKARLLGARQAEGEILVFLDAHCEC 188
L+KARL GA+ A G++LVFLD+H E
Sbjct: 356 LMKARLAGAKLASGDVLVFLDSHTEA 381
Score = 112 bits (279), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 56/105 (53%), Positives = 76/105 (72%), Gaps = 1/105 (0%)
Query: 87 EFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA 146
+ LP++S++I F++E WS LLRTV+SV+ RSP S+L EI+LVDD S + FLK LD YVA
Sbjct: 682 QHLPRTSVIIPFYDEHWSTLLRTVYSVMRRSPSSLLLEIILVDDGSMKNFLKEQLDHYVA 741
Query: 147 K-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTL 190
L ++I P R GLI ARL GA+ A+G++LVFLD+H E +
Sbjct: 742 THLKHLVKIIHLPTRSGLITARLAGAKIAKGDVLVFLDSHVEAGI 786
>gi|307186272|gb|EFN71935.1| Polypeptide N-acetylgalactosaminyltransferase 35A [Camponotus
floridanus]
Length = 667
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 198/315 (62%), Gaps = 8/315 (2%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP +SI+I F+NE ++ LLR++HS++ R+P ++L EI+LV+D S + L + Y+
Sbjct: 197 LPNASIIICFYNEHYTTLLRSLHSILERTPAALLHEIILVNDFSDSDILHEKIHAYIKNN 256
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
R+ ++ R GLI+AR+ GAR+A G++L+FLD+H E W+E L++R+A +T V
Sbjct: 257 FGAKVRLFKTKKREGLIRARVFGARKATGDVLIFLDSHIEVNEIWIEPLLSRIAYSKTIV 316
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
PVIDII+ TF Y S L G FNW LHF+W + DF +P K+P MAG
Sbjct: 317 PMPVIDIINADTFQYTGS-PLVRGGFNWGLHFKWDNLPIG-TLKHENDFVKPIKSPTMAG 374
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAIDR YF IG YD M VWGGENLE+SFR+W CGGSIE+ PCS V H+FR+ PY
Sbjct: 375 GLFAIDREYFIKIGEYDTGMDVWGGENLEISFRIWMCGGSIELIPCSRVGHVFRRRRPYG 434
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
+ + N RVA VWMDE+ +++ K K D + RL LR++L+C +F
Sbjct: 435 -SDDPHDTMLKNSLRVAHVWMDEYKDYFLK----NAKAIDYGDISERLALRQKLECKTFD 489
Query: 496 WYLTHVWPHHFLPMD 510
WYL V+P LP D
Sbjct: 490 WYLKVVYPELTLPDD 504
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 103/164 (62%), Gaps = 5/164 (3%)
Query: 40 GLHSNLSDAQQL---YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
G+ N+ D Q+ Y+ FN+L+SD + V R +PD R CKT+ ++ LP +SI+I
Sbjct: 146 GMVKNMEDQQKRTIGYKNYAFNVLISDNLGVRRNVPDTRHKLCKTQKYSSN-LPNASIII 204
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA-KLSVPTRVI 155
F+NE ++ LLR++HS++ R+P ++L EI+LV+D S + L + Y+ R+
Sbjct: 205 CFYNEHYTTLLRSLHSILERTPAALLHEIILVNDFSDSDILHEKIHAYIKNNFGAKVRLF 264
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++ R GLI+AR+ GAR+A G++L+FLD+H E ++ E L +
Sbjct: 265 KTKKREGLIRARVFGARKATGDVLIFLDSHIEVNEIWIEPLLSR 308
>gi|170593939|ref|XP_001901721.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158590665|gb|EDP29280.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 645
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 150/354 (42%), Positives = 221/354 (62%), Gaps = 19/354 (5%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC E LP +SIVI FHNE W+ LLRTVHSV+ RSP ++KEI++VDD S +E LK
Sbjct: 189 ECKYWHYPEDLPTASIVIAFHNEGWTPLLRTVHSVLLRSPPHLIKEIIMVDDFSDKEHLK 248
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
LD Y+ + +++R+ R GLI+ R +GA++A G++++FLDAHCE + WL L+A
Sbjct: 249 DRLDVYLKQFDGKVKLVRNSEREGLIRTRSIGAKEAVGDVVIFLDAHCEVNVNWLPPLLA 308
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRK 362
+ ++R + PVID I ++Y S + H+ G F W L ++ T SS +++RK
Sbjct: 309 PIRQNRKVMTVPVIDGIDKNDWSYRIVYGSADKHYRGIFEWGLLYK-ETELSSQELLRRK 367
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+EPF++P AGGLFAI++ +F +G YD +Q+WGGE E+SF++WQCGG I PCS
Sbjct: 368 HNSEPFRSPTHAGGLFAINKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFVPCS 427
Query: 423 HVAHLFRKASPYSFPGGVS--EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
HV H++R PY F G +S V+ N+ RV WMDE+ ++Y+ P A + R +
Sbjct: 428 HVGHVYRSHMPYGF-GKLSGKPVISTNMLRVIKTWMDEYDKYYYIREPSA-RHRLPGNIS 485
Query: 481 SRLELRKQLKCHSFKWYLTHV-------WPHHFLPMDDKFFGRIRHVQTHKCVE 527
S+L+LRK LKC SFKWY+ V +P P ++ +G ++ T KC++
Sbjct: 486 SQLKLRKSLKCKSFKWYMEKVAYDVVVSYP---FPPENHVWGEAKNHATGKCID 536
Score = 141 bits (356), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 68/150 (45%), Positives = 103/150 (68%), Gaps = 1/150 (0%)
Query: 41 LHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHN 100
+ L++ ++ + FN VSD I +NR++PDVR +CK + E+ LP +SIVI FHN
Sbjct: 152 IDKTLTEVKEAMREYGFNTYVSDMISLNRSIPDVRMDECKYWHYPED-LPTASIVIAFHN 210
Query: 101 EAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGR 160
E W+ LLRTVHSV+ RSP ++KEI++VDD S +E LK LD Y+ + +++R+ R
Sbjct: 211 EGWTPLLRTVHSVLLRSPPHLIKEIIMVDDFSDKEHLKDRLDVYLKQFDGKVKLVRNSER 270
Query: 161 VGLIKARLLGARQAEGEILVFLDAHCECTL 190
GLI+ R +GA++A G++++FLDAHCE +
Sbjct: 271 EGLIRTRSIGAKEAVGDVVIFLDAHCEVNV 300
>gi|410911676|ref|XP_003969316.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2-like [Takifugu rubripes]
Length = 629
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 171/465 (36%), Positives = 247/465 (53%), Gaps = 55/465 (11%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C E LP +S+VI FHNEAWS LLRTVHSV+S +P+ L+E+LLVDD S
Sbjct: 170 HPACLEQHFTESLPSASVVICFHNEAWSTLLRTVHSVLSTAPKQHLREVLLVDDLSQHGH 229
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LKS L EYV++LS R++RS R+G+ R LGA +AEG++LVF+D+HCEC GWLE L
Sbjct: 230 LKSVLSEYVSRLSR-VRLVRSARRLGVGGCRSLGAAKAEGDVLVFMDSHCECQKGWLEPL 288
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTY-----GSSDAII 359
+ +VA+DRTRVV P+ID+I TF Y + G F+W L F W +Y G DA +
Sbjct: 289 LQKVAQDRTRVVSPIIDVIDWRTFQYNATQWPVRGVFDWRLDFHWESYTWMPDGDPDAAV 348
Query: 360 KRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIA 419
+ ++P + G +FA+DR +F +G +D M +WG E +E+S RVW CGGS+E+
Sbjct: 349 ------QALRSPVLGGEVFAVDRHFFQRVGGFDPGMLLWGEEQIELSIRVWSCGGSMEVV 402
Query: 420 PCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQR----- 474
PCS VAHL R PYSFP E+L N R+A +WMD + + +++ + A R
Sbjct: 403 PCSRVAHLDRHHLPYSFPD--QELLQSNKIRIAEIWMDVYKKIFYRRDTLAHFIRQVGRG 460
Query: 475 ---------------------DKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMD-DK 512
+ + RL+L+K L C +F+WYL+ V+P ++P D
Sbjct: 461 ERSQVDHVGHMGKGVMFVVKSESPGISDRLQLKKSLGCKNFQWYLSTVYPQLYIPEDRPA 520
Query: 513 FFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLP-CTHLPVLTQMF---VMKLPTDLIAT 568
G + + + G A G +L C + F +KL L+
Sbjct: 521 LSGEHCELNSESEIRWGPQGGLCFDAIGERVVLSRCPAQRPASSRFQWKFVKLSGQLVHL 580
Query: 569 DESVCLDV----------PEYENDISPRVRILACSGFNRQRWTYD 603
+CL+ P+ N S + + C+ +Q+W +D
Sbjct: 581 QSQLCLEAVPEVGLSHSSPKEANSHSGELFLRPCTHHPKQQWHFD 625
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 82/165 (49%), Positives = 107/165 (64%), Gaps = 6/165 (3%)
Query: 35 DSVDGGLHSNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSI 94
DS D G + S Q+ FN VS+ I V+R LP+ R P C + F E LP +S+
Sbjct: 133 DSEDTGRYMEPSSVQKY----GFNEAVSEGISVHRRLPEARHPACLEQHFTES-LPSASV 187
Query: 95 VIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRV 154
VI FHNEAWS LLRTVHSV+S +P+ L+E+LLVDD S LKS L EYV++LS R+
Sbjct: 188 VICFHNEAWSTLLRTVHSVLSTAPKQHLREVLLVDDLSQHGHLKSVLSEYVSRLSR-VRL 246
Query: 155 IRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+RS R+G+ R LGA +AEG++LVF+D+HCEC + E L K
Sbjct: 247 VRSARRLGVGGCRSLGAAKAEGDVLVFMDSHCECQKGWLEPLLQK 291
>gi|344288103|ref|XP_003415790.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase-like
protein 2 [Loxodonta africana]
Length = 640
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 157/394 (39%), Positives = 231/394 (58%), Gaps = 18/394 (4%)
Query: 159 GRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTV 218
G G + AR+ R+A E+ H C + LP +S+++ FH+EAWS LLRTV
Sbjct: 160 GLTGALSARM-PLRRALPEV-----RHPLCLQQHAGDRLPMASVILCFHDEAWSTLLRTV 213
Query: 219 HSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLG 278
HS++ +PR+ LKEI+LVDD S + LKS+L EYVA+L +++RS R+G I+ R+LG
Sbjct: 214 HSILDTAPRAFLKEIILVDDLSQQGQLKSALSEYVARLEG-VKLLRSNKRLGPIQGRMLG 272
Query: 279 ARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHW 338
A +A G++LVF+D+HCEC GWLE L++R+A DR+RVV PVID+I TF Y S L
Sbjct: 273 ATRATGDVLVFMDSHCECHQGWLEPLLSRIAGDRSRVVSPVIDVIDWKTFQYYPSEALQR 332
Query: 339 GAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVW 398
G +W L F W + + + P ++P + GG+ AIDR YF + GAYD M +W
Sbjct: 333 GVLDWNLDFHWEPLPEHEKKALQSPIS-PIRSPVVPGGVVAIDRHYFQNTGAYDPLMSLW 391
Query: 399 GGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDE 458
GGENLE+S + W CGGS+EI PCS V H++R + P E N R+A W+
Sbjct: 392 GGENLELSLKTWLCGGSVEILPCSRVGHVYRNQD--AHPVLDQEATLQNKIRIAETWLAS 449
Query: 459 WAEFYFKFNPEA--EKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDD--KFF 514
+ E ++K +PEA Q +K RL+L+++L C F W+L +++P + P + +F
Sbjct: 450 FKETFYKHSPEAFSLSQAEKPDCTERLQLQRRLGCRMFHWFLANIYPELY-PSEHMPRFS 508
Query: 515 GRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
G++ + C + + + P L PC
Sbjct: 509 GKLHNTGLGFCAD---CQAEGDTLGCPMMLFPCN 539
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 67/139 (48%), Positives = 99/139 (71%), Gaps = 2/139 (1%)
Query: 61 VSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRS 120
+S R+P+ R LP+VR P C + + LP +S+++ FH+EAWS LLRTVHS++ +PR+
Sbjct: 165 LSARMPLRRALPEVRHPLCLQQHAGDR-LPMASVILCFHDEAWSTLLRTVHSILDTAPRA 223
Query: 121 MLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILV 180
LKEI+LVDD S + LKS+L EYVA+L +++RS R+G I+ R+LGA +A G++LV
Sbjct: 224 FLKEIILVDDLSQQGQLKSALSEYVARLEG-VKLLRSNKRLGPIQGRMLGATRATGDVLV 282
Query: 181 FLDAHCECTLVFNEEFLPK 199
F+D+HCEC + E L +
Sbjct: 283 FMDSHCECHQGWLEPLLSR 301
>gi|340727930|ref|XP_003402286.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Bombus terrestris]
Length = 643
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 146/329 (44%), Positives = 204/329 (62%), Gaps = 8/329 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +SIVI F+NE + LLR++HS+I R+P S+L EI+LV+D S +
Sbjct: 161 HKLCEIQKYSSKLPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVNDWSDSKA 220
Query: 245 LKSSLDEY-VAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L + Y V + + ++ R GLI+AR+ GAR+A GE+L+FLD+H E W+E
Sbjct: 221 LHEKIKTYIVNNFNGKVKFYKTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKRWIEP 280
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L++++A+ +T V P+IDII+ TF Y S L G FNW LHF+W +D
Sbjct: 281 LLSQIAQSKTIVAMPIIDIINPDTFQYTGS-PLVRGGFNWGLHFKWDNVPVG-TFAHDED 338
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F +P K+P MAGGLFA+DR YF +G YD M +WGGENLE+SFR+W CGGSIE+ PCS
Sbjct: 339 FIKPIKSPTMAGGLFAMDRKYFTKLGEYDAGMDIWGGENLEISFRIWMCGGSIELIPCSR 398
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FR+ PY + + N RVA VW+DE+ +++ K ++ D + RL
Sbjct: 399 VGHVFRRRRPYG-TFDQHDTMLKNSLRVAHVWLDEYKDYFLK----NVQKVDYGDISERL 453
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDDK 512
LRK+LKC +F WYL V+P LP D+K
Sbjct: 454 NLRKRLKCKNFAWYLNVVYPELALPDDNK 482
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 105/172 (61%), Gaps = 6/172 (3%)
Query: 33 LGDSVDG-GLHSNLSDAQQL---YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEF 88
LG +D G+ N D ++ Y+ FN+LVSD I ++R +PD R C+ + ++ +
Sbjct: 114 LGQGLDELGMVKNFEDQRKRDEGYKNYSFNILVSDNIGLHREIPDTRHKLCEIQKYSSK- 172
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEY-VAK 147
LP +SIVI F+NE + LLR++HS+I R+P S+L EI+LV+D S + L + Y V
Sbjct: 173 LPNASIVICFYNEHYMTLLRSLHSIIDRTPASLLHEIILVNDWSDSKALHEKIKTYIVNN 232
Query: 148 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ + ++ R GLI+AR+ GAR+A GE+L+FLD+H E + E L +
Sbjct: 233 FNGKVKFYKTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKRWIEPLLSQ 284
>gi|312370886|gb|EFR19191.1| hypothetical protein AND_22918 [Anopheles darlingi]
Length = 1204
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 170/413 (41%), Positives = 243/413 (58%), Gaps = 22/413 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SSIVIVF+NEAWS LLRTVHSV+ R+P ++++EILLVDD+ST + L L+ Y+A L
Sbjct: 126 LPPSSIVIVFYNEAWSVLLRTVHSVLDRTPSALIEEILLVDDSSTMKHLHQQLEHYIAAL 185
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R++R+P RVGLI+ARLLGA+ A I+ FLDAHCE GWLE LVA VA+ T +
Sbjct: 186 P-KVRLVRAPKRVGLIRARLLGAKAARSGIITFLDAHCEVIEGWLEALVAHVAQRETMIA 244
Query: 317 CPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYTYGSSDAIIK---RKDFTEPFKTPA 372
P ID I + T A ++ ++G+F+W L+F+W +D I++ + P+ TP
Sbjct: 245 IPAIDWIHEDTLALNAQNSVRYYGSFDWGLNFQWRV--RADRIMQPAMAGNPAAPYDTPT 302
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
MAGGLF I R++F +G YDE MQ++GGEN+E+SF+ W CGGS++I CS VAH+ ++
Sbjct: 303 MAGGLFTIHRSFFERLGWYDEGMQIYGGENMELSFKAWMCGGSMQIVGCSRVAHIQKRGH 362
Query: 433 PY--SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLK 490
PY G + ++ N RVA VW+DE+A+++++ ++ + R ELR++L
Sbjct: 363 PYLRQLSDGFA-LVRRNSIRVAEVWLDEYADYFYETFGGRARRGSFGNLTERHELRQRLA 421
Query: 491 CHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
C F+WYL V+P F P G IR K L S+ SL+ C H
Sbjct: 422 CKPFRWYLETVFPEQFDPSKAVARGEIRFADDAKATPLCLDWPSL------LSLVTC-HG 474
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQRWTYD 603
Q++ + ++ T E CLD Y+ ++ VR G R W D
Sbjct: 475 YGGHQLWYLTAKGEV--TREDHCLD---YDGELLSVVRCHGLGGNQRWVWLPD 522
Score = 284 bits (727), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 165/411 (40%), Positives = 236/411 (57%), Gaps = 17/411 (4%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LPK+SIVIVF+NEAWS L+RTVHS++ R+P ++++EI+LVDD S L++ L+EY A
Sbjct: 753 LPKASIVIVFYNEAWSVLVRTVHSILDRTPSALIEEIILVDDYSNLAHLRTQLEEYFASY 812
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAED-RTRV 315
+ R++R+P R+GLI+ARLLGAR A E L FLDAHCEC +GWLE + VA D R +
Sbjct: 813 PL-VRILRAPERLGLIRARLLGARNATSEFLTFLDAHCECMVGWLEGQLDVVARDPRHTI 871
Query: 316 VCPVIDIISDVTFAYVRS-FELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMA 374
P ID I + V + +++GA W L F+W G D + K ++ EPF TP MA
Sbjct: 872 ALPTIDWIDEKNLGLVSNKAPVYYGAMGWGLDFQWR--GRWDRVNKPENKLEPFSTPVMA 929
Query: 375 GGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPY 434
GGLF I R F +G YD+++ V+GGEN+E+S + W CGG + PCS VAH+ + PY
Sbjct: 930 GGLFTIHRKLFEWLGWYDQQLDVYGGENIELSLKAWMCGGQLLTVPCSRVAHIQKTGHPY 989
Query: 435 SFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSF 494
G +V N RVA VW+D++A + + + D V R +LR+ L C SF
Sbjct: 990 LL-GLAKDVARTNSVRVAEVWLDQYAAVLYDLFGGPQYRGDFGDVTERKQLRRALHCKSF 1048
Query: 495 KWYLTHVWPHHFLPMDDKFFGRIRHVQTHKCVE-KPLAKGSMNQASGPASLLPCTHLPVL 553
+WYL V+P P DK G R +E +P + ++G ++ PC
Sbjct: 1049 RWYLETVFP-ELAPALDKRPGHGRFENEALSMEGQPKHCLTAQSSAGLPTMEPCQAGSDA 1107
Query: 554 TQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNRQRWTYD 603
Q ++ L +L ++E+ CLD Y+ +R+ AC Q W Y+
Sbjct: 1108 RQHWLHNLFGEL--SNENRCLD---YDGSA---LRVYACHKARGNQEWRYN 1150
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 74/143 (51%), Positives = 101/143 (70%), Gaps = 1/143 (0%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
Y N +SD IPV+R LPD+R P C + + LPK+SIVIVF+NEAWS L+RTVH
Sbjct: 716 YAHQGLNQYISDLIPVHRRLPDLRDPWCTEEGRLQATLPKASIVIVFYNEAWSVLVRTVH 775
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
S++ R+P ++++EI+LVDD S L++ L+EY A + R++R+P R+GLI+ARLLGA
Sbjct: 776 SILDRTPSALIEEIILVDDYSNLAHLRTQLEEYFASYPL-VRILRAPERLGLIRARLLGA 834
Query: 172 RQAEGEILVFLDAHCECTLVFNE 194
R A E L FLDAHCEC + + E
Sbjct: 835 RNATSEFLTFLDAHCECMVGWLE 857
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/159 (50%), Positives = 106/159 (66%), Gaps = 13/159 (8%)
Query: 41 LHSNLSDA-----QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKT-------KVFNEEF 88
L NLS A Q+ ++ N VSD IP++R LPDVR P C++ +
Sbjct: 66 LPDNLSPAVLALVQRGWKEQGLNQYVSDMIPLHRRLPDVRDPWCRSEEQERRNRNIGPGS 125
Query: 89 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 148
LP SSIVIVF+NEAWS LLRTVHSV+ R+P ++++EILLVDD+ST + L L+ Y+A L
Sbjct: 126 LPPSSIVIVFYNEAWSVLLRTVHSVLDRTPSALIEEILLVDDSSTMKHLHQQLEHYIAAL 185
Query: 149 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
R++R+P RVGLI+ARLLGA+ A I+ FLDAHCE
Sbjct: 186 P-KVRLVRAPKRVGLIRARLLGAKAARSGIITFLDAHCE 223
>gi|326436017|gb|EGD81587.1| polypeptide N-acetylgalactosaminyltransferase 14 [Salpingoeca sp.
ATCC 50818]
Length = 645
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 169/435 (38%), Positives = 247/435 (56%), Gaps = 36/435 (8%)
Query: 183 DAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTR 242
D+ C+ + N+ +P +S+V VF NE S LLR++HSV+ R+P +L EI+LVDD S
Sbjct: 186 DSQCKKIVYPND--MPAASVVFVFFNEPLSPLLRSIHSVLDRTPPHLLHEIVLVDDGSDA 243
Query: 243 EFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE 302
++L+ L++Y+ L T+++R P R GL++AR+ GA+ A GE++VFLD+H E + WLE
Sbjct: 244 QWLQQELNDYMRWLP-KTKIVRMPQRKGLMEARVQGAKAATGEVIVFLDSHIEVSPVWLE 302
Query: 303 NLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRK 362
++AR+ EDR VV P+ID I +F Y R L F+W L G +R+
Sbjct: 303 PMLARITEDRRHVVMPIIDSIDADSFQY-RKGGLDILGFSWAL-------GQKSIGTRRR 354
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
T+P +P MAGGLFA+DR YFF +GAYD EMQ++GGE +E+SFR+WQCGG++E PCS
Sbjct: 355 TRTQPMPSPIMAGGLFAMDRKYFFDLGAYDPEMQLYGGEEMEISFRIWQCGGTLECIPCS 414
Query: 423 HVAHLFR-----KASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQ 477
V H+FR K Y+ PG +V+ N R A VWMDE+ + + + +
Sbjct: 415 RVGHVFRTGKYWKGQVYTVPG---DVIVRNKLRAAEVWMDEYKDIVKRVMSPLPRGKTLG 471
Query: 478 AVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFF---GRIRHVQTHKCVEKPLAKGS 534
+ ++RK+ C FKWYL +V+P F+P D G +R+ T C + G+
Sbjct: 472 PLDMMFDIRKRHNCKPFKWYLENVYPEMFVPYDPDHVVASGEVRNPHTDACFD---TLGA 528
Query: 535 MNQASGPASLLPCTHLPVLTQMFVMKLPTDL--IATDESVCLDVPEYENDISPRVRILAC 592
+Q + + PC H TQ FV+ D+ A D CLD + I I C
Sbjct: 529 RHQGARIGA-YPCHH-SHGTQEFVLSTGGDIRVAAMDFDTCLDRGNGDGSIG----IWPC 582
Query: 593 --SGFNRQRWTYDKE 605
+G N Q+W YD +
Sbjct: 583 HQTGGN-QKWNYDPK 596
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/148 (43%), Positives = 101/148 (68%), Gaps = 2/148 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
++ N FNL SD IP++R +PD R +CK V+ + +P +S+V VF NE S LLR++H
Sbjct: 162 HKQNCFNLRRSDSIPLDRDIPDHRDSQCKKIVYPND-MPAASVVFVFFNEPLSPLLRSIH 220
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV+ R+P +L EI+LVDD S ++L+ L++Y+ L T+++R P R GL++AR+ GA
Sbjct: 221 SVLDRTPPHLLHEIVLVDDGSDAQWLQQELNDYMRWLP-KTKIVRMPQRKGLMEARVQGA 279
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFLPK 199
+ A GE++VFLD+H E + V+ E L +
Sbjct: 280 KAATGEVIVFLDSHIEVSPVWLEPMLAR 307
>gi|443687046|gb|ELT90152.1| hypothetical protein CAPTEDRAFT_141956, partial [Capitella teleta]
Length = 351
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 202/312 (64%), Gaps = 2/312 (0%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
L K+SI+I+FHNEA S LLRT+H+++ R+P +L EIL+VDDAST +LK LD+Y+ L
Sbjct: 42 LGKTSIIIIFHNEARSTLLRTIHALLERTPILLLVEILIVDDASTHAWLKEPLDKYLQHL 101
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IR R GLI+AR GA +A+G+IL F DAH E GWL L+ R+ E+R +V
Sbjct: 102 PR-IRIIRLKQRQGLIRARTRGAEEAKGDILYFADAHTEVGEGWLPPLLQRIKENRKVLV 160
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
P +D I +F Y R+ + + GAF W + F+ Y + + + +R D T+P +P M G
Sbjct: 161 FPEMDPIQHQSFEYWRAGDEYHGAFYWHMEFK-YKFAPKEILNRRSDPTQPVPSPVMVGC 219
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
AI+R YFF GAYD +M++WGGEN+E +FR+W CGG +E+ PCS V H+F+ PYSF
Sbjct: 220 AHAIEREYFFETGAYDTDMEIWGGENIEHAFRLWMCGGRVEVIPCSRVGHVFKPRLPYSF 279
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
G + ++ NL R+A WMD++ +F++ P D ++++ R +R++L C F W
Sbjct: 280 TGDSASIIQRNLIRIAETWMDDYKKFFYATQPSTISAVDLESLQKRKNIREKLNCKDFAW 339
Query: 497 YLTHVWPHHFLP 508
YL +V+P +P
Sbjct: 340 YLKNVFPEMPIP 351
Score = 115 bits (288), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 65/132 (49%), Positives = 86/132 (65%), Gaps = 2/132 (1%)
Query: 57 FNLLVSDRIPVNR-TLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVIS 115
FN SD + R LPD R C K ++ L K+SI+I+FHNEA S LLRT+H+++
Sbjct: 9 FNHSSSDLVGNFRHELPDFRMEGCHKKTYDLTTLGKTSIIIIFHNEARSTLLRTIHALLE 68
Query: 116 RSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAE 175
R+P +L EIL+VDDAST +LK LD+Y+ L R+IR R GLI+AR GA +A+
Sbjct: 69 RTPILLLVEILIVDDASTHAWLKEPLDKYLQHLPR-IRIIRLKQRQGLIRARTRGAEEAK 127
Query: 176 GEILVFLDAHCE 187
G+IL F DAH E
Sbjct: 128 GDILYFADAHTE 139
>gi|341894191|gb|EGT50126.1| CBN-GLY-4 protein [Caenorhabditis brenneri]
Length = 584
Score = 288 bits (737), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 155/342 (45%), Positives = 211/342 (61%), Gaps = 11/342 (3%)
Query: 200 SSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVP 259
++++I +HNEA S+LLRTV SV + SP +L EI+LVDD S + + + +A++
Sbjct: 150 TTVIITYHNEARSSLLRTVFSVFNMSPEDLLHEIVLVDDNS----IDVDIGKELAQIER- 204
Query: 260 TRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVVCPV 319
+V+R+ R GLI++R+ GA+ A IL FLD+H EC WLE L+AR+AE+ VV P+
Sbjct: 205 VKVLRNNQREGLIRSRVKGAQVAGAPILTFLDSHIECNQKWLEPLLARIAENPKAVVAPI 264
Query: 320 IDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFA 379
ID+I+ F YV + G F+W L FRW + K+ T P K+P MAGGLFA
Sbjct: 265 IDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRTERHKNPTAPIKSPTMAGGLFA 324
Query: 380 IDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSFPGG 439
I + +F +G YD +M+VWGGENLEMSFRVWQCGGS+EI PCS V H+FRK PY+FPGG
Sbjct: 325 ISKEWFEELGTYDLDMEVWGGENLEMSFRVWQCGGSLEILPCSRVGHVFRKKHPYTFPGG 384
Query: 440 VSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKWYLT 499
V N R A VWMDE+ Y K P A + + + RL +R +L+C SFKWYL
Sbjct: 385 SGNVFQKNTRRAAEVWMDEYKAIYLKNVPSA-RFVNFGDITDRLAIRDRLQCKSFKWYLD 443
Query: 500 HVWPHHFLPMDDKFFGRIRHVQT-HKCVEKPLAKGSMNQASG 540
V+P +P K G+ V+ H C++ K N+A G
Sbjct: 444 TVYPQLEVP--KKAAGKSVQVKMGHHCLDSMARK--ENEAPG 481
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 110/203 (54%), Gaps = 27/203 (13%)
Query: 48 AQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALL 107
+ Y+ N FN SD I R +PD R+P C+ +++ + ++++I +HNEA S+LL
Sbjct: 106 GEDKYKANSFNQEASDSISPIRKIPDSREPPCRDVDYSQFQMRPTTVIITYHNEARSSLL 165
Query: 108 RTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKAR 167
RTV SV + SP +L EI+LVDD S + + + +A++ +V+R+ R GLI++R
Sbjct: 166 RTVFSVFNMSPEDLLHEIVLVDDNS----IDVDIGKELAQIER-VKVLRNNQREGLIRSR 220
Query: 168 LLGARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPR 227
+ GA+ A IL FLD+H EC N+ W L + + I+ +P+
Sbjct: 221 VKGAQVAGAPILTFLDSHIEC-------------------NQKW---LEPLLARIAENPK 258
Query: 228 SMLKEILLVDDASTREFLKSSLD 250
+++ I+ V + ++ +S D
Sbjct: 259 AVVAPIIDVINVDNFNYVGASAD 281
>gi|383862333|ref|XP_003706638.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 35A-like
[Megachile rotundata]
Length = 637
Score = 288 bits (736), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 143/325 (44%), Positives = 198/325 (60%), Gaps = 9/325 (2%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H C + LP +SIVI F+NE + LLR++HS+I R+P+ +L EI+LV+D S +
Sbjct: 156 HKLCQMQQYPNKLPNASIVICFYNEHYMTLLRSIHSIIERTPKHLLHEIILVNDWSDSKE 215
Query: 245 LKSSLDEYVAK-LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
L + ++ + ++ R GLI+AR+ GAR+A GE+L+FLD+H E W+E
Sbjct: 216 LHEKIKAFINNNFDRKVKFFKTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKMWIEP 275
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L++R+A +T V PVIDII+ TF Y S L G FNW LHF+W + ++ +D
Sbjct: 276 LLSRIAHSKTIVAMPVIDIINADTFQYTAS-PLVRGGFNWGLHFKWEQLPTK--LVHDED 332
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F +P K+P MAGGLFA+DR YF +G YD M VWGGENLE+SFR+W CGGSIE+ PCS
Sbjct: 333 FIKPIKSPTMAGGLFAMDREYFVELGEYDAGMDVWGGENLEISFRIWMCGGSIELIPCSR 392
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY + + N RVA VW+DE+ +Y K + D + RL
Sbjct: 393 VGHVFRKRRPYG-ADDKHDTMLKNSLRVAYVWLDEYKHYYLK----DVNKIDYGDITDRL 447
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLP 508
LR++LKC F WY+ V+P P
Sbjct: 448 NLRQKLKCKDFAWYVKEVYPELTFP 472
Score = 119 bits (299), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/164 (37%), Positives = 101/164 (61%), Gaps = 5/164 (3%)
Query: 40 GLHSNLSDA---QQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
G+ NL D ++ Y+ FN+L+SD I ++R LPD R C+ + + + LP +SIVI
Sbjct: 117 GMIKNLDDQRKREEGYKNYAFNVLISDNIGLDRKLPDTRHKLCQMQQYPNK-LPNASIVI 175
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYV-AKLSVPTRVI 155
F+NE + LLR++HS+I R+P+ +L EI+LV+D S + L + ++ +
Sbjct: 176 CFYNEHYMTLLRSIHSIIERTPKHLLHEIILVNDWSDSKELHEKIKAFINNNFDRKVKFF 235
Query: 156 RSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
++ R GLI+AR+ GAR+A GE+L+FLD+H E ++ E L +
Sbjct: 236 KTEKREGLIRARMFGARKATGEVLIFLDSHIEVNKMWIEPLLSR 279
>gi|158286701|ref|XP_565317.3| AGAP006881-PA [Anopheles gambiae str. PEST]
gi|157020594|gb|EAL41927.3| AGAP006881-PA [Anopheles gambiae str. PEST]
Length = 587
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 178/430 (41%), Positives = 244/430 (56%), Gaps = 41/430 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP SIVIVFH+EA S LLRTVHSV++R+P +++EILL+DD S+ LK+ LD+Y
Sbjct: 128 LPPVSIVIVFHDEALSVLLRTVHSVLNRTPPELVQEILLIDDWSSLVQLKTFLDDYFLPY 187
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLE---NLVARVAEDRT 313
S R++R+P R+GLI R+ GA++A + L+FLDAHCEC GWLE LVA E+R
Sbjct: 188 SDKVRILRTPKRLGLITGRIFGAKRASADYLLFLDAHCECLAGWLEPLLELVASNQENRK 247
Query: 314 RVVCPVIDIISDVTFAY-VRSFELHWGAFNWELHFRWYT-YGSSDAIIKRKDFTEPFKTP 371
V P ID +++ T A V + +GAF+W L F+W Y A +++ EPF TP
Sbjct: 248 VVAVPTIDWLNETTLALQVGASSGLYGAFDWNLSFQWRPRYDRLQA--PQENLLEPFDTP 305
Query: 372 AMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKA 431
MAGGLF I++A+F +G YD +QV+GGEN+E+SF+VW CGG+I PCSHVAH+ ++
Sbjct: 306 VMAGGLFCIEKAFFAQLGWYDPGLQVYGGENMELSFKVWMCGGAIRTVPCSHVAHIQKRN 365
Query: 432 SPYSFPGGVS---EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRS------- 481
+PY G + ++ N RVA VWMDE+AEF ++ +P+ + S
Sbjct: 366 NPYI--GSYTKERDLTMRNSLRVAEVWMDEYAEFLYRLHPDYRALLASRTSHSLSNVNLD 423
Query: 482 -RLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGRIRHVQT--HKCVEKPLAKGSMNQA 538
R +LR +L C SF+WYL HV+P P + + G IRH C+ P+ S+
Sbjct: 424 ARRQLRSELGCKSFRWYLQHVFPEQDDPSEAQAAGWIRHENEAGQLCLTWPMRDRSL--- 480
Query: 539 SGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGFNRQ 598
+LL C L F K T IA E CL V E V I CS
Sbjct: 481 ----ALLHCHGLGGQQIWFHRK--TGEIAR-EGHCLGVDSAE------VTIALCSSEGSS 527
Query: 599 ---RWTYDKE 605
RW Y ++
Sbjct: 528 GAYRWLYRRQ 537
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 72/146 (49%), Positives = 97/146 (66%), Gaps = 2/146 (1%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVH 111
+Q +N VSD I V R LPDVR P C+ + LP SIVIVFH+EA S LLRTVH
Sbjct: 93 WQRQGYNQFVSDLISVRRELPDVRDPWCRDR--KRSALPPVSIVIVFHDEALSVLLRTVH 150
Query: 112 SVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGA 171
SV++R+P +++EILL+DD S+ LK+ LD+Y S R++R+P R+GLI R+ GA
Sbjct: 151 SVLNRTPPELVQEILLIDDWSSLVQLKTFLDDYFLPYSDKVRILRTPKRLGLITGRIFGA 210
Query: 172 RQAEGEILVFLDAHCECTLVFNEEFL 197
++A + L+FLDAHCEC + E L
Sbjct: 211 KRASADYLLFLDAHCECLAGWLEPLL 236
>gi|195436945|ref|XP_002066406.1| GK18112 [Drosophila willistoni]
gi|194162491|gb|EDW77392.1| GK18112 [Drosophila willistoni]
Length = 588
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 160/416 (38%), Positives = 239/416 (57%), Gaps = 33/416 (7%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C + E LP++SIV+ F+NE L+R++ +V+ R+P+ +LKEI+LVDD S L
Sbjct: 104 CNRSETLESEQLPQASIVMCFYNEHKMTLMRSIKTVLERTPQYLLKEIILVDDNSDLPEL 163
Query: 246 KSSL--DEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+ L D + R IR+ R GLI++R++GAR+A G++LVFLD+H E W+E
Sbjct: 164 EFHLLADLHNRLKYDNLRYIRNEKREGLIRSRVIGAREALGDVLVFLDSHIEVNRQWVEP 223
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ + + + + PVID+I+ TF Y S L G FNW LHFRW + + +D
Sbjct: 224 LLRLIKAENSTLAVPVIDLINADTFGYTPS-PLVRGGFNWGLHFRWENL-PEGTLKQPED 281
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F PF++P MAGGLFA++R YF HIG YD M +WGGEN+E+SFR WQCGGSI+I PCS
Sbjct: 282 FRGPFRSPTMAGGLFAVNRLYFQHIGEYDMAMDIWGGENIEISFRAWQCGGSIKIVPCSR 341
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PY+ P G + +L +L R+A VWMD + ++Y K + + K D + R+
Sbjct: 342 VGHIFRKRRPYTSPDGANTMLKNSL-RLAYVWMDRFKDYYIK-HEKVSKDFDYGDISERV 399
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMD-----------------------DKFFGRIRHV 520
+LR++L+CH F WYL +++P +P + D + R+
Sbjct: 400 KLREKLQCHDFDWYLKNIYPELPIPGEEPKKTAAAAPIYQPWHSRKRNYIDSYQLRLSGT 459
Query: 521 QTHKCVEKPLAKGSMNQASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDV 576
+ V P KG + SG L PC + P Q++ +++I D+ +CL+
Sbjct: 460 ELCASVVAPKVKGFWKKGSG-LQLQPCHNSP--NQIWYETEKSEII-LDKLLCLEA 511
Score = 103 bits (257), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 93/149 (62%), Gaps = 3/149 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKC-KTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
Y+ + FN LVS+ I + R +PD R C +++ E LP++SIV+ F+NE L+R++
Sbjct: 77 YKHHAFNALVSNNIGLYRDIPDTRHKVCNRSETLESEQLPQASIVMCFYNEHKMTLMRSI 136
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSL--DEYVAKLSVPTRVIRSPGRVGLIKARL 168
+V+ R+P+ +LKEI+LVDD S L+ L D + R IR+ R GLI++R+
Sbjct: 137 KTVLERTPQYLLKEIILVDDNSDLPELEFHLLADLHNRLKYDNLRYIRNEKREGLIRSRV 196
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
+GAR+A G++LVFLD+H E + E L
Sbjct: 197 IGAREALGDVLVFLDSHIEVNRQWVEPLL 225
>gi|426226648|ref|XP_004007451.1| PREDICTED: LOW QUALITY PROTEIN: polypeptide
N-acetylgalactosaminyltransferase 6 [Ovis aries]
Length = 792
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/362 (43%), Positives = 219/362 (60%), Gaps = 36/362 (9%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP +S++IVFHNEAWS LLRTV+SV+ +P +LKEI+LVDDAST E+LK L+ +V +L
Sbjct: 367 LPATSVIIVFHNEAWSTLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERHVKQL 426
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
V +L FLDAHCEC GWLE L+AR+AED T VV
Sbjct: 427 QV----------------------VQVVRVLTFLDAHCECFHGWLEPLLARIAEDETVVV 464
Query: 317 CPVIDIISDVTFAYVRSFE----LHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPA 372
P I I TF + + + G F+W L F W + + +RKD T P K+P
Sbjct: 465 SPNIVTIDLNTFEFSKPVQRGRVQSRGNFDWSLTFGWEVLPAREKQ-RRKDETYPIKSPT 523
Query: 373 MAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKAS 432
AGGLF+I +AYF HIG YD +M++WGGEN+EMSFRVWQCGG +EI PCS V H+FR S
Sbjct: 524 FAGGLFSISKAYFEHIGTYDNQMEIWGGENVEMSFRVWQCGGQLEIIPCSVVGHVFRTKS 583
Query: 433 PYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQA---VRSRLELRKQL 489
P++FP G++ V+ N R+A VWMD + E +++ N +A + +++ + RL+LR++L
Sbjct: 584 PHTFPKGIN-VIARNQVRLAEVWMDGYKEIFYRRNLQAAQMAREKSFGDISERLQLRERL 642
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCT 548
C +F W+L +++P F+P + FFG ++++ C++ G N P L C
Sbjct: 643 NCRNFSWFLDNIYPEMFVPDLKPTFFGALKNLGVDHCLD----VGENNNGGKPLILYACH 698
Query: 549 HL 550
L
Sbjct: 699 GL 700
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/144 (43%), Positives = 83/144 (57%), Gaps = 24/144 (16%)
Query: 47 DAQQLYQINRFNLLVSDRIPVNRTL-PDVRKPKCKTKVFNE-EFLPKSSIVIVFHNEAWS 104
+ ++ Y+ + FN SDRI + R L PD R P+C + F LP +S++IVFHNEAWS
Sbjct: 323 EKEEGYKKHCFNAFASDRISLQRALGPDTRPPECVDQKFRRCPPLPATSVIIVFHNEAWS 382
Query: 105 ALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLI 164
LLRTV+SV+ +P +LKEI+LVDDAST E+LK L+ +V +L V
Sbjct: 383 TLLRTVYSVLHTTPAILLKEIILVDDASTDEYLKEPLERHVKQLQV-------------- 428
Query: 165 KARLLGARQAEGEILVFLDAHCEC 188
+L FLDAHCEC
Sbjct: 429 --------VQVVRVLTFLDAHCEC 444
>gi|170587206|ref|XP_001898369.1| glycosyl transferase, group 2 family protein [Brugia malayi]
gi|158594195|gb|EDP32781.1| glycosyl transferase, group 2 family protein [Brugia malayi]
Length = 582
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 155/361 (42%), Positives = 215/361 (59%), Gaps = 16/361 (4%)
Query: 171 ARQAEGEILVFLDAHCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSML 230
A + + +I+ +A C TL++N E LP +S+VI +HNEA S LLRT+ SV RSP +L
Sbjct: 119 ATRWDRDIIDSREASCG-TLIYNVETLPSTSVVITYHNEARSTLLRTIVSVFLRSPPQLL 177
Query: 231 KEILLVDDASTREFLKSSLDEYVAKLSVPTR---VIRSPGRVGLIKARLLGARQAEGEIL 287
EI+LVDD S + + L +P VIR+ R GLI++R+ G+ A +L
Sbjct: 178 HEIILVDDFSDDITIGTDL--------LPIENVVVIRNTKREGLIRSRVKGSTLARASVL 229
Query: 288 VFLDAHCECTLGWLENLVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHF 347
FLD+HCEC + WLE L+ARV E+ VV PVIDII TF YV + G F W L F
Sbjct: 230 TFLDSHCECNVNWLEPLLARVKENHRAVVAPVIDIIDKDTFKYVAASADLRGGFEWNLIF 289
Query: 348 RWYTYGSSDAIIKRKDFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSF 407
+W + T P +TP +AGGLF I + +F +G YDE+M VWGGENLE+SF
Sbjct: 290 KWEYLLGKLRDDRHAQPTAPIRTPVIAGGLFMIQKDWFEKLGTYDEQMDVWGGENLELSF 349
Query: 408 RVWQCGGSIEIAPCSHVAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFN 467
RVW CGGS+EI PCS V H+FRK PY+FPGG V N R A VW+ ++ Y +
Sbjct: 350 RVWLCGGSLEIIPCSRVGHVFRKQHPYTFPGGNGNVFQKNTRRAAEVWLGDYKYLYLRKV 409
Query: 468 PEAEKQRDKQAVRSRLELRKQLKCHSFKWYLTHVWPHHFLPMDDKFFGR-IRHVQTHKCV 526
P A + + + +RL+L+K+L+C F WYL ++P +P ++ GR + Q + C+
Sbjct: 410 PSA-RYVNFGDITARLDLKKRLRCKDFDWYLKEIYPELAIPSKEQ--GRYLTFRQGNVCI 466
Query: 527 E 527
+
Sbjct: 467 D 467
Score = 110 bits (274), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 90/160 (56%), Gaps = 11/160 (6%)
Query: 43 SNLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEA 102
+L + Y N+FN SD +R + D R+ C T ++N E LP +S+VI +HNEA
Sbjct: 98 GSLQPGEDRYAANKFNQAASDATRWDRDIIDSREASCGTLIYNVETLPSTSVVITYHNEA 157
Query: 103 WSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTR---VIRSPG 159
S LLRT+ SV RSP +L EI+LVDD S + + L +P VIR+
Sbjct: 158 RSTLLRTIVSVFLRSPPQLLHEIILVDDFSDDITIGTDL--------LPIENVVVIRNTK 209
Query: 160 RVGLIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLPK 199
R GLI++R+ G+ A +L FLD+HCEC + + E L +
Sbjct: 210 REGLIRSRVKGSTLARASVLTFLDSHCECNVNWLEPLLAR 249
>gi|194758571|ref|XP_001961535.1| GF14884 [Drosophila ananassae]
gi|190615232|gb|EDV30756.1| GF14884 [Drosophila ananassae]
Length = 627
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 213/328 (64%), Gaps = 6/328 (1%)
Query: 186 CECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFL 245
C+ + F E LP +SIV+ F+NE L+R++ +V+ R+P +L+EI+LVDD S L
Sbjct: 135 CDRSETFEAENLPHASIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDNSDLPEL 194
Query: 246 KSSLD-EYVAKLSVPT-RVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLEN 303
+ L + A+L P R I++ R GLI++R++GAR+A G++LVFLD+H E WLE
Sbjct: 195 EFHLHADLRARLKYPNLRYIKNEEREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEP 254
Query: 304 LVARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKD 363
L+ + + + + PVID+I+ TF Y S L G FNW LHFRW + +D
Sbjct: 255 LLRLIKAENSTLAVPVIDLINADTFEYTPS-PLVRGGFNWGLHFRWENLPEGTLKVP-ED 312
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
F PF++P MAGGLFA++R YF H+G YD M +WGGEN+E+SFR WQCGG+I+I PC+
Sbjct: 313 FRGPFRSPTMAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCAR 372
Query: 424 VAHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRL 483
V H+FRK PYS P G + +L +L R+A VWMD++ ++Y K + + K D + R+
Sbjct: 373 VGHIFRKRRPYSSPDGANTMLKNSL-RLAHVWMDQYKDYYIK-HEKVPKDFDYGDISDRV 430
Query: 484 ELRKQLKCHSFKWYLTHVWPHHFLPMDD 511
+LR++L+CH F WYL +V+P +P ++
Sbjct: 431 KLRERLQCHDFSWYLKNVYPELRVPGEE 458
Score = 106 bits (264), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/149 (40%), Positives = 95/149 (63%), Gaps = 3/149 (2%)
Query: 52 YQINRFNLLVSDRIPVNRTLPDVRKPKC-KTKVFNEEFLPKSSIVIVFHNEAWSALLRTV 110
Y+ + FN LVS+ I + R +PD R C +++ F E LP +SIV+ F+NE L+R++
Sbjct: 108 YKHHAFNALVSNNIGLFRDIPDTRHKVCDRSETFEAENLPHASIVMCFYNEHKMTLMRSI 167
Query: 111 HSVISRSPRSMLKEILLVDDASTREFLKSSLD-EYVAKLSVPT-RVIRSPGRVGLIKARL 168
+V+ R+P +L+EI+LVDD S L+ L + A+L P R I++ R GLI++R+
Sbjct: 168 KTVLERTPSYLLREIILVDDNSDLPELEFHLHADLRARLKYPNLRYIKNEEREGLIRSRV 227
Query: 169 LGARQAEGEILVFLDAHCECTLVFNEEFL 197
+GAR+A G++LVFLD+H E + E L
Sbjct: 228 IGAREAVGDVLVFLDSHIEVNQQWLEPLL 256
>gi|260809642|ref|XP_002599614.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
gi|229284894|gb|EEN55626.1| hypothetical protein BRAFLDRAFT_217836 [Branchiostoma floridae]
Length = 432
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 218/339 (64%), Gaps = 23/339 (6%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I+F+NEAWS LLRTVHSV+ R+P +L+E++LVDD ST YV++L
Sbjct: 67 LPRTSVIIIFYNEAWSTLLRTVHSVLERTPSELLREVILVDDCST---FGKERQRYVSQL 123
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
R+IRSP R GLI+ARLLG + A GE+L FLD+HCEC GWLE L+ R++ + T V
Sbjct: 124 P-EVRIIRSPTRQGLIRARLLGVKHARGEVLTFLDSHCECMYGWLEPLLERISLNHTVVP 182
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFN------WELHFRWYTYGSSDAIIKRKDFTEPFKT 370
PV+D+I FAY+ H G + +L F W+ + RK +P ++
Sbjct: 183 WPVLDMIQHNDFAYL----FHGGVLSVGSVDLVDLRFNWHAVPQKE-FRARKSIIDPIRS 237
Query: 371 PAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRV-WQCGGSIEIAPCSHVAHLFR 429
P M GG+F+I + YF ++G YD+ M++WGGEN+E+SFRV WQCGG+IE+ PCSHV H+FR
Sbjct: 238 PTMPGGVFSIHKKYFEYLGGYDDGMEIWGGENIELSFRVIWQCGGTIELVPCSHVGHVFR 297
Query: 430 KASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQL 489
SPYS P V + + N R+A VWMD++ ++ +P+ K + V R LRK L
Sbjct: 298 VTSPYSAP--VDKWMKNN-KRLAEVWMDDYKNVIYRKHPDY-KTVETGNVMPRKVLRKAL 353
Query: 490 KCHSFKWYLTHVWPHHFLP-MDDKFFGRIRHVQTHKCVE 527
CH F WY+ +V+P+ ++P + +G++R T KC++
Sbjct: 354 HCHDFSWYVQNVYPNLYVPDVRPVAYGQVR--MTGKCLD 390
Score = 137 bits (345), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/150 (47%), Positives = 100/150 (66%), Gaps = 5/150 (3%)
Query: 53 QINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRTVHS 112
+++ FN S +I V+R+L D R CK K + LP++S++I+F+NEAWS LLRTVHS
Sbjct: 32 KLHSFNEWASSKISVHRSLLDFRHELCKAKKYYRP-LPRTSVIIIFYNEAWSTLLRTVHS 90
Query: 113 VISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGAR 172
V+ R+P +L+E++LVDD ST YV++L R+IRSP R GLI+ARLLG +
Sbjct: 91 VLERTPSELLREVILVDDCST---FGKERQRYVSQLP-EVRIIRSPTRQGLIRARLLGVK 146
Query: 173 QAEGEILVFLDAHCECTLVFNEEFLPKSSI 202
A GE+L FLD+HCEC + E L + S+
Sbjct: 147 HARGEVLTFLDSHCECMYGWLEPLLERISL 176
>gi|328712307|ref|XP_001942933.2| PREDICTED: putative polypeptide N-acetylgalactosaminyltransferase
10-like [Acyrthosiphon pisum]
Length = 592
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 164/414 (39%), Positives = 240/414 (57%), Gaps = 23/414 (5%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAK- 255
LP S+VI FHNE +S LLRTV+SV++RSP+ +LKEI+LVDD+ST+ LK LD +++
Sbjct: 141 LPTVSVVIPFHNEHFSTLLRTVYSVLNRSPKILLKEIILVDDSSTKTSLKRPLDNFLSNN 200
Query: 256 LSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRV 315
L+ ++I R GLI+ARL GAR+A EIL+FLD+H E WL L+ + ED
Sbjct: 201 LADTVQIIHLKKRQGLIRARLAGARKATSEILIFLDSHTEANANWLPPLLEPITEDYRTC 260
Query: 316 VCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAG 375
VCP ID+I+ TF Y E GAF+WE ++ D + T+PF++P MAG
Sbjct: 261 VCPFIDVIAFETFEYRAQDEGARGAFDWEFFYKRLPLLPEDLLYP----TKPFRSPVMAG 316
Query: 376 GLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYS 435
GLFAI +F+ +G YD + +WGGE E+SF++WQCGG+I APCS V H++RK +P+
Sbjct: 317 GLFAISAKWFWELGGYDPGLDIWGGEQYELSFKIWQCGGTILDAPCSRVGHIYRKFAPFP 376
Query: 436 FPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFK 495
P G+ + + N RVA VWMDE+AE+ + P + + + + ++R+ LKC FK
Sbjct: 377 NP-GIGDFVGKNYRRVAEVWMDEYAEYLYLRRPHY-RNINTGDITKQKKIRENLKCKPFK 434
Query: 496 WYLTHV---WPHHFLPMD--DKFFGRIRHVQTHKCVEKPLAKGSMNQASGPASLLPCTHL 550
W++ V + P++ D F G+IR T E L S N P +L C
Sbjct: 435 WFIHEVAFDLVEKYPPIEPPDVFKGKIR---TFTAPELCLDATSEN----PLNLKECIDN 487
Query: 551 PVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILAC-SGFNRQRWTYD 603
Q F++ D I T ++CLD+ ++ + ++ + C +G Q W YD
Sbjct: 488 DDENQKFILSWRND-IKTRNNMCLDIS--DSSLKAKISLYGCHNGGGNQLWHYD 538
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 77/156 (49%), Positives = 108/156 (69%), Gaps = 6/156 (3%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
N +LY++N FN L+SD I VNR++PD+R C+ K +N + LP S+VI FHNE +
Sbjct: 97 NRHKYDELYKVNGFNALLSDSISVNRSIPDIRHKLCRFKKYNSK-LPTVSVVIPFHNEHF 155
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVA-KLSVPTRVIRSPGRVG 162
S LLRTV+SV++RSP+ +LKEI+LVDD+ST+ LK LD +++ L+ ++I R G
Sbjct: 156 STLLRTVYSVLNRSPKILLKEIILVDDSSTKTSLKRPLDNFLSNNLADTVQIIHLKKRQG 215
Query: 163 LIKARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
LI+ARL GAR+A EIL+FLD+H E N +LP
Sbjct: 216 LIRARLAGARKATSEILIFLDSHTEA----NANWLP 247
>gi|312094065|ref|XP_003147897.1| hypothetical protein LOAG_12336 [Loa loa]
Length = 560
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 153/375 (40%), Positives = 228/375 (60%), Gaps = 26/375 (6%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC E LP +S+VI FHNE W+ LLRTVHSV+ RSP ++KEI+LVDD S +E LK
Sbjct: 104 ECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEIILVDDFSDKEHLK 163
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+ Y+ + ++IR+ R GLI+ R +GA++A G+++VFLDAHCE + WL L+A
Sbjct: 164 DRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAHCEVNINWLPPLLA 223
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRK 362
+ ++R + PVID I ++Y S + H+ G F W L ++ T + +++RK
Sbjct: 224 PIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYK-ETEIPAQELLRRK 282
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+EPF++P AGGLFAI + +F +G YD +Q+WGGE E+SF++WQCGG I PCS
Sbjct: 283 HSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFIPCS 342
Query: 423 HVAHLFRKASPYSFPGGVS--EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
HV H++R PY F G +S V+ N+ RV WMDE+ ++Y+ P A K R +
Sbjct: 343 HVGHVYRSHMPYGF-GKLSGKPVISTNMLRVIKTWMDEYEKYYYIREPSA-KHRLPGDIS 400
Query: 481 SRLELRKQLKCHSFKWYLTHV-------WPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKG 533
S+L+LR++LKC SF+WY+ V +P LP ++ +G ++ T KC++
Sbjct: 401 SQLKLRERLKCKSFEWYMEKVAYDVIVSYP---LPPENHVWGEAKNHATGKCID------ 451
Query: 534 SMNQA-SGPASLLPC 547
++ Q G +PC
Sbjct: 452 TIGQTIPGIVGAMPC 466
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 102/147 (69%), Gaps = 1/147 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
NL + ++ + FN VSD I +NR++PDVR +CK + E+ LP +S+VI FHNE W
Sbjct: 70 NLPEVKKAMREYGFNTYVSDMISLNRSIPDVRLDECKYWHYPED-LPSASVVIAFHNEGW 128
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
+ LLRTVHSV+ RSP ++KEI+LVDD S +E LK L+ Y+ + ++IR+ R GL
Sbjct: 129 TPLLRTVHSVLLRSPSQLIKEIILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGL 188
Query: 164 IKARLLGARQAEGEILVFLDAHCECTL 190
I+ R +GA++A G+++VFLDAHCE +
Sbjct: 189 IRTRSIGAKEAVGDVVVFLDAHCEVNI 215
>gi|393911317|gb|EFO16172.2| hypothetical protein LOAG_12336 [Loa loa]
Length = 562
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 153/375 (40%), Positives = 228/375 (60%), Gaps = 26/375 (6%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC E LP +S+VI FHNE W+ LLRTVHSV+ RSP ++KEI+LVDD S +E LK
Sbjct: 106 ECKYWHYPEDLPSASVVIAFHNEGWTPLLRTVHSVLLRSPSQLIKEIILVDDFSDKEHLK 165
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
L+ Y+ + ++IR+ R GLI+ R +GA++A G+++VFLDAHCE + WL L+A
Sbjct: 166 DRLERYLKQFRGKVKLIRNAEREGLIRTRSIGAKEAVGDVVVFLDAHCEVNINWLPPLLA 225
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY---VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRK 362
+ ++R + PVID I ++Y S + H+ G F W L ++ T + +++RK
Sbjct: 226 PIRQNRKVMTVPVIDGIDKDDWSYRIVYSSVDKHYRGIFEWGLLYK-ETEIPAQELLRRK 284
Query: 363 DFTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCS 422
+EPF++P AGGLFAI + +F +G YD +Q+WGGE E+SF++WQCGG I PCS
Sbjct: 285 HSSEPFRSPTHAGGLFAISKKWFEELGYYDPGLQIWGGEQYELSFKIWQCGGGILFIPCS 344
Query: 423 HVAHLFRKASPYSFPGGVS--EVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVR 480
HV H++R PY F G +S V+ N+ RV WMDE+ ++Y+ P A K R +
Sbjct: 345 HVGHVYRSHMPYGF-GKLSGKPVISTNMLRVIKTWMDEYEKYYYIREPSA-KHRLPGDIS 402
Query: 481 SRLELRKQLKCHSFKWYLTHV-------WPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKG 533
S+L+LR++LKC SF+WY+ V +P LP ++ +G ++ T KC++
Sbjct: 403 SQLKLRERLKCKSFEWYMEKVAYDVIVSYP---LPPENHVWGEAKNHATGKCID------ 453
Query: 534 SMNQA-SGPASLLPC 547
++ Q G +PC
Sbjct: 454 TIGQTIPGIVGAMPC 468
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 70/147 (47%), Positives = 102/147 (69%), Gaps = 1/147 (0%)
Query: 44 NLSDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAW 103
NL + ++ + FN VSD I +NR++PDVR +CK + E+ LP +S+VI FHNE W
Sbjct: 72 NLPEVKKAMREYGFNTYVSDMISLNRSIPDVRLDECKYWHYPED-LPSASVVIAFHNEGW 130
Query: 104 SALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGL 163
+ LLRTVHSV+ RSP ++KEI+LVDD S +E LK L+ Y+ + ++IR+ R GL
Sbjct: 131 TPLLRTVHSVLLRSPSQLIKEIILVDDFSDKEHLKDRLERYLKQFRGKVKLIRNAEREGL 190
Query: 164 IKARLLGARQAEGEILVFLDAHCECTL 190
I+ R +GA++A G+++VFLDAHCE +
Sbjct: 191 IRTRSIGAKEAVGDVVVFLDAHCEVNI 217
>gi|268370157|ref|NP_001161259.1| polypeptide GalNAc transferase 6-like [Nasonia vitripennis]
Length = 615
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 170/429 (39%), Positives = 237/429 (55%), Gaps = 19/429 (4%)
Query: 185 HCECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREF 244
H +C + L S+V+ FHNE +S L+RT SVI+RSP S+L EI+LVDDAST+
Sbjct: 134 HPDCKNKRYLKDLDPVSVVVSFHNEHFSTLMRTCWSVINRSPPSLLHEIILVDDASTKVE 193
Query: 245 LKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENL 304
LK LDEYV K +++R P R GLI+ RL GAR+A +ILVFLD+H E + WL L
Sbjct: 194 LKDKLDEYVKKNLPKVKIVRLPRRSGLIRGRLAGARKATAKILVFLDSHSEANVNWLPPL 253
Query: 305 VARVAEDRTRVVCPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDF 364
+ +A+D VCP ID+I+ TF Y E GAF+WEL+++ D K+
Sbjct: 254 LEPIAKDYKTCVCPFIDVIAYETFEYRAQDEGARGAFDWELYYKRLPLLPEDL----KNP 309
Query: 365 TEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHV 424
+EPFK+P MAGGLFAI +F+ +G YD + +WGGE E+SF++WQCGG + APCS V
Sbjct: 310 SEPFKSPVMAGGLFAISAKFFWELGGYDPGLDIWGGEQYELSFKIWQCGGQMYDAPCSRV 369
Query: 425 AHLFRKASPYSFPGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLE 484
H++RK P+ P G + L N RVA VWMDE+A+F ++ P + D + +
Sbjct: 370 GHIYRKFPPFPNP-GRGDFLGKNYKRVAEVWMDEYADFIYRRRPHL-RAMDPGDLTEQKA 427
Query: 485 LRKQLKCHSFKWY-------LTHVWPHHFLPMDDKFFGRIRHVQTHKCVEKPLAKGSMNQ 537
LR +LKC SFKW+ L V+P + DD +G +R++ KG +
Sbjct: 428 LRDKLKCKSFKWFMENIAFDLVEVYPP--IEPDDFAYGEMRNIGVPNLCLDAKGKGKDEE 485
Query: 538 ASGPASLLPCTHLPVLTQMFVMKLPTDLIATDESVCLDVPEYENDISPRVRILACSGF-N 596
+ + Q F + D+ + CLDV +D SP V + C G
Sbjct: 486 IAVDYCQKDTPKIKG-EQEFQLTWHKDIRPNRRTECLDVSR-GDDKSP-VTLYPCHGKQG 542
Query: 597 RQRWTYDKE 605
Q W Y+ E
Sbjct: 543 NQLWRYNVE 551
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/141 (51%), Positives = 100/141 (70%), Gaps = 1/141 (0%)
Query: 50 QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSALLRT 109
+LY++N FN +SD I +NR++PD+R P CK K + ++ P S+V+ FHNE +S L+RT
Sbjct: 108 RLYKVNGFNAALSDLISLNRSIPDIRHPDCKNKRYLKDLDP-VSVVVSFHNEHFSTLMRT 166
Query: 110 VHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIKARLL 169
SVI+RSP S+L EI+LVDDAST+ LK LDEYV K +++R P R GLI+ RL
Sbjct: 167 CWSVINRSPPSLLHEIILVDDASTKVELKDKLDEYVKKNLPKVKIVRLPRRSGLIRGRLA 226
Query: 170 GARQAEGEILVFLDAHCECTL 190
GAR+A +ILVFLD+H E +
Sbjct: 227 GARKATAKILVFLDSHSEANV 247
>gi|391345232|ref|XP_003746894.1| PREDICTED: polypeptide N-acetylgalactosaminyltransferase 11-like
[Metaseiulus occidentalis]
Length = 585
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 202/312 (64%), Gaps = 6/312 (1%)
Query: 197 LPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKL 256
LP++S++I F+NEAWS L+RTV+SV+ RSP ++L+EI+LVDD S L+ L +V K
Sbjct: 128 LPRASVIICFYNEAWSTLIRTVNSVLDRSPSALLQEIILVDDLSDIAELEP-LAGFVQKH 186
Query: 257 SVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVARVAEDRTRVV 316
RVIR+ R GLI+AR++GA + G++LVFLD+H E WL+ L+ + +++T V
Sbjct: 187 E-KVRVIRTREREGLIRARMIGAHNSTGDVLVFLDSHVEVNERWLQPLLVPIQQNQTTVT 245
Query: 317 CPVIDIISDVTFAYVRSFELHWGAFNWELHFRWYTYGSSDAIIKRKDFTEPFKTPAMAGG 376
CPVIDII+ TF Y S L G FNW +HFRW K+ P +P MAGG
Sbjct: 246 CPVIDIINADTFEYSPS-PLVKGGFNWGMHFRWDNL-PKGYFKSEKERIAPLPSPTMAGG 303
Query: 377 LFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSHVAHLFRKASPYSF 436
LFAI + F +G YD M VWGGENLE+SFR+W CGGS++I PCS V H+FRK PY
Sbjct: 304 LFAIHKDEFRRLGEYDWGMDVWGGENLELSFRIWMCGGSLKIMPCSRVGHVFRKRRPYGA 363
Query: 437 PGGVSEVLYGNLARVALVWMDEWAEFYFKFNPEAEKQRDKQAVRSRLELRKQLKCHSFKW 496
G + L N RVA VWMD++ ++Y++ P+ K D + +R+ELR +LKC SF W
Sbjct: 364 SNG-EDTLAKNSLRVANVWMDDYKKYYYRMRPDL-KDIDFGDISARVELRNRLKCKSFDW 421
Query: 497 YLTHVWPHHFLP 508
YL +++P LP
Sbjct: 422 YLKNIYPDLQLP 433
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/151 (46%), Positives = 103/151 (68%), Gaps = 6/151 (3%)
Query: 40 GLHSNLSDAQ---QLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVI 96
GL N D Q + Y+ + FN LVS+RI + R +PD R CK + ++++ LP++S++I
Sbjct: 77 GLIRNSQDQQIKTEGYRQHAFNTLVSERIGLRRRVPDTRDALCKQQKYSKD-LPRASVII 135
Query: 97 VFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIR 156
F+NEAWS L+RTV+SV+ RSP ++L+EI+LVDD S L+ L +V K RVIR
Sbjct: 136 CFYNEAWSTLIRTVNSVLDRSPSALLQEIILVDDLSDIAELE-PLAGFVQKHE-KVRVIR 193
Query: 157 SPGRVGLIKARLLGARQAEGEILVFLDAHCE 187
+ R GLI+AR++GA + G++LVFLD+H E
Sbjct: 194 TREREGLIRARMIGAHNSTGDVLVFLDSHVE 224
>gi|91081797|ref|XP_973938.1| PREDICTED: similar to n-acetylgalactosaminyltransferase [Tribolium
castaneum]
gi|270006291|gb|EFA02739.1| hypothetical protein TcasGA2_TC008465 [Tribolium castaneum]
Length = 583
Score = 285 bits (730), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 161/379 (42%), Positives = 225/379 (59%), Gaps = 30/379 (7%)
Query: 187 ECTLVFNEEFLPKSSIVIVFHNEAWSALLRTVHSVISRSPRSMLKEILLVDDASTREFLK 246
EC E LP +S++IVFHNE WS LLRTVHSVI+RSP +LKE+LLVDD S +E LK
Sbjct: 123 ECKHWNYPENLPSTSVIIVFHNEGWSVLLRTVHSVINRSPPKILKEVLLVDDFSDKENLK 182
Query: 247 SSLDEYVAKLSVPTRVIRSPGRVGLIKARLLGARQAEGEILVFLDAHCECTLGWLENLVA 306
+ L+ Y+ + + R+IR+ R GLI+ R GA++A GE++VFLDAHCE WL L+A
Sbjct: 183 TRLETYIERFNGKVRLIRNAQREGLIRTRSRGAKEATGEVIVFLDAHCEVNTNWLPPLLA 242
Query: 307 RVAEDRTRVVCPVIDIISDVTFAY--VRSFELHW-GAFNWELHFRWYTYGSSDAIIKRKD 363
+ DR+ + PVID I TF Y V + H+ G F W + ++ + + RK
Sbjct: 243 PIYRDRSVMTVPVIDGIDHKTFEYRPVYGEDRHFRGIFEWGMLYKENEVPQKE-LNTRKH 301
Query: 364 FTEPFKTPAMAGGLFAIDRAYFFHIGAYDEEMQVWGGENLEMSFRVWQCGGSIEIAPCSH 423
+EP+K+P AGGLFAI+R YF +GAYD + VWGGEN E+SF++WQCGGSIE PCS
Sbjct: 302 NSEPYKSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSR 361
Query: 424 VAHLFRKASPYSFPGGVSEVLYG-----NLARVALVWMDE-WAEFYFKFNPEAEKQRDKQ 477
V H++R PY+F G +++ G N RV W D+ + EF++ P A + D
Sbjct: 362 VGHVYRSFMPYNF-GKLAQKKKGPLITINYKRVIETWFDDKYKEFFYTREPMA-RFLDMG 419
Query: 478 AVRSRLELRKQLKCHSFKWYLTHV-------WPHHFLPMDDKFFGRIRHVQTHKCVEKPL 530
+ +L L+K+L C SF+W++ +V +P LP + FG +R V KC++
Sbjct: 420 DISEQLALKKRLNCKSFEWFMENVAYDVLDKYPE--LP-PNIHFGELRSVAVSKCLD--- 473
Query: 531 AKGSMNQASGPASLLPCTH 549
GP SL+ H
Sbjct: 474 -----TLGHGPPSLMAIQH 487
Score = 141 bits (356), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 75/153 (49%), Positives = 102/153 (66%), Gaps = 5/153 (3%)
Query: 46 SDAQQLYQINRFNLLVSDRIPVNRTLPDVRKPKCKTKVFNEEFLPKSSIVIVFHNEAWSA 105
+DA Q N+ SD I ++RT+ D R +CK + E LP +S++IVFHNE WS
Sbjct: 91 NDADQSESEYGMNVACSDEISLDRTILDTRLSECKHWNYPEN-LPSTSVIIVFHNEGWSV 149
Query: 106 LLRTVHSVISRSPRSMLKEILLVDDASTREFLKSSLDEYVAKLSVPTRVIRSPGRVGLIK 165
LLRTVHSVI+RSP +LKE+LLVDD S +E LK+ L+ Y+ + + R+IR+ R GLI+
Sbjct: 150 LLRTVHSVINRSPPKILKEVLLVDDFSDKENLKTRLETYIERFNGKVRLIRNAQREGLIR 209
Query: 166 ARLLGARQAEGEILVFLDAHCECTLVFNEEFLP 198
R GA++A GE++VFLDAHCE N +LP
Sbjct: 210 TRSRGAKEATGEVIVFLDAHCEV----NTNWLP 238
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.324 0.138 0.426
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 9,429,374,856
Number of Sequences: 23463169
Number of extensions: 387357667
Number of successful extensions: 1105547
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2081
Number of HSP's successfully gapped in prelim test: 1533
Number of HSP's that attempted gapping in prelim test: 1089151
Number of HSP's gapped (non-prelim): 7893
length of query: 605
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 456
effective length of database: 8,863,183,186
effective search space: 4041611532816
effective search space used: 4041611532816
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 80 (35.4 bits)